Collins, Richard A; Stajich, Jason E; Field, Deborah J; Olive, Joan E; DeAbreu, Diane M
2015-05-01
When we expressed a small (0.9 kb) nonprotein-coding transcript derived from the mitochondrial VS plasmid in the nucleus of Neurospora we found that it was efficiently spliced at one or more of eight 5' splice sites and ten 3' splice sites, which are present apparently by chance in the sequence. Further experimental and bioinformatic analyses of other mitochondrial plasmids, random sequences, and natural nuclear genes in Neurospora and other fungi indicate that fungal spliceosomes recognize a wide range of 5' splice site and branchpoint sequences and predict introns to be present at high frequency in random sequence. In contrast, analysis of intronless fungal nuclear genes indicates that branchpoint, 5' splice site and 3' splice site consensus sequences are underrepresented compared with random sequences. This underrepresentation of splicing signals is sufficient to deplete the nuclear genome of splice sites at locations that do not comprise biologically relevant introns. Thus, the splicing machinery can recognize a wide range of splicing signal sequences, but splicing still occurs with great accuracy, not because the splicing machinery distinguishes correct from incorrect introns, but because incorrect introns are substantially depleted from the genome. © 2015 Collins et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Multiple splicing defects in an intronic false exon.
Sun, H; Chasin, L A
2000-09-01
Splice site consensus sequences alone are insufficient to dictate the recognition of real constitutive splice sites within the typically large transcripts of higher eukaryotes, and large numbers of pseudoexons flanked by pseudosplice sites with good matches to the consensus sequences can be easily designated. In an attempt to identify elements that prevent pseudoexon splicing, we have systematically altered known splicing signals, as well as immediately adjacent flanking sequences, of an arbitrarily chosen pseudoexon from intron 1 of the human hprt gene. The substitution of a 5' splice site that perfectly matches the 5' consensus combined with mutation to match the CAG/G sequence of the 3' consensus failed to get this model pseudoexon included as the central exon in a dhfr minigene context. Provision of a real 3' splice site and a consensus 5' splice site and removal of an upstream inhibitory sequence were necessary and sufficient to confer splicing on the pseudoexon. This activated context also supported the splicing of a second pseudoexon sequence containing no apparent enhancer. Thus, both the 5' splice site sequence and the polypyrimidine tract of the pseudoexon are defective despite their good agreement with the consensus. On the other hand, the pseudoexon body did not exert a negative influence on splicing. The introduction into the pseudoexon of a sequence selected for binding to ASF/SF2 or its replacement with beta-globin exon 2 only partially reversed the effect of the upstream negative element and the defective polypyrimidine tract. These results support the idea that exon-bridging enhancers are not a prerequisite for constitutive exon definition and suggest that intrinsically defective splice sites and negative elements play important roles in distinguishing the real splicing signal from the vast number of false splicing signals.
Suh, E R; Waring, R B
1990-01-01
It has been proposed that recognition of the 3' splice site in many group I introns involves base pairing between the start of the 3' exon and a region of the intron known as the internal guide sequence (R. W. Davies, R. B. Waring, J. Ray, T. A. Brown, and C. Scazzocchio, Nature [London] 300:719-724, 1982). We have examined this hypothesis, using the self-splicing rRNA intron from Tetrahymena thermophila. Mutations in the 3' exon that weaken this proposed pairing increased use of a downstream cryptic 3' splice site. Compensatory mutations in the guide sequence that restore this pairing resulted in even stronger selection of the normal 3' splice site. These changes in 3' splice site usage were more pronounced in the background of a mutation (414A) which resulted in an adenine instead of a guanine being the last base of the intron. These results show that the proposed pairing (P10) plays an important role in ensuring that cryptic 3' splice sites are selected against. Surprisingly, the 414A mutation alone did not result in activation of the cryptic 3' splice site. Images PMID:2342465
Widespread alternative and aberrant splicing revealed by lariat sequencing
Stepankiw, Nicholas; Raghavan, Madhura; Fogarty, Elizabeth A.; Grimson, Andrew; Pleiss, Jeffrey A.
2015-01-01
Alternative splicing is an important and ancient feature of eukaryotic gene structure, the existence of which has likely facilitated eukaryotic proteome expansions. Here, we have used intron lariat sequencing to generate a comprehensive profile of splicing events in Schizosaccharomyces pombe, amongst the simplest organisms that possess mammalian-like splice site degeneracy. We reveal an unprecedented level of alternative splicing, including alternative splice site selection for over half of all annotated introns, hundreds of novel exon-skipping events, and thousands of novel introns. Moreover, the frequency of these events is far higher than previous estimates, with alternative splice sites on average activated at ∼3% the rate of canonical sites. Although a subset of alternative sites are conserved in related species, implying functional potential, the majority are not detectably conserved. Interestingly, the rate of aberrant splicing is inversely related to expression level, with lowly expressed genes more prone to erroneous splicing. Although we validate many events with RNAseq, the proportion of alternative splicing discovered with lariat sequencing is far greater, a difference we attribute to preferential decay of aberrantly spliced transcripts. Together, these data suggest the spliceosome possesses far lower fidelity than previously appreciated, highlighting the potential contributions of alternative splicing in generating novel gene structures. PMID:26261211
Human Splice-Site Prediction with Deep Neural Networks.
Naito, Tatsuhiko
2018-04-18
Accurate splice-site prediction is essential to delineate gene structures from sequence data. Several computational techniques have been applied to create a system to predict canonical splice sites. For classification tasks, deep neural networks (DNNs) have achieved record-breaking results and often outperformed other supervised learning techniques. In this study, a new method of splice-site prediction using DNNs was proposed. The proposed system receives an input sequence data and returns an answer as to whether it is splice site. The length of input is 140 nucleotides, with the consensus sequence (i.e., "GT" and "AG" for the donor and acceptor sites, respectively) in the middle. Each input sequence model is applied to the pretrained DNN model that determines the probability that an input is a splice site. The model consists of convolutional layers and bidirectional long short-term memory network layers. The pretraining and validation were conducted using the data set tested in previously reported methods. The performance evaluation results showed that the proposed method can outperform the previous methods. In addition, the pattern learned by the DNNs was visualized as position frequency matrices (PFMs). Some of PFMs were very similar to the consensus sequence. The trained DNN model and the brief source code for the prediction system are uploaded. Further improvement will be achieved following the further development of DNNs.
SpliceRover: Interpretable Convolutional Neural: Networks for Improved Splice Site Prediction.
Zuallaert, Jasper; Godin, Fréderic; Kim, Mijung; Soete, Arne; Saeys, Yvan; De Neve, Wesley
2018-06-21
During the last decade, improvements in high-throughput sequencing have generated a wealth of genomic data. Functionally interpreting these sequences and finding the biological signals that are hallmarks of gene function and regulation is currently mostly done using automated genome annotation platforms, which mainly rely on integrated machine learning frameworks to identify different functional sites of interest, including splice sites. Splicing is an essential step in the gene regulation process, and the correct identification of splice sites is a major cornerstone in a genome annotation system. In this paper, we present SpliceRover, a predictive deep learning approach that outperforms the state-of-the-art in splice site prediction. SpliceRover uses convolutional neural networks (CNNs), which have been shown to obtain cutting edge performance on a wide variety of prediction tasks. We adapted this approach to deal with genomic sequence inputs, and show it consistently outperforms already existing approaches, with relative improvements in prediction effectiveness of up to 80.9% when measured in terms of false discovery rate. However, a major criticism of CNNs concerns their "black box" nature, as mechanisms to obtain insight into their reasoning processes are limited. To facilitate interpretability of the SpliceRover models, we introduce an approach to visualize the biologically relevant information learnt. We show that our visualization approach is able to recover features known to be important for splice site prediction (binding motifs around the splice site, presence of polypyrimidine tracts and branch points), as well as reveal new features (e.g., several types of exclusion patterns near splice sites). SpliceRover is available as a web service. The prediction tool and instructions can be found at http://bioit2.irc.ugent.be/splicerover/. Supplementary materials are available at Bioinformatics online.
A 5′ Splice Site-Proximal Enhancer Binds SF1 and Activates Exon Bridging of a Microexon
Carlo, Troy; Sierra, Rebecca; Berget, Susan M.
2000-01-01
Internal exon size in vertebrates occurs over a narrow size range. Experimentally, exons shorter than 50 nucleotides are poorly included in mRNA unless accompanied by strengthened splice sites or accessory sequences that act as splicing enhancers, suggesting steric interference between snRNPs and other splicing factors binding simultaneously to the 3′ and 5′ splice sites of microexons. Despite these problems, very small naturally occurring exons exist. Here we studied the factors and mechanism involved in recognizing a constitutively included six-nucleotide exon from the cardiac troponin T gene. Inclusion of this exon is dependent on an enhancer located downstream of the 5′ splice site. This enhancer contains six copies of the simple sequence GGGGCUG. The enhancer activates heterologous microexons and will work when located either upstream or downstream of the target exon, suggesting an ability to bind factors that bridge splicing units. A single copy of this sequence is sufficient for in vivo exon inclusion and is the binding site for the known bridging mammalian splicing factor 1 (SF1). The enhancer and its bound SF1 act to increase recognition of the upstream exon during exon definition, such that competition of in vitro reactions with RNAs containing the GGGGCUG repeated sequence depress splicing of the upstream intron, assembly of the spliceosome on the 3′ splice site of the exon, and cross-linking of SF1. These results suggest a model in which SF1 bridges the small exon during initial assembly, thereby effectively extending the domain of the exon. PMID:10805741
Rogan, P K; Schneider, T D
1995-01-01
Predicting the effects of nucleotide substitutions in human splice sites has been based on analysis of consensus sequences. We used a graphic representation of sequence conservation and base frequency, the sequence logo, to demonstrate that a change in a splice acceptor of hMSH2 (a gene associated with familial nonpolyposis colon cancer) probably does not reduce splicing efficiency. This confirms a population genetic study that suggested that this substitution is a genetic polymorphism. The information theory-based sequence logo is quantitative and more sensitive than the corresponding splice acceptor consensus sequence for detection of true mutations. Information analysis may potentially be used to distinguish polymorphisms from mutations in other types of transcriptional, translational, or protein-coding motifs.
[Deregulation of pre-messenger RNA splicing and rare diseases].
de la Grange, Pierre
2016-12-01
Most of protein-coding human genes are subjected to alternative pre-mRNA splicing. This mechanism is highly regulated to precisely modulate detection of specific splice sites. This regulation is under control of the spliceosome and several splicing factors are also required to modulate the alternative usage of splice sites. Splicing factors and spliceosome components recognize splicing signals and regulatory sequences of the pre-mRNAs. These splicing sequences make splicing susceptible to polymorphisms and mutations. Examples of associations between human rare diseases and defects in pre-messenger RNA splicing are accumulating. Although many alterations are caused by mutations in splicing sequence (i.e., cis acting mutations), recent studies described the disruptive impact of mutations within spliceosome components or splicing factors (i.e., trans acting mutations). Following growing of knowledge regarding splicing regulation, several approaches have been developed to compensate for the effect of deleterious mutations and to restore sufficient amounts of functional protein. © 2016 médecine/sciences – Inserm.
Human Splicing Finder: an online bioinformatics tool to predict splicing signals.
Desmet, François-Olivier; Hamroun, Dalil; Lalande, Marine; Collod-Béroud, Gwenaëlle; Claustres, Mireille; Béroud, Christophe
2009-05-01
Thousands of mutations are identified yearly. Although many directly affect protein expression, an increasing proportion of mutations is now believed to influence mRNA splicing. They mostly affect existing splice sites, but synonymous, non-synonymous or nonsense mutations can also create or disrupt splice sites or auxiliary cis-splicing sequences. To facilitate the analysis of the different mutations, we designed Human Splicing Finder (HSF), a tool to predict the effects of mutations on splicing signals or to identify splicing motifs in any human sequence. It contains all available matrices for auxiliary sequence prediction as well as new ones for binding sites of the 9G8 and Tra2-beta Serine-Arginine proteins and the hnRNP A1 ribonucleoprotein. We also developed new Position Weight Matrices to assess the strength of 5' and 3' splice sites and branch points. We evaluated HSF efficiency using a set of 83 intronic and 35 exonic mutations known to result in splicing defects. We showed that the mutation effect was correctly predicted in almost all cases. HSF could thus represent a valuable resource for research, diagnostic and therapeutic (e.g. therapeutic exon skipping) purposes as well as for global studies, such as the GEN2PHEN European Project or the Human Variome Project.
Human Splicing Finder: an online bioinformatics tool to predict splicing signals
Desmet, François-Olivier; Hamroun, Dalil; Lalande, Marine; Collod-Béroud, Gwenaëlle; Claustres, Mireille; Béroud, Christophe
2009-01-01
Thousands of mutations are identified yearly. Although many directly affect protein expression, an increasing proportion of mutations is now believed to influence mRNA splicing. They mostly affect existing splice sites, but synonymous, non-synonymous or nonsense mutations can also create or disrupt splice sites or auxiliary cis-splicing sequences. To facilitate the analysis of the different mutations, we designed Human Splicing Finder (HSF), a tool to predict the effects of mutations on splicing signals or to identify splicing motifs in any human sequence. It contains all available matrices for auxiliary sequence prediction as well as new ones for binding sites of the 9G8 and Tra2-β Serine-Arginine proteins and the hnRNP A1 ribonucleoprotein. We also developed new Position Weight Matrices to assess the strength of 5′ and 3′ splice sites and branch points. We evaluated HSF efficiency using a set of 83 intronic and 35 exonic mutations known to result in splicing defects. We showed that the mutation effect was correctly predicted in almost all cases. HSF could thus represent a valuable resource for research, diagnostic and therapeutic (e.g. therapeutic exon skipping) purposes as well as for global studies, such as the GEN2PHEN European Project or the Human Variome Project. PMID:19339519
Conservation of CD44 exon v3 functional elements in mammals
Vela, Elena; Hilari, Josep M; Delclaux, María; Fernández-Bellon, Hugo; Isamat, Marcos
2008-01-01
Background The human CD44 gene contains 10 variable exons (v1 to v10) that can be alternatively spliced to generate hundreds of different CD44 protein isoforms. Human CD44 variable exon v3 inclusion in the final mRNA depends on a multisite bipartite splicing enhancer located within the exon itself, which we have recently described, and provides the protein domain responsible for growth factor binding to CD44. Findings We have analyzed the sequence of CD44v3 in 95 mammalian species to report high conservation levels for both its splicing regulatory elements (the 3' splice site and the exonic splicing enhancer), and the functional glycosaminglycan binding site coded by v3. We also report the functional expression of CD44v3 isoforms in peripheral blood cells of different mammalian taxa with both consensus and variant v3 sequences. Conclusion CD44v3 mammalian sequences maintain all functional splicing regulatory elements as well as the GAG binding site with the same relative positions and sequence identity previously described during alternative splicing of human CD44. The sequence within the GAG attachment site, which in turn contains the Y motif of the exonic splicing enhancer, is more conserved relative to the rest of exon. Amplification of CD44v3 sequence from mammalian species but not from birds, fish or reptiles, may lead to classify CD44v3 as an exclusive mammalian gene trait. PMID:18710510
Spinelli, Roberta; Pirola, Alessandra; Redaelli, Sara; Sharma, Nitesh; Raman, Hima; Valletta, Simona; Magistroni, Vera; Piazza, Rocco; Gambacorti-Passerini, Carlo
2013-11-01
Point mutations in intronic regions near mRNA splice junctions can affect the splicing process. To identify novel splicing variants from exome sequencing data, we developed a bioinformatics splice-site prediction procedure to analyze next-generation sequencing (NGS) data (SpliceFinder). SpliceFinder integrates two functional annotation tools for NGS, ANNOVAR and MutationTaster and two canonical splice site prediction programs for single mutation analysis, SSPNN and NetGene2. By SpliceFinder, we identified somatic mutations affecting RNA splicing in a colon cancer sample, in eight atypical chronic myeloid leukemia (aCML), and eight CML patients. A novel homozygous splicing mutation was found in APC (NM_000038.4:c.1312+5G>A) and six heterozygous in GNAQ (NM_002072.2:c.735+1C>T), ABCC 3 (NM_003786.3:c.1783-1G>A), KLHDC 1 (NM_172193.1:c.568-2A>G), HOOK 1 (NM_015888.4:c.1662-1G>A), SMAD 9 (NM_001127217.2:c.1004-1C>T), and DNAH 9 (NM_001372.3:c.10242+5G>A). Integrating whole-exome and RNA sequencing in aCML and CML, we assessed the phenotypic effect of mutations on mRNA splicing for GNAQ, ABCC 3, HOOK 1. In ABCC 3 and HOOK 1, RNA-Seq showed the presence of aberrant transcripts with activation of a cryptic splice site or intron retention, validated by the reverse transcription-polymerase chain reaction (RT-PCR) in the case of HOOK 1. In GNAQ, RNA-Seq showed 22% of wild-type transcript and 78% of mRNA skipping exon 5, resulting in a 4-6 frameshift fusion confirmed by RT-PCR. The pipeline can be useful to identify intronic variants affecting RNA sequence by complementing conventional exome analysis.
Splicing predictions reliably classify different types of alternative splicing
Busch, Anke; Hertel, Klemens J.
2015-01-01
Alternative splicing is a key player in the creation of complex mammalian transcriptomes and its misregulation is associated with many human diseases. Multiple mRNA isoforms are generated from most human genes, a process mediated by the interplay of various RNA signature elements and trans-acting factors that guide spliceosomal assembly and intron removal. Here, we introduce a splicing predictor that evaluates hundreds of RNA features simultaneously to successfully differentiate between exons that are constitutively spliced, exons that undergo alternative 5′ or 3′ splice-site selection, and alternative cassette-type exons. Surprisingly, the splicing predictor did not feature strong discriminatory contributions from binding sites for known splicing regulators. Rather, the ability of an exon to be involved in one or multiple types of alternative splicing is dictated by its immediate sequence context, mainly driven by the identity of the exon's splice sites, the conservation around them, and its exon/intron architecture. Thus, the splicing behavior of human exons can be reliably predicted based on basic RNA sequence elements. PMID:25805853
Katz, R A; Kotler, M; Skalka, A M
1988-01-01
The full-length retroviral RNA transcript serves as (i) mRNA for the gag and pol gene products, (ii) genomic RNA that is assembled into progeny virions, and (iii) a pre-mRNA for spliced subgenomic mRNAs. Therefore, a balance of spliced and unspliced RNA is required to generate the appropriate levels of protein and RNA products for virion production. We have introduced an insertion mutation near the avian sarcoma virus env splice acceptor site that results in a significant increase in splicing to form functional env mRNA. The mutant virus is replication defective, but phenotypic revertant viruses that have acquired second-site mutations near the splice acceptor site can be isolated readily. Detailed analysis of one of these viruses revealed that a single nucleotide change at -20 from the splice acceptor site, within the original mutagenic insert, was sufficient to restore viral growth and significantly decrease splicing efficiency compared with the original mutant and wild-type viruses. Thus, minor sequence alterations near the env splice acceptor site can produce major changes in the balance of spliced and unspliced RNAs. Our results suggest a mechanism of control in which splicing is modulated by cis-acting sequences at the env splice acceptor site. Furthermore, this retroviral system provides a powerful genetic method for selection and analysis of mutations that affect splicing control. Images PMID:2839694
iSS-PseDNC: identifying splicing sites using pseudo dinucleotide composition.
Chen, Wei; Feng, Peng-Mian; Lin, Hao; Chou, Kuo-Chen
2014-01-01
In eukaryotic genes, exons are generally interrupted by introns. Accurately removing introns and joining exons together are essential processes in eukaryotic gene expression. With the avalanche of genome sequences generated in the postgenomic age, it is highly desired to develop automated methods for rapid and effective detection of splice sites that play important roles in gene structure annotation and even in RNA splicing. Although a series of computational methods were proposed for splice site identification, most of them neglected the intrinsic local structural properties. In the present study, a predictor called "iSS-PseDNC" was developed for identifying splice sites. In the new predictor, the sequences were formulated by a novel feature-vector called "pseudo dinucleotide composition" (PseDNC) into which six DNA local structural properties were incorporated. It was observed by the rigorous cross-validation tests on two benchmark datasets that the overall success rates achieved by iSS-PseDNC in identifying splice donor site and splice acceptor site were 85.45% and 87.73%, respectively. It is anticipated that iSS-PseDNC may become a useful tool for identifying splice sites and that the six DNA local structural properties described in this paper may provide novel insights for in-depth investigations into the mechanism of RNA splicing.
Designing oligo libraries taking alternative splicing into account
NASA Astrophysics Data System (ADS)
Shoshan, Avi; Grebinskiy, Vladimir; Magen, Avner; Scolnicov, Ariel; Fink, Eyal; Lehavi, David; Wasserman, Alon
2001-06-01
We have designed sequences for DNA microarrays and oligo libraries, taking alternative splicing into account. Alternative splicing is a common phenomenon, occurring in more than 25% of the human genes. In many cases, different splice variants have different functions, are expressed in different tissues or may indicate different stages of disease. When designing sequences for DNA microarrays or oligo libraries, it is very important to take into account the sequence information of all the mRNA transcripts. Therefore, when a gene has more than one transcript (as a result of alternative splicing, alternative promoter sites or alternative poly-adenylation sites), it is very important to take all of them into account in the design. We have used the LEADS transcriptome prediction system to cluster and assemble the human sequences in GenBank and design optimal oligonucleotides for all the human genes with a known mRNA sequence based on the LEADS predictions.
Bonizzoni, Paola; Rizzi, Raffaella; Pesole, Graziano
2005-10-05
Currently available methods to predict splice sites are mainly based on the independent and progressive alignment of transcript data (mostly ESTs) to the genomic sequence. Apart from often being computationally expensive, this approach is vulnerable to several problems--hence the need to develop novel strategies. We propose a method, based on a novel multiple genome-EST alignment algorithm, for the detection of splice sites. To avoid limitations of splice sites prediction (mainly, over-predictions) due to independent single EST alignments to the genomic sequence our approach performs a multiple alignment of transcript data to the genomic sequence based on the combined analysis of all available data. We recast the problem of predicting constitutive and alternative splicing as an optimization problem, where the optimal multiple transcript alignment minimizes the number of exons and hence of splice site observations. We have implemented a splice site predictor based on this algorithm in the software tool ASPIC (Alternative Splicing PredICtion). It is distinguished from other methods based on BLAST-like tools by the incorporation of entirely new ad hoc procedures for accurate and computationally efficient transcript alignment and adopts dynamic programming for the refinement of intron boundaries. ASPIC also provides the minimal set of non-mergeable transcript isoforms compatible with the detected splicing events. The ASPIC web resource is dynamically interconnected with the Ensembl and Unigene databases and also implements an upload facility. Extensive bench marking shows that ASPIC outperforms other existing methods in the detection of novel splicing isoforms and in the minimization of over-predictions. ASPIC also requires a lower computation time for processing a single gene and an EST cluster. The ASPIC web resource is available at http://aspic.algo.disco.unimib.it/aspic-devel/.
SplicingTypesAnno: annotating and quantifying alternative splicing events for RNA-Seq data.
Sun, Xiaoyong; Zuo, Fenghua; Ru, Yuanbin; Guo, Jiqiang; Yan, Xiaoyan; Sablok, Gaurav
2015-04-01
Alternative splicing plays a key role in the regulation of the central dogma. Four major types of alternative splicing have been classified as intron retention, exon skipping, alternative 5 splice sites or alternative donor sites, and alternative 3 splice sites or alternative acceptor sites. A few algorithms have been developed to detect splice junctions from RNA-Seq reads. However, there are few tools targeting at the major alternative splicing types at the exon/intron level. This type of analysis may reveal subtle, yet important events of alternative splicing, and thus help gain deeper understanding of the mechanism of alternative splicing. This paper describes a user-friendly R package, extracting, annotating and analyzing alternative splicing types for sequence alignment files from RNA-Seq. SplicingTypesAnno can: (1) provide annotation for major alternative splicing at exon/intron level. By comparing the annotation from GTF/GFF file, it identifies the novel alternative splicing sites; (2) offer a convenient two-level analysis: genome-scale annotation for users with high performance computing environment, and gene-scale annotation for users with personal computers; (3) generate a user-friendly web report and additional BED files for IGV visualization. SplicingTypesAnno is a user-friendly R package for extracting, annotating and analyzing alternative splicing types at exon/intron level for sequence alignment files from RNA-Seq. It is publically available at https://sourceforge.net/projects/splicingtypes/files/ or http://genome.sdau.edu.cn/research/software/SplicingTypesAnno.html. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Spinelli, Roberta; Pirola, Alessandra; Redaelli, Sara; Sharma, Nitesh; Raman, Hima; Valletta, Simona; Magistroni, Vera; Piazza, Rocco; Gambacorti-Passerini, Carlo
2013-01-01
Point mutations in intronic regions near mRNA splice junctions can affect the splicing process. To identify novel splicing variants from exome sequencing data, we developed a bioinformatics splice-site prediction procedure to analyze next-generation sequencing (NGS) data (SpliceFinder). SpliceFinder integrates two functional annotation tools for NGS, ANNOVAR and MutationTaster and two canonical splice site prediction programs for single mutation analysis, SSPNN and NetGene2. By SpliceFinder, we identified somatic mutations affecting RNA splicing in a colon cancer sample, in eight atypical chronic myeloid leukemia (aCML), and eight CML patients. A novel homozygous splicing mutation was found in APC (NM_000038.4:c.1312+5G>A) and six heterozygous in GNAQ (NM_002072.2:c.735+1C>T), ABCC3 (NM_003786.3:c.1783-1G>A), KLHDC1 (NM_172193.1:c.568-2A>G), HOOK1 (NM_015888.4:c.1662-1G>A), SMAD9 (NM_001127217.2:c.1004-1C>T), and DNAH9 (NM_001372.3:c.10242+5G>A). Integrating whole-exome and RNA sequencing in aCML and CML, we assessed the phenotypic effect of mutations on mRNA splicing for GNAQ, ABCC3, HOOK1. In ABCC3 and HOOK1, RNA-Seq showed the presence of aberrant transcripts with activation of a cryptic splice site or intron retention, validated by the reverse transcription-polymerase chain reaction (RT-PCR) in the case of HOOK1. In GNAQ, RNA-Seq showed 22% of wild-type transcript and 78% of mRNA skipping exon 5, resulting in a 4–6 frameshift fusion confirmed by RT-PCR. The pipeline can be useful to identify intronic variants affecting RNA sequence by complementing conventional exome analysis. PMID:24498620
DOE Office of Scientific and Technical Information (OSTI.GOV)
Willing, M.; Deschenes, S.
We have identified a G to A substitution in the 5{prime} donor splice site of intron 18 of one COL1A1 allele in two unrelated families with osteogenesis imperfecta (OI) type I. A third OI type I family has a G to A substitution at the identical position in intron 48 of one COL1A1 allele. Both mutations abolish normal splicing and lead to reduced steady-state levels of mRNA from the mutant COL1A1 allele. The intron 18 mutation leads to both exon 18 skipping in the mRNA and to utilization of a single alternative splice site near the 3{prime} end of exonmore » 18. The latter results in deletion of the last 8 nucleotides of exon 18 from the mRNA, a shift in the translational reading-frame, and the creation of a premature termination codon in exon 19. Of the potential alternative 5{prime} splice sites in exon 18 and intron 18, the one utilized has a surrounding nucleotide sequence which most closely resembles that of the natural splice site. Although a G to A mutation was detected at the identical position in intron 48 of one COL1A1 allele in another OI type I family, nine complex alternative splicing patterns were identified by sequence analysis of cDNA clones derived from fibroblast mRNA from this cell strain. All result in partial or complete skipping of exon 48, with in-frame deletions of portions of exons 47 and/or 49. The different patterns of RNA splicing were not explained by their sequence homology with naturally occuring 5{prime} splice sites, but rather by recombination between highly homologous exon sequences, suggesting that we may not have identified the major splicing alternative(s) in this cell strain. Both G to A mutations result in decreased production of type I collagen, the common biochemical correlate of OI type I.« less
Kawarai, Toshitaka; Miyamoto, Ryosuke; Mori, Atsuko; Oki, Ryosuke; Tsukamoto-Miyashiro, Ai; Matsui, Naoko; Miyazaki, Yoshimichi; Orlacchio, Antonio; Izumi, Yuishin; Nishida, Yoshihiko; Kaji, Ryuji
2015-12-15
We identified a novel homozygous mutation in the splice site donor (SSD) of intron 30 (c.5866+1G>A) in consanguineous Japanese SPG11 siblings showing late-onset spastic paraplegia using the whole-exome sequencing. Phenotypic variability was observed, including age-at-onset, dysarthria and pes cavus. Coding DNA sequencing revealed that the mutation affected the recognition of the constitutive SSD of intron 30, splicing upstream onto a nearby cryptic SSD in exon 30. The use of constitutive splice sites of intron 29 was confirmed by sequencing. The mutant transcripts are mostly subject to degradation by the nonsense-mediated mRNA decay system. SPG11 transcripts, escaping from the nonsense-mediated mRNA decay pathway, would generate a truncated protein (p.Tyr1900Phefs5X) containing the first 1899 amino acids and followed by 4 aberrant amino acids. This study showed a successful clinical application of whole-exome sequencing in spastic paraplegia and demonstrated a further evidence of allelic heterogeneity in SPG11. The confirmation of aberrant transcript by splice site mutation is a prerequisite for a more precise molecular diagnosis. Copyright © 2015 Elsevier B.V. All rights reserved.
The Human Splicing Factor ASF/SF2 can Specifically Recognize Pre-mRNA 5' Splice Sites
NASA Astrophysics Data System (ADS)
Zuo, Ping; Manley, James L.
1994-04-01
ASF/SF2 is a human protein previously shown to function in in vitro pre-mRNA splicing as an essential factor necessary for all splices and also as an alternative splicing factor, capable of switching selection of 5' splice sites. To begin to study the protein's mechanism of action, we have investigated the RNA binding properties of purified recombinant ASF/SF2. Using UV crosslinking and gel shift assays, we demonstrate that the RNA binding region of ASF/SF2 can interact with RNA in a sequence-specific manner, recognizing the 5' splice site in each of two different pre-mRNAs. Point mutations in the 5' splice site consensus can reduce binding by as much as a factor of 100, with the largest effects observed in competition assays. These findings support a model in which ASF/SF2 aids in the recognition of pre-mRNA 5' splice sites.
Detection of Splice Sites Using Support Vector Machine
NASA Astrophysics Data System (ADS)
Varadwaj, Pritish; Purohit, Neetesh; Arora, Bhumika
Automatic identification and annotation of exon and intron region of gene, from DNA sequences has been an important research area in field of computational biology. Several approaches viz. Hidden Markov Model (HMM), Artificial Intelligence (AI) based machine learning and Digital Signal Processing (DSP) techniques have extensively and independently been used by various researchers to cater this challenging task. In this work, we propose a Support Vector Machine based kernel learning approach for detection of splice sites (the exon-intron boundary) in a gene. Electron-Ion Interaction Potential (EIIP) values of nucleotides have been used for mapping character sequences to corresponding numeric sequences. Radial Basis Function (RBF) SVM kernel is trained using EIIP numeric sequences. Furthermore this was tested on test gene dataset for detection of splice site by window (of 12 residues) shifting. Optimum values of window size, various important parameters of SVM kernel have been optimized for a better accuracy. Receiver Operating Characteristic (ROC) curves have been utilized for displaying the sensitivity rate of the classifier and results showed 94.82% accuracy for splice site detection on test dataset.
A Predictive Model of Intein Insertion Site for Use in the Engineering of Molecular Switches
Apgar, James; Ross, Mary; Zuo, Xiao; Dohle, Sarah; Sturtevant, Derek; Shen, Binzhang; de la Vega, Humberto; Lessard, Philip; Lazar, Gabor; Raab, R. Michael
2012-01-01
Inteins are intervening protein domains with self-splicing ability that can be used as molecular switches to control activity of their host protein. Successfully engineering an intein into a host protein requires identifying an insertion site that permits intein insertion and splicing while allowing for proper folding of the mature protein post-splicing. By analyzing sequence and structure based properties of native intein insertion sites we have identified four features that showed significant correlation with the location of the intein insertion sites, and therefore may be useful in predicting insertion sites in other proteins that provide native-like intein function. Three of these properties, the distance to the active site and dimer interface site, the SVM score of the splice site cassette, and the sequence conservation of the site showed statistically significant correlation and strong predictive power, with area under the curve (AUC) values of 0.79, 0.76, and 0.73 respectively, while the distance to secondary structure/loop junction showed significance but with less predictive power (AUC of 0.54). In a case study of 20 insertion sites in the XynB xylanase, two features of native insertion sites showed correlation with the splice sites and demonstrated predictive value in selecting non-native splice sites. Structural modeling of intein insertions at two sites highlighted the role that the insertion site location could play on the ability of the intein to modulate activity of the host protein. These findings can be used to enrich the selection of insertion sites capable of supporting intein splicing and hosting an intein switch. PMID:22649521
Crotti, Lia; Lewandowska, Marzena A; Schwartz, Peter J; Insolia, Roberto; Pedrazzini, Matteo; Bussani, Erica; Dagradi, Federica; George, Alfred L; Pagani, Franco
2009-02-01
Genetic screening of long QT syndrome (LQTS) fails to identify disease-causing mutations in about 30% of patients. So far, molecular screening has focused mainly on coding sequence mutations or on substitutions at canonical splice sites. The purpose of this study was to explore the possibility that intronic variants not at canonical splice sites might affect splicing regulatory elements, lead to aberrant transcripts, and cause LQTS. Molecular screening was performed through DHPLC and sequence analysis. The role of the intronic mutation identified was assessed with a hybrid minigene splicing assay. A three-generation LQTS family was investigated. Molecular screening failed to identify an obvious disease-causing mutation in the coding sequences of the major LQTS genes but revealed an intronic A-to-G substitution in KCNH2 (IVS9-28A/G) cosegregating with the clinical phenotype in family members. In vitro analysis proved that the mutation disrupts the acceptor splice site definition by affecting the branch point (BP) sequence and promoting intron retention. We further demonstrated a tight functional relationship between the BP and the polypyrimidine tract, whose weakness is responsible for the pathological effect of the IVS9-28A/G mutation. We identified a novel BP mutation in KCNH2 that disrupts the intron 9 acceptor splice site definition and causes LQT2. The present finding demonstrates that intronic mutations affecting pre-mRNA processing may contribute to the failure of traditional molecular screening in identifying disease-causing mutations in LQTS subjects and offers a rationale strategy for the reduction of genotype-negative cases.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Solovyev, V.V.; Salamov, A.A.; Lawrence, C.B.
1994-12-31
Discriminant analysis is applied to the problem of recognition 5`-, internal and 3`-exons in human DNA sequences. Specific recognition functions were developed for revealing exons of particular types. The method based on a splice site prediction algorithm that uses the linear Fisher discriminant to combine the information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotide in protein coding and nation regions. The accuracy of our splice site recognition function is about 97%. A discriminant function for 5`-exon prediction includes hexanucleotide composition of upstream region, triplet composition around the ATG codon, ORF codingmore » potential, donor splice site potential and composition of downstream introit region. For internal exon prediction, we combine in a discriminant function the characteristics describing the 5`- intron region, donor splice site, coding region, acceptor splice site and Y-intron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences is 77% with a specificity of 79% and a level of pseudoexon ORF prediction of 99.96%. The recognition quality computed at the level of individual nucleotides is 89%, for exon sequences and 98% for intron sequences. A discriminant function for 3`-exon prediction includes octanucleolide composition of upstream nation region, triplet composition around the stop codon, ORF coding potential, acceptor splice site potential and hexanucleotide composition of downstream region. We unite these three discriminant functions in exon predicting program FEX (find exons). FEX exactly predicts 70% of 1016 exons from the test of 181 complete genes with specificity 73%, and 89% exons are exactly or partially predicted. On the average, 85% of nucleotides were predicted accurately with specificity 91%.« less
Reading the tea leaves: Dead transposon copies reveal novel host and transposon biology.
McLaughlin, Richard N
2018-03-01
Transposable elements comprise a huge portion of most animal genomes. Unlike many pathogens, these elements leave a mark of their impact via their insertion into host genomes. With proper teasing, these sequences can relay information about the evolutionary history of transposons and their hosts. In a new publication, Larson and colleagues describe a previously unappreciated density of long interspersed element-1 (LINE-1) sequences that have been spliced (LINE-1 and other reverse transcribing elements are necessarily intronless). They provide data to suggest that the retention of these potentially deleterious splice sites in LINE-1 results from the sites' overlap with an important transcription factor binding site. These spliced LINE-1s (i.e., spliced integrated retrotransposed elements [SpiREs]) lose their ability to replicate, suggesting they are evolutionary dead ends. However, the lethality of this splicing could be an efficient means of blocking continued replication of LINE-1. In this way, the record of inactive LINE-1 sequences in the human genome revealed a new, though infrequent, event in the LINE-1 replication cycle and motivates future studies to test whether splicing might be another weapon in the anti-LINE-1 arsenal of host genomes.
Fu, X Y; Colgan, J D; Manley, J L
1988-01-01
We have determined the effects of a number of mutations in the small-t antigen mRNA intron on the alternative splicing pattern of the simian virus 40 early transcript. Expansion of the distance separating the small-t pre-mRNA lariat branch point and the shared large T-small t 3' splice site from 18 to 29 nucleotides (nt) resulted in a relative enhancement of small-t splicing in vivo. This finding, coupled with the observation that large-T pre-RNA splicing in vitro was not affected by this expansion, suggests that small-t splicing is specifically constrained by a short branch point-3' splice site distance. Similarly, the distance separating the 5' splice site and branch point (48 nt) was found to be at or near a minimum for small-t splicing, because deletions in this region as small as 2 nt dramatically reduced the ratio of small-t to large-T mRNA that accumulated in transfected cells. Finally, a specific sequence within the small-t intron, encompassing the upstream branch sites used in large-T splicing, was found to be an important element in the cell-specific pattern of early alternative splicing. Substitutions within this region reduced the ratio of small-t to large-T mRNA produced in HeLa cells but had only minor effects in human 293 cells. Images PMID:2851720
Coordinated tissue-specific regulation of adjacent alternative 3′ splice sites in C. elegans
Ragle, James Matthew; Katzman, Sol; Akers, Taylor F.; Barberan-Soler, Sergio; Zahler, Alan M.
2015-01-01
Adjacent alternative 3′ splice sites, those separated by ≤18 nucleotides, provide a unique problem in the study of alternative splicing regulation; there is overlap of the cis-elements that define the adjacent sites. Identification of the intron's 3′ end depends upon sequence elements that define the branchpoint, polypyrimidine tract, and terminal AG dinucleotide. Starting with RNA-seq data from germline-enriched and somatic cell-enriched Caenorhabditis elegans samples, we identify hundreds of introns with adjacent alternative 3′ splice sites. We identify 203 events that undergo tissue-specific alternative splicing. For these, the regulation is monodirectional, with somatic cells preferring to splice at the distal 3′ splice site (furthest from the 5′ end of the intron) and germline cells showing a distinct shift toward usage of the adjacent proximal 3′ splice site (closer to the 5′ end of the intron). Splicing patterns in somatic cells follow C. elegans consensus rules of 3′ splice site definition; a short stretch of pyrimidines preceding an AG dinucleotide. Splicing in germline cells occurs at proximal 3′ splice sites that lack a preceding polypyrimidine tract, and in three instances the germline-specific site lacks the AG dinucleotide. We provide evidence that use of germline-specific proximal 3′ splice sites is conserved across Caenorhabditis species. We propose that there are differences between germline and somatic cells in the way that the basal splicing machinery functions to determine the intron terminus. PMID:25922281
Förch, Patrik; Merendino, Livia; Martínez, Concepción; Valcárcel, Juan
2003-01-01
The splicing factor U2AF(65), U2 small nuclear ribonucleoprotein particle (snRNP) auxillary factor of 65 kDa, binds to pyrimidine-rich sequences at 3' splice sites to recruit U2 snRNP to pre-mRNAs. We report that U2AF(65) can also promote the recruitment of U1 snRNP to weak 5' splice sites that are followed by uridine-rich sequences. The arginine- and serine-rich domain of U2AF(65) is critical for U1 recruitment, and we discuss the role of its RNA-RNA annealing activity in this novel function of U2AF(65). PMID:12558503
Schernthaner-Reiter, Marie Helene; Adams, David; Trivellin, Giampaolo; Ramnitz, Mary Scott; Raygada, Margarita; Golas, Gretchen; Faucz, Fabio R; Nilsson, Ola; Nella, Aikaterini A; Dileepan, Kavitha; Lodish, Maya; Lee, Paul; Tifft, Cynthia; Markello, Thomas; Gahl, William; Stratakis, Constantine A
2016-05-01
X-linked nephrogenic diabetes insipidus (NDI, OMIM#304800) is caused by mutations in the arginine vasopressin (AVP, OMIM*192340) receptor type 2 (AVPR2, OMIM*300538) gene. A 20-month-old boy and his 8-year-old brother presented with polyuria, polydipsia, and failure to thrive. Both boys demonstrated partial DDAVP (1-desamino-8-D AVP or desmopressin) responses; thus, NDI diagnosis was delayed. While routine sequencing of AVPR2 showed a potential splice site variant, it was not until exome sequencing confirmed the AVPR2 splice site variant and did not reveal any more likely candidates that the patients' diagnosis was made and proper treatment was instituted. Both patients were hemizygous for two AVPR2 variants predicted in silico to affect AVPR2 messenger RNA (mRNA) splicing. A minigene assay revealed that the novel AVPR2 c.276A>G mutation creates a novel splice acceptor site leading to 5' truncation of AVPR2 exon 2 in HEK293 human kidney cells. Both patients have been treated with high-dose DDAVP with a remarkable improvement of their symptoms and accelerated linear growth and weight gain. We present here a unique case of partial X-linked NDI due to an AVPR2 splice site mutation; patients with diabetes insipidus of unknown etiology may harbor splice site mutations that are initially underestimated in their pathogenicity on sequence analysis. • X-linked nephrogenic diabetes insipidus is caused by AVPR2 mutations, and disease severity can vary depending on the functional effect of the mutation. What is New: • We demonstrate here that a splice site mutation in AVPR2 leads to partial X-linked NDI in two brothers. • Treatment with high-dose DDAVP led to improvement of polyuria and polydipsia, weight gain, and growth.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Conrad, R.; Thomas, J.; Spieth, J.
In nematodes, the RNA products of some genes are trans-spliced to a 22-nucleotide spliced leader (SL), while the RNA products of other genes are not. In Caenorhabditis elegans, there are two SLs, Sl1 and SL2, donated by two distinct small nuclear ribonucleoprotein particles in a process functionally quite similar to nuclear intron removal. The authors demonstrate here that it is possible to convert a non-trans-spliced gene into a trans-spliced gene by placement of an intron missing only the 5[prime] splice site into the 5[prime] untranslated region. Stable transgenic strains were isolated expressing a gene in which 69 nucleotides of amore » vit-5 intron, including the 3[prime] splice site, were inserted into the 5[prime] untranslated region of a vit-2/vit-6 fusion gene. The RNA product of this gene was examined by primer extension and PCR amplification. Although the vit-2/vit-6 transgene product is not normally trans-spliced, the majority of transcripts from this altered gene were trans-spliced to SL1. They termed the region of a trans-spliced mRNA precursor between the 5[prime] end and the first 3[prime] splice site an 'outrun'. The results suggest that if a transcript begins with intronlike sequence followed by a 3[prime] splice site, this alone may constitute an outrun and be sufficient to demarcate a transcript as a trans-splice acceptor. These findings leave open the possibility that specific sequences are required to increase the efficiency of trans-splicing.« less
Qu, Wen; Cingolani, Pablo; Zeeberg, Barry R; Ruden, Douglas M
2017-01-01
Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5' or 3' splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an "alternative-splicing code" in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs.
The in vivo use of alternate 3'-splice sites in group I introns.
Sellem, C H; Belcour, L
1994-04-11
Alternative splicing of group I introns has been postulated as a possible mechanism that would ensure the translation of proteins encoded into intronic open reading frames, discontinuous with the upstream exon and lacking an initiation signal. Alternate splice sites were previously depicted according to secondary structures of several group I introns. We present here strong evidence that, in the case of Podospora anserina nad 1-i4 and cox1-i7 mitochondrial introns, alternative splicing events do occur in vivo. Indeed, by PCR experiments we have detected molecules whose sequence is precisely that expected if the predicted alternate 3'-splice sites were used.
Ono, Hiroyuki; Saitsu, Hirotomo; Horikawa, Reiko; Nakashima, Shinichi; Ohkubo, Yumiko; Yanagi, Kumiko; Nakabayashi, Kazuhiko; Fukami, Maki; Fujisawa, Yasuko; Ogata, Tsutomu
2018-02-02
Although partial androgen insensitivity syndrome (PAIS) is caused by attenuated responsiveness to androgens, androgen receptor gene (AR) mutations on the coding regions and their splice sites have been identified only in <25% of patients with a diagnosis of PAIS. We performed extensive molecular studies including whole exome sequencing in a Japanese family with PAIS, identifying a deep intronic variant beyond the branch site at intron 6 of AR (NM_000044.4:c.2450-42 G > A). This variant created the splice acceptor motif that was accompanied by pyrimidine-rich sequence and two candidate branch sites. Consistent with this, reverse transcriptase (RT)-PCR experiments for cycloheximide-treated lymphoblastoid cell lines revealed a relatively large amount of aberrant mRNA produced by the newly created splice acceptor site and a relatively small amount of wildtype mRNA produced by the normal splice acceptor site. Furthermore, most of the aberrant mRNA was shown to undergo nonsense mediated decay (NMD) and, if a small amount of aberrant mRNA may have escaped NMD, such mRNA was predicted to generate a truncated AR protein missing some functional domains. These findings imply that the deep intronic mutation creating an alternative splice acceptor site resulted in the production of a relatively small amount of wildtype AR mRNA, leading to PAIS.
Ge, H; Noble, J; Colgan, J; Manley, J L
1990-01-01
We have studied splicing of the polyoma virus early region pre-mRNA in vitro. This RNA is alternatively spliced in vivo to produce mRNA encoding the large, middle-sized (MTAg), and small (StAg) tumor antigens. Our primary interest was to learn how the 48-nucleotide StAg intron is excised, because the length of this intron is significantly less than the apparent minimum established for mammalian introns. Although the products of all three splices are detected in vitro, characterization of the pathway and sequence requirements of StAg splicing suggests that splicing factors interact with the precursor RNA in an unexpected way to catalyze removal of this intron. Specifically, StAg splicing uses either of two lariat branch points, one of which is located only 4 nucleotides from the 3' splice site. Furthermore, the StAg splice absolutely requires that the alternative MTAg 3' splice site, located 14 nucleotides downstream of the StAg 3' splice site, be intact. Insertion mutations that increase or decrease the quality of the MTAg pyrimidine stretch enhance or repress StAg as well as MTAg splicing, and a single-base change in the MTAg AG splice acceptor totally blocks both splices. These results demonstrate the ability of two 3' splice sites to cooperate with each other to bring about removal of a single intron. Images PMID:2159146
Cryptic splice site in the complementary DNA of glucocerebrosidase causes inefficient expression.
Bukovac, Scott W; Bagshaw, Richard D; Rigat, Brigitte A; Callahan, John W; Clarke, Joe T R; Mahuran, Don J
2008-10-15
The low levels of human lysosomal glucocerebrosidase activity expressed in transiently transfected Chinese hamster ovary (CHO) cells were investigated. Reverse transcription PCR (RT-PCR) demonstrated that a significant portion of the transcribed RNA was misspliced owing to the presence of a cryptic splice site in the complementary DNA (cDNA). Missplicing results in the deletion of 179 bp of coding sequence and a premature stop codon. A repaired cDNA was constructed abolishing the splice site without changing the amino acid sequence. The level of glucocerebrosidase expression was increased sixfold. These data demonstrate that for maximum expression of any cDNA construct, the transcription products should be examined.
Regulation of alternative mRNA splicing: old players and new perspectives.
Dvinge, Heidi
2018-06-01
Nearly all human multi-exon genes are subject to alternative splicing in one or more cell types. The splicing machinery, therefore, has to select between multiple splice sites in a context-dependent manner, relying on sequence features in cis and trans-acting splicing regulators that either promote or repress splice site recognition and spliceosome assembly. However, the functional coupling between multiple gene regulatory layers signifies that splicing can also be modulated by transcriptional or epigenetic characteristics. Other, less obvious, aspects of alternative splicing have come to light in recent years, often involving core components of the spliceosome previously thought to perform a basal rather than a regulatory role in splicing. Together this paints a highly dynamic picture of splicing regulation, where the final splice site choice is governed by the entire transcriptional environment of a gene and its cellular context. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Sharma, Neeraj; Sosnay, Patrick R.; Ramalho, Anabela S.; Douville, Christopher; Franca, Arianna; Gottschalk, Laura B.; Park, Jeenah; Lee, Melissa; Vecchio-Pagan, Briana; Raraigh, Karen S.; Amaral, Margarida D.; Karchin, Rachel; Cutting, Garry R.
2015-01-01
Assessment of the functional consequences of variants near splice sites is a major challenge in the diagnostic laboratory. To address this issue, we created expression minigenes (EMGs) to determine the RNA and protein products generated by splice site variants (n = 10) implicated in cystic fibrosis (CF). Experimental results were compared with the splicing predictions of eight in silico tools. EMGs containing the full-length Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) coding sequence and flanking intron sequences generated wild-type transcript and fully processed protein in Human Embryonic Kidney (HEK293) and CF bronchial epithelial (CFBE41o-) cells. Quantification of variant induced aberrant mRNA isoforms was concordant using fragment analysis and pyrosequencing. The splicing patterns of c.1585−1G>A and c.2657+5G>A were comparable to those reported in primary cells from individuals bearing these variants. Bioinformatics predictions were consistent with experimental results for 9/10 variants (MES), 8/10 variants (NNSplice), and 7/10 variants (SSAT and Sroogle). Programs that estimate the consequences of mis-splicing predicted 11/16 (HSF and ASSEDA) and 10/16 (Fsplice and SplicePort) experimentally observed mRNA isoforms. EMGs provide a robust experimental approach for clinical interpretation of splice site variants and refinement of in silico tools. PMID:25066652
A mutational analysis of U12-dependent splice site dinucleotides
DIETRICH, ROSEMARY C.; FULLER, JOHN D.; PADGETT, RICHARD A.
2005-01-01
Introns spliced by the U12-dependent minor spliceosome are divided into two classes based on their splice site dinucleotides. The /AU-AC/ class accounts for about one-third of U12-dependent introns in humans, while the /GU-AG/ class accounts for the other two-thirds. We have investigated the in vivo and in vitro splicing phenotypes of mutations in these dinucleotide sequences. A 5′ A residue can splice to any 3′ residue, although C is preferred. A 5′ G residue can splice to 3′ G or U residues with a preference for G. Little or no splicing was observed to 3′ A or C residues. A 5′ U or C residue is highly deleterious for U12-dependent splicing, although some combinations, notably 5′ U to 3′ U produced detectable spliced products. The dependence of 3′ splice site activity on the identity of the 5′ residue provides evidence for communication between the first and last nucleotides of the intron. Most mutants in the second position of the 5′ splice site and the next to last position of the 3′ splice site were defective for splicing. Double mutants of these residues showed no evidence of communication between these nucleotides. Varying the distance between the branch site and the 3′ splice site dinucleotide in the /GU-AG/ class showed that a somewhat larger range of distances was functional than for the /AU-AC/ class. The optimum branch site to 3′ splice site distance of 11–12 nucleotides appears to be the same for both classes. PMID:16043500
Alternative Splicing as a Target for Cancer Treatment.
Martinez-Montiel, Nancy; Rosas-Murrieta, Nora Hilda; Anaya Ruiz, Maricruz; Monjaraz-Guzman, Eduardo; Martinez-Contreras, Rebeca
2018-02-11
Alternative splicing is a key mechanism determinant for gene expression in metazoan. During alternative splicing, non-coding sequences are removed to generate different mature messenger RNAs due to a combination of sequence elements and cellular factors that contribute to splicing regulation. A different combination of splicing sites, exonic or intronic sequences, mutually exclusive exons or retained introns could be selected during alternative splicing to generate different mature mRNAs that could in turn produce distinct protein products. Alternative splicing is the main source of protein diversity responsible for 90% of human gene expression, and it has recently become a hallmark for cancer with a full potential as a prognostic and therapeutic tool. Currently, more than 15,000 alternative splicing events have been associated to different aspects of cancer biology, including cell proliferation and invasion, apoptosis resistance and susceptibility to different chemotherapeutic drugs. Here, we present well established and newly discovered splicing events that occur in different cancer-related genes, their modification by several approaches and the current status of key tools developed to target alternative splicing with diagnostic and therapeutic purposes.
Identifying RNA splicing factors using IFT genes in Chlamydomonas reinhardtii.
Lin, Huawen; Zhang, Zhengyan; Iomini, Carlo; Dutcher, Susan K
2018-03-01
Intraflagellar transport moves proteins in and out of flagella/cilia and it is essential for the assembly of these organelles. Using whole-genome sequencing, we identified splice site mutations in two IFT genes, IFT81 ( fla9 ) and IFT121 ( ift121-2 ), which lead to flagellar assembly defects in the unicellular green alga Chlamydomonas reinhardtii The splicing defects in these ift mutants are partially corrected by mutations in two conserved spliceosome proteins, DGR14 and FRA10. We identified a dgr14 deletion mutant, which suppresses the 3' splice site mutation in IFT81 , and a frameshift mutant of FRA10 , which suppresses the 5' splice site mutation in IFT121 Surprisingly, we found dgr14-1 and fra10 mutations suppress both splice site mutations. We suggest these two proteins are involved in facilitating splice site recognition/interaction; in their absence some splice site mutations are tolerated. Nonsense mutations in SMG1 , which is involved in nonsense-mediated decay, lead to accumulation of aberrant transcripts and partial restoration of flagellar assembly in the ift mutants. The high density of introns and the conservation of noncore splicing factors, together with the ease of scoring the ift mutant phenotype, make Chlamydomonas an attractive organism to identify new proteins involved in splicing through suppressor screening. © 2018 The Authors.
LEDGF/p75 interacts with mRNA splicing factors and targets HIV-1 integration to highly spliced genes
Singh, Parmit Kumar; Plumb, Matthew R.; Ferris, Andrea L.; Iben, James R.; Wu, Xiaolin; Fadel, Hind J.; Luke, Brian T.; Esnault, Caroline; Poeschla, Eric M.; Hughes, Stephen H.; Kvaratskhelia, Mamuka; Levin, Henry L.
2015-01-01
The host chromatin-binding factor LEDGF/p75 interacts with HIV-1 integrase and directs integration to active transcription units. To understand how LEDGF/p75 recognizes transcription units, we sequenced 1 million HIV-1 integration sites isolated from cultured HEK293T cells. Analysis of integration sites showed that cancer genes were preferentially targeted, raising concerns about using lentivirus vectors for gene therapy. Additional analysis led to the discovery that introns and alternative splicing contributed significantly to integration site selection. These correlations were independent of transcription levels, size of transcription units, and length of the introns. Multivariate analysis with five parameters previously found to predict integration sites showed that intron density is the strongest predictor of integration density in transcription units. Analysis of previously published HIV-1 integration site data showed that integration density in transcription units in mouse embryonic fibroblasts also correlated strongly with intron number, and this correlation was absent in cells lacking LEDGF. Affinity purification showed that LEDGF/p75 is associated with a number of splicing factors, and RNA sequencing (RNA-seq) analysis of HEK293T cells lacking LEDGF/p75 or the LEDGF/p75 integrase-binding domain (IBD) showed that LEDGF/p75 contributes to splicing patterns in half of the transcription units that have alternative isoforms. Thus, LEDGF/p75 interacts with splicing factors, contributes to exon choice, and directs HIV-1 integration to transcription units that are highly spliced. PMID:26545813
An RNAi-Enhanced Logic Circuit for Cancer Specific Detection and Destruction
2013-02-01
monomeric protein secreted by Corynebacterium diphtheriae, and pro-apoptotic members of Bcl-2 family: mBax (Mus musculus), hBax ( Homo sapiens ), and its...Gata3 mStaple. Intron- feature sequences – donor site, branch point, poly- pyrimidine tract, and acceptor site – were selected based on previously...sequences found in literature our intron features were chosen according SplicePort [4], an online analyzer that detects the likelihood of splicing to
Khan, Shahid Y.; Ali, Shahbaz; Naeem, Muhammad Asif; Khan, Shaheen N.; Husnain, Tayyab; Butt, Nadeem H.; Qazi, Zaheeruddin A.; Akram, Javed; Riazuddin, Sheikh; Ayyagari, Radha; Hejtmancik, J. Fielding
2015-01-01
Purpose This study was conducted to localize and identify causal mutations associated with autosomal recessive retinitis pigmentosa (RP) in consanguineous familial cases of Pakistani origin. Methods Ophthalmic examinations that included funduscopy and electroretinography (ERG) were performed to confirm the affectation status. Blood samples were collected from all participating individuals, and genomic DNA was extracted. A genome-wide scan was performed, and two-point logarithm of odds (LOD) scores were calculated. Sanger sequencing was performed to identify the causative variants. Subsequently, we performed whole exome sequencing to rule out the possibility of a second causal variant within the linkage interval. Sequence conservation was performed with alignment analyses of PDE6A orthologs, and in silico splicing analysis was completed with Human Splicing Finder version 2.4.1. Results A large multigenerational consanguineous family diagnosed with early-onset RP was ascertained. An ophthalmic clinical examination consisting of fundus photography and electroretinography confirmed the diagnosis of RP. A genome-wide scan was performed, and suggestive two-point LOD scores were observed with markers on chromosome 5q. Haplotype analyses identified the region; however, the region did not segregate with the disease phenotype in the family. Subsequently, we performed a second genome-wide scan that excluded the entire genome except the chromosome 5q region harboring PDE6A. Next-generation whole exome sequencing identified a splice acceptor site mutation in intron 16: c.2028–1G>A, which was completely conserved in PDE6A orthologs and was absent in ethnically matched 350 control chromosomes, the 1000 Genomes database, and the NHLBI Exome Sequencing Project. Subsequently, we investigated our entire cohort of RP familial cases and identified a second family who harbored a splice acceptor site mutation in intron 10: c.1408–2A>G. In silico analysis suggested that these mutations will result in the elimination of wild-type splice acceptor sites that would result in either skipping of the respective exon or the creation of a new cryptic splice acceptor site; both possibilities would result in retinal photoreceptor cells that lack PDE6A wild-type protein. Conclusions we report two splice acceptor site variations in PDE6A in consanguineous Pakistani families who manifested cardinal symptoms of RP. Taken together with our previously published work, our data suggest that mutations in PDE6A account for about 2% of the total genetic load of RP in our cohort and possibly in the Pakistani population as well. PMID:26321862
Quantitation of normal CFTR mRNA in CF patients with splice-site mutations
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, Z.; Olsen, J.C.; Silverman, L.M.
Previously we identified two mutations in introns of the CFTR gene associated with partially active splice sites and unusual clinical phenotypes. One mutation in intron 19 (3849+10 kb C to T) is common in CF patients with normal sweat chloride values; an 84 bp sequence from intron 19, which contains a stop codon, is inserted between exon 19 and exon 20 in most nasal CFTR transcripts. The other mutation in intron 14B (2789+5 G to A) is associated with elevated sweat chloride levels, but mild pulmonary disease; exon 14B (38 bp) is spliced out of most nasal CFTR transcipts. Themore » remaining CFTR cDNA sequences, other than the 84 bp insertion of exon 14B deletion, are identical to the published sequence. To correlate genotype and phenotype, we used quantitative RT-PCR to determine the levels of normally-spliced CFTR mRNA in nasal epithelia from these patients. CFTR cDNA was amplified (25 cycles) by using primers specific for normally-spliced species, {gamma}-actin cDNA was amplified as a standard.« less
iSS-PC: Identifying Splicing Sites via Physical-Chemical Properties Using Deep Sparse Auto-Encoder.
Xu, Zhao-Chun; Wang, Peng; Qiu, Wang-Ren; Xiao, Xuan
2017-08-15
Gene splicing is one of the most significant biological processes in eukaryotic gene expression, such as RNA splicing, which can cause a pre-mRNA to produce one or more mature messenger RNAs containing the coded information with multiple biological functions. Thus, identifying splicing sites in DNA/RNA sequences is significant for both the bio-medical research and the discovery of new drugs. However, it is expensive and time consuming based only on experimental technique, so new computational methods are needed. To identify the splice donor sites and splice acceptor sites accurately and quickly, a deep sparse auto-encoder model with two hidden layers, called iSS-PC, was constructed based on minimum error law, in which we incorporated twelve physical-chemical properties of the dinucleotides within DNA into PseDNC to formulate given sequence samples via a battery of cross-covariance and auto-covariance transformations. In this paper, five-fold cross-validation test results based on the same benchmark data-sets indicated that the new predictor remarkably outperformed the existing prediction methods in this field. Furthermore, it is expected that many other related problems can be also studied by this approach. To implement classification accurately and quickly, an easy-to-use web-server for identifying slicing sites has been established for free access at: http://www.jci-bioinfo.cn/iSS-PC.
Diverse alternative back-splicing and alternative splicing landscape of circular RNAs
Zhang, Xiao-Ou; Dong, Rui; Zhang, Yang; Zhang, Jia-Lin; Luo, Zheng; Zhang, Jun; Chen, Ling-Ling; Yang, Li
2016-01-01
Circular RNAs (circRNAs) derived from back-spliced exons have been widely identified as being co-expressed with their linear counterparts. A single gene locus can produce multiple circRNAs through alternative back-splice site selection and/or alternative splice site selection; however, a detailed map of alternative back-splicing/splicing in circRNAs is lacking. Here, with the upgraded CIRCexplorer2 pipeline, we systematically annotated different types of alternative back-splicing and alternative splicing events in circRNAs from various cell lines. Compared with their linear cognate RNAs, circRNAs exhibited distinct patterns of alternative back-splicing and alternative splicing. Alternative back-splice site selection was correlated with the competition of putative RNA pairs across introns that bracket alternative back-splice sites. In addition, all four basic types of alternative splicing that have been identified in the (linear) mRNA process were found within circRNAs, and many exons were predominantly spliced in circRNAs. Unexpectedly, thousands of previously unannotated exons were detected in circRNAs from the examined cell lines. Although these novel exons had similar splice site strength, they were much less conserved than known exons in sequences. Finally, both alternative back-splicing and circRNA-predominant alternative splicing were highly diverse among the examined cell lines. All of the identified alternative back-splicing and alternative splicing in circRNAs are available in the CIRCpedia database (http://www.picb.ac.cn/rnomics/circpedia). Collectively, the annotation of alternative back-splicing and alternative splicing in circRNAs provides a valuable resource for depicting the complexity of circRNA biogenesis and for studying the potential functions of circRNAs in different cells. PMID:27365365
Another heritage from the RNA world: self-excision of intron sequence from nuclear pre-tRNAs.
Weber, U; Beier, H; Gross, H J
1996-06-15
The intervening sequences of nuclear tRNA precursors are known to be excised by tRNA splicing endonuclease. We show here that a T7 transcript corresponding to a pre-tRNA(Tyr) from Arabidopsis thaliana has a highly specific activity for autolytic intron excision. Self-cleavage occurs precisely at the authentic 3'-splice site and at the phosphodiester bond one nucleotide downstream of the authentic 5'-splice site. The reaction results in fragments with 2',3'-cyclic phosphate and 5'-OH termini. It is resistant to proteinase K and/or SDS treatment and is not inhibited by added tRNA. The self-cleavage depends on Mg2+ and is stimulated by spermine and Triton X-100. A set of sequence variants at the cleavage sites has been analysed for autolytic intron excision and, in parallel, for enzymatic in vitro splicing in wheat germ S23 extract. Single-stranded loops are a prerequisite for both reactions. Self-cleavage not only occurs at pyrimidine-A but also at U-U bonds. Since intron self-excision is only about five times slower than the enzymatic intron excision in a wheat germ S23 extract, we propose that the splicing endonuclease may function by improving the preciseness and efficiency of an inherent pre-tRNA self-cleavage activity.
Unusual Intron Conservation near Tissue-Regulated Exons Found by Splicing Microarrays
Sugnet, Charles W; Srinivasan, Karpagam; Clark, Tyson A; O'Brien, Georgeann; Cline, Melissa S; Wang, Hui; Williams, Alan; Kulp, David; Blume, John E; Haussler, David; Ares, Manuel
2006-01-01
Alternative splicing contributes to both gene regulation and protein diversity. To discover broad relationships between regulation of alternative splicing and sequence conservation, we applied a systems approach, using oligonucleotide microarrays designed to capture splicing information across the mouse genome. In a set of 22 adult tissues, we observe differential expression of RNA containing at least two alternative splice junctions for about 40% of the 6,216 alternative events we could detect. Statistical comparisons identify 171 cassette exons whose inclusion or skipping is different in brain relative to other tissues and another 28 exons whose splicing is different in muscle. A subset of these exons is associated with unusual blocks of intron sequence whose conservation in vertebrates rivals that of protein-coding exons. By focusing on sets of exons with similar regulatory patterns, we have identified new sequence motifs implicated in brain and muscle splicing regulation. Of note is a motif that is strikingly similar to the branchpoint consensus but is located downstream of the 5′ splice site of exons included in muscle. Analysis of three paralogous membrane-associated guanylate kinase genes reveals that each contains a paralogous tissue-regulated exon with a similar tissue inclusion pattern. While the intron sequences flanking these exons remain highly conserved among mammalian orthologs, the paralogous flanking intron sequences have diverged considerably, suggesting unusually complex evolution of the regulation of alternative splicing in multigene families. PMID:16424921
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture.
Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen; Burge, Christopher B
2017-12-27
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning ('intron definition') or exon-spanning ('exon definition') pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila , using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60-70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.
The power of fission: yeast as a tool for understanding complex splicing.
Fair, Benjamin Jung; Pleiss, Jeffrey A
2017-06-01
Pre-mRNA splicing is an essential component of eukaryotic gene expression. Many metazoans, including humans, regulate alternative splicing patterns to generate expansions of their proteome from a limited number of genes. Importantly, a considerable fraction of human disease causing mutations manifest themselves through altering the sequences that shape the splicing patterns of genes. Thus, understanding the mechanistic bases of this complex pathway will be an essential component of combating these diseases. Dating almost to the initial discovery of splicing, researchers have taken advantage of the genetic tractability of budding yeast to identify the components and decipher the mechanisms of splicing. However, budding yeast lacks the complex splicing machinery and alternative splicing patterns most relevant to humans. More recently, many researchers have turned their efforts to study the fission yeast, Schizosaccharomyces pombe, which has retained many features of complex splicing, including degenerate splice site sequences, the usage of exonic splicing enhancers, and SR proteins. Here, we review recent work using fission yeast genetics to examine pre-mRNA splicing, highlighting its promise for modeling the complex splicing seen in higher eukaryotes.
Lisbin, Michael J.; Qiu, Jan; White, Kalpana
2001-01-01
Drosophila melanogaster neural-specific protein, ELAV, has been shown to regulate the neural-specific splicing of three genes: neuroglian (nrg), erect wing, and armadillo. Alternative splicing of the nrg transcript involves alternative inclusion of a 3′-terminal exon. Here, using a minigene reporter, we show that the nrg alternatively spliced intron (nASI) has all the determinants required to recreate proper neural-specific RNA processing seen with the endogenous nrg transcript, including regulation by ELAV. An in vitro UV cross-linking assay revealed that ELAV from nuclear extracts cross-links to four distinct sites along the 3200 nucleotide long nASI; one EXS is positioned at the polypyrimidine tract of the default 3′ splice site. ELAV cross-linking sites (EXSs) have in common long tracts of (U)-rich sequence rather than a precise consensus; moreover, each tract has at least two 8/10U elements; their importance is validated by mutant transgene reporter analysis. Further, we propose criteria for ELAV target sequence recognition based on the four EXSs, sites within the nASI that are (U) rich but do not cross-link with ELAV, and predicted EXSs from a phylogenetic comparison with Drosophila virilis nASI. These results suggest that ELAV regulates nrg alternative splicing by direct interaction with the nASI. PMID:11581160
Lisbin, M J; Qiu, J; White, K
2001-10-01
Drosophila melanogaster neural-specific protein, ELAV, has been shown to regulate the neural-specific splicing of three genes: neuroglian (nrg), erect wing, and armadillo. Alternative splicing of the nrg transcript involves alternative inclusion of a 3'-terminal exon. Here, using a minigene reporter, we show that the nrg alternatively spliced intron (nASI) has all the determinants required to recreate proper neural-specific RNA processing seen with the endogenous nrg transcript, including regulation by ELAV. An in vitro UV cross-linking assay revealed that ELAV from nuclear extracts cross-links to four distinct sites along the 3200 nucleotide long nASI; one EXS is positioned at the polypyrimidine tract of the default 3' splice site. ELAV cross-linking sites (EXSs) have in common long tracts of (U)-rich sequence rather than a precise consensus; moreover, each tract has at least two 8/10U elements; their importance is validated by mutant transgene reporter analysis. Further, we propose criteria for ELAV target sequence recognition based on the four EXSs, sites within the nASI that are (U) rich but do not cross-link with ELAV, and predicted EXSs from a phylogenetic comparison with Drosophila virilis nASI. These results suggest that ELAV regulates nrg alternative splicing by direct interaction with the nASI.
A Novel Subgenomic Murine Leukemia Virus RNA Transcript Results from Alternative Splicing
Déjardin, Jérôme; Bompard-Maréchal, Guillaume; Audit, Muriel; Hope, Thomas J.; Sitbon, Marc; Mougel, Marylène
2000-01-01
Here we show the existence of a novel subgenomic 4.4-kb RNA in cells infected with the prototypic replication-competent Friend or Moloney murine leukemia viruses (MuLV). This RNA derives by splicing from an alternative donor site (SD′) within the capsid-coding region to the canonical envelope splice acceptor site. The position and the sequence of SD′ was highly conserved among mammalian type C and D oncoviruses. Point mutations used to inactivate SD′ without changing the capsid-coding ability affected viral RNA splicing and reduced viral replication in infected cells. PMID:10729146
NASA Astrophysics Data System (ADS)
Shih, Shin-Ru; Nemeroff, Martin E.; Krug, Robert M.
1995-07-01
The influenza virus M1 mRNA has two alternative 5' splice sites: a distal 5' splice site producing mRNA_3 that has the coding potential for 9 amino acids and a proximal 5' splice site producing M2 mRNA encoding the essential M2 ion-channel protein. Only mRNA_3 was made in uninfected cells transfected with DNA expressing M1 mRNA. Similarly, using nuclear extracts from uninfected cells, in vitro splicing of M1 mRNA yielded only mRNA_3. Only when the mRNA_3 5' splice site was inactivated by mutation was M2 mRNA made in uninfected cells and in uninfected cell extracts. In influenza virus-infected cells, M2 mRNA was made, but only after a delay, suggesting that newly synthesized viral gene product(s) were needed to activate the M2 5' splice site. We present strong evidence that these gene products are the complex of the three polymerase proteins, the same complex that functions in the transcription and replication of the viral genome. Gel shift experiments showed that the viral polymerase complex bound to the 5' end of the viral M1 mRNA in a sequence-specific and cap-dependent manner. During in vitro splicing catalyzed by uninfected cell extracts, the binding of the viral polymerase complex blocked the mRNA_3 5' splice site, resulting in the switch to the M2 mRNA 5' splice site and the production of M2 mRNA.
Novel BRCA1 splice-site mutation in ovarian cancer patients of Slavic origin.
Krivokuca, Ana; Dragos, Vita Setrajcic; Stamatovic, Ljiljana; Blatnik, Ana; Boljevic, Ivana; Stegel, Vida; Rakobradovic, Jelena; Skerl, Petra; Jovandic, Stevo; Krajc, Mateja; Magic, Mirjana Brankovic; Novakovic, Srdjan
2018-04-01
Mutations in breast cancer susceptibility gene 1 (BRCA1) lead to defects in a number of cellular pathways including DNA damage repair and transcriptional regulation, resulting in the elevated genome instability and predisposing to breast and ovarian cancers. We report a novel mutation LRG_292t1:c.4356delA,p.(Ala1453Glnfs*3) in the 12th exon of BRCA1, in the splice site region near the donor site of intron 12. It is a frameshift mutation with the termination codon generated on the third amino acid position from the site of deletion. Human Splice Finder 3.0 and MutationTaster have assessed this variation as disease causing, based on the alteration of splicing, creation of premature stop codon and other potential alterations initiated by nucleotide deletion. Among the most important alterations are frameshift and splice site changes (score of the newly created donor splice site: 0.82). c.4356delA was associated with two ovarian cancer cases in two families of Slavic origin. It was detected by next generation sequencing, and confirmed with Sanger sequencing in both cases. Because of the fact that it changes the reading frame of the protein, novel mutation c.4356delA p.(Ala1453Glnfs*3) in BRCA1 gene might be of clinical significance for hereditary ovarian cancer. Further functional as well as segregation analyses within the families are necessary for appropriate clinical classification of this variant. Since it has been detected in two ovarian cancer patients of Slavic origin, it is worth investigating founder effect of this mutation in Slavic populations.
Sequence variation between 462 human individuals fine-tunes functional sites of RNA processing
NASA Astrophysics Data System (ADS)
Ferreira, Pedro G.; Oti, Martin; Barann, Matthias; Wieland, Thomas; Ezquina, Suzana; Friedländer, Marc R.; Rivas, Manuel A.; Esteve-Codina, Anna; Estivill, Xavier; Guigó, Roderic; Dermitzakis, Emmanouil; Antonarakis, Stylianos; Meitinger, Thomas; Strom, Tim M.; Palotie, Aarno; François Deleuze, Jean; Sudbrak, Ralf; Lerach, Hans; Gut, Ivo; Syvänen, Ann-Christine; Gyllensten, Ulf; Schreiber, Stefan; Rosenstiel, Philip; Brunner, Han; Veltman, Joris; Hoen, Peter A. C. T.; Jan van Ommen, Gert; Carracedo, Angel; Brazma, Alvis; Flicek, Paul; Cambon-Thomsen, Anne; Mangion, Jonathan; Bentley, David; Hamosh, Ada; Rosenstiel, Philip; Strom, Tim M.; Lappalainen, Tuuli; Guigó, Roderic; Sammeth, Michael
2016-09-01
Recent advances in the cost-efficiency of sequencing technologies enabled the combined DNA- and RNA-sequencing of human individuals at the population-scale, making genome-wide investigations of the inter-individual genetic impact on gene expression viable. Employing mRNA-sequencing data from the Geuvadis Project and genome sequencing data from the 1000 Genomes Project we show that the computational analysis of DNA sequences around splice sites and poly-A signals is able to explain several observations in the phenotype data. In contrast to widespread assessments of statistically significant associations between DNA polymorphisms and quantitative traits, we developed a computational tool to pinpoint the molecular mechanisms by which genetic markers drive variation in RNA-processing, cataloguing and classifying alleles that change the affinity of core RNA elements to their recognizing factors. The in silico models we employ further suggest RNA editing can moonlight as a splicing-modulator, albeit less frequently than genomic sequence diversity. Beyond existing annotations, we demonstrate that the ultra-high resolution of RNA-Seq combined from 462 individuals also provides evidence for thousands of bona fide novel elements of RNA processing—alternative splice sites, introns, and cleavage sites—which are often rare and lowly expressed but in other characteristics similar to their annotated counterparts.
Graveley, Brenton R.
2008-01-01
Summary Drosophila Dscam encodes 38,016 distinct axon guidance receptors through the mutually exclusive alternative splicing of 95 variable exons. Importantly, known mechanisms that ensure the mutually exclusive splicing of pairs of exons cannot explain this phenomenon in Dscam. I have identified two classes of conserved elements in the Dscam exon 6 cluster, which contains 48 alternative exons—the docking site, located in the intron downstream of constitutive exon 5, and the selector sequences, which are located upstream of each exon 6 variant. Strikingly, each selector sequence is complementary to a portion of the docking site, and this pairing juxtaposes one, and only one, alternative exon to the upstream constitutive exon. The mutually exclusive nature of the docking site:selector sequence interactions suggests that the formation of these competing RNA structures is a central component of the mechanism guaranteeing that only one exon 6 variant is included in each Dscam mRNA. PMID:16213213
Xu, Dong-Qing; Mattox, William
2006-01-01
Exonic splicing enhancers (ESEs) are sequences that facilitate recognition of splice sites and prevent exon-skipping. Because ESEs are often embedded within proteincoding sequences, alterations in them can also often be interpreted as nonsense, missense or silent mutations. To correctly interpret exonic mutations and their roles in disease, it is important to develop strategies that identify ESE mutations. Potential ESEs can be found computationally in many exons but it has proven difficult to predict if a given mutation will have effects on splicing based on sequence alone. Here we describe a flexible in vitro method that can be used to functionally compare the effects of multiple sequence variants on ESE activity in a single in vitro splicing reaction. We have applied this method in parallel with conventional splicing assays to test for a splicing enhancer in exon 17 of the human MLH1 gene. Point mutations associated with hereditary nonpolyposis colorectal cancer (HNPCC) have previously been found to correlate with exon-skipping in both lymphocytes and tumors from patients. We show that sequences from this exon can replace an ESE from the mouse IgM gene to support RNA splicing in HeLa nuclear extracts. ESE activity was reduced by HNPCC point mutations in codon 659 indicating that their primary effect is on splicing. Surprisingly the strongest enhancer function mapped to a different region of the exon upstream of this codon. Together our results indicate that HNPCC point mutations in codon 659 affect an auxillary element that augments the enhancer function to ensure exon inclusion. PMID:16357104
Chee, Gab-Joo; Takami, Hideto
2011-01-01
Group II introns inserted into genes often undergo splicing at unexpected sites, and participate in the transcription of host genes. We identified five copies of a group II intron, designated Oi.Int, in the genome of an extremely halotolerant and alkaliphilic bacillus, Oceanobacillus iheyensis. The Oi.Int4 differs from the Oi.Int3 at four bases. The ligated exons of the Oi.Int4 could not be detected by RT-PCR assays in vivo or in vitro although group II introns can generally self-splice in vitro without the involvement of an intron-encoded open reading frame (ORF). In the Oi.Int4 mutants with base substitutions within the ORF, ligated exons were detected by in vitro self-splicing. It was clear that the ligation of exons during splicing is affected by the sequence of the intron-encoded ORF since the splice sites corresponded to the joining sites of the intron. In addition, the mutant introns showed unexpected multiple products with alternative 5' splice sites. These findings imply that alternative 5' splicing which causes a functional change of ligated exons presumably has influenced past adaptations of O. iheyensis to various environmental changes.
New Splice Site Acceptor Mutation in AIRE Gene in Autoimmune Polyendocrine Syndrome Type 1
Mora, Mireia; Hanzu, Felicia A.; Pradas-Juni, Marta; Aranda, Gloria B.; Halperin, Irene; Puig-Domingo, Manuel; Aguiló, Sira; Fernández-Rebollo, Eduardo
2014-01-01
Autoimmune polyglandular syndrome type 1 (APS-1, OMIM 240300) is a rare autosomal recessive disorder, characterized by the presence of at least two of three major diseases: hypoparathyroidism, Addison’s disease, and chronic mucocutaneous candidiasis. We aim to identify the molecular defects and investigate the clinical and mutational characteristics in an index case and other members of a consanguineous family. We identified a novel homozygous mutation in the splice site acceptor (SSA) of intron 5 (c.653-1G>A) in two siblings with different clinical outcomes of APS-1. Coding DNA sequencing revealed that this AIRE mutation potentially compromised the recognition of the constitutive SSA of intron 5, splicing upstream onto a nearby cryptic SSA in intron 5. Surprisingly, the use of an alternative SSA entails the uncovering of a cryptic donor splice site in exon 5. This new transcript generates a truncated protein (p.A214fs67X) containing the first 213 amino acids and followed by 68 aberrant amino acids. The mutation affects the proper splicing, not only at the acceptor but also at the donor splice site, highlighting the complexity of recognizing suitable splicing sites and the importance of sequencing the intron-exon junctions for a more precise molecular diagnosis and correct genetic counseling. As both siblings were carrying the same mutation but exhibited a different APS-1 onset, and one of the brothers was not clinically diagnosed, our finding highlights the possibility to suspect mutations in the AIRE gene in cases of childhood chronic candidiasis and/or hypoparathyroidism otherwise unexplained, especially when the phenotype is associated with other autoimmune diseases. PMID:24988226
U2AF1 mutations alter splice site recognition in hematological malignancies.
Ilagan, Janine O; Ramakrishnan, Aravind; Hayes, Brian; Murphy, Michele E; Zebari, Ahmad S; Bradley, Philip; Bradley, Robert K
2015-01-01
Whole-exome sequencing studies have identified common mutations affecting genes encoding components of the RNA splicing machinery in hematological malignancies. Here, we sought to determine how mutations affecting the 3' splice site recognition factor U2AF1 alter its normal role in RNA splicing. We find that U2AF1 mutations influence the similarity of splicing programs in leukemias, but do not give rise to widespread splicing failure. U2AF1 mutations cause differential splicing of hundreds of genes, affecting biological pathways such as DNA methylation (DNMT3B), X chromosome inactivation (H2AFY), the DNA damage response (ATR, FANCA), and apoptosis (CASP8). We show that U2AF1 mutations alter the preferred 3' splice site motif in patients, in cell culture, and in vitro. Mutations affecting the first and second zinc fingers give rise to different alterations in splice site preference and largely distinct downstream splicing programs. These allele-specific effects are consistent with a computationally predicted model of U2AF1 in complex with RNA. Our findings suggest that U2AF1 mutations contribute to pathogenesis by causing quantitative changes in splicing that affect diverse cellular pathways, and give insight into the normal function of U2AF1's zinc finger domains. © 2015 Ilagan et al.; Published by Cold Spring Harbor Laboratory Press.
Can the HIV-1 splicing machinery be targeted for drug discovery?
Dlamini, Zodwa; Hull, Rodney
2017-01-01
HIV-1 is able to express multiple protein types and isoforms from a single 9 kb mRNA transcript. These proteins are also expressed at particular stages of viral development, and this is achieved through the control of alternative splicing and the export of these transcripts from the nucleus. The nuclear export is controlled by the HIV protein Rev being required to transport incompletely spliced and partially spliced mRNA from the nucleus where they are normally retained. This implies a close relationship between the control of alternate splicing and the nuclear export of mRNA in the control of HIV-1 viral proliferation. This review discusses both the processes. The specificity and regulation of splicing in HIV-1 is controlled by the use of specific splice sites as well as exonic splicing enhancer and exonic splicing silencer sequences. The use of these silencer and enhancer sequences is dependent on the serine arginine family of proteins as well as the heterogeneous nuclear ribonucleoprotein family of proteins that bind to these sequences and increase or decrease splicing. Since alternative splicing is such a critical factor in viral development, it presents itself as a promising drug target. This review aims to discuss the inhibition of splicing, which would stall viral development, as an anti-HIV therapeutic strategy. In this review, the most recent knowledge of splicing in human immunodeficiency viral development and the latest therapeutic strategies targeting human immunodeficiency viral splicing are discussed. PMID:28331370
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen
2017-01-01
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing. PMID:29280736
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
Pai, Athma A.; Henriques, Telmo; McCue, Kayla; ...
2017-12-27
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pai, Athma A.; Henriques, Telmo; McCue, Kayla
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less
Xue, Yuan; Schoser, Benedikt; Rao, Aliz R; Quadrelli, Roberto; Vaglio, Alicia; Rupp, Verena; Beichler, Christine; Nelson, Stanley F; Schapacher-Tilp, Gudrun; Windpassinger, Christian; Wilcox, William R
2016-04-01
Previously, we reported a rare X-linked disorder, Uruguay syndrome in a single family. The main features are pugilistic facies, skeletal deformities, and muscular hypertrophy despite a lack of exercise and cardiac ventricular hypertrophy leading to premature death. An ≈19 Mb critical region on X chromosome was identified through identity-by-descent analysis of 3 affected males. Exome sequencing was conducted on one affected male to identify the disease-causing gene and variant. A splice site variant (c.502-2A>G) in the FHL1 gene was highly suspicious among other candidate genes and variants. FHL1A is the predominant isoform of FHL1 in cardiac and skeletal muscle. Sequencing cDNA showed the splice site variant led to skipping of exons 6 of the FHL1A isoform, equivalent to the FHL1C isoform. Targeted analysis showed that this splice site variant cosegregated with disease in the family. Western blot and immunohistochemical analysis of muscle from the proband showed a significant decrease in protein expression of FHL1A. Real-time polymerase chain reaction analysis of different isoforms of FHL1 demonstrated that the FHL1C is markedly increased. Mutations in the FHL1 gene have been reported in disorders with skeletal and cardiac myopathy but none has the skeletal or facial phenotype seen in patients with Uruguay syndrome. Our data suggest that a novel FHL1 splice site variant results in the absence of FHL1A and the abundance of FHL1C, which may contribute to the complex and severe phenotype. Mutation screening of the FHL1 gene should be considered for patients with uncharacterized myopathies and cardiomyopathies. © 2016 American Heart Association, Inc.
Analysis and recognition of 5′ UTR intron splice sites in human pre-mRNA
Eden, E.; Brunak, S.
2004-01-01
Prediction of splice sites in non-coding regions of genes is one of the most challenging aspects of gene structure recognition. We perform a rigorous analysis of such splice sites embedded in human 5′ untranslated regions (UTRs), and investigate correlations between this class of splice sites and other features found in the adjacent exons and introns. By restricting the training of neural network algorithms to ‘pure’ UTRs (not extending partially into protein coding regions), we for the first time investigate the predictive power of the splicing signal proper, in contrast to conventional splice site prediction, which typically relies on the change in sequence at the transition from protein coding to non-coding. By doing so, the algorithms were able to pick up subtler splicing signals that were otherwise masked by ‘coding’ noise, thus enhancing significantly the prediction of 5′ UTR splice sites. For example, the non-coding splice site predicting networks pick up compositional and positional bias in the 3′ ends of non-coding exons and 5′ non-coding intron ends, where cytosine and guanine are over-represented. This compositional bias at the true UTR donor sites is also visible in the synaptic weights of the neural networks trained to identify UTR donor sites. Conventional splice site prediction methods perform poorly in UTRs because the reading frame pattern is absent. The NetUTR method presented here performs 2–3-fold better compared with NetGene2 and GenScan in 5′ UTRs. We also tested the 5′ UTR trained method on protein coding regions, and discovered, surprisingly, that it works quite well (although it cannot compete with NetGene2). This indicates that the local splicing pattern in UTRs and coding regions is largely the same. The NetUTR method is made publicly available at www.cbs.dtu.dk/services/NetUTR. PMID:14960723
Validation of Splicing Events in Transcriptome Sequencing Data
Kaisers, Wolfgang; Ptok, Johannes; Schwender, Holger; Schaal, Heiner
2017-01-01
Genomic alignments of sequenced cellular messenger RNA contain gapped alignments which are interpreted as consequence of intron removal. The resulting gap-sites, genomic locations of alignment gaps, are landmarks representing potential splice-sites. As alignment algorithms report gap-sites with a considerable false discovery rate, validations are required. We describe two quality scores, gap quality score (gqs) and weighted gap information score (wgis), developed for validation of putative splicing events: While gqs solely relies on alignment data wgis additionally considers information from the genomic sequence. FASTQ files obtained from 54 human dermal fibroblast samples were aligned against the human genome (GRCh38) using TopHat and STAR aligner. Statistical properties of gap-sites validated by gqs and wgis were evaluated by their sequence similarity to known exon-intron borders. Within the 54 samples, TopHat identifies 1,000,380 and STAR reports 6,487,577 gap-sites. Due to the lack of strand information, however, the percentage of identified GT-AG gap-sites is rather low. While gap-sites from TopHat contain ≈89% GT-AG, gap-sites from STAR only contain ≈42% GT-AG dinucleotide pairs in merged data from 54 fibroblast samples. Validation with gqs yields 156,251 gap-sites from TopHat alignments and 166,294 from STAR alignments. Validation with wgis yields 770,327 gap-sites from TopHat alignments and 1,065,596 from STAR alignments. Both alignment algorithms, TopHat and STAR, report gap-sites with considerable false discovery rate, which can drastically be reduced by validation with gqs and wgis. PMID:28545234
DeVry, C G; Tsai, W; Clarke, S
1996-11-15
The protein L-isoaspartyl/D-aspartyl O-methyltransferase (EC 2.1.1.77) catalyzes the first step in the repair of proteins damaged in the aging process by isomerization or racemization reactions at aspartyl and asparaginyl residues. A single gene has been localized to human chromosome 6 and multiple transcripts arising through alternative splicing have been identified. Restriction enzyme mapping, subcloning, and DNA sequence analysis of three overlapping clones from a human genomic library in bacteriophage P1 indicate that the gene spans approximately 60 kb and is composed of 8 exons interrupted by 7 introns. Analysis of intron/exon splice junctions reveals that all of the donor and acceptor splice sites are in agreement with the mammalian consensus splicing sequence. Determination of transcription initiation sites by primer extension analysis of poly(A)+ mRNA from human brain identifies multiple start sites, with a major site 159 nucleotides upstream from the ATG start codon. Sequence analysis of the 5'-untranslated region demonstrates several potential cis-acting DNA elements including SP1, ETF, AP1, AP2, ARE, XRE, CREB, MED-1, and half-palindromic ERE motifs. The promoter of this methyltransferase gene lacks an identifiable TATA box but is characterized by a CpG island which begins approximately 723 nucleotides upstream of the major transcriptional start site and extends through exon 1 and into the first intron. These features are characteristic of housekeeping genes and are consistent with the wide tissue distribution observed for this methyltransferase activity.
Pseudoexon activation increases phenotype severity in a Becker muscular dystrophy patient.
Greer, Kane; Mizzi, Kayla; Rice, Emily; Kuster, Lukas; Barrero, Roberto A; Bellgard, Matthew I; Lynch, Bryan J; Foley, Aileen Reghan; O Rathallaigh, Eoin; Wilton, Steve D; Fletcher, Sue
2015-07-01
We report a dystrophinopathy patient with an in-frame deletion of DMD exons 45-47, and therefore a genetic diagnosis of Becker muscular dystrophy, who presented with a more severe than expected phenotype. Analysis of the patient DMD mRNA revealed an 82 bp pseudoexon, derived from intron 44, that disrupts the reading frame and is expected to yield a nonfunctional dystrophin. Since the sequence of the pseudoexon and canonical splice sites does not differ from the reference sequence, we concluded that the genomic rearrangement promoted recognition of the pseudoexon, causing a severe dystrophic phenotype. We characterized the deletion breakpoints and identified motifs that might influence selection of the pseudoexon. We concluded that the donor splice site was strengthened by juxtaposition of intron 47, and loss of intron 44 silencer elements, normally located downstream of the pseudoexon donor splice site, further enhanced pseudoexon selection and inclusion in the DMD transcript in this patient.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vidaud, M.; Vidaud, D.; Amselem, S.
The authors have characterized a Mediterranean {beta}-thalassemia allele containing a sequence change at codon 30 that alters both {beta}-globin pre-mRNA splicing and the structure of the homoglobin product. Presumably, this G {yields} C transversion at position {minus}1 of intron 1 reduces severely the utilization of the normal 5{prime} splice site since the level of the Arg {yields} Thr mutant hemoglobin (designated hemoglobin Kairouan) found in the erythrocytes of the patient is very low (2% of total hemoglobin). Since no natural mutations of the guanine located at position {minus}1 of the CAG/GTAAGT consensus sequence had been isolated previously. They investigated themore » role of this nucleotide in the constitution of an active 5{prime} splice site by studying the splicing of the pre-mRNA in cell-free extracts. They demonstrate that correct splicing of the mutant pre-mRNA is 98% inhibited. Their results provide further insights into the mechanisms of pre-mRNA maturation by revealing that the last residue of the exon plays a role at least equivalent to that of the intron residue at position +5.« less
Dynamic ASXL1 Exon Skipping and Alternative Circular Splicing in Single Human Cells
Natarajan, Sivaraman; Carter, Robert; Brown, Patrick O.
2016-01-01
Circular RNAs comprise a poorly understood new class of noncoding RNA. In this study, we used a combination of targeted deletion, high-resolution splicing detection, and single-cell sequencing to deeply probe ASXL1 circular splicing. We found that efficient circular splicing required the canonical transcriptional start site and inverted AluSx elements. Sequencing-based interrogation of isoforms after ASXL1 overexpression identified promiscuous linear splicing between all exons, with the two most abundant non-canonical linear products skipping the exons that produced the circular isoforms. Single-cell sequencing revealed a strong preference for either the linear or circular ASXL1 isoforms in each cell, and found the predominant exon skipping product is frequently co-expressed with its reciprocal circular isoform. Finally, absolute quantification of ASXL1 isoforms confirmed our findings and suggests that standard methods overestimate circRNA abundance. Taken together, these data reveal a dynamic new view of circRNA genesis, providing additional framework for studying their roles in cellular biology. PMID:27736885
Matos, Liliana; Canals, Isaac; Dridi, Larbi; Choi, Yoo; Prata, Maria João; Jordan, Peter; Desviat, Lourdes R; Pérez, Belén; Pshezhetsky, Alexey V; Grinberg, Daniel; Alves, Sandra; Vilageliu, Lluïsa
2014-12-10
Mutations affecting RNA splicing represent more than 20% of the mutant alleles in Sanfilippo syndrome type C, a rare lysosomal storage disorder that causes severe neurodegeneration. Many of these mutations are localized in the conserved donor or acceptor splice sites, while few are found in the nearby nucleotides. In this study we tested several therapeutic approaches specifically designed for different splicing mutations depending on how the mutations affect mRNA processing. For three mutations that affect the donor site (c.234 + 1G > A, c.633 + 1G > A and c.1542 + 4dupA), different modified U1 snRNAs recognizing the mutated donor sites, have been developed in an attempt to rescue the normal splicing process. For another mutation that affects an acceptor splice site (c.372-2A > G) and gives rise to a protein lacking four amino acids, a competitive inhibitor of the HGSNAT protein, glucosamine, was tested as a pharmacological chaperone to correct the aberrant folding and to restore the normal trafficking of the protein to the lysosome. Partial correction of c.234 + 1G > A mutation was achieved with a modified U1 snRNA that completely matches the splice donor site suggesting that these molecules may have a therapeutic potential for some splicing mutations. Furthermore, the importance of the splice site sequence context is highlighted as a key factor in the success of this type of therapy. Additionally, glucosamine treatment resulted in an increase in the enzymatic activity, indicating a partial recovery of the correct folding. We have assayed two therapeutic strategies for different splicing mutations with promising results for the future applications.
Lücke, S; Xu, G L; Palfi, Z; Cross, M; Bellofatto, V; Bindereif, A
1996-01-01
In trypanosomes mRNAs are generated through trans splicing. The spliced leader (SL) RNA, which donates the 5'-terminal mini-exon to each of the protein coding exons, plays a central role in the trans splicing process. We have established in vivo assays to study in detail trans splicing, cap4 modification, and RNP assembly of the SL RNA in the trypanosomatid species Leptomonas seymouri. First, we found that extensive sequences within the mini-exon are required for SL RNA function in vivo, although a conserved length of 39 nt is not essential. In contrast, the intron sequence appears to be surprisingly tolerant to mutation; only the stem-loop II structure is indispensable. The asymmetry of the sequence requirements in the stem I region suggests that this domain may exist in different functional conformations. Second, distinct mini-exon sequences outside the modification site are important for efficient cap4 formation. Third, all SL RNA mutations tested allowed core RNP assembly, suggesting flexible requirements for core protein binding. In sum, the results of our mutational analysis provide evidence for a discrete domain structure of the SL RNA and help to explain the strong phylogenetic conservation of the mini-exon sequence and of the overall SL RNA secondary structure; they also suggest that there may be certain differences between trans splicing in nematodes and trypanosomes. This approach provides a basis for studying RNA-RNA interactions in the trans spliceosome. Images PMID:8861965
A survey of the sorghum transcriptome using single-molecule long reads
Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; ...
2016-06-24
Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novelmore » splice isoforms. Additionally, we uncover APA ofB11,000 expressed genes and more than 2,100 novel genes. Lastly, these results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism.« less
A survey of the sorghum transcriptome using single-molecule long reads
Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; Ngam, Peter; Devitt, Nicholas; Schilkey, Faye; Ben-Hur, Asa; Reddy, Anireddy S. N.
2016-01-01
Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novel splice isoforms. Additionally, we uncover APA of ∼11,000 expressed genes and more than 2,100 novel genes. These results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism. PMID:27339290
Short intronic repeat sequences facilitate circular RNA production
Liang, Dongming
2014-01-01
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery “backsplices” and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3′ end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. PMID:25281217
Mechanisms and Regulation of Alternative Pre-mRNA Splicing
Lee, Yeon
2015-01-01
Precursor messenger RNA (pre-mRNA) splicing is a critical step in the posttranscriptional regulation of gene expression, providing significant expansion of the functional proteome of eukaryotic organisms with limited gene numbers. Split eukaryotic genes contain intervening sequences or introns disrupting protein-coding exons, and intron removal occurs by repeated assembly of a large and highly dynamic ribonucleoprotein complex termed the spliceosome, which is composed of five small nuclear ribonucleoprotein particles, U1, U2, U4/U6, and U5. Biochemical studies over the past 10 years have allowed the isolation as well as compositional, functional, and structural analysis of splicing complexes at distinct stages along the spliceosome cycle. The average human gene contains eight exons and seven introns, producing an average of three or more alternatively spliced mRNA isoforms. Recent high-throughput sequencing studies indicate that 100% of human genes produce at least two alternative mRNA isoforms. Mechanisms of alternative splicing include RNA–protein interactions of splicing factors with regulatory sites termed silencers or enhancers, RNA–RNA base-pairing interactions, or chromatin-based effects that can change or determine splicing patterns. Disease-causing mutations can often occur in splice sites near intron borders or in exonic or intronic RNA regulatory silencer or enhancer elements, as well as in genes that encode splicing factors. Together, these studies provide mechanistic insights into how spliceosome assembly, dynamics, and catalysis occur; how alternative splicing is regulated and evolves; and how splicing can be disrupted by cis- and trans-acting mutations leading to disease states. These findings make the spliceosome an attractive new target for small-molecule, antisense, and genome-editing therapeutic interventions. PMID:25784052
Sun, Xiaoyong; Wang, Lin; Ding, Jiechao; Wang, Yanru; Wang, Jiansheng; Zhang, Xiaoyang; Che, Yulei; Liu, Ziwei; Zhang, Xinran; Ye, Jiazhen; Wang, Jie; Sablok, Gaurav; Deng, Zhiping; Zhao, Hongwei
2016-10-01
A new regulatory class of small endogenous RNAs called circular RNAs (circRNAs) has been described as miRNA sponges in animals. Using 16 Arabidopsis thaliana RNA-Seq data sets, we identified 803 circRNAs in RNase R-/non-RNase R-treated samples. The results revealed the following features: Canonical and noncanonical splicing can generate circRNAs; chloroplasts are a hotspot for circRNA generation; furthermore, limited complementary sequences exist not only in introns, but also in the sequences flanking splice sites. The latter finding suggests that multiple combinations between complementary sequences may facilitate the formation of the circular structure. Our results contribute to a better understanding of this novel class of plant circRNAs. © 2016 Federation of European Biochemical Societies.
RNA editing in nascent RNA affects pre-mRNA splicing
Hsiao, Yun-Hua Esther; Bahn, Jae Hoon; Yang, Yun; Lin, Xianzhi; Tran, Stephen; Yang, Ei-Wen; Quinones-Valdez, Giovanni
2018-01-01
In eukaryotes, nascent RNA transcripts undergo an intricate series of RNA processing steps to achieve mRNA maturation. RNA editing and alternative splicing are two major RNA processing steps that can introduce significant modifications to the final gene products. By tackling these processes in isolation, recent studies have enabled substantial progress in understanding their global RNA targets and regulatory pathways. However, the interplay between individual steps of RNA processing, an essential aspect of gene regulation, remains poorly understood. By sequencing the RNA of different subcellular fractions, we examined the timing of adenosine-to-inosine (A-to-I) RNA editing and its impact on alternative splicing. We observed that >95% A-to-I RNA editing events occurred in the chromatin-associated RNA prior to polyadenylation. We report about 500 editing sites in the 3′ acceptor sequences that can alter splicing of the associated exons. These exons are highly conserved during evolution and reside in genes with important cellular function. Furthermore, we identified a second class of exons whose splicing is likely modulated by RNA secondary structures that are recognized by the RNA editing machinery. The genome-wide analyses, supported by experimental validations, revealed remarkable interplay between RNA editing and splicing and expanded the repertoire of functional RNA editing sites. PMID:29724793
RNA editing in nascent RNA affects pre-mRNA splicing.
Hsiao, Yun-Hua Esther; Bahn, Jae Hoon; Yang, Yun; Lin, Xianzhi; Tran, Stephen; Yang, Ei-Wen; Quinones-Valdez, Giovanni; Xiao, Xinshu
2018-06-01
In eukaryotes, nascent RNA transcripts undergo an intricate series of RNA processing steps to achieve mRNA maturation. RNA editing and alternative splicing are two major RNA processing steps that can introduce significant modifications to the final gene products. By tackling these processes in isolation, recent studies have enabled substantial progress in understanding their global RNA targets and regulatory pathways. However, the interplay between individual steps of RNA processing, an essential aspect of gene regulation, remains poorly understood. By sequencing the RNA of different subcellular fractions, we examined the timing of adenosine-to-inosine (A-to-I) RNA editing and its impact on alternative splicing. We observed that >95% A-to-I RNA editing events occurred in the chromatin-associated RNA prior to polyadenylation. We report about 500 editing sites in the 3' acceptor sequences that can alter splicing of the associated exons. These exons are highly conserved during evolution and reside in genes with important cellular function. Furthermore, we identified a second class of exons whose splicing is likely modulated by RNA secondary structures that are recognized by the RNA editing machinery. The genome-wide analyses, supported by experimental validations, revealed remarkable interplay between RNA editing and splicing and expanded the repertoire of functional RNA editing sites. © 2018 Hsiao et al.; Published by Cold Spring Harbor Laboratory Press.
Kalyna, Maria; Lopato, Sergiy; Voronin, Viktor; Barta, Andrea
2006-01-01
Alternative splicing is an important mechanism for fine tuning of gene expression at the post-transcriptional level. SR proteins govern splice site selection and spliceosome assembly. The Arabidopsis genome encodes 19 SR proteins, several of which have no orthologues in metazoan. Three of the plant specific subfamilies are characterized by the presence of a relatively long alternatively spliced intron located in their first RNA recognition motif, which potentially results in an extremely truncated protein. In atRSZ33, a member of the RS2Z subfamily, this alternative splicing event was shown to be autoregulated. Here we show that atRSp31, a member of the RS subfamily, does not autoregulate alternative splicing of its similarily positioned intron. Interestingly, this alternative splicing event is regulated by atRSZ33. We demonstrate that the positions of these long introns and their capability for alternative splicing are conserved from green algae to flowering plants. Moreover, in particular alternative splicing events the splicing signals are embedded into highly conserved sequences. In different taxa, these conserved sequences occur in at least one gene within a subfamily. The evolutionary preservation of alternative splice forms together with highly conserved intron features argues for additional functions hidden in the genes of these plant-specific SR proteins. PMID:16936312
Kim, Dong Seon; Hahn, Yoonsoo
2012-11-13
Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution.
An RRM–ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion
Collins, Katherine M.; Kainov, Yaroslav A.; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A.
2017-01-01
Abstract RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1–ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. PMID:28379442
An RRM-ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion.
Collins, Katherine M; Kainov, Yaroslav A; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A; Makeyev, Eugene V; Ramos, Andres
2017-06-20
RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1-ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Arman, Ahmet; Ozon, Alev; Isguven, Pinar S; Coker, Ajda; Peker, Ismail; Yordam, Nursen
2008-01-01
Growth hormone (GH) is involved in growth, and fat and carbohydrate metabolism. Interaction of GH with the GH receptor (GHR) is necessary for systemic and local production of insulin-like growth factor-I (IGF-I) which mediates GH actions. Mutations in the GHR cause severe postnatal growth failure; the disorder is an autosomal recessive genetic disease resulting in GH insensitivity, called Laron syndrome. It is characterized by dwarfism with elevated serum GH and low levels of IGF-I. We analyzed the GHR gene for mutations and polymorphisms in eight patients with Laron-type dwarfism from six families. We found three missense mutations (S40L, V125A, I526L), one nonsense mutation (W157X), and one splice site mutation in the extracellular domain of GHR. Furthermore, G168G and exon 3 deletion polymorphisms were detected in patients with Laron syndrome. The splice site mutation, which is a novel mutation, was located at the donor splice site of exon 2/ intron 2 within GHR. Although this mutation changed the highly conserved donor splice site consensus sequence GT to GGT by insertion of a G residue, the intron splicing between exon 2 and exon 3 was detected in the patient. These results imply that the splicing occurs arthe GT site in intron 2, leaving the extra inserted G residue at the end of exon 2, thus changing the open reading frame of GHR resulting in a premature termination codon in exon 3.
[Analysis of USH2A gene mutation in a Chinese family affected with Usher syndrome].
Li, Pengcheng; Liu, Fei; Zhang, Mingchang; Wang, Qiufen; Liu, Mugen
2015-08-01
To investigate the disease-causing mutation in a Chinese family affected with Usher syndrome type II. All of the 11 members from the family underwent comprehensive ophthalmologic examination and hearing test, and their genomic DNA were isolated from venous leukocytes. PCR and direct sequencing of USH2A gene were performed for the proband. Wild type and mutant type minigene vectors containing exon 42, intron 42 and exon 43 of the USH2A gene were constructed and transfected into Hela cells by lipofectamine reagent. Reverse transcription (RT)-PCR was carried out to verify the splicing of the minigenes. Pedigree analysis and clinical diagnosis indicated that the patients have suffered from autosomal recessive Usher syndrome type II. DNA sequencing has detected a homozygous c.8559-2A>G mutation of the USH2A gene in the proband, which has co-segregated with the disease in the family. The mutation has affected a conserved splice site in intron 42, which has led to inactivation of the splice site. Minigene experiment has confirmed the retaining of intron 42 in mature mRNA. The c.8559-2A>G mutation in the USH2A gene probably underlies the Usher syndrome type II in this family. The splice site mutation has resulted in abnormal splicing of USH2A pre-mRNA.
Iqbal, Muhammad; Hayat, Maqsood
2016-05-01
Gene splicing is a vital source of protein diversity. Perfectly eradication of introns and joining exons is the prominent task in eukaryotic gene expression, as exons are usually interrupted by introns. Identification of splicing sites through experimental techniques is complicated and time-consuming task. With the avalanche of genome sequences generated in the post genomic age, it remains a complicated and challenging task to develop an automatic, robust and reliable computational method for fast and effective identification of splicing sites. In this study, a hybrid model "iSS-Hyb-mRMR" is proposed for quickly and accurately identification of splicing sites. Two sample representation methods namely; pseudo trinucleotide composition (PseTNC) and pseudo tetranucleotide composition (PseTetraNC) were used to extract numerical descriptors from DNA sequences. Hybrid model was developed by concatenating PseTNC and PseTetraNC. In order to select high discriminative features, minimum redundancy maximum relevance algorithm was applied on the hybrid feature space. The performance of these feature representation methods was tested using various classification algorithms including K-nearest neighbor, probabilistic neural network, general regression neural network, and fitting network. Jackknife test was used for evaluation of its performance on two benchmark datasets S1 and S2, respectively. The predictor, proposed in the current study achieved an accuracy of 93.26%, sensitivity of 88.77%, and specificity of 97.78% for S1, and the accuracy of 94.12%, sensitivity of 87.14%, and specificity of 98.64% for S2, respectively. It is observed, that the performance of proposed model is higher than the existing methods in the literature so for; and will be fruitful in the mechanism of RNA splicing, and other research academia. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Niimi, Hideki; Ogawa, Tomomi; Note, Rhougou; Hayashi, Shirou; Ueno, Tomohiro; Harada, Kenu; Uji, Yoshinori; Kitajima, Isao
2010-12-01
In recent years, genetic diagnostics of pathogenic splicing abnormalities are increasingly recognized as critically important in the clinical genetic diagnostics. It is reported that approximately 10% of pathogenic mutations causing human inherited diseases are splicing mutations. Nonetheless, it is still difficult to identify splicing abnormalities in routine genetic diagnostic settings. Here, we studied two different kinds of cases with splicing abnormalities. The first case is a protein S deficiency. Nucleotide analyses revealed that the proband had a previously reported G to C substitution in the invariant AG dinucleotide at the splicing acceptor site of intronl/exon2, which produces multiple splicing abnormalities resulting in protein S deficiency. The second case is an antithrombin (AT) deficiency. This proband had a previously reported G to A substitution, at nucleotide position 9788 in intron 4, 14 bp in front of exon 5, which created a de novo exon 5 splice site and resulted in AT deficiency. From a practical standpoint, we discussed the pitfalls, attentions, and screening approaches in genetic diagnostics of pathogenic splicing abnormalities. Due to the difficulty with full-length sequence analysis of introns, and the lack of RNA samples, splicing mutations may escape identification. Although current genetic testing remains to be improved, to screen for splicing abnormalities more efficiently, it is significant to use an appropriate combination of various approaches such as DNA and/or RNA samples, splicing mutation databases, bioinformatic tools to detect splice sites and cis-regulatory elements, and in vitro and/or in vivo experimentally methods as needed.
Genetic therapies for RNA mis-splicing diseases.
Hammond, Suzan M; Wood, Matthew J A
2011-05-01
RNA mis-splicing diseases account for up to 15% of all inherited diseases, ranging from neurological to myogenic and metabolic disorders. With greatly increased genomic sequencing being performed for individual patients, the number of known mutations affecting splicing has risen to 50-60% of all disease-causing mutations. During the past 10years, genetic therapy directed toward correction of RNA mis-splicing in disease has progressed from theoretical work in cultured cells to promising clinical trials. In this review, we discuss the use of antisense oligonucleotides to modify splicing as well as the principles and latest work in bifunctional RNA, trans-splicing and modification of U1 and U7 snRNA to target splice sites. The success of clinical trials for modifying splicing to treat Duchenne muscular dystrophy opens the door for the use of splicing modification for most of the mis-splicing diseases. Copyright © 2011 Elsevier Ltd. All rights reserved.
Härter, Bettina; Fuchs, Irene; Müller, Thomas; Akbulut, Ulas Emre; Cakir, Murat; Janecke, Andreas R
2016-04-01
Autosomal recessive proprotein convertase 1/3 (PC1/3) deficiency, caused by mutations in the PCSK1 gene, is characterized by severe congenital malabsorptive diarrhea, early-onset obesity, and certain endocrine abnormalities. We suspected PC1/3 deficiency in a 4-month-old girl based on the presence of congenital diarrhea and polyuria. Sequencing the whole coding region and splice sites detected a novel homozygous PCSK1 splice-site mutation, c.544-2A>G, in the patient. The mutation resulted in the skipping of exon 5, the generation of a premature termination codon, and nonsense-mediated PCSK1 messenger ribonucleic acid decay, which was demonstrated in complementary DNA derived from fibroblasts.
Hereditary cancer genes are highly susceptible to splicing mutations
Soemedi, Rachel; Maguire, Samantha; Murray, Michael F.; Monaghan, Sean F.
2018-01-01
Substitutions that disrupt pre-mRNA splicing are a common cause of genetic disease. On average, 13.4% of all hereditary disease alleles are classified as splicing mutations mapping to the canonical 5′ and 3′ splice sites. However, splicing mutations present in exons and deeper intronic positions are vastly underreported. A recent re-analysis of coding mutations in exon 10 of the Lynch Syndrome gene, MLH1, revealed an extremely high rate (77%) of mutations that lead to defective splicing. This finding is confirmed by extending the sampling to five other exons in the MLH1 gene. Further analysis suggests a more general phenomenon of defective splicing driving Lynch Syndrome. Of the 36 mutations tested, 11 disrupted splicing. Furthermore, analyzing past reports suggest that MLH1 mutations in canonical splice sites also occupy a much higher fraction (36%) of total mutations than expected. When performing a comprehensive analysis of splicing mutations in human disease genes, we found that three main causal genes of Lynch Syndrome, MLH1, MSH2, and PMS2, belonged to a class of 86 disease genes which are enriched for splicing mutations. Other cancer genes were also enriched in the 86 susceptible genes. The enrichment of splicing mutations in hereditary cancers strongly argues for additional priority in interpreting clinical sequencing data in relation to cancer and splicing. PMID:29505604
Short intronic repeat sequences facilitate circular RNA production.
Liang, Dongming; Wilusz, Jeremy E
2014-10-15
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery "backsplices" and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼ 30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3' end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. © 2014 Liang and Wilusz; Published by Cold Spring Harbor Laboratory Press.
Jin, Lirong; Li, Guanglin; Yu, Dazhao; Huang, Wei; Cheng, Chao; Liao, Shengjie; Wu, Qijia; Zhang, Yi
2017-02-06
Alternative splicing (AS) regulation is extensive and shapes the functional complexity of higher organisms. However, the contribution of alternative splicing to fungal biology is not well studied. This study provides sequences of the transcriptomes of the plant wilt pathogen Verticillium dahliae, using two different strains and multiple methods for cDNA library preparations. We identified alternatively spliced mRNA isoforms in over a half of the multi-exonic fungal genes. Over one-thousand isoforms involve TopHat novel splice junction; multiple types of combinatory alternative splicing patterns were identified. We showed that one Verticillium gene could use four different 5' splice sites and two different 3' donor sites to produce up to five mature mRNAs, representing one of the most sophisticated alternative splicing model in eukaryotes other than animals. Hundreds of novel intron types involving a pair of new splice sites were identified in the V. dahliae genome. All the types of AS events were validated by using RT-PCR. Functional enrichment analysis showed that AS genes are involved in most known biological functions and enriched in ATP biosynthesis, sexual/asexual reproduction, morphogenesis, signal transduction etc., predicting that the AS regulation modulates mRNA isoform output and shapes the V. dahliae proteome plasticity of the pathogen in response to the environmental and developmental changes. These findings demonstrate the comprehensive alternative splicing mechanisms in a fungal plant pathogen, which argues the importance of this fungus in developing complicate genome regulation strategies in eukaryotes.
TopHat: discovering splice junctions with RNA-Seq
Trapnell, Cole; Pachter, Lior; Salzberg, Steven L.
2009-01-01
Motivation: A new protocol for sequencing the messenger RNA in a cell, known as RNA-Seq, generates millions of short sequence fragments in a single run. These fragments, or ‘reads’, can be used to measure levels of gene expression and to identify novel splice variants of genes. However, current software for aligning RNA-Seq data to a genome relies on known splice junctions and cannot identify novel ones. TopHat is an efficient read-mapping algorithm designed to align reads from an RNA-Seq experiment to a reference genome without relying on known splice sites. Results: We mapped the RNA-Seq reads from a recent mammalian RNA-Seq experiment and recovered more than 72% of the splice junctions reported by the annotation-based software from that study, along with nearly 20 000 previously unreported junctions. The TopHat pipeline is much faster than previous systems, mapping nearly 2.2 million reads per CPU hour, which is sufficient to process an entire RNA-Seq experiment in less than a day on a standard desktop computer. We describe several challenges unique to ab initio splice site discovery from RNA-Seq reads that will require further algorithm development. Availability: TopHat is free, open-source software available from http://tophat.cbcb.umd.edu Contact: cole@cs.umd.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:19289445
Stanescu, Ana; Caragea, Doina
2015-01-01
Recent biochemical advances have led to inexpensive, time-efficient production of massive volumes of raw genomic data. Traditional machine learning approaches to genome annotation typically rely on large amounts of labeled data. The process of labeling data can be expensive, as it requires domain knowledge and expert involvement. Semi-supervised learning approaches that can make use of unlabeled data, in addition to small amounts of labeled data, can help reduce the costs associated with labeling. In this context, we focus on the problem of predicting splice sites in a genome using semi-supervised learning approaches. This is a challenging problem, due to the highly imbalanced distribution of the data, i.e., small number of splice sites as compared to the number of non-splice sites. To address this challenge, we propose to use ensembles of semi-supervised classifiers, specifically self-training and co-training classifiers. Our experiments on five highly imbalanced splice site datasets, with positive to negative ratios of 1-to-99, showed that the ensemble-based semi-supervised approaches represent a good choice, even when the amount of labeled data consists of less than 1% of all training data. In particular, we found that ensembles of co-training and self-training classifiers that dynamically balance the set of labeled instances during the semi-supervised iterations show improvements over the corresponding supervised ensemble baselines. In the presence of limited amounts of labeled data, ensemble-based semi-supervised approaches can successfully leverage the unlabeled data to enhance supervised ensembles learned from highly imbalanced data distributions. Given that such distributions are common for many biological sequence classification problems, our work can be seen as a stepping stone towards more sophisticated ensemble-based approaches to biological sequence annotation in a semi-supervised framework.
2015-01-01
Background Recent biochemical advances have led to inexpensive, time-efficient production of massive volumes of raw genomic data. Traditional machine learning approaches to genome annotation typically rely on large amounts of labeled data. The process of labeling data can be expensive, as it requires domain knowledge and expert involvement. Semi-supervised learning approaches that can make use of unlabeled data, in addition to small amounts of labeled data, can help reduce the costs associated with labeling. In this context, we focus on the problem of predicting splice sites in a genome using semi-supervised learning approaches. This is a challenging problem, due to the highly imbalanced distribution of the data, i.e., small number of splice sites as compared to the number of non-splice sites. To address this challenge, we propose to use ensembles of semi-supervised classifiers, specifically self-training and co-training classifiers. Results Our experiments on five highly imbalanced splice site datasets, with positive to negative ratios of 1-to-99, showed that the ensemble-based semi-supervised approaches represent a good choice, even when the amount of labeled data consists of less than 1% of all training data. In particular, we found that ensembles of co-training and self-training classifiers that dynamically balance the set of labeled instances during the semi-supervised iterations show improvements over the corresponding supervised ensemble baselines. Conclusions In the presence of limited amounts of labeled data, ensemble-based semi-supervised approaches can successfully leverage the unlabeled data to enhance supervised ensembles learned from highly imbalanced data distributions. Given that such distributions are common for many biological sequence classification problems, our work can be seen as a stepping stone towards more sophisticated ensemble-based approaches to biological sequence annotation in a semi-supervised framework. PMID:26356316
DOE Office of Scientific and Technical Information (OSTI.GOV)
Solera, J.; Magallon, M.; Martin-Villar, J.
1992-02-01
DNA from a patient with severe hemophilia B was evaluated by RFLP analysis, producing results which suggested the existence of a partial deletion within the factor IX gene. The deletion was further localized and characterized by PCR amplification and sequencing. The altered allele has a 4,442-bp deletion which removes both the donor splice site located at the 5[prime] end of intron d and the two last coding nucleotides located at the 3[prime] end of exon IV in the normal factor IX gene; this fragment has been inserted in inverted orientation. Two homologous sequences have been discovered at the ends ofmore » the deleted DNA fragment.« less
Changes in exon–intron structure during vertebrate evolution affect the splicing pattern of exons
Gelfman, Sahar; Burstein, David; Penn, Osnat; Savchenko, Anna; Amit, Maayan; Schwartz, Schraga; Pupko, Tal; Ast, Gil
2012-01-01
Exon–intron architecture is one of the major features directing the splicing machinery to the short exons that are located within long flanking introns. However, the evolutionary dynamics of exon–intron architecture and its impact on splicing is largely unknown. Using a comparative genomic approach, we analyzed 17 vertebrate genomes and reconstructed the ancestral motifs of both 3′ and 5′ splice sites, as also the ancestral length of exons and introns. Our analyses suggest that vertebrate introns increased in length from the shortest ancestral introns to the longest primate introns. An evolutionary analysis of splice sites revealed that weak splice sites act as a restrictive force keeping introns short. In contrast, strong splice sites allow recognition of exons flanked by long introns. Reconstruction of the ancestral state suggests these phenomena were not prevalent in the vertebrate ancestor, but appeared during vertebrate evolution. By calculating evolutionary rate shifts in exons, we identified cis-acting regulatory sequences that became fixed during the transition from early vertebrates to mammals. Experimental validations performed on a selection of these hexamers confirmed their regulatory function. We additionally revealed many features of exons that can discriminate alternative from constitutive exons. These features were integrated into a machine-learning approach to predict whether an exon is alternative. Our algorithm obtains very high predictive power (AUC of 0.91), and using these predictions we have identified and successfully validated novel alternatively spliced exons. Overall, we provide novel insights regarding the evolutionary constraints acting upon exons and their recognition by the splicing machinery. PMID:21974994
Yeakley, J M; Hedjran, F; Morfin, J P; Merillat, N; Rosenfeld, M G; Emeson, R B
1993-01-01
The calcitonin/calcitonin gene-related peptide (CGRP) primary transcript is alternatively spliced in thyroid C cells and neurons, resulting in the tissue-specific production of calcitonin and CGRP mRNAs. Analyses of mutated calcitonin/CGRP transcription units in permanently transfected cell lines have indicated that alternative splicing is regulated by a differential capacity to utilize the calcitonin-specific splice acceptor. The analysis of an extensive series of mutations suggests that tissue-specific regulation of calcitonin mRNA production does not depend on the presence of a single, unique cis-active element but instead appears to be a consequence of suboptimal constitutive splicing signals. While only those mutations that altered constitutive splicing signals affected splice choices, the action of multiple regulatory sequences cannot be formally excluded. Further, we have identified a 13-nucleotide purine-rich element from a constitutive exon that, when placed in exon 4, entirely switches splice site usage in CGRP-producing cells. These data suggest that specific exon recruitment sequences, in combination with other constitutive elements, serve an important function in exon recognition. These results are consistent with the hypothesis that tissue-specific alternative splicing of the calcitonin/CGRP primary transcript is mediated by cell-specific differences in components of the constitutive splicing machinery. Images PMID:8413203
2012-01-01
Background Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. Results We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Conclusions Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution. PMID:23148531
Splicing factor SFRS1 recognizes a functionally diverse landscape of RNA transcripts.
Sanford, Jeremy R; Wang, Xin; Mort, Matthew; Vanduyn, Natalia; Cooper, David N; Mooney, Sean D; Edenberg, Howard J; Liu, Yunlong
2009-03-01
Metazoan genes are encrypted with at least two superimposed codes: the genetic code to specify the primary structure of proteins and the splicing code to expand their proteomic output via alternative splicing. Here, we define the specificity of a central regulator of pre-mRNA splicing, the conserved, essential splicing factor SFRS1. Cross-linking immunoprecipitation and high-throughput sequencing (CLIP-seq) identified 23,632 binding sites for SFRS1 in the transcriptome of cultured human embryonic kidney cells. SFRS1 was found to engage many different classes of functionally distinct transcripts including mRNA, miRNA, snoRNAs, ncRNAs, and conserved intergenic transcripts of unknown function. The majority of these diverse transcripts share a purine-rich consensus motif corresponding to the canonical SFRS1 binding site. The consensus site was not only enriched in exons cross-linked to SFRS1 in vivo, but was also enriched in close proximity to splice sites. mRNAs encoding RNA processing factors were significantly overrepresented, suggesting that SFRS1 may broadly influence the post-transcriptional control of gene expression in vivo. Finally, a search for the SFRS1 consensus motif within the Human Gene Mutation Database identified 181 mutations in 82 different genes that disrupt predicted SFRS1 binding sites. This comprehensive analysis substantially expands the known roles of human SR proteins in the regulation of a diverse array of RNA transcripts.
In silico study of breast cancer associated gene 3 using LION Target Engine and other tools.
León, Darryl A; Cànaves, Jaume M
2003-12-01
Sequence analysis of individual targets is an important step in annotation and validation. As a test case, we investigated human breast cancer associated gene 3 (BCA3) with LION Target Engine and with other bioinformatics tools. LION Target Engine confirmed that the BCA3 gene is located on 11p15.4 and that the two most likely splice variants (lacking exon 3 and exons 3 and 5, respectively) exist. Based on our manual curation of sequence data, it is proposed that an additional variant (missing only exon 5) published in a public sequence repository, is a prediction artifact. A significant number of new orthologs were also identified, and these were the basis for a high-quality protein secondary structure prediction. Moreover, our research confirmed several distinct functional domains as described in earlier reports. Sequence conservation from multiple sequence alignments, splice variant identification, secondary structure predictions, and predicted phosphorylation sites suggest that the removal of interaction sites through alternative splicing might play a modulatory role in BCA3. This in silico approach shows the depth and relevance of an analysis that can be accomplished by including a variety of publicly available tools with an integrated and customizable life science informatics platform.
Cheriyan, Manoj; Chan, Siu-Hong; Perler, Francine
2014-12-12
Inteins self-catalytically cleave out of precursor proteins while ligating the surrounding extein fragments with a native peptide bond. Much attention has been lavished on these molecular marvels with the hope of understanding and harnessing their chemistry for novel biochemical transformations including coupling peptides from synthetic or biological origins and controlling protein function. Despite an abundance of powerful applications, the use of inteins is still hampered by limitations in our understanding of their specificity (defined as flanking sequences that permit splicing) and the challenge of inserting inteins into target proteins. We examined the frequently used Nostoc punctiforme Npu DnaE intein after the C-extein cysteine nucleophile (Cys+1) was mutated to serine or threonine. Previous studies demonstrated reduced rates and/or splicing yields with the Npu DnaE intein after mutation of Cys+1 to Ser+1. In this study, genetic selection identified extein sequences with Ser+1 that enabled the Npu DnaE intein to splice with only a 5-fold reduction in rate compared to the wild-type Cys+1 intein and without mutation of the intein itself to activate Ser+1 as a nucleophile. Three different proteins spliced efficiently after insertion of the intein flanked by the selected sequences. We then used this selected specificity to achieve traceless splicing in a targeted enzyme at a location predicted by primary sequence similarity to only the selected C-extein sequence. This study highlights the latent catalytic potential of the Npu DnaE intein to splice with an alternative nucleophile and enables broader intein utility by increasing insertion site choices. Copyright © 2014. Published by Elsevier Ltd.
Unusual splice site mutations disrupt FANCA exon 8 definition.
Mattioli, Chiara; Pianigiani, Giulia; De Rocco, Daniela; Bianco, Anna Monica Rosaria; Cappelli, Enrico; Savoia, Anna; Pagani, Franco
2014-07-01
The pathological role of mutations that affect not conserved splicing regulatory sequences can be difficult to determine. In a patient with Fanconi anemia, we identified two unpredictable splicing mutations that act on either sides of FANCA exon 8. In patients-derived cells and in minigene splicing assay, we showed that both an apparently benign intronic c.710-5T>C transition and the nonsense c.790C>T substitution induce almost complete exon 8 skipping. Site-directed mutagenesis experiments indicated that the c.710-5T>C transition affects a polypyrimidine tract where most of the thymidines cannot be compensated by cytidines. The c.790C>T mutation located in position -3 relative to the donor site induce exon 8 skipping in an NMD-independent manner and complementation experiments with modified U1 snRNAs showed that U1 snRNP is only partially involved in the splicing defect. Our results highlight the importance of performing splicing functional assay for correct identification of disease-causing mechanism of genomic variants and provide mechanistic insights on how these two FANCA mutations affect exon 8 definition. Copyright © 2014 Elsevier B.V. All rights reserved.
Novel C8orf37 mutations cause retinitis pigmentosa in consanguineous families of Pakistani origin
Ravesh, Zeinab; El Asrag, Mohammed E.; Weisschuh, Nicole; McKibbin, Martin; Reuter, Peggy; Watson, Christopher M.; Baumann, Britta; Poulter, James A.; Sajid, Sundus; Panagiotou, Evangelia S.; O’Sullivan, James; Abdelhamed, Zakia; Bonin, Michael; Soltanifar, Mehdi; Black, Graeme C.M.; Din, Muhammad Amin-ud; Toomes, Carmel; Ansar, Muhammad; Inglehearn, Chris F.; Wissinger, Bernd
2015-01-01
Purpose To investigate the molecular basis of retinitis pigmentosa in two consanguineous families of Pakistani origin with multiple affected members. Methods Homozygosity mapping and Sanger sequencing of candidate genes were performed in one family while the other was analyzed with whole exome next-generation sequencing. A minigene splicing assay was used to confirm the splicing defects. Results In family MA48, a novel homozygous nucleotide substitution in C8orf37, c.244–2A>C, that disrupted the consensus splice acceptor site of exon 3 was found. The minigene splicing assay revealed that this mutation activated a cryptic splice site within exon 3, causing a 22 bp deletion in the transcript that is predicted to lead to a frameshift followed by premature protein truncation. In family MA13, a novel homozygous null mutation in C8orf37, c.555G>A, p.W185*, was identified. Both mutations segregated with the disease phenotype as expected in a recessive manner and were absent in 8,244 unrelated individuals of South Asian origin. Conclusions In this report, we describe C8orf37 mutations that cause retinal dystrophy in two families of Pakistani origin, contributing further data on the phenotype and the spectrum of mutations in this form of retinitis pigmentosa. PMID:25802487
Prchalova, Darina; Havlovicova, Marketa; Sterbova, Katalin; Stranecky, Viktor; Hancarova, Miroslava; Sedlacek, Zdenek
2017-06-02
Whole exome sequencing is a powerful tool for the analysis of genetically heterogeneous conditions. The prioritization of variants identified often focuses on nonsense, frameshift and canonical splice site mutations, and highly deleterious missense variants, although other defects can also play a role. The definition of the phenotype range and course of rare genetic conditions requires long-term clinical follow-up of patients. We report an adult female patient with severe intellectual disability, severe speech delay, epilepsy, autistic features, aggressiveness, sleep problems, broad-based clumsy gait and constipation. Whole exome sequencing identified a de novo mutation in the SYNGAP1 gene. The variant was located in the broader splice donor region of intron 10 and replaced G by A at position +5 of the splice site. The variant was predicted in silico and shown experimentally to abolish the regular splice site and to activate a cryptic donor site within exon 10, causing frameshift and premature termination. The overall clinical picture of the patient corresponded well with the characteristic SYNGAP1-associated phenotype observed in previously reported patients. However, our patient was 31 years old which contrasted with most other published SYNGAP1 cases who were much younger. Our patient had a significant growth delay and microcephaly. Both features normalised later, although the head circumference stayed only slightly above the lower limit of the norm. The patient had a delayed puberty. Her cognitive and language performance remained at the level of a one-year-old child even in adulthood and showed a slow decline. Myopathic facial features and facial dysmorphism became more pronounced with age. Although the gait of the patient was unsteady in childhood, more severe gait problems developed in her teens. While the seizures remained well-controlled, her aggressive behaviour worsened with age and required extensive medication. The finding in our patient underscores the notion that the interpretation of variants identified using whole exome sequencing should focus not only on variants in the canonical splice dinucleotides GT and AG, but also on broader splice regions. The long-term clinical follow-up of our patient contributes to the knowledge of the developmental trajectory in individuals with SYNGAP1 gene defects.
Novel splice mutation in microthalmia-associated transcription factor in Waardenburg Syndrome.
Brenner, Laura; Burke, Kelly; Leduc, Charles A; Guha, Saurav; Guo, Jiancheng; Chung, Wendy K
2011-01-01
Waardenburg Syndrome (WS) is a syndromic form of hearing loss associated with mutations in six different genes. We identified a large family with WS that had previously undergone clinical testing, with no reported pathogenic mutation. Using linkage analysis, a region on 3p14.1 with an LOD score of 6.6 was identified. Microthalmia-Associated Transcription Factor, a gene known to cause WS, is located within this region of linkage. Sequencing of Microthalmia-Associated Transcription Factor demonstrated a c.1212 G>A synonymous variant that segregated with the WS in the family and was predicted to cause a novel splicing site that was confirmed with expression analysis of the mRNA. This case illustrates the need to computationally analyze novel synonymous sequence variants for possible effects on splicing to maximize the clinical sensitivity of sequence-based genetic testing.
Ma, Nina S; Malloy, Peter J; Pitukcheewanont, Pisit; Dreimane, Daina; Geffner, Mitchell E; Feldman, David
2009-10-01
To study the vitamin D receptor (VDR) gene in a young girl with severe rickets and clinical features of hereditary vitamin D resistant rickets, including hypocalcemia, hypophosphatemia, partial alopecia, and elevated serum levels of 1,25-dihydroxyvitamin D. We amplified and sequenced DNA samples from blood from the patient, her mother, and the patient's two siblings. We also amplified and sequenced the VDR cDNA from RNA isolated from the patient's blood. DNA sequence analyses of the VDR gene showed that the patient was homozygous for a novel guanine to thymine substitution in the 5'-splice site in the exon 8-intron J junction. Analysis of the VDR cDNA using reverse transcriptase-polymerase chain reaction showed that exons 7 and 9 were fused, and that exon 8 was skipped. The mother was heterozygous for the mutation and the two siblings were unaffected. A novel splice site mutation was identified in the VDR gene that caused exon 8 to be skipped. The mutation deleted amino acids 303-341 in the VDR ligand-binding domain, which is expected to render the VDR non-functional. Nevertheless, successful outpatient treatment was achieved with frequent high doses of oral calcium.
Khoo, Bernard; Roca, Xavier; Chew, Shern L; Krainer, Adrian R
2007-01-17
Apolipoprotein B (APOB) is an integral part of the LDL, VLDL, IDL, Lp(a) and chylomicron lipoprotein particles. The APOB pre-mRNA consists of 29 constitutively-spliced exons. APOB exists as two natural isoforms: the full-length APOB100 isoform, assembled into LDL, VLDL, IDL and Lp(a) and secreted by the liver in humans; and the C-terminally truncated APOB48, assembled into chylomicrons and secreted by the intestine in humans. Down-regulation of APOB100 is a potential therapy to lower circulating LDL and cholesterol levels. We investigated the ability of 2'O-methyl RNA antisense oligonucleotides (ASOs) to induce the skipping of exon 27 in endogenous APOB mRNA in HepG2 cells. These ASOs are directed towards the 5' and 3' splice-sites of exon 27, the branch-point sequence (BPS) of intron 26-27 and several predicted exonic splicing enhancers within exon 27. ASOs targeting either the 5' or 3' splice-site, in combination with the BPS, are the most effective. The splicing of other alternatively spliced genes are not influenced by these ASOs, suggesting that the effects seen are not due to non-specific changes in alternative splicing. The skip 27 mRNA is translated into a truncated isoform, APOB87SKIP27. The induction of APOB87SKIP27 expression in vivo should lead to decreased LDL and cholesterol levels, by analogy to patients with hypobetalipoproteinemia. As intestinal APOB mRNA editing and APOB48 expression rely on sequences within exon 26, exon 27 skipping should not affect APOB48 expression unlike other methods of down-regulating APOB100 expression which also down-regulate APOB48.
Matsumoto, Jun; Dewar, Ken; Wasserscheid, Jessica; Wiley, Graham B; Macmil, Simone L; Roe, Bruce A; Zeller, Robert W; Satou, Yutaka; Hastings, Kenneth E M
2010-05-01
Pre-mRNA 5' spliced-leader (SL) trans-splicing occurs in some metazoan groups but not in others. Genome-wide characterization of the trans-spliced mRNA subpopulation has not yet been reported for any metazoan. We carried out a high-throughput analysis of the SL trans-spliced mRNA population of the ascidian tunicate Ciona intestinalis by 454 Life Sciences (Roche) pyrosequencing of SL-PCR-amplified random-primed reverse transcripts of tailbud embryo RNA. We obtained approximately 250,000 high-quality reads corresponding to 8790 genes, approximately 58% of the Ciona total gene number. The great depth of this data revealed new aspects of trans-splicing, including the existence of a significant class of "infrequently trans-spliced" genes, accounting for approximately 28% of represented genes, that generate largely non-trans-spliced mRNAs, but also produce trans-spliced mRNAs, in part through alternative promoter use. Thus, the conventional qualitative dichotomy of trans-spliced versus non-trans-spliced genes should be supplanted by a more accurate quantitative view recognizing frequently and infrequently trans-spliced gene categories. Our data include reads representing approximately 80% of Ciona frequently trans-spliced genes. Our analysis also revealed significant use of closely spaced alternative trans-splice acceptor sites which further underscores the mechanistic similarity of cis- and trans-splicing and indicates that the prevalence of +/-3-nt alternative splicing events at tandem acceptor sites, NAGNAG, is driven by spliceosomal mechanisms, and not nonsense-mediated decay, or selection at the protein level. The breadth of gene representation data enabled us to find new correlations between trans-splicing status and gene function, namely the overrepresentation in the frequently trans-spliced gene class of genes associated with plasma/endomembrane system, Ca(2+) homeostasis, and actin cytoskeleton.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Doerk, T.; Wulbrand, U.; Tuemmler, B.
1993-03-01
Single cases of the four novel splice site mutations 1525[minus]1 G [r arrow] A (intron 9), 3601[minus]2 A [r arrow] G (intron 18), 3850[minus]3 T [r arrow] G (intron 19), and 4374+1 G [r arrow] T (intron 23) were detected in the CFTR gene of cystic fibrosis patients of Indo-Iranian, Turkish, Polish, and Germany descent. The nucleotide substitutions at the +1, [minus]1, and [minus]2 positions all destroy splice sites and lead to severe disease alleles associated with features typical of gastrointestinal and pulmonary cystic fibrosis disease. The 3850[minus]3 T-to-G change was discovered in a very mildly affected 33-year-old [Delta]F508 compoundmore » heterozygote, suggesting that the T-to-G transversion at the less conserved [minus]3 position of the acceptor splice site may retain some wildtype function. 13 refs., 1 fig., 2 tabs.« less
Kurose, Kouichi; Koyano, Satoru; Ikeda, Shinobu; Tohkin, Masahiro; Hasegawa, Ryuichi; Sawada, Jun-Ichi
2005-05-01
The human pregnane X receptor (PXR) is a crucial regulator of the genes encoding several major cytochrome P450 enzymes and transporters, such as CYP3A4 and MDR1, but its own transcriptional regulation remains unclear. To elucidate the transcriptional mechanisms of human PXR gene, we first endeavored to identify the transcription initiation site of human PXR using 5'-RACE. Five types of 5'-variable transcripts (a, b, c, d, and e) with common exon 2 sequence were found, and comparison of these sequences with the genomic sequence suggested that their 5' diversity is derived from initiation by alternative promoters and alternative splicing. None of the exons found in our study contain any new in-frame coding regions. Newly identified introns IVS-a and IVS-b were found to have CT-AC splice sites that do not follow the GT-AG rule of conventional donor and acceptor splice sites. Of the five types of 5' variable transcripts identified, RT-PCR showed that type-a was the major transcript type. Four transcription initiation sites (A-D) for type-a transcript were identified by 5'-RACE using GeneRacer RACE Ready cDNA (human liver) constructed by the oligo-capping method. Putative TATA boxes were located approximately 30 bp upstream from the transcriptional start sites of the major transcript (C) and the longest minor transcript (A) expressed in the human liver. These results indicate that the initiation of transcription of human PXR is more complex than previously reported.
Purifying Selection on Exonic Splice Enhancers in Intronless Genes
Savisaar, Rosina; Hurst, Laurence D.
2016-01-01
Exonic splice enhancers (ESEs) are short nucleotide motifs, enriched near exon ends, that enhance the recognition of the splice site and thus promote splicing. Are intronless genes under selection to avoid these motifs so as not to attract the splicing machinery to an mRNA that should not be spliced, thereby preventing the production of an aberrant transcript? Consistent with this possibility, we find that ESEs in putative recent retrocopies are at a higher density and evolving faster than those in other intronless genes, suggesting that they are being lost. Moreover, intronless genes are less dense in putative ESEs than intron-containing ones. However, this latter difference is likely due to the skewed base composition of intronless sequences, a skew that is in line with the general GC richness of few exon genes. Indeed, after controlling for such biases, we find that both intronless and intron-containing genes are denser in ESEs than expected by chance. Importantly, nucleotide-controlled analysis of evolutionary rates at synonymous sites in ESEs indicates that the ESEs in intronless genes are under purifying selection in both human and mouse. We conclude that on the loss of introns, some but not all, ESE motifs are lost, the remainder having functions beyond a role in splice promotion. These results have implications for the design of intronless transgenes and for understanding the causes of selection on synonymous sites. PMID:26802218
Long-read sequencing of nascent RNA reveals coupling among RNA processing events.
Herzel, Lydia; Straube, Korinna; Neugebauer, Karla M
2018-06-14
Pre-mRNA splicing is accomplished by the spliceosome, a megadalton complex that assembles de novo on each intron. Because spliceosome assembly and catalysis occur cotranscriptionally, we hypothesized that introns are removed in the order of their transcription in genomes dominated by constitutive splicing. Remarkably little is known about splicing order and the regulatory potential of nascent transcript remodeling by splicing, due to the limitations of existing methods that focus on analysis of mature splicing products (mRNAs) rather than substrates and intermediates. Here, we overcome this obstacle through long-read RNA sequencing of nascent, multi-intron transcripts in the fission yeast Schizosaccharomyces pombe Most multi-intron transcripts were fully spliced, consistent with rapid cotranscriptional splicing. However, an unexpectedly high proportion of transcripts were either fully spliced or fully unspliced, suggesting that splicing of any given intron is dependent on the splicing status of other introns in the transcript. Supporting this, mild inhibition of splicing by a temperature-sensitive mutation in prp2 , the homolog of vertebrate U2AF65, increased the frequency of fully unspliced transcripts. Importantly, fully unspliced transcripts displayed transcriptional read-through at the polyA site and were degraded cotranscriptionally by the nuclear exosome. Finally, we show that cellular mRNA levels were reduced in genes with a high number of unspliced nascent transcripts during caffeine treatment, showing regulatory significance of cotranscriptional splicing. Therefore, overall splicing of individual nascent transcripts, 3' end formation, and mRNA half-life depend on the splicing status of neighboring introns, suggesting crosstalk among spliceosomes and the polyA cleavage machinery during transcription elongation. © 2018 Herzel et al.; Published by Cold Spring Harbor Laboratory Press.
Rice, Michael; Gladstone, William; Weir, Michael
2004-01-01
We discuss how relational databases constitute an ideal framework for representing and analyzing large-scale genomic data sets in biology. As a case study, we describe a Drosophila splice-site database that we recently developed at Wesleyan University for use in research and teaching. The database stores data about splice sites computed by a custom algorithm using Drosophila cDNA transcripts and genomic DNA and supports a set of procedures for analyzing splice-site sequence space. A generic Web interface permits the execution of the procedures with a variety of parameter settings and also supports custom structured query language queries. Moreover, new analytical procedures can be added by updating special metatables in the database without altering the Web interface. The database provides a powerful setting for students to develop informatic thinking skills.
2004-01-01
We discuss how relational databases constitute an ideal framework for representing and analyzing large-scale genomic data sets in biology. As a case study, we describe a Drosophila splice-site database that we recently developed at Wesleyan University for use in research and teaching. The database stores data about splice sites computed by a custom algorithm using Drosophila cDNA transcripts and genomic DNA and supports a set of procedures for analyzing splice-site sequence space. A generic Web interface permits the execution of the procedures with a variety of parameter settings and also supports custom structured query language queries. Moreover, new analytical procedures can be added by updating special metatables in the database without altering the Web interface. The database provides a powerful setting for students to develop informatic thinking skills. PMID:15592597
Liu, Kaiyu; Li, Yi; Jousset, Françoise-Xavière; Zadori, Zoltan; Szelei, Jozsef; Yu, Qian; Pham, Hanh Thi; Lépine, François; Bergoin, Max; Tijssen, Peter
2011-01-01
The Acheta domesticus densovirus (AdDNV), isolated from crickets, has been endemic in Europe for at least 35 years. Severe epizootics have also been observed in American commercial rearings since 2009 and 2010. The AdDNV genome was cloned and sequenced for this study. The transcription map showed that splicing occurred in both the nonstructural (NS) and capsid protein (VP) multicistronic RNAs. The splicing pattern of NS mRNA predicted 3 nonstructural proteins (NS1 [576 codons], NS2 [286 codons], and NS3 [213 codons]). The VP gene cassette contained two VP open reading frames (ORFs), of 597 (ORF-A) and 268 (ORF-B) codons. The VP2 sequence was shown by N-terminal Edman degradation and mass spectrometry to correspond with ORF-A. Mass spectrometry, sequencing, and Western blotting of baculovirus-expressed VPs versus native structural proteins demonstrated that the VP1 structural protein was generated by joining ORF-A and -B via splicing (splice II), eliminating the N terminus of VP2. This splice resulted in a nested set of VP1 (816 codons), VP3 (467 codons), and VP4 (429 codons) structural proteins. In contrast, the two splices within ORF-B (Ia and Ib) removed the donor site of intron II and resulted in VP2, VP3, and VP4 expression. ORF-B may also code for several nonstructural proteins, of 268, 233, and 158 codons. The small ORF-B contains the coding sequence for a phospholipase A2 motif found in VP1, which was shown previously to be critical for cellular uptake of the virus. These splicing features are unique among parvoviruses and define a new genus of ambisense densoviruses. PMID:21775445
Ajiro, Masahiko; Jia, Rong; Zhang, Lifang; Liu, Xuefeng; Zheng, Zhi-Ming
2012-01-01
HPV16 E6 and E7, two viral oncogenes, are expressed from a single bicistronic pre-mRNA. In this report, we provide the evidence that the bicistronic pre-mRNA intron 1 contains three 5′ splice sites (5′ ss) and three 3′ splice sites (3′ ss) normally used in HPV16+ cervical cancer and its derived cell lines. The choice of two novel alternative 5′ ss (nt 221 5′ ss and nt 191 5′ ss) produces two novel isoforms of E6E7 mRNAs (E6*V and E6*VI). The nt 226 5′ ss and nt 409 3′ ss is preferentially selected over the other splice sites crossing over the intron to excise a minimal length of the intron in RNA splicing. We identified AACAAAC as the preferred branch point sequence (BPS) and an adenosine at nt 385 (underlined) in the BPS as a branch site to dictate the selection of the nt 409 3′ ss for E6*I splicing and E7 expression. Introduction of point mutations into the mapped BPS led to reduced U2 binding to the BPS and thereby inhibition of the second step of E6E7 splicing at the nt 409 3′ ss. Importantly, the E6E7 bicistronic RNA with a mutant BPS and inefficient splicing makes little or no E7 and the resulted E6 with mutations of 91QYNK94 to 91PSFW94 displays attenuate activity on p53 degradation. Together, our data provide structural basis of the E6E7 intron 1 for better understanding of how viral E6 and E7 expression is regulated by alternative RNA splicing. This study elucidates for the first time a mapped branch point in HPV16 genome involved in viral oncogene expression. PMID:23056301
Saravanaperumal, Siva Arumugam; Pediconi, Dario; Renieri, Carlo; La Terza, Antonietta
2012-01-01
Stem cell factor (SCF) is a growth factor, essential for haemopoiesis, mast cell development and melanogenesis. In the hematopoietic microenvironment (HM), SCF is produced either as a membrane-bound (−) or soluble (+) forms. Skin expression of SCF stimulates melanocyte migration, proliferation, differentiation, and survival. We report for the first time, a novel mRNA splice variant of SCF from the skin of white merino sheep via cloning and sequencing. Reverse transcriptase (RT)-PCR and molecular prediction revealed two different cDNA products of SCF. Full-length cDNA libraries were enriched by the method of rapid amplification of cDNA ends (RACE-PCR). Nucleotide sequencing and molecular prediction revealed that the primary 1519 base pair (bp) cDNA encodes a precursor protein of 274 amino acids (aa), commonly known as ‘soluble’ isoform. In contrast, the shorter (835 and/or 725 bp) cDNA was found to be a ‘novel’ mRNA splice variant. It contains an open reading frame (ORF) corresponding to a truncated protein of 181 aa (vs 245 aa) with an unique C-terminus lacking the primary proteolytic segment (28 aa) right after the D175G site which is necessary to produce ‘soluble’ form of SCF. This alternative splice (AS) variant was explained by the complete nucleotide sequencing of splice junction covering exon 5-intron (5)-exon 6 (948 bp) with a premature termination codon (PTC) whereby exons 6 to 9/10 are skipped (Cassette Exon, CE 6–9/10). We also demonstrated that the Northern blot analysis at transcript level is mediated via an intron-5 splicing event. Our data refine the structure of SCF gene; clarify the presence (+) and/or absence (−) of primary proteolytic-cleavage site specific SCF splice variants. This work provides a basis for understanding the functional role and regulation of SCF in hair follicle melanogenesis in sheep beyond what was known in mice, humans and other mammals. PMID:22719917
Shen, Yingfang; Wu, Xiaopei; Liu, Demei; Song, Shengjing; Liu, Dengcai; Wang, Haiqing
2016-05-27
Histone methylation is an epigenetic modification mechanism that regulates gene expression in eukaryotic cells. Jumonji C domain-containing demethylases are involved in removal of methyl groups at lysine or arginine residues. The JmjC domain-only member, JMJ30/JMJD5 of Arabidopsis, is a component of the plant circadian clock. Although some plant circadian clock genes undergo alternative splicing in response to external cues, there is no evidence that JMJ30/JMJD5 is regulated by alternative splicing. In this study, the expression of an Arabidopsis JMJ30/JMJD5 ortholog in Medicago truncatula, MtJMJC5, in response to circadian clock and abiotic stresses were characterized. The results showed that MtJMJC5 oscillates with a circadian rhythm, and undergoes cold specifically induced alternative splicing. The cold-induced alternative splicing could be reversed after ambient temperature returning to the normal. Sequencing results revealed four alternative splicing RNA isoforms including a full-length authentic protein encoding variant, and three premature termination condon-containing variants due to alternative 3' splice sites at the first and second intron. Under cold treatment, the variants that share a common 3' alternative splicing site at the second intron were intensively up-regulated while the authentic protein encoding variant and the premature termination condon-containing variant only undergoing a 3' alternative splicing at the first intron were down regulated. Although all the premature termination condon-harboring alternative splicing variants were sensitive to nonsense-mediated decay, the premature termination codon-harboring alternative splicing variants sharing the 3' alternative splicing site at the second intron showed less sensitivity than the one only containing the 3' alternative slicing site at the first intron under cold treatment. These results suggest that the cold-dependent alternative splicing of MtJMJC5 is likely a species or genus-specific mechanism of gene expression regulation on RNA levels, and might play a role in epigenetic regulation of the link between the circadian clock and ambient temperature fluctuation in Medicago. Copyright © 2016 Elsevier Inc. All rights reserved.
A mechanism underlying position-specific regulation of alternative splicing
Hamid, Fursham M.
2017-01-01
Abstract Many RNA-binding proteins including a master regulator of splicing in developing brain and muscle, polypyrimidine tract-binding protein 1 (PTBP1), can either activate or repress alternative exons depending on the pre-mRNA recruitment position. When bound upstream or within regulated exons PTBP1 tends to promote their skipping, whereas binding to downstream sites often stimulates inclusion. How this switch is orchestrated at the molecular level is poorly understood. Using bioinformatics and biochemical approaches we show that interaction of PTBP1 with downstream intronic sequences can activate natural cassette exons by promoting productive docking of the spliceosomal U1 snRNP to a suboptimal 5′ splice site. Strikingly, introducing upstream PTBP1 sites to this circuitry leads to a potent splicing repression accompanied by the assembly of an exonic ribonucleoprotein complex with a tightly bound U1 but not U2 snRNP. Our data suggest a molecular mechanism underlying the transition between a better-known repressive function of PTBP1 and its role as a bona fide splicing activator. More generally, we argue that the functional outcome of individual RNA contacts made by an RNA-binding protein is subject to extensive context-specific modulation.
Permanent Neonatal Diabetes Caused by Creation of an Ectopic Splice Site within the INS Gene
Gastaldo, Elena; Harries, Lorna W.; Rubio-Cabezas, Oscar; Castaño, Luis
2012-01-01
Background The aim of this study was to characterize the genetic etiology in a patient who presented with permanent neonatal diabetes at 2 months of age. Methodology/Principal Findings Regulatory elements and coding exons 2 and 3 of the INS gene were amplified and sequenced from genomic and complementary DNA samples. A novel heterozygous INS mutation within the terminal intron of the gene was identified in the proband and her affected father. This mutation introduces an ectopic splice site leading to the insertion of 29 nucleotides from the intronic sequence into the mature mRNA, which results in a longer and abnormal transcript. Conclusions/Significance This study highlights the importance of routinely sequencing the exon-intron boundaries and the need to carry out additional studies to confirm the pathogenicity of any identified intronic genetic variants. PMID:22235272
In silico prediction of splice-altering single nucleotide variants in the human genome.
Jian, Xueqiu; Boerwinkle, Eric; Liu, Xiaoming
2014-12-16
In silico tools have been developed to predict variants that may have an impact on pre-mRNA splicing. The major limitation of the application of these tools to basic research and clinical practice is the difficulty in interpreting the output. Most tools only predict potential splice sites given a DNA sequence without measuring splicing signal changes caused by a variant. Another limitation is the lack of large-scale evaluation studies of these tools. We compared eight in silico tools on 2959 single nucleotide variants within splicing consensus regions (scSNVs) using receiver operating characteristic analysis. The Position Weight Matrix model and MaxEntScan outperformed other methods. Two ensemble learning methods, adaptive boosting and random forests, were used to construct models that take advantage of individual methods. Both models further improved prediction, with outputs of directly interpretable prediction scores. We applied our ensemble scores to scSNVs from the Catalogue of Somatic Mutations in Cancer database. Analysis showed that predicted splice-altering scSNVs are enriched in recurrent scSNVs and known cancer genes. We pre-computed our ensemble scores for all potential scSNVs across the human genome, providing a whole genome level resource for identifying splice-altering scSNVs discovered from large-scale sequencing studies.
Spliced RNA of woodchuck hepatitis virus.
Ogston, C W; Razman, D G
1992-07-01
Polymerase chain reaction was used to investigate RNA splicing in liver of woodchucks infected with woodchuck hepatitis virus (WHV). Two spliced species were detected, and the splice junctions were sequenced. The larger spliced RNA has an intron of 1300 nucleotides, and the smaller spliced sequence shows an additional downstream intron of 1104 nucleotides. We did not detect singly spliced sequences from which the smaller intron alone was removed. Control experiments showed that spliced sequences are present in both RNA and DNA in infected liver, showing that the viral reverse transcriptase can use spliced RNA as template. Spliced sequences were detected also in virion DNA prepared from serum. The upstream intron produces a reading frame that fuses the core to the polymerase polypeptide, while the downstream intron causes an inframe deletion in the polymerase open reading frame. Whereas the splicing patterns in WHV are superficially similar to those reported recently in hepatitis B virus, we detected no obvious homology in the coding capacity of spliced RNAs from these two viruses.
Arnaud, Lionel; Salachas, François; Lucien, Nicole; Maisonobe, Thierry; Le Pennec, Pierre-Yves; Babinet, Jérôme; Cartron, Jean-Pierre
2009-03-01
McLeod syndrome is a rare X-linked neuroacanthocytosis syndrome with hematologic, muscular, and neurologic manifestations. McLeod syndrome is caused by mutations in the XK gene whose product is expressed at the red blood cell (RBC) surface but whose function is currently unknown. A variety of XK mutations has been reported but no clear phenotype-genotype correlation has been found, especially for the point mutations affecting splicing sites. A man suspected of neuroacanthocytosis was evaluated by neurologic examination, electromyography, muscle biopsy, muscle computed tomography, and cerebral magnetic resonance imaging. The McLeod RBC phenotype was disclosed by blood smear and immunohematology analyses and then confirmed at the biochemical level by Western blot analysis. The responsible XK mutation was characterized at the mRNA level by reverse transcription-polymerase chain reaction (PCR), identified by genomic DNA sequencing, and verified by allele-specific PCR. A novel XK splice site mutation (IVS1-1G>A) has been identified in a McLeod patient who has developed hematologic, neuromuscular, and neurologic symptoms. This is the first reported example of a XK point mutation affecting the 3' acceptor splice site of Intron 1, and it was demonstrated that this mutation indeed induces aberrant splicing of XK RNA and lack of XK protein at the RBC membrane. The detailed characterization at the molecular biology level of this novel XK splice site mutation associated with the clinical description of the patient contributes to a better understanding of the phenotype-genotype correlation in the McLeod syndrome.
López-Urrutia, Eduardo; Valdés, Jesús; Bonilla-Moreno, Raúl; Martínez-Salazar, Martha; Martínez-Garcia, Martha; Berumen, Jaime; Villegas-Sepúlveda, Nicolás
2012-06-01
The HPV-16 E6/E7 genes, which contain intron 1, are processed by alternative splicing and its transcripts are detected with a heterogeneous profile in tumours cells. Frequently, the HPV-16 positive carcinoma cells bear viral variants that contain single nucleotide polymorphisms into its DNA sequence. We were interested in analysing the contribution of this polymorphism to the heterogeneity in the pattern of the E6/E7 spliced transcripts. Using the E6/E7 sequences from three closely related HPV-16 variants, we have shown that a few nucleotide changes are sufficient to produce heterogeneity in the splicing profile. Furthermore, using mutants that contained a single SNP, we also showed that one nucleotide change was sufficient to reproduce the heterogeneous splicing profile. Additionally, a difference of two or three SNPs among these viral sequences was sufficient to recruit differentially several splicing factors to the polymorphic E6/E7 transcripts. Moreover, only one SNP was sufficient to alter the binding site of at least one splicing factor, changing the ability of splicing factors to bind the transcript. Finally, the factors that were differentially bound to the short form of intron 1 of one of these E6/E7 variants were identified as TIA1 and/or TIAR and U1-70k, while U2AF65, U5-52k and PTB were preferentially bound to the transcript of the other variants. Copyright © 2012 Elsevier B.V. All rights reserved.
RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application
2015-01-01
Background The study of RNA has been dramatically improved by the introduction of Next Generation Sequencing platforms allowing massive and cheap sequencing of selected RNA fractions, also providing information on strand orientation (RNA-Seq). The complexity of transcriptomes and of their regulative pathways make RNA-Seq one of most complex field of NGS applications, addressing several aspects of the expression process (e.g. identification and quantification of expressed genes and transcripts, alternative splicing and polyadenylation, fusion genes and trans-splicing, post-transcriptional events, etc.). Moreover, the huge volume of data generated by NGS platforms introduces unprecedented computational and technological challenges to efficiently analyze and store sequence data and results. Methods In order to provide researchers with an effective and friendly resource for analyzing RNA-Seq data, we present here RAP (RNA-Seq Analysis Pipeline), a cloud computing web application implementing a complete but modular analysis workflow. This pipeline integrates both state-of-the-art bioinformatics tools for RNA-Seq analysis and in-house developed scripts to offer to the user a comprehensive strategy for data analysis. RAP is able to perform quality checks (adopting FastQC and NGS QC Toolkit), identify and quantify expressed genes and transcripts (with Tophat, Cufflinks and HTSeq), detect alternative splicing events (using SpliceTrap) and chimeric transcripts (with ChimeraScan). This pipeline is also able to identify splicing junctions and constitutive or alternative polyadenylation sites (implementing custom analysis modules) and call for statistically significant differences in genes and transcripts expression, splicing pattern and polyadenylation site usage (using Cuffdiff2 and DESeq). Results Through a user friendly web interface, the RAP workflow can be suitably customized by the user and it is automatically executed on our cloud computing environment. This strategy allows to access to bioinformatics tools and computational resources without specific bioinformatics and IT skills. RAP provides a set of tabular and graphical results that can be helpful to browse, filter and export analyzed data, according to the user needs. PMID:26046471
X-linked Alport syndrome caused by splicing mutations in COL4A5.
Nozu, Kandai; Vorechovsky, Igor; Kaito, Hiroshi; Fu, Xue Jun; Nakanishi, Koichi; Hashimura, Yuya; Hashimoto, Fusako; Kamei, Koichi; Ito, Shuichi; Kaku, Yoshitsugu; Imasawa, Toshiyuki; Ushijima, Katsumi; Shimizu, Junya; Makita, Yoshio; Konomoto, Takao; Yoshikawa, Norishige; Iijima, Kazumoto
2014-11-07
X-linked Alport syndrome is caused by mutations in the COL4A5 gene. Although many COL4A5 mutations have been detected, the mutation detection rate has been unsatisfactory. Some men with X-linked Alport syndrome show a relatively mild phenotype, but molecular basis investigations have rarely been conducted to clarify the underlying mechanism. In total, 152 patients with X-linked Alport syndrome who were suspected of having Alport syndrome through clinical and pathologic investigations and referred to the hospital for mutational analysis between January of 2006 and January of 2013 were genetically diagnosed. Among those patients, 22 patients had suspected splice site mutations. Transcripts are routinely examined when suspected splice site mutations for abnormal transcripts are detected; 11 of them showed expected exon skipping, but others showed aberrant splicing patterns. The mutation detection strategy had two steps: (1) genomic DNA analysis using PCR and direct sequencing and (2) mRNA analysis using RT-PCR to detect RNA processing abnormalities. Six splicing consensus site mutations resulting in aberrant splicing patterns, one exonic mutation leading to exon skipping, and four deep intronic mutations producing cryptic splice site activation were identified. Interestingly, one case produced a cryptic splice site with a single nucleotide substitution in the deep intron that led to intronic exonization containing a stop codon; however, the patient showed a clearly milder phenotype for X-linked Alport syndrome in men with a truncating mutation. mRNA extracted from the kidney showed both normal and abnormal transcripts, with the normal transcript resulting in the milder phenotype. This novel mechanism leads to mild clinical characteristics. This report highlights the importance of analyzing transcripts to enhance the mutation detection rate and provides insight into genotype-phenotype correlations. This approach can clarify the cause of atypically mild phenotypes in X-linked Alport syndrome. Copyright © 2014 by the American Society of Nephrology.
Tang, Rongying; Prosser, Debra O.; Love, Donald R.
2016-01-01
The increasing diagnostic use of gene sequencing has led to an expanding dataset of novel variants that lie within consensus splice junctions. The challenge for diagnostic laboratories is the evaluation of these variants in order to determine if they affect splicing or are merely benign. A common evaluation strategy is to use in silico analysis, and it is here that a number of programmes are available online; however, currently, there are no consensus guidelines on the selection of programmes or protocols to interpret the prediction results. Using a collection of 222 pathogenic mutations and 50 benign polymorphisms, we evaluated the sensitivity and specificity of four in silico programmes in predicting the effect of each variant on splicing. The programmes comprised Human Splice Finder (HSF), Max Entropy Scan (MES), NNSplice, and ASSP. The MES and ASSP programmes gave the highest performance based on Receiver Operator Curve analysis, with an optimal cut-off of score reduction of 10%. The study also showed that the sensitivity of prediction is affected by the level of conservation of individual positions, with in silico predictions for variants at positions −4 and +7 within consensus splice sites being largely uninformative. PMID:27313609
Legendre, Marine; Rodriguez-Ballesteros, Montserrat; Rossi, Massimiliano; Abadie, Véronique; Amiel, Jeanne; Revencu, Nicole; Blanchet, Patricia; Brioude, Frédéric; Delrue, Marie-Ange; Doubaj, Yassamine; Sefiani, Abdelaziz; Francannet, Christine; Holder-Espinasse, Muriel; Jouk, Pierre-Simon; Julia, Sophie; Melki, Judith; Mur, Sébastien; Naudion, Sophie; Fabre-Teste, Jennifer; Busa, Tiffany; Stamm, Stephen; Lyonnet, Stanislas; Attie-Bitach, Tania; Kitzis, Alain; Gilbert-Dussardier, Brigitte; Bilan, Frédéric
2018-02-01
CHARGE syndrome is a rare genetic disorder mainly due to de novo and private truncating mutations of CHD7 gene. Here we report an intriguing hot spot of intronic mutations (c.5405-7G > A, c.5405-13G > A, c.5405-17G > A and c.5405-18C > A) located in CHD7 IVS25. Combining computational in silico analysis, experimental branch-point determination and in vitro minigene assays, our study explains this mutation hot spot by a particular genomic context, including the weakness of the IVS25 natural acceptor-site and an unconventional lariat sequence localized outside the common 40 bp upstream the acceptor splice site. For each of the mutations reported here, bioinformatic tools indicated a newly created 3' splice site, of which the existence was confirmed using pSpliceExpress, an easy-to-use and reliable splicing reporter tool. Our study emphasizes the idea that combining these two complementary approaches could increase the efficiency of routine molecular diagnosis.
Di Giacomo, Daniela; Gaildrat, Pascaline; Abuli, Anna; Abdat, Julie; Frébourg, Thierry; Tosi, Mario; Martins, Alexandra
2013-11-01
Exonic variants can alter pre-mRNA splicing either by changing splice sites or by modifying splicing regulatory elements. Often these effects are difficult to predict and are only detected by performing RNA analyses. Here, we analyzed, in a minigene assay, 26 variants identified in the exon 7 of BRCA2, a cancer predisposition gene. Our results revealed eight new exon skipping mutations in this exon: one directly altering the 5' splice site and seven affecting potential regulatory elements. This brings the number of splicing regulatory mutations detected in BRCA2 exon 7 to a total of 11, a remarkably high number considering the total number of variants reported in this exon (n = 36), all tested in our minigene assay. We then exploited this large set of splicing data to test the predictive value of splicing regulator hexamers' scores recently established by Ke et al. (). Comparisons of hexamer-based predictions with our experimental data revealed high sensitivity in detecting variants that increased exon skipping, an important feature for prescreening variants before RNA analysis. In conclusion, hexamer scores represent a promising tool for predicting the biological consequences of exonic variants and may have important applications for the interpretation of variants detected by high-throughput sequencing. © 2013 WILEY PERIODICALS, INC.
Mechanism for DNA transposons to generate introns on genomic scales
Huff, Jason T.; Zilberman, Daniel; Roy, Scott W.
2017-01-01
Discovered four decades ago, the existence of introns was one of the most unexpected findings in molecular biology1. Introns are sequences interrupting genes that must be removed as part of mRNA production. Genome sequencing projects have documented that most eukaryotic genes contain at least one and frequently many introns2,3. Comparison of these genomes reveals a history of long evolutionary periods with little intron gain punctuated by episodes of rapid, extensive gain2,3. However, no detailed mechanism for such episodic intron generation has been empirically supported on a sufficient scale, despite several proposals4–8. Here we show how short non-autonomous DNA transposons independently generated hundreds to thousands of introns in the prasinophyte Micromonas pusilla and the pelagophyte Aureococcus anophagefferens. Each transposon carries one splice site. The other splice site is co-opted from gene sequence duplicated upon transposon insertion, allowing perfect splicing out of RNA. The distributions of sequences that can be co-opted are biased with respect to codons, and phasing of transposon-generated introns is similarly biased. These transposons insert between preexisting nucleosomes, so that multiple nearby insertions generate nucleosome-sized intervening segments. Thus, transposon insertion and sequence co-option may explain the intron phase biases2 and prevalence of nucleosome-sized exons9 observed in eukaryotes. Overall, the two independent examples of proliferating elements illustrate a general DNA transposon mechanism plausibly accounting for episodes of rapid, extensive intron gain during eukaryotic evolution2,3. PMID:27760113
van der Woerd, Wendy L; Mulder, Johanna; Pagani, Franco; Beuers, Ulrich; Houwen, Roderick H J; van de Graaf, Stan F J
2015-04-01
ATP8B1 deficiency is a severe autosomal recessive liver disease resulting from mutations in the ATP8B1 gene characterized by a continuous phenotypical spectrum from intermittent (benign recurrent intrahepatic cholestasis; BRIC) to progressive familial intrahepatic cholestasis (PFIC). Current therapeutic options are insufficient, and elucidating the molecular consequences of mutations could lead to personalized mutation-specific therapies. We investigated the effect on pre-messenger RNA splicing of 14 ATP8B1 mutations at exon-intron boundaries using an in vitro minigene system. Eleven mutations, mostly associated with a PFIC phenotype, resulted in aberrant splicing and a complete absence of correctly spliced product. In contrast, three mutations led to partially correct splicing and were associated with a BRIC phenotype. These findings indicate an inverse correlation between the level of correctly spliced product and disease severity. Expression of modified U1 small nuclear RNAs (snRNA) complementary to the splice donor sites strongly improved or completely rescued splicing for several ATP8B1 mutations located at donor, as well as acceptor, splice sites. In one case, we also evaluated exon-specific U1 snRNAs that, by targeting nonconserved intronic sequences, might reduce possible off-target events. Although very effective in correcting exon skipping, they also induced retention of the short downstream intron. We systematically characterized the molecular consequences of 14 ATP8B1 mutations at exon-intron boundaries associated with ATP8B1 deficiency and found that the majority resulted in total exon skipping. The amount of correctly spliced product inversely correlated with disease severity. Compensatory modified U1 snRNAs, complementary to mutated donor splice sites, were able to improve exon definition very efficiently and could be a novel therapeutic strategy in ATP8B1 deficiency as well as other genetic diseases. © 2014 by the American Association for the Study of Liver Diseases.
Method of artificial DNA splicing by directed ligation (SDL).
Lebedenko, E N; Birikh, K R; Plutalov, O V; Berlin YuA
1991-01-01
An approach to directed genetic recombination in vitro has been devised, which allows for joining together, in a predetermined way, a series of DNA segments to give a precisely spliced polynucleotide sequence (DNA splicing by directed ligation, SDL). The approach makes use of amplification, by means of several polymerase chain reactions (PCR), of a chosen set of DNA segments. Primers for the amplifications contain recognition sites of the class IIS restriction endonucleases, which transform blunt ends of the amplification products into protruding ends of unique primary structures, the ends to be used for joining segments together being mutually complementary. Ligation of the mixture of the segments so synthesized gives the desired sequence in an unambiguous way. The suggested approach has been exemplified by the synthesis of a totally processed (intronless) gene encoding human mature interleukin-1 alpha. Images PMID:1662363
Rodríguez-Martín, Carlos; Cidre, Florencia; Fernández-Teijeiro, Ana; Gómez-Mariano, Gema; de la Vega, Leticia; Ramos, Patricia; Zaballos, Ángel; Monzón, Sara; Alonso, Javier
2016-05-01
Retinoblastoma (RB, MIM 180200) is the paradigm of hereditary cancer. Individuals harboring a constitutional mutation in one allele of the RB1 gene have a high predisposition to develop RB. Here, we present the first case of familial RB caused by a de novo insertion of a full-length long interspersed element-1 (LINE-1) into intron 14 of the RB1 gene that caused a highly heterogeneous splicing pattern of RB1 mRNA. LINE-1 insertion was inferred by mRNA studies and full-length sequenced by massive parallel sequencing. Some of the aberrant mRNAs were produced by noncanonical acceptor splice sites, a new finding that up to date has not been described to occur upon LINE-1 retrotransposition. Our results clearly show that RNA-based strategies have the potential to detect disease-causing transposon insertions. It also confirms that the incorporation of new genetic approaches, such as massive parallel sequencing, contributes to characterize at the sequence level these unique and exceptional genetic alterations.
RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease.
Xiong, Hui Y; Alipanahi, Babak; Lee, Leo J; Bretschneider, Hannes; Merico, Daniele; Yuen, Ryan K C; Hua, Yimin; Gueroussov, Serge; Najafabadi, Hamed S; Hughes, Timothy R; Morris, Quaid; Barash, Yoseph; Krainer, Adrian R; Jojic, Nebojsa; Scherer, Stephen W; Blencowe, Benjamin J; Frey, Brendan J
2015-01-09
To facilitate precision medicine and whole-genome annotation, we developed a machine-learning technique that scores how strongly genetic variants affect RNA splicing, whose alteration contributes to many diseases. Analysis of more than 650,000 intronic and exonic variants revealed widespread patterns of mutation-driven aberrant splicing. Intronic disease mutations that are more than 30 nucleotides from any splice site alter splicing nine times as often as common variants, and missense exonic disease mutations that have the least impact on protein function are five times as likely as others to alter splicing. We detected tens of thousands of disease-causing mutations, including those involved in cancers and spinal muscular atrophy. Examination of intronic and exonic variants found using whole-genome sequencing of individuals with autism revealed misspliced genes with neurodevelopmental phenotypes. Our approach provides evidence for causal variants and should enable new discoveries in precision medicine. Copyright © 2015, American Association for the Advancement of Science.
Global regulation of alternative RNA splicing by the SR-rich protein RBM39.
Mai, Sanyue; Qu, Xiuhua; Li, Ping; Ma, Qingjun; Cao, Cheng; Liu, Xuan
2016-08-01
RBM39 is a serine/arginine-rich RNA-binding protein that is highly homologous to the splicing factor U2AF65. However, the role of RBM39 in alternative splicing is poorly understood. In this study, RBM39-mediated global alternative splicing was investigated using RNA-Seq and genome-wide RBM39-RNA interactions were mapped via cross-linking and immunoprecipitation coupled with deep sequencing (CLIP-Seq) in wild-type and RBM39-knockdown MCF-7 cells. RBM39 was involved in the up- or down-regulation of the transcript levels of various genes. Hundreds of alternative splicing events regulated by endogenous RBM39 were identified. The majority of these events were cassette exons. Genes containing RBM39-regulated alternative exons were found to be linked to G2/M transition, cellular response to DNA damage, adherens junctions and endocytosis. CLIP-Seq analysis showed that the binding site of RBM39 was mainly in proximity to 5' and 3' splicing sites. Considerable RBM39 binding to mRNAs encoding proteins involved in translation was observed. Of particular importance, ~20% of the alternative splicing events that were significantly regulated by RBM39 were similarly regulated by U2AF65. RBM39 is extensively involved in alternative splicing of RNA and helps regulate transcript levels. RBM39 may modulate alternative splicing similarly to U2AF65 by either directly binding to RNA or recruiting other splicing factors, such as U2AF65. The current study offers a genome-wide view of RBM39's regulatory function in alternative splicing. RBM39 may play important roles in multiple cellular processes by regulating both alternative splicing of RNA molecules and transcript levels. Copyright © 2016 Elsevier B.V. All rights reserved.
Postnatal Expression of V2 Vasopressin Receptor Splice Variants in the Rat Cerebellum
Vargas, Karina J.; Sarmiento, José M.; Ehrenfeld, Pamela; Añazco, Carolina C.; Villanueva, Carolina I.; Carmona, Pamela L.; Brenet, Marianne; Navarro, Javier; Müller-Esterl, Werner; Figueroa, Carlos D.; González, Carlos B.
2010-01-01
The V2 vasopressin receptor gene contains an alternative splice site in exon-3, which leads to the generation of two splice variants (V2a and V2b) first identified in the kidney. The open reading frame of the alternatively spliced V2b transcripten codes a truncated receptor, showing the same amino acid sequence as the canonical V2a receptor up to the 6th transmembrane segment, but displaying a distinct sequence to the corresponding 7th transmembrane segment and C-terminal domain relative to the V2a receptor. Here, we demonstrate the postnatal expression of V2a and V2b variants in the rat cerebellum. Most importantly, we showed by in situ hybridization and immunocytochemistry that both V2 splice variants were preferentially expressed in Purkinje cells, from early to late postnatal development. In addition, both variants were transiently expressed in the neuroblastic external granule cells and Bergmann fibers. These results indicate that the cellular distributions of both splice variants are developmentally regulated, and suggest that the transient expression of the V2 receptor is involved in the mechanisms of cerebellar cytodifferentiation by AVP. Finally, transfected CHO-K1 .expressing similar amounts of both V2 splice variants, as that found in the cerebellum, showed a significant reduction in the surface expression of V2a receptors, suggesting that the differential expression of the V2 splice variants regulate the vasopressin signaling in the cerebellum. PMID:19281786
Succession of splicing regulatory elements determines cryptic 5΄ss functionality
Brillen, Anna-Lena; Schöneweis, Katrin; Walotka, Lara; Hartmann, Linda; Müller, Lisa; Ptok, Johannes; Kaisers, Wolfgang; Poschmann, Gereon; Stühler, Kai; Buratti, Emanuele
2017-01-01
Abstract A critical step in exon definition is the recognition of a proper splice donor (5΄ss) by the 5’ end of U1 snRNA. In the selection of appropriate 5΄ss, cis-acting splicing regulatory elements (SREs) are indispensable. As a model for 5΄ss recognition, we investigated cryptic 5΄ss selection within the human fibrinogen Bβ-chain gene (FGB) exon 7, where we identified several exonic SREs that simultaneously acted on up- and downstream cryptic 5΄ss. In the FGB exon 7 model system, 5΄ss selection iteratively proceeded along an alternating sequence of U1 snRNA binding sites and interleaved SREs which in principle supported different 3’ exon ends. Like in a relay race, SREs either suppressed a potential 5΄ss and passed the splicing baton on or splicing actually occurred. From RNA-Seq data, we systematically selected 19 genes containing exons with silent U1 snRNA binding sites competing with nearby highly used 5΄ss. Extensive SRE analysis by different algorithms found authentic 5΄ss significantly more supported by SREs than silent U1 snRNA binding sites, indicating that our concept may permit generalization to a model for 5΄ss selection and 3’ exon end definition. PMID:28039323
Zhu, Fu-Yuan; Chen, Mo-Xian; Ye, Neng-Hui; Shi, Lu; Ma, Kai-Long; Yang, Jing-Fang; Cao, Yun-Ying; Zhang, Youjun; Yoshida, Takuya; Fernie, Alisdair R; Fan, Guang-Yi; Wen, Bo; Zhou, Ruo; Liu, Tie-Yuan; Fan, Tao; Gao, Bei; Zhang, Di; Hao, Ge-Fei; Xiao, Shi; Liu, Ying-Gao; Zhang, Jianhua
2017-08-01
In eukaryotes, mechanisms such as alternative splicing (AS) and alternative translation initiation (ATI) contribute to organismal protein diversity. Specifically, splicing factors play crucial roles in responses to environment and development cues; however, the underlying mechanisms are not well investigated in plants. Here, we report the parallel employment of short-read RNA sequencing, single molecule long-read sequencing and proteomic identification to unravel AS isoforms and previously unannotated proteins in response to abscisic acid (ABA) treatment. Combining the data from the two sequencing methods, approximately 83.4% of intron-containing genes were alternatively spliced. Two AS types, which are referred to as alternative first exon (AFE) and alternative last exon (ALE), were more abundant than intron retention (IR); however, by contrast to AS events detected under normal conditions, differentially expressed AS isoforms were more likely to be translated. ABA extensively affects the AS pattern, indicated by the increasing number of non-conventional splicing sites. This work also identified thousands of unannotated peptides and proteins by ATI based on mass spectrometry and a virtual peptide library deduced from both strands of coding regions within the Arabidopsis genome. The results enhance our understanding of AS and alternative translation mechanisms under normal conditions, and in response to ABA treatment. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application.
D'Antonio, Mattia; D'Onorio De Meo, Paolo; Pallocca, Matteo; Picardi, Ernesto; D'Erchia, Anna Maria; Calogero, Raffaele A; Castrignanò, Tiziana; Pesole, Graziano
2015-01-01
The study of RNA has been dramatically improved by the introduction of Next Generation Sequencing platforms allowing massive and cheap sequencing of selected RNA fractions, also providing information on strand orientation (RNA-Seq). The complexity of transcriptomes and of their regulative pathways make RNA-Seq one of most complex field of NGS applications, addressing several aspects of the expression process (e.g. identification and quantification of expressed genes and transcripts, alternative splicing and polyadenylation, fusion genes and trans-splicing, post-transcriptional events, etc.). In order to provide researchers with an effective and friendly resource for analyzing RNA-Seq data, we present here RAP (RNA-Seq Analysis Pipeline), a cloud computing web application implementing a complete but modular analysis workflow. This pipeline integrates both state-of-the-art bioinformatics tools for RNA-Seq analysis and in-house developed scripts to offer to the user a comprehensive strategy for data analysis. RAP is able to perform quality checks (adopting FastQC and NGS QC Toolkit), identify and quantify expressed genes and transcripts (with Tophat, Cufflinks and HTSeq), detect alternative splicing events (using SpliceTrap) and chimeric transcripts (with ChimeraScan). This pipeline is also able to identify splicing junctions and constitutive or alternative polyadenylation sites (implementing custom analysis modules) and call for statistically significant differences in genes and transcripts expression, splicing pattern and polyadenylation site usage (using Cuffdiff2 and DESeq). Through a user friendly web interface, the RAP workflow can be suitably customized by the user and it is automatically executed on our cloud computing environment. This strategy allows to access to bioinformatics tools and computational resources without specific bioinformatics and IT skills. RAP provides a set of tabular and graphical results that can be helpful to browse, filter and export analyzed data, according to the user needs.
CryoEM structure of the spliceosome immediately after branching
Galej, Wojciech P.; Wilkinson, Max E.; Fica, Sebastian M.; Oubridge, Chris; Newman, Andrew J.; Nagai, Kiyoshi
2016-01-01
Pre-mRNA splicing proceeds by two consecutive trans-esterification reactions via a lariat-intron intermediate. We present the 3.8Å cryoEM structure of the spliceosome immediately after lariat formation. The 5’-splice site is cleaved but remains close to the catalytic Mg2+ site in the U2/U6 snRNA triplex, and the 5’-phosphate of the intron nucleotide G(+1) is linked to the branch adenosine 2’OH. The 5’-exon is held between the Prp8 N-terminal and Linker domains, and base-pairs with U5 snRNA loop 1. Non-Watson-Crick interactions between the branch helix and 5’-splice site dock the branch adenosine into the active site, while intron nucleotides +3 to +6 base-pair with the U6 snRNA ACAGAGA sequence. Isy1 and the step one factors Yju2 and Cwc25 stabilise docking of the branch helix. The intron downstream of the branch site emerges between the Prp8 RT and Linker domains and extends towards Prp16 helicase, suggesting a plausible mechanism of remodelling before exon ligation. PMID:27459055
hnRNP L regulates differences in expression of mouse integrin alpha2beta1.
Cheli, Yann; Kunicki, Thomas J
2006-06-01
There is a 2-fold variation in platelet integrin alpha2beta1 levels among inbred mouse strains. Decreased alpha2beta1 in 4 strains carrying Itga2 haplotype 2 results from decreased affinity of heterogeneous ribonucleoprotein L (hnRNP L) for a 6 CA repeat sequence (CA6) within intron 1. Seven strains bearing haplotype 1 and a 21 CA repeat sequence at this position (CA21) express twice the level of platelet alpha2beta1 and exhibit an equivalent gain of platelet function in vitro. By UV crosslinking and immunoprecipitation, hnRNP L binds more avidly to CA21, relative to CA6. By cell-free, in vitro mRNA splicing, decreased binding of hnRNP L results in decreased splicing efficiency and an increased proportion of alternatively spliced product. The splicing enhancer activity of CA21 in vivo is abolished by prior treatment with hnRNP L-specific siRNA. Thus, decreased surface alpha2beta1 results from decreased Itga2 pre-mRNA splicing regulated by hnRNP L and depends on CA repeat length at a specific site in intron 1.
hnRNP L regulates differences in expression of mouse integrin α2β1
Cheli, Yann; Kunicki, Thomas J.
2006-01-01
There is a 2-fold variation in platelet integrin α2β1 levels among inbred mouse strains. Decreased α2β1 in 4 strains carrying Itga2 haplotype 2 results from decreased affinity of heterogeneous ribonucleoprotein L (hnRNP L) for a 6 CA repeat sequence (CA6) within intron 1. Seven strains bearing haplotype 1 and a 21 CA repeat sequence at this position (CA21) express twice the level of platelet α2β1 and exhibit an equivalent gain of platelet function in vitro. By UV crosslinking and immunoprecipitation, hnRNP L binds more avidly to CA21, relative to CA6. By cell-free, in vitro mRNA splicing, decreased binding of hnRNP L results in decreased splicing efficiency and an increased proportion of alternatively spliced product. The splicing enhancer activity of CA21 in vivo is abolished by prior treatment with hnRNP L–specific siRNA. Thus, decreased surface α2β1 results from decreased Itga2 pre-mRNA splicing regulated by hnRNP L and depends on CA repeat length at a specific site in intron 1. PMID:16455949
Novel variants in PAX6 gene caused congenital aniridia in two Chinese families.
Zhang, R; Linpeng, S; Wei, X; Li, H; Huang, Y; Guo, J; Wu, Q; Liang, D; Wu, L
2017-06-01
PurposeTo reveal the underlying genetic defect in two four-generation Chinese families with aniridia and explore the pathologic mechanism.MethodsFull ophthalmic examinations were performed in two families with aniridia. The PAX6 gene was directly sequenced in patients of two families, and the detected variants were screened in unaffected family members and two hundred unrelated healthy controls. Real-time quantitative PCR was used to explore pathologic mechanisms of the two variants.ResultsAniridia, cataract, and oscillatory nystagmus were observed in patients of the two families. In addition, we observed corneal opacity and microphthalmus in family 1, and strabismus, left ectopia lentis, microphthalmus, and microcornea in family 2. Sanger sequencing detected a novel 1-bp duplication (c.50dupA) in family 1 and a novel 2-bp splice site deletion (c.765+1_765+2delGT) in family 2. Sequencing of cDNA indicated skipping of exon 9 caused by the splice site deletion, being predicted to cause a premature stop codon, as well as the duplication. The PAX6 mRNA significantly lower in patients with aniridia than in unaffected family members in both families, suggesting that the duplication and splice site deletion caused nonsense-mediated mRNA decay.ConclusionsOur study identified two novel PAX6 variants in two families with aniridia and revealed the pathogenicity of the variants; this would expand the variant spectrum of PAX6 and help us better understand the molecular basis of aniridia, thus facilitating genetic counseling.
Wang, Dan; Liang, Shengyun; Zhang, Zhao; Zhao, Guoru; Hu, Yuan; Liang, Shengran; Zhang, Xipeng; Banerjee, Santasree
2017-03-28
Familial adenomatous polyposis (FAP) is an autosomal dominant precancerous condition, clinically characterized by the presence of multiple colorectal adenomas or polyps. Patients with FAP has a high risk of developing colorectal cancer (CRC) from these colorectal adenomatous polyps by the mean age of diagnosis at 40 years. Germline mutations of the APC gene cause familial adenomatous polyposis (FAP). Colectomy has recommended for the FAP patients with significant polyposis. Here, we present a clinical molecular study of a four generation Chinese family with FAP. Clinical diagnosis of FAP has been done according to the phenotype, family history and medical records. Patient's blood samples were collected and genomic DNA was extracted. In order to identify the pathogenic mutation underlying the disease phenotype targeted next-generation sequencing and confirmatory sanger sequencing has undertaken. Targeted next generation sequencing identified a novel heterozygous splice-acceptor site mutation [c.1744-1G>A] in intron 14 of APC gene, which is co-segregated with the FAP phenotypes in the proband and amongst all the affected family members. This mutation is not present in unaffected family members and in normal healthy controls of same ethnic origin. According to the LOVD database for Chinese colorectal cancer patients, in Chinese population, 60% of the previously reported APC gene mutations causes FAP, are missense mutations. This novel splice-acceptor site mutation causing FAP in this Chinese family expands the germline mutation spectrum of the APC gene in the Chinese population.
Spliced integrated retrotransposed element (SpIRE) formation in the human genome.
Larson, Peter A; Moldovan, John B; Jasti, Naveen; Kidd, Jeffrey M; Beck, Christine R; Moran, John V
2018-03-01
Human Long interspersed element-1 (L1) retrotransposons contain an internal RNA polymerase II promoter within their 5' untranslated region (UTR) and encode two proteins, (ORF1p and ORF2p) required for their mobilization (i.e., retrotransposition). The evolutionary success of L1 relies on the continuous retrotransposition of full-length L1 mRNAs. Previous studies identified functional splice donor (SD), splice acceptor (SA), and polyadenylation sequences in L1 mRNA and provided evidence that a small number of spliced L1 mRNAs retrotransposed in the human genome. Here, we demonstrate that the retrotransposition of intra-5'UTR or 5'UTR/ORF1 spliced L1 mRNAs leads to the generation of spliced integrated retrotransposed elements (SpIREs). We identified a new intra-5'UTR SpIRE that is ten times more abundant than previously identified SpIREs. Functional analyses demonstrated that both intra-5'UTR and 5'UTR/ORF1 SpIREs lack Cis-acting transcription factor binding sites and exhibit reduced promoter activity. The 5'UTR/ORF1 SpIREs also produce nonfunctional ORF1p variants. Finally, we demonstrate that sequence changes within the L1 5'UTR over evolutionary time, which permitted L1 to evade the repressive effects of a host protein, can lead to the generation of new L1 splicing events, which, upon retrotransposition, generates a new SpIRE subfamily. We conclude that splicing inhibits L1 retrotransposition, SpIREs generally represent evolutionary "dead-ends" in the L1 retrotransposition process, mutations within the L1 5'UTR alter L1 splicing dynamics, and that retrotransposition of the resultant spliced transcripts can generate interindividual genomic variation.
Spliced integrated retrotransposed element (SpIRE) formation in the human genome
Larson, Peter A.; Moldovan, John B.; Jasti, Naveen; Kidd, Jeffrey M.; Beck, Christine R.; Moran, John V.
2018-01-01
Human Long interspersed element-1 (L1) retrotransposons contain an internal RNA polymerase II promoter within their 5′ untranslated region (UTR) and encode two proteins, (ORF1p and ORF2p) required for their mobilization (i.e., retrotransposition). The evolutionary success of L1 relies on the continuous retrotransposition of full-length L1 mRNAs. Previous studies identified functional splice donor (SD), splice acceptor (SA), and polyadenylation sequences in L1 mRNA and provided evidence that a small number of spliced L1 mRNAs retrotransposed in the human genome. Here, we demonstrate that the retrotransposition of intra-5′UTR or 5′UTR/ORF1 spliced L1 mRNAs leads to the generation of spliced integrated retrotransposed elements (SpIREs). We identified a new intra-5′UTR SpIRE that is ten times more abundant than previously identified SpIREs. Functional analyses demonstrated that both intra-5′UTR and 5′UTR/ORF1 SpIREs lack Cis-acting transcription factor binding sites and exhibit reduced promoter activity. The 5′UTR/ORF1 SpIREs also produce nonfunctional ORF1p variants. Finally, we demonstrate that sequence changes within the L1 5′UTR over evolutionary time, which permitted L1 to evade the repressive effects of a host protein, can lead to the generation of new L1 splicing events, which, upon retrotransposition, generates a new SpIRE subfamily. We conclude that splicing inhibits L1 retrotransposition, SpIREs generally represent evolutionary “dead-ends” in the L1 retrotransposition process, mutations within the L1 5′UTR alter L1 splicing dynamics, and that retrotransposition of the resultant spliced transcripts can generate interindividual genomic variation. PMID:29505568
NMR studies of two spliced leader RNAs using isotope labeling
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lapham, J.; Crothers, D.M.
1994-12-01
Spliced leader RNAs are a class of RNA molecules (<200 nts) involved in the trans splicing of messenger RNA found in trypanosomes, nematodes, and other lower eukaryotes. The spliced leader RNA from the trypanosome Leptomonas Collosoma exists in two alternate structural forms with similar thermal stabilities. The 54 nucleotides on the 5{prime} end of the SL molecule is structurally independent from the 3{prime} half of the RNA, and displays the two structural forms. Furthermore, the favored of the two structures was shown to contain anomalous nuclease sensitivity and thermal stability features, which suggests that there may be tertiary interactions betweenmore » the splice site and other nucleotides in the 5{prime} end. Multidimensional NMR studies are underway to elucidate the structural elements present in the SL RNAs that give rise to their physical properties. Two spliced leader sequences have been studied. The first, the 54 nucleotides on the 5{prime} end of the L. Collosoma sequence, was selected because of earlier studies in our laboratory. The second sequence is the 5{prime} end of the trypanosome Crithidia Fasciculata, which was chosen because of its greater sequence homology to other SL sequences. Given the complexity of the NMR spectra for RNA molecules of this size, we have incorporated {sup 15}N/{sup 13}C-labeled nucleotides into the RNA. One of the techniques we have developed to simplify the spectra of these RNA molecules is isotope labeling of specific regions of the RNA. This has been especially helpful in assigning the secondary structure of molecules that may be able to adopt multiple conformations. Using this technique one can examine a part of the molecule without spectral interference from the unlabeled portion. We hope this approach will promote an avenue for studying the structure of larger RNAs in their native surroundings.« less
Fine-Scale Variation and Genetic Determinants of Alternative Splicing across Individuals
Coulombe-Huntington, Jasmin; Lam, Kevin C. L.; Dias, Christel; Majewski, Jacek
2009-01-01
Recently, thanks to the increasing throughput of new technologies, we have begun to explore the full extent of alternative pre–mRNA splicing (AS) in the human transcriptome. This is unveiling a vast layer of complexity in isoform-level expression differences between individuals. We used previously published splicing sensitive microarray data from lymphoblastoid cell lines to conduct an in-depth analysis on splicing efficiency of known and predicted exons. By combining publicly available AS annotation with a novel algorithm designed to search for AS, we show that many real AS events can be detected within the usually unexploited, speculative majority of the array and at significance levels much below standard multiple-testing thresholds, demonstrating that the extent of cis-regulated differential splicing between individuals is potentially far greater than previously reported. Specifically, many genes show subtle but significant genetically controlled differences in splice-site usage. PCR validation shows that 42 out of 58 (72%) candidate gene regions undergo detectable AS, amounting to the largest scale validation of isoform eQTLs to date. Targeted sequencing revealed a likely causative SNP in most validated cases. In all 17 incidences where a SNP affected a splice-site region, in silico splice-site strength modeling correctly predicted the direction of the micro-array and PCR results. In 13 other cases, we identified likely causative SNPs disrupting predicted splicing enhancers. Using Fst and REHH analysis, we uncovered significant evidence that 2 putative causative SNPs have undergone recent positive selection. We verified the effect of five SNPs using in vivo minigene assays. This study shows that splicing differences between individuals, including quantitative differences in isoform ratios, are frequent in human populations and that causative SNPs can be identified using in silico predictions. Several cases affected disease-relevant genes and it is likely some of these differences are involved in phenotypic diversity and susceptibility to complex diseases. PMID:20011102
Two Novel Variants Affecting CDKL5 Transcript Associated with Epileptic Encephalopathy.
Neupauerová, Jana; Štěrbová, Katalin; Vlčková, Markéta; Sebroňová, Věra; Maříková, Tat'ána; Krůtová, Marcela; David, Staněk; Kršek, Pavel; Žaliová, Markéta; Seeman, Pavel; Laššuthová, Petra
2017-10-01
Variants in the human X-linked cyclin-dependent kinase-like 5 (CDKL5) gene have been reported as being etiologically associated with early infantile epileptic encephalopathy type 2 (EIEE2). We report on two patients, a boy and a girl, with EIEE2 that present with early onset epilepsy, hypotonia, severe intellectual disability, and poor eye contact. Massively parallel sequencing (MPS) of a custom-designed gene panel for epilepsy and epileptic encephalopathy containing 112 epilepsy-related genes was performed. Sanger sequencing was used to confirm the novel variants. For confirmation of the functional consequence of an intronic CDKL5 variant in patient 2, an RNA study was done. DNA sequencing revealed de novo variants in CDKL5, a c.2578C>T (p. Gln860*) present in a hemizygous state in a 3-year-old boy, and a potential splice site variant c.463+5G>A in heterozygous state in a 5-year-old girl. Multiple in silico splicing algorithms predicted a highly reduced splice site score for c.463+5G>A. A subsequent mRNA study confirmed an aberrant shorter transcript lacking exon 7. Our data confirmed that variants in the CDKL5 are associated with EIEE2. There is credible evidence that the novel identified variants are pathogenic and, therefore, are likely the cause of the disease in the presented patients. In one of the patients a stop codon variant is predicted to produce a truncated protein, and in the other patient an intronic variant results in aberrant splicing.
Soundararajan, Ramani; Stearns, Timothy M.; Griswold, Anthony J.; Mehta, Arpit; Czachor, Alexander; Fukumoto, Jutaro; Lockey, Richard F.; King, Benjamin L.; Kolliputi, Narasaiah
2015-01-01
RNA editing is a post-transcriptional modification of RNA. The majority of these changes result from adenosine deaminase acting on RNA (ADARs) catalyzing the conversion of adenosine residues to inosine in double-stranded RNAs (dsRNAs). Massively parallel sequencing has enabled the identification of RNA editing sites in human transcriptomes. In this study, we sequenced DNA and RNA from human lungs and identified RNA editing sites with high confidence via a computational pipeline utilizing stringent analysis thresholds. We identified a total of 3,447 editing sites that overlapped in three human lung samples, and with 50% of these sites having canonical A-to-G base changes. Approximately 27% of the edited sites overlapped with Alu repeats, and showed A-to-G clustering (>3 clusters in 100 bp). The majority of edited sites mapped to either 3′ untranslated regions (UTRs) or introns close to splice sites; whereas, only few sites were in exons resulting in non-synonymous amino acid changes. Interestingly, we identified 652 A-to-G editing events in the 3′ UTR of 205 target genes that mapped to 932 potential miRNA target binding sites. Several of these miRNA edited sites were validated in silico. Additionally, we validated several A-to-G edited sites by Sanger sequencing. Altogether, our study suggests a role for RNA editing in miRNA-mediated gene regulation and splicing in human lungs. In this study, we have generated a RNA editome of human lung tissue that can be compared with other RNA editomes across different lung tissues to delineate a role for RNA editing in normal and diseased states. PMID:26486088
Soundararajan, Ramani; Stearns, Timothy M; Griswold, Anthony L; Mehta, Arpit; Czachor, Alexander; Fukumoto, Jutaro; Lockey, Richard F; King, Benjamin L; Kolliputi, Narasaiah
2015-11-03
RNA editing is a post-transcriptional modification of RNA. The majority of these changes result from adenosine deaminase acting on RNA (ADARs) catalyzing the conversion of adenosine residues to inosine in double-stranded RNAs (dsRNAs). Massively parallel sequencing has enabled the identification of RNA editing sites in human transcriptomes. In this study, we sequenced DNA and RNA from human lungs and identified RNA editing sites with high confidence via a computational pipeline utilizing stringent analysis thresholds. We identified a total of 3,447 editing sites that overlapped in three human lung samples, and with 50% of these sites having canonical A-to-G base changes. Approximately 27% of the edited sites overlapped with Alu repeats, and showed A-to-G clustering (>3 clusters in 100 bp). The majority of edited sites mapped to either 3' untranslated regions (UTRs) or introns close to splice sites; whereas, only few sites were in exons resulting in non-synonymous amino acid changes. Interestingly, we identified 652 A-to-G editing events in the 3' UTR of 205 target genes that mapped to 932 potential miRNA target binding sites. Several of these miRNA edited sites were validated in silico. Additionally, we validated several A-to-G edited sites by Sanger sequencing. Altogether, our study suggests a role for RNA editing in miRNA-mediated gene regulation and splicing in human lungs. In this study, we have generated a RNA editome of human lung tissue that can be compared with other RNA editomes across different lung tissues to delineate a role for RNA editing in normal and diseased states.
Li, Fang; Vensko, Steven P.; Belikoff, Esther J.; Scott, Maxwell J.
2013-01-01
Transformer (TRA) promotes female development in several dipteran species including the Australian sheep blowfly Lucilia cuprina, the Mediterranean fruit fly, housefly and Drosophila melanogaster. tra transcripts are sex-specifically spliced such that only the female form encodes full length functional protein. The presence of six predicted TRA/TRA2 binding sites in the sex-specific female intron of the L. cuprina gene suggested that tra splicing is auto-regulated as in medfly and housefly. With the aim of identifying conserved motifs that may play a role in tra sex-specific splicing, here we have isolated and characterized the tra gene from three additional blowfly species, L. sericata, Cochliomyia hominivorax and C. macellaria. The blowfly adult male and female transcripts differ in the choice of splice donor site in the first intron, with males using a site downstream of the site used in females. The tra genes all contain a single TRA/TRA2 site in the male exon and a cluster of four to five sites in the male intron. However, overall the sex-specific intron sequences are poorly conserved in closely related blowflies. The most conserved regions are around the exon/intron junctions, the 3′ end of the intron and near the cluster of TRA/TRA2 sites. We propose a model for sex specific regulation of tra splicing that incorporates the conserved features identified in this study. In L. sericata embryos, the male tra transcript was first detected at around the time of cellular blastoderm formation. RNAi experiments showed that tra is required for female development in L. sericata and C. macellaria. The isolation of the tra gene from the New World screwworm fly C. hominivorax, a major livestock pest, will facilitate the development of a “male-only” strain for genetic control programs. PMID:23409170
Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W
1996-02-15
Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U).
Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W
1996-01-01
Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U). PMID:8604302
Evaluating approaches to find exon chains based on long reads.
Kuosmanen, Anna; Norri, Tuukka; Mäkinen, Veli
2018-05-01
Transcript prediction can be modeled as a graph problem where exons are modeled as nodes and reads spanning two or more exons are modeled as exon chains. Pacific Biosciences third-generation sequencing technology produces significantly longer reads than earlier second-generation sequencing technologies, which gives valuable information about longer exon chains in a graph. However, with the high error rates of third-generation sequencing, aligning long reads correctly around the splice sites is a challenging task. Incorrect alignments lead to spurious nodes and arcs in the graph, which in turn lead to incorrect transcript predictions. We survey several approaches to find the exon chains corresponding to long reads in a splicing graph, and experimentally study the performance of these methods using simulated data to allow for sensitivity/precision analysis. Our experiments show that short reads from second-generation sequencing can be used to significantly improve exon chain correctness either by error-correcting the long reads before splicing graph creation, or by using them to create a splicing graph on which the long-read alignments are then projected. We also study the memory and time consumption of various modules, and show that accurate exon chains lead to significantly increased transcript prediction accuracy. The simulated data and in-house scripts used for this article are available at http://www.cs.helsinki.fi/group/gsa/exon-chains/exon-chains-bib.tar.bz2.
Preußer, Christian; Rossbach, Oliver; Hung, Lee-Hsueh; Li, Dan; Bindereif, Albrecht
2014-01-01
Trans-splicing in trypanosomes adds a 39-nucleotide mini-exon from the spliced leader (SL) RNA to the 5′ end of each protein-coding sequence. On the other hand, cis-splicing of the few intron-containing genes requires the U1 small nuclear ribonucleoprotein (snRNP) particle. To search for potential new functions of the U1 snRNP in Trypanosoma brucei, we applied genome-wide individual-nucleotide resolution crosslinking-immunoprecipitation (iCLIP), focusing on the U1 snRNP-specific proteins U1C and U1-70K. Surprisingly, U1C and U1-70K interact not only with the U1, but also with U6 and SL RNAs. In addition, mapping of crosslinks to the cis-spliced PAP [poly(A) polymerase] pre-mRNA indicate an active role of these proteins in 5′ splice site recognition. In sum, our results demonstrate that the iCLIP approach provides insight into stable and transient RNA–protein contacts within the spliceosomal network. We propose that the U1 snRNP may represent an evolutionary link between the cis- and trans-splicing machineries, playing a dual role in 5′ splice site recognition on the trans-spliceosomal SL RNP as well as on pre-mRNA cis-introns. PMID:24748659
Pettigrew, Christopher; Wayte, Nicola; Lovelock, Paul K; Tavtigian, Sean V; Chenevix-Trench, Georgia; Spurdle, Amanda B; Brown, Melissa A
2005-01-01
Introduction Aberrant pre-mRNA splicing can be more detrimental to the function of a gene than changes in the length or nature of the encoded amino acid sequence. Although predicting the effects of changes in consensus 5' and 3' splice sites near intron:exon boundaries is relatively straightforward, predicting the possible effects of changes in exonic splicing enhancers (ESEs) remains a challenge. Methods As an initial step toward determining which ESEs predicted by the web-based tool ESEfinder in the breast cancer susceptibility gene BRCA1 are likely to be functional, we have determined their evolutionary conservation and compared their location with known BRCA1 sequence variants. Results Using the default settings of ESEfinder, we initially detected 669 potential ESEs in the coding region of the BRCA1 gene. Increasing the threshold score reduced the total number to 464, while taking into consideration the proximity to splice donor and acceptor sites reduced the number to 211. Approximately 11% of these ESEs (23/211) either are identical at the nucleotide level in human, primates, mouse, cow, dog and opossum Brca1 (conserved) or are detectable by ESEfinder in the same position in the Brca1 sequence (shared). The frequency of conserved and shared predicted ESEs between human and mouse is higher in BRCA1 exons (2.8 per 100 nucleotides) than in introns (0.6 per 100 nucleotides). Of conserved or shared putative ESEs, 61% (14/23) were predicted to be affected by sequence variants reported in the Breast Cancer Information Core database. Applying the filters described above increased the colocalization of predicted ESEs with missense changes, in-frame deletions and unclassified variants predicted to be deleterious to protein function, whereas they decreased the colocalization with known polymorphisms or unclassified variants predicted to be neutral. Conclusion In this report we show that evolutionary conservation analysis may be used to improve the specificity of an ESE prediction tool. This is the first report on the prediction of the frequency and distribution of ESEs in the BRCA1 gene, and it is the first reported attempt to predict which ESEs are most likely to be functional and therefore which sequence variants in ESEs are most likely to be pathogenic. PMID:16280041
EUGENE'HOM: A generic similarity-based gene finder using multiple homologous sequences.
Foissac, Sylvain; Bardou, Philippe; Moisan, Annick; Cros, Marie-Josée; Schiex, Thomas
2003-07-01
EUGENE'HOM is a gene prediction software for eukaryotic organisms based on comparative analysis. EUGENE'HOM is able to take into account multiple homologous sequences from more or less closely related organisms. It integrates the results of TBLASTX analysis, splice site and start codon prediction and a robust coding/non-coding probabilistic model which allows EUGENE'HOM to handle sequences from a variety of organisms. The current target of EUGENE'HOM is plant sequences. The EUGENE'HOM web site is available at http://genopole.toulouse.inra.fr/bioinfo/eugene/EuGeneHom/cgi-bin/EuGeneHom.pl.
Non-exomic and synonymous variants in ABCA4 are an important cause of Stargardt disease
Braun, Terry A.; Mullins, Robert F.; Wagner, Alex H.; Andorf, Jeaneen L.; Johnston, Rebecca M.; Bakall, Benjamin B.; Deluca, Adam P.; Fishman, Gerald A.; Lam, Byron L.; Weleber, Richard G.; Cideciyan, Artur V.; Jacobson, Samuel G.; Sheffield, Val C.; Tucker, Budd A.; Stone, Edwin M.
2013-01-01
Mutations in ABCA4 cause Stargardt disease and other blinding autosomal recessive retinal disorders. However, sequencing of the complete coding sequence in patients with clinical features of Stargardt disease sometimes fails to detect one or both mutations. For example, among 208 individuals with clear clinical evidence of ABCA4 disease ascertained at a single institution, 28 had only one disease-causing allele identified in the exons and splice junctions of the primary retinal transcript of the gene. Haplotype analysis of these 28 probands revealed 3 haplotypes shared among ten families, suggesting that 18 of the 28 missing alleles were rare enough to be present only once in the cohort. We hypothesized that mutations near rare alternate splice junctions in ABCA4 might cause disease by increasing the probability of mis-splicing at these sites. Next-generation sequencing of RNA extracted from human donor eyes revealed more than a dozen alternate exons that are occasionally incorporated into the ABCA4 transcript in normal human retina. We sequenced the genomic DNA containing 15 of these minor exons in the 28 one-allele subjects and observed five instances of two different variations in the splice signals of exon 36.1 that were not present in normal individuals (P < 10−6). Analysis of RNA obtained from the keratinocytes of patients with these mutations revealed the predicted alternate transcript. This study illustrates the utility of RNA sequence analysis of human donor tissue and patient-derived cell lines to identify mutations that would be undetectable by exome sequencing. PMID:23918662
cDNA sequences and organization of IgM heavy chain genes in two holostean fish.
Wilson, M R; van Ravenstein, E; Miller, N W; Clem, L W; Middleton, D L; Warr, G W
1995-01-01
Immunoglobulin M heavy chain (mu) sequences of two holostean fish, the bowfin, Amia calva, and the longnose gar, Lepisosteus osseus, were amplified from spleen mRNA by RACE-PCR, cloned, and sequenced. Each mu chain showed the conserved four constant domain structure typical of a secreted mu chain. Southern blot analyses with specific heavy chain variable (VH) and constant (CH) region probes suggest that both fish possess an IgH locus that resembles that of the teleosts, amphibians, and mammals in its organization. The overall sequence similarity of gar and bowfin mu chains was 60% and 48% at the nucleotide and amino acid levels, respectively, while similarity to the mu chains of teleosts and elasmobranchs was lower. The bowfin mu chain possesses a distinctive proline-rich sequence at the C mu 1/C mu 2 boundary; a shorter proline-rich sequence is present at this position in the gar mu chain. Both gar and bowfin show, in their C mu 4 sequences, motifs that could serve as cryptic splice donor sites for the production of mRNA encoding the membrane-bound form of the mu chains, and the bowfin also shows a potential cryptic splice donor site in the C mu 3 exon.
Spliceman2: a computational web server that predicts defects in pre-mRNA splicing.
Cygan, Kamil Jan; Sanford, Clayton Hendrick; Fairbrother, William Guy
2017-09-15
Most pre-mRNA transcripts in eukaryotic cells must undergo splicing to remove introns and join exons, and splicing elements present a large mutational target for disease-causing mutations. Splicing elements are strongly position dependent with respect to the transcript annotations. In 2012, we presented Spliceman, an online tool that used positional dependence to predict how likely distant mutations around annotated splice sites were to disrupt splicing. Here, we present an improved version of the previous tool that will be more useful for predicting the likelihood of splicing mutations. We have added industry-standard input options (i.e. Spliceman now accepts variant call format files), which allow much larger inputs than previously available. The tool also can visualize the locations-within exons and introns-of sequence variants to be analyzed and the predicted effects on splicing of the pre-mRNA transcript. In addition, Spliceman2 integrates with RNAcompete motif libraries to provide a prediction of which trans -acting factors binding sites are disrupted/created and links out to the UCSC genome browser. In summary, the new features in Spliceman2 will allow scientists and physicians to better understand the effects of single nucleotide variations on splicing. Freely available on the web at http://fairbrother.biomed.brown.edu/spliceman2 . Website implemented in PHP framework-Laravel 5, PostgreSQL, Apache, and Perl, with all major browsers supported. william_fairbrother@brown.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Sequence Discrimination by Alternatively Spliced Isoforms of a DNA Binding Zinc Finger Domain
NASA Astrophysics Data System (ADS)
Gogos, Joseph A.; Hsu, Tien; Bolton, Jesse; Kafatos, Fotis C.
1992-09-01
Two major developmentally regulated isoforms of the Drosophila chorion transcription factor CF2 differ by an extra zinc finger within the DNA binding domain. The preferred DNA binding sites were determined and are distinguished by an internal duplication of TAT in the site recognized by the isoform with the extra finger. The results are consistent with modular interactions between zinc fingers and trinucleotides and also suggest rules for recognition of AT-rich DNA sites by zinc finger proteins. The results show how modular finger interactions with trinucleotides can be used, in conjunction with alternative splicing, to alter the binding specificity and increase the spectrum of sites recognized by a DNA binding domain. Thus, CF2 may potentially regulate distinct sets of target genes during development.
Yigit, Gökhan; Wieczorek, Dagmar; Bögershausen, Nina; Beleggia, Filippo; Möller-Hartmann, Claudia; Altmüller, Janine; Thiele, Holger; Nürnberg, Peter; Wollnik, Bernd
2016-03-01
Using whole-exome sequencing, we identified a homozygous acceptor splice-site mutation in intron 6 of the KATNB1 gene in a patient from a consanguineous Turkish family who presented with congenital microcephaly, lissencephaly, short stature, polysyndactyly, and dental abnormalities. cDNA analysis revealed complete loss of the natural acceptor splice-site resulting either in the usage of an alternative, exonic acceptor splice-site inducing a frame-shift and premature protein truncation or, to a minor extent, in complete skipping of exon 7. Both effects most likely lead to complete loss of KATNB1 function. Homozygous and compound heterozygous mutations in KATNB1 have very recently been described as a cause of microcephaly with brain malformations and seizures. We extend the KATNB1 associated phenotype by describing a syndrome characterized by primordial dwarfism, lissencephaly, polysyndactyly, and dental anomalies, which is caused by a homozygous truncating KATNB1 mutation. © 2015 Wiley Periodicals, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pham-Dinh, D.; Gaspera, D.B.; Dautigny, A.
1995-09-20
Myelin/oligodendrocyte glycoprotein (MOG), a special component of the central nervous system localization on the outermost lamellae of mature myelin, is a member of the immunoglobulin superfamily. We report here the organization of the human MOG gene, which spans approximately 17 kb, and the characterization of six MOG mRNA splicing variants. The intron/exon structure of the human MOG gene confirmed the splicing pattern, supporting the hypothesis that mRNA isoforms could arise by alternative splicing of a single gene. In addition to the eight exons coding for the major MOG isoform, the human MOG gene also contains 3` region, a previously unknownmore » alternatively spliced coding exon, VIA. Alternative utilization of two acceptor splicing sites for exon VIII could produce two different C-termini. The nucleotide sequences presented here may be a useful tool to study further possible involvement if the MOG gene in hereditary neurological disorders. 23 refs., 5 figs.« less
Global impact of RNA splicing on transcriptome remodeling in the heart.
Gao, Chen; Wang, Yibin
2012-08-01
In the eukaryotic transcriptome, both the numbers of genes and different RNA species produced by each gene contribute to the overall complexity. These RNA species are generated by the utilization of different transcriptional initiation or termination sites, or more commonly, from different messenger RNA (mRNA) splicing events. Among the 30,000+ genes in human genome, it is estimated that more than 95% of them can generate more than one gene product via alternative RNA splicing. The protein products generated from different RNA splicing variants can have different intracellular localization, activity, or tissue-distribution. Therefore, alternative RNA splicing is an important molecular process that contributes to the overall complexity of the genome and the functional specificity and diversity among different cell types. In this review, we will discuss current efforts to unravel the full complexity of the cardiac transcriptome using a deep-sequencing approach, and highlight the potential of this technology to uncover the global impact of RNA splicing on the transcriptome during development and diseases of the heart.
Rare splicing defects of FAS underly severe recessive autoimmune lymphoproliferative syndrome.
Agrebi, N; Ben-Mustapha, I; Matoussi, N; Dhouib, N; Ben-Ali, M; Mekki, N; Ben-Ahmed, M; Larguèche, B; Ben Becher, S; Béjaoui, M; Barbouche, M R
2017-10-01
Autoimmune lymphoproliferative syndrome (ALPS) is a prototypic disorder of impaired apoptosis characterized by autoimmune features and lymphoproliferation. Heterozygous germline or somatic FAS mutations associated with preserved protein expression have been described. Very rare cases of homozygous germline FAS mutations causing severe autosomal recessive form of ALPS with a complete defect of Fas expression have been reported. We report two unrelated patients from highly inbred North African population showing a severe ALPS phenotype and an undetectable Fas surface expression. Two novel homozygous mutations have been identified underlying rare splicing defects mechanisms. The first mutation breaks a branch point sequence and the second alters a regulatory exonic splicing site. These splicing defects induce the skipping of exon 6 encoding the transmembrane domain of CD95. Our findings highlight the requirement of tight regulation of FAS exon 6 splicing for balanced alternative splicing and illustrate the importance of such studies in highly consanguineous populations. Copyright © 2017 Elsevier Inc. All rights reserved.
A novel protein factor is required for use of distal alternative 5' splice sites in vitro.
Harper, J E; Manley, J L
1991-01-01
Adenovirus E1A pre-mRNA was used as a model to examine alternative 5' splice site selection during in vitro splicing reactions. Strong preference for the downstream 13S 5' splice site over the upstream 12S or 9S 5' splice sites was observed. However, the 12S 5' splice site was used efficiently when a mutant pre-mRNA lacking the 13S 5' splice site was processed, and 12S splicing from this substrate was not reduced by 13S splicing from a separate pre-mRNA, demonstrating that 13S splicing reduced 12S 5' splice site selection through a bona fide cis-competition. DEAE-cellulose chromatography of nuclear extract yielded two fractions with different splicing activities. The bound fraction contained all components required for efficient splicing of simple substrates but was unable to utilize alternative 5' splice sites. In contrast, the flow-through fraction, which by itself was inactive, contained an activity required for alternative splicing and was shown to stimulate 12S and 9S splicing, while reducing 13S splicing, when added to reactions carried out by the bound fraction. Furthermore, the activity, which we have called distal splicing factor (DSF), enhanced utilization of an upstream 5' splice site on a simian virus 40 early pre-mRNA, suggesting that the factor acts in a position-dependent, substrate-independent fashion. Several lines of evidence are presented suggesting that DSF is a non-small nuclear ribonucleoprotein protein. Finally, we describe a functional interaction between DSF and ASF, a protein that enhances use of downstream 5' splice sites. Images PMID:1658620
First report of HGD mutations in a Chinese with alkaptonuria.
Yang, Yong-jia; Guo, Ji-hong; Chen, Wei-jian; Zhao, Rui; Tang, Jin-song; Meng, Xiao-hua; Zhao, Liu; Tu, Ming; He, Xin-yu; Wu, Ling-qian; Zhu, Yi-min
2013-04-15
Alkaptonuria (AKU) is one of the first prototypic inborn errors in metabolism and the first human disease found to be transmitted via Mendelian autosomal recessive inheritance. It is caused by HGD mutations, which leads to a deficiency in homogentisate 1,2-dioxygenase (HGD) activity. To date, several HGD mutations have been identified as the cause of the prototypic disease across different ethnic populations worldwide. However, in Asia, the HGD mutation is very rarely reported. For the Chinese population, no literature on HGD mutation screening is available to date. In this paper, we describe two novel HGD mutations in a Chinese AKU family, the splicing mutation of IVS7+1G>C, a donor splice site of exon 7, and a missense mutation of F329C in exon 12. The predicted new splicing site of the mutated exon 7 sequence demonstrated a 303bp extension after the mutation site. The F329C mutation most probably disturbed the stability of the conformation of the two loops critical to the Fe(2+) active site of the HGD enzyme. Copyright © 2013 Elsevier B.V. All rights reserved.
qPMS9: An Efficient Algorithm for Quorum Planted Motif Search
NASA Astrophysics Data System (ADS)
Nicolae, Marius; Rajasekaran, Sanguthevar
2015-01-01
Discovering patterns in biological sequences is a crucial problem. For example, the identification of patterns in DNA sequences has resulted in the determination of open reading frames, identification of gene promoter elements, intron/exon splicing sites, and SH RNAs, location of RNA degradation signals, identification of alternative splicing sites, etc. In protein sequences, patterns have led to domain identification, location of protease cleavage sites, identification of signal peptides, protein interactions, determination of protein degradation elements, identification of protein trafficking elements, discovery of short functional motifs, etc. In this paper we focus on the identification of an important class of patterns, namely, motifs. We study the (l, d) motif search problem or Planted Motif Search (PMS). PMS receives as input n strings and two integers l and d. It returns all sequences M of length l that occur in each input string, where each occurrence differs from M in at most d positions. Another formulation is quorum PMS (qPMS), where the motif appears in at least q% of the strings. We introduce qPMS9, a parallel exact qPMS algorithm that offers significant runtime improvements on DNA and protein datasets. qPMS9 solves the challenging DNA (l, d)-instances (28, 12) and (30, 13). The source code is available at https://code.google.com/p/qpms9/.
Brown, Roger B; Madrid, Nathaniel J; Suzuki, Hideaki; Ness, Scott A
2017-01-01
RNA-sequencing (RNA-seq) has become the standard method for unbiased analysis of gene expression but also provides access to more complex transcriptome features, including alternative RNA splicing, RNA editing, and even detection of fusion transcripts formed through chromosomal translocations. However, differences in library methods can adversely affect the ability to recover these different types of transcriptome data. For example, some methods have bias for one end of transcripts or rely on low-efficiency steps that limit the complexity of the resulting library, making detection of rare transcripts less likely. We tested several commonly used methods of RNA-seq library preparation and found vast differences in the detection of advanced transcriptome features, such as alternatively spliced isoforms and RNA editing sites. By comparing several different protocols available for the Ion Proton sequencer and by utilizing detailed bioinformatics analysis tools, we were able to develop an optimized random primer based RNA-seq technique that is reliable at uncovering rare transcript isoforms and RNA editing features, as well as fusion reads from oncogenic chromosome rearrangements. The combination of optimized libraries and rapid Ion Proton sequencing provides a powerful platform for the transcriptome analysis of research and clinical samples.
Lan, Susan; Kamel, Wael; Punga, Tanel; Akusjärvi, Göran
2017-02-28
The adenovirus L4-22K protein both activates and suppresses transcription from the adenovirus major late promoter (MLP) by binding to DNA elements located downstream of the MLP transcriptional start site: the so-called DE element (positive) and the R1 region (negative). Here we show that L4-22K preferentially binds to the RNA form of the R1 region, both to the double-stranded RNA and the single-stranded RNA of the same polarity as the nascent MLP transcript. Further, L4-22K binds to a 5΄-CAAA-3΄ motif in the single-stranded RNA, which is identical to the sequence motif characterized for L4-22K DNA binding. L4-22K binding to single-stranded RNA results in an enhancement of U1 snRNA recruitment to the major late first leader 5΄ splice site. This increase in U1 snRNA binding results in a suppression of MLP transcription and a concurrent stimulation of major late first intron splicing. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Seven novel mutations at the 5,10-methylenetetrahydrofolate reductase locus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Goyette, P.; Frosst, P.; Rosenblatt, D.S.
1994-09-01
5,10-methylenetetrahydrofolate reductase (MTHFR), a flavoprotein, catalyzes the conversion of 5,10-methylenetetrahydrofolate to 5-methyltetrahydrofolate, a cofactor for methionine synthase in the methylation of homocysteine to methionine. Severe MTHFR deficiency, which causes homocysteinemia, is an autosomal recessive disorder with variable clinical features; developmental delay, perinatal death, mental retardation and asymptomatic individuals have been observed. A milder deficiency has been reported in patients with cardiovascular disease. We have recently described the isolation of a cDNA for MTHFR and the identification of 2 mutations in patients with severe MTHFR deficiency. We report here the characterization of 7 additional mutations at this locus: 5 missense mutationsmore » and 2 splicing mutations. Mutation analysis was performed by SSCP on PCR products generated either from reverse transcription-PCR of patients` total fibroblast RNA or from PCR of patients` genomic DNA. The 5 missense mutations are as follows: 1 Arg to Cys substitution in a hydrophilic segment proposed to be the hinge region that connects the catalytic and regulatory domains, 2 different Arg to Cys substitutions in 2 patients whose enzymatic thermolability is responsive to FAD, 1 Thr to Met substitution affecting an evolutionarily-conserved residue and a Pro to Leu substitution. The 2 splicing mutations affect the 5{prime} splice site and the 3{prime} splice site of 2 introns, respectively. The 5{prime} splice site mutation generates a 57 bp in-frame deletion of the RNA through the utilization of a cryptic 5{prime} splice site within the coding sequence. The identification of 9 mutations at this locus has allowed us to make preliminary correlations between genotype and phenotype and to contribute to a structure:function analysis of the enzyme.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kaler, S.G.; Gahl, W.A.
1994-09-01
Menkes disease is an X linked recessive disorder of copper metabolism produced by abnormalities in a gene that encodes a copper transporting ATPase. The clinical spectrum of Menkes disease includes a range of neurological severity from the classical type to the occipital horn syndrome (OHS) in which slightly subnormal intelligence or signs of autonomic dysfunction are the only neurologic abnormalities. We previously documented a distinctive, less severe Menkes phenotype associated with a +3 intronic splice donor mutation at the 3{prime} end of the gene in which exon skipping occurred but some normally spliced message was also detectable. We now reportmore » a similar splicing mutation in a patient with a typical OHS phenotype an A to G transition at the 2 exonic position of a splice donor site in the middle of the Menkes coding sequence. Some normally sized transcripts are evident by RT-PCR of lymphoblast mRNA from this individual, as well as 2 truncated fragments generated by exon skipping and activation of a cryptic splice acceptor site, respectively. The predicted effect of the mutation on the gene product involves a serine to glycine substitution in a noncritical region of the Menkes ATPase from the patient`s normally sized message, and premature termination due to translational frameshift in both truncated transcripts. The mutation eliminates a Dde 1 restriction site in the gene which provided a method to rapidly screen other family members, and revealed that the patient`s mother is a non-carrier. The mutational base change was not present in 25 normal X chromosomes studied. Preliminary analysis of the Menkes locus in 5 other Menkes disease families indicates aberrant mRNA splicing in 2. Our findings confirm allelism at the Menkes locus, indicate that splice mutations are relatively common mutational event in Menkes disease, and suggest that splice mutations in which some normal splicing is preserved may underlie milder Menkes disease variants, including OHS.« less
Preedagasamzin, Sarinthip; Nualkaew, Tiwaporn; Pongrujikorn, Tanjitti; Jinawath, Natini; Kole, Ryszard; Fucharoen, Suthat; Jearawiriyapaisarn, Natee; Svasti, Saovaros
2018-04-30
Repair of a splicing defect of β-globin pre-mRNA harboring hemoglobin E (HbE) mutation was successfully accomplished in erythroid cells from patients with β-thalassemia/HbE disorder by a synthetic splice-switching oligonucleotide (SSO). However, its application is limited by short-term effectiveness and requirement of lifelong periodic administration of SSO, especially for chronic diseases like thalassemias. Here, we engineered lentiviral vectors that stably express U7 small nuclear RNA (U7 snRNA) carrying the splice-switching sequence of the SSO that restores correct splicing of β E -globin pre-mRNA and achieves a long-term therapeutic effect. Using a two-step tiling approach, we systematically screened U7 snRNAs carrying splice-switching SSO sequences targeted to the cryptic 5' splice site created by HbE mutation. We tested this approach and identified the most responsive element for mediating splicing correction in engineered U7 snRNAs in HeLa-β E cell model cell line. Remarkably, the U7 snRNA lentiviral vector (U7 βE4+1) targeted to this region effectively restored the correctly-spliced β E -globin mRNA for at least 5 months. Moreover, the effects of the U7 βE4+1 snRNA lentiviral vector were also evident as upregulation of the correctly-spliced β E -globin mRNA in erythroid progenitor cells from β-thalassemia/HbE patients treated with the vector, which led to improvements of pathologies in erythroid progenitor cells from thalassemia patients. These results suggest that the splicing correction of β E -globin pre-mRNA by the engineered U7 snRNA lentiviral vector provides a promising, long-term treatment for β-thalassemia/HbE. Copyright © 2018 Elsevier Inc. All rights reserved.
A de novo mosaic mutation of PHEX in a boy with hypophosphatemic rickets.
Weng, Chen; Chen, Jiao; Sun, Li; Zhou, Zhong-Wei; Feng, Xue; Sun, Jun-Hui; Lu, Ling-Ping; Yu, Ping; Qi, Ming
2016-03-01
X-linked dominant hypophosphatemic rickets (XLHR), is characterized mainly by renal phosphate wasting with hypophosphatemia, short stature and abnormal bone mineralization. PHEX, located at Xp22.1-p22.2, is the gene causing XLHR. We aim to characterize the pathogenesis of a Chinese boy who is apparently 'heterozygous' in PHEX gene. Direct sequencing showed two peaks: one was a wild-type 'G' and the other was one base substitution to 'A', though the patient was a male. TA clone assay clearly showed each sequences and the ratios. The mutation effect was predicted via bioinformatics and validated by exon-trapping assay. Real-time PCR was applied to determine the copy number of PHEX. TA clone assay showed the frequency of normal (G) to mutant allele (A) as 19:13. Normal karyotype and real-time PCR results indicate the normal copy number of PHEX. This splice site mutation leads to 4 bp of exon 18 skipping out causing frame shift p.Gly590Glufs*28 that ends up with a loss of active site and Zn(2+)-binding site of PHEX, which probably interfere with renal phosphate reabsorption and bone mineralization. In conclusion, mutation at conserved splice acceptor site resulted in aberrant splicing, ending up with a damaged protein product. This novel mutation is de novo in mosaic pattern that may be induced during early postzygotic period. Taking mosaic somatic mutation of PHEX into consideration is strongly suggested in genetic counseling and etiology research for XLHR.
Wang, Taotao; Wang, Huiyuan; Cai, Dawei; Gao, Yubang; Zhang, Hangxiao; Wang, Yongsheng; Lin, Chentao; Ma, Liuyin; Gu, Lianfeng
2017-08-01
Moso bamboo (Phyllostachys edulis) represents one of the fastest-spreading plants in the world, due in part to its well-developed rhizome system. However, the post-transcriptional mechanism for the development of the rhizome system in bamboo has not been comprehensively studied. We therefore used a combination of single-molecule long-read sequencing technology and polyadenylation site sequencing (PAS-seq) to re-annotate the bamboo genome, and identify genome-wide alternative splicing (AS) and alternative polyadenylation (APA) in the rhizome system. In total, 145 522 mapped full-length non-chimeric (FLNC) reads were analyzed, resulting in the correction of 2241 mis-annotated genes and the identification of 8091 previously unannotated loci. Notably, more than 42 280 distinct splicing isoforms were derived from 128 667 intron-containing full-length FLNC reads, including a large number of AS events associated with rhizome systems. In addition, we characterized 25 069 polyadenylation sites from 11 450 genes, 6311 of which have APA sites. Further analysis of intronic polyadenylation revealed that LTR/Gypsy and LTR/Copia were two major transposable elements within the intronic polyadenylation region. Furthermore, this study provided a quantitative atlas of poly(A) usage. Several hundred differential poly(A) sites in the rhizome-root system were identified. Taken together, these results suggest that post-transcriptional regulation may potentially have a vital role in the underground rhizome-root system. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
Srivastava, Vaibhav; Srivastava, Manoj Kumar; Chibani, Kamel; Nilsson, Robert; Rouhier, Nicolas; Melzer, Michael; Wingsle, Gunnar
2009-01-01
Recent evidence has shown that alternative splicing (AS) is widely involved in the regulation of gene expression, substantially extending the diversity of numerous proteins. In this study, a subset of expressed sequence tags representing members of the reactive oxygen species gene network was selected from the PopulusDB database to investigate AS mechanisms in Populus. Examples of all known types of AS were detected, but intron retention was the most common. Interestingly, the closest Arabidopsis (Arabidopsis thaliana) homologs of half of the AS genes identified in Populus are not reportedly alternatively spliced. Two genes encoding the protein of most interest in our study (high-isoelectric-point superoxide dismutase [hipI-SOD]) have been found in black cottonwood (Populus trichocarpa), designated PthipI-SODC1 and PthipI-SODC2. Analysis of the expressed sequence tag libraries has indicated the presence of two transcripts of PthipI-SODC1 (hipI-SODC1b and hipI-SODC1s). Alignment of these sequences with the PthipI-SODC1 gene showed that hipI-SODC1b was 69 bp longer than hipI-SODC1s due to an AS event involving the use of an alternative donor splice site in the sixth intron. Transcript analysis showed that the splice variant hipI-SODC1b was differentially expressed, being clearly expressed in cambial and xylem, but not phloem, regions. In addition, immunolocalization and mass spectrometric data confirmed the presence of hipI-SOD proteins in vascular tissue. The functionalities of the spliced gene products were assessed by expressing recombinant hipI-SOD proteins and in vitro SOD activity assays. PMID:19176719
Srivastava, Vaibhav; Srivastava, Manoj Kumar; Chibani, Kamel; Nilsson, Robert; Rouhier, Nicolas; Melzer, Michael; Wingsle, Gunnar
2009-04-01
Recent evidence has shown that alternative splicing (AS) is widely involved in the regulation of gene expression, substantially extending the diversity of numerous proteins. In this study, a subset of expressed sequence tags representing members of the reactive oxygen species gene network was selected from the PopulusDB database to investigate AS mechanisms in Populus. Examples of all known types of AS were detected, but intron retention was the most common. Interestingly, the closest Arabidopsis (Arabidopsis thaliana) homologs of half of the AS genes identified in Populus are not reportedly alternatively spliced. Two genes encoding the protein of most interest in our study (high-isoelectric-point superoxide dismutase [hipI-SOD]) have been found in black cottonwood (Populus trichocarpa), designated PthipI-SODC1 and PthipI-SODC2. Analysis of the expressed sequence tag libraries has indicated the presence of two transcripts of PthipI-SODC1 (hipI-SODC1b and hipI-SODC1s). Alignment of these sequences with the PthipI-SODC1 gene showed that hipI-SODC1b was 69 bp longer than hipI-SODC1s due to an AS event involving the use of an alternative donor splice site in the sixth intron. Transcript analysis showed that the splice variant hipI-SODC1b was differentially expressed, being clearly expressed in cambial and xylem, but not phloem, regions. In addition, immunolocalization and mass spectrometric data confirmed the presence of hipI-SOD proteins in vascular tissue. The functionalities of the spliced gene products were assessed by expressing recombinant hipI-SOD proteins and in vitro SOD activity assays.
Widespread Use of Non-productive Alternative Splice Sites in Saccharomyces cerevisiae
Kawashima, Tadashi; Douglass, Stephen; Gabunilas, Jason; Pellegrini, Matteo; Chanfreau, Guillaume F.
2014-01-01
Saccharomyces cerevisiae has been used as a model system to investigate the mechanisms of pre-mRNA splicing but only a few examples of alternative splice site usage have been described in this organism. Using RNA-Seq analysis of nonsense-mediated mRNA decay (NMD) mutant strains, we show that many S. cerevisiae intron-containing genes exhibit usage of alternative splice sites, but many transcripts generated by splicing at these sites are non-functional because they introduce premature termination codons, leading to degradation by NMD. Analysis of splicing mutants combined with NMD inactivation revealed the role of specific splicing factors in governing the use of these alternative splice sites and identified novel functions for Prp17p in enhancing the use of branchpoint-proximal upstream 3′ splice sites and for Prp18p in suppressing the usage of a non-canonical AUG 3′-splice site in GCR1. The use of non-productive alternative splice sites can be increased in stress conditions in a promoter-dependent manner, contributing to the down-regulation of genes during stress. These results show that alternative splicing is frequent in S. cerevisiae but masked by RNA degradation and that the use of alternative splice sites in this organism is mostly aimed at controlling transcript levels rather than increasing proteome diversity. PMID:24722551
Jiang, Cong; Li, Yang; Li, Chaohui; Liu, Huiquan; Kang, Zhensheng; Xu, Jin-Rong
2016-01-01
PRP4 encodes the only kinase among the spliceosome components. Although it is an essential gene in the fission yeast and other eukaryotic organisms, the Fgprp4 mutant was viable in the wheat scab fungus Fusarium graminearum. Deletion of FgPRP4 did not block intron splicing but affected intron splicing efficiency in over 60% of the F. graminearum genes. The Fgprp4 mutant had severe growth defects and produced spontaneous suppressors that were recovered in growth rate. Suppressor mutations were identified in the PRP6, PRP31, BRR2, and PRP8 orthologs in nine suppressor strains by sequencing analysis with candidate tri-snRNP component genes. The Q86K mutation in FgMSL1 was identified by whole genome sequencing in suppressor mutant S3. Whereas two of the suppressor mutations in FgBrr2 and FgPrp8 were similar to those characterized in their orthologs in yeasts, suppressor mutations in Prp6 and Prp31 orthologs or FgMSL1 have not been reported. Interestingly, four and two suppressor mutations identified in FgPrp6 and FgPrp31, respectively, all are near the conserved Prp4-phosphorylation sites, suggesting that these mutations may have similar effects with phosphorylation by Prp4 kinase. In FgPrp31, the non-sense mutation at R464 resulted in the truncation of the C-terminal 130 aa region that contains all the conserved Prp4-phosphorylation sites. Deletion analysis showed that the N-terminal 310-aa rich in SR residues plays a critical role in the localization and functions of FgPrp4. We also conducted phosphoproteomics analysis with FgPrp4 and identified S289 as the phosphorylation site that is essential for its functions. These results indicated that FgPrp4 is critical for splicing efficiency but not essential for intron splicing, and FgPrp4 may regulate pre-mRNA splicing by phosphorylation of other components of the tri-snRNP although itself may be activated by phosphorylation at S289. PMID:27058959
Colwill, Karen; Wells, Clark D; Elder, Kelly; Goudreault, Marilyn; Hersi, Kadija; Kulkarni, Sarang; Hardy, W Rod; Pawson, Tony; Morin, Gregg B
2006-03-06
Recombinational systems have been developed to rapidly shuttle Open Reading Frames (ORFs) into multiple expression vectors in order to analyze the large number of cDNAs available in the post-genomic era. In the Creator system, an ORF introduced into a donor vector can be transferred with Cre recombinase to a library of acceptor vectors optimized for different applications. Usability of the Creator system is impacted by the ability to easily manipulate DNA, the number of acceptor vectors for downstream applications, and the level of protein expression from Creator vectors. To date, we have developed over 20 novel acceptor vectors that employ a variety of promoters and epitope tags commonly employed for proteomics applications and gene function analysis. We also made several enhancements to the donor vectors including addition of different multiple cloning sites to allow shuttling from pre-existing vectors and introduction of the lacZ alpha reporter gene to allow for selection. Importantly, in order to ameliorate any effects on protein expression of the loxP site between a 5' tag and ORF, we introduced a splicing event into our expression vectors. The message produced from the resulting 'Creator Splice' vector undergoes splicing in mammalian systems to remove the loxP site. Upon analysis of our Creator Splice constructs, we discovered that protein expression levels were also significantly increased. The development of new donor and acceptor vectors has increased versatility during the cloning process and made this system compatible with a wider variety of downstream applications. The modifications introduced in our Creator Splice system were designed to remove extraneous sequences due to recombination but also aided in downstream analysis by increasing protein expression levels. As a result, we can now employ epitope tags that are detected less efficiently and reduce our assay scale to allow for higher throughput. The Creator Splice system appears to be an extremely useful tool for proteomics.
Colwill, Karen; Wells, Clark D; Elder, Kelly; Goudreault, Marilyn; Hersi, Kadija; Kulkarni, Sarang; Hardy, W Rod; Pawson, Tony; Morin, Gregg B
2006-01-01
Background Recombinational systems have been developed to rapidly shuttle Open Reading Frames (ORFs) into multiple expression vectors in order to analyze the large number of cDNAs available in the post-genomic era. In the Creator system, an ORF introduced into a donor vector can be transferred with Cre recombinase to a library of acceptor vectors optimized for different applications. Usability of the Creator system is impacted by the ability to easily manipulate DNA, the number of acceptor vectors for downstream applications, and the level of protein expression from Creator vectors. Results To date, we have developed over 20 novel acceptor vectors that employ a variety of promoters and epitope tags commonly employed for proteomics applications and gene function analysis. We also made several enhancements to the donor vectors including addition of different multiple cloning sites to allow shuttling from pre-existing vectors and introduction of the lacZ alpha reporter gene to allow for selection. Importantly, in order to ameliorate any effects on protein expression of the loxP site between a 5' tag and ORF, we introduced a splicing event into our expression vectors. The message produced from the resulting 'Creator Splice' vector undergoes splicing in mammalian systems to remove the loxP site. Upon analysis of our Creator Splice constructs, we discovered that protein expression levels were also significantly increased. Conclusion The development of new donor and acceptor vectors has increased versatility during the cloning process and made this system compatible with a wider variety of downstream applications. The modifications introduced in our Creator Splice system were designed to remove extraneous sequences due to recombination but also aided in downstream analysis by increasing protein expression levels. As a result, we can now employ epitope tags that are detected less efficiently and reduce our assay scale to allow for higher throughput. The Creator Splice system appears to be an extremely useful tool for proteomics. PMID:16519801
Weak Negative and Positive Selection and the Drift Load at Splice Sites
Denisov, Stepan V.; Bazykin, Georgii A.; Sutormin, Roman; Favorov, Alexander V.; Mironov, Andrey A.; Gelfand, Mikhail S.; Kondrashov, Alexey S.
2014-01-01
Splice sites (SSs) are short sequences that are crucial for proper mRNA splicing in eukaryotic cells, and therefore can be expected to be shaped by strong selection. Nevertheless, in mammals and in other intron-rich organisms, many of the SSs often involve nonconsensus (Nc), rather than consensus (Cn), nucleotides, and beyond the two critical nucleotides, the SSs are not perfectly conserved between species. Here, we compare the SS sequences between primates, and between Drosophila fruit flies, to reveal the pattern of selection acting at SSs. Cn-to-Nc substitutions are less frequent, and Nc-to-Cn substitutions are more frequent, than neutrally expected, indicating, respectively, negative and positive selection. This selection is relatively weak (1 < |4Nes| < 4), and has a similar efficiency in primates and in Drosophila. Within some nucleotide positions, the positive selection in favor of Nc-to-Cn substitutions is weaker than the negative selection maintaining already established Cn nucleotides; this difference is due to site-specific negative selection favoring current Nc nucleotides. In general, however, the strength of negative selection protecting the Cn alleles is similar in magnitude to the strength of positive selection favoring replacement of Nc alleles, as expected under the simple nearly neutral turnover. In summary, although a fraction of the Nc nucleotides within SSs is maintained by selection, the abundance of deleterious nucleotides in this class suggests a substantial genome-wide drift load. PMID:24966225
EUGÈNE'HOM: a generic similarity-based gene finder using multiple homologous sequences
Foissac, Sylvain; Bardou, Philippe; Moisan, Annick; Cros, Marie-Josée; Schiex, Thomas
2003-01-01
EUGÈNE'HOM is a gene prediction software for eukaryotic organisms based on comparative analysis. EUGÈNE'HOM is able to take into account multiple homologous sequences from more or less closely related organisms. It integrates the results of TBLASTX analysis, splice site and start codon prediction and a robust coding/non-coding probabilistic model which allows EUGÈNE'HOM to handle sequences from a variety of organisms. The current target of EUGÈNE'HOM is plant sequences. The EUGÈNE'HOM web site is available at http://genopole.toulouse.inra.fr/bioinfo/eugene/EuGeneHom/cgi-bin/EuGeneHom.pl. PMID:12824408
Hot-spot KIF5A mutations cause familial ALS
Yilmaz, Rüstem; Müller, Kathrin; Grehl, Torsten; Petri, Susanne; Meyer, Thomas; Grosskreutz, Julian; Weydt, Patrick; Ruf, Wolfgang; Neuwirth, Christoph; Weber, Markus; Pinto, Susana; Claeys, Kristl G; Schrank, Berthold; Jordan, Berit; Knehr, Antje; Günther, Kornelia; Hübers, Annemarie; Zeller, Daniel; Kubisch, Christian; Jablonka, Sibylle; Klopstock, Thomas; de Carvalho, Mamede; Sperfeld, Anne; Borck, Guntram; Volk, Alexander E; Dorst, Johannes; Weis, Joachim; Otto, Markus; Schuster, Joachim; Del Tredici, Kelly; Braak, Heiko; Danzer, Karin M; Freischmidt, Axel; Meitinger, Thomas; Strom, Tim M; Ludolph, Albert C; Andersen, Peter M; Weishaupt, Jochen H; Weyen, Ute; Hermann, Andreas; Hagenacker, Tim; Koch, Jan Christoph; Lingor, Paul; Göricke, Bettina; Zierz, Stephan; Baum, Petra; Wolf, Joachim; Winkler, Andrea; Young, Peter; Bogdahn, Ulrich; Prudlo, Johannes; Kassubek, Jan
2018-01-01
Abstract Heterozygous missense mutations in the N-terminal motor or coiled-coil domains of the kinesin family member 5A (KIF5A) gene cause monogenic spastic paraplegia (HSP10) and Charcot-Marie-Tooth disease type 2 (CMT2). Moreover, heterozygous de novo frame-shift mutations in the C-terminal domain of KIF5A are associated with neonatal intractable myoclonus, a neurodevelopmental syndrome. These findings, together with the observation that many of the disease genes associated with amyotrophic lateral sclerosis disrupt cytoskeletal function and intracellular transport, led us to hypothesize that mutations in KIF5A are also a cause of amyotrophic lateral sclerosis. Using whole exome sequencing followed by rare variant analysis of 426 patients with familial amyotrophic lateral sclerosis and 6137 control subjects, we detected an enrichment of KIF5A splice-site mutations in amyotrophic lateral sclerosis (2/426 compared to 0/6137 in controls; P = 4.2 × 10−3), both located in a hot-spot in the C-terminus of the protein and predicted to affect splicing exon 27. We additionally show co-segregation with amyotrophic lateral sclerosis of two canonical splice-site mutations in two families. Investigation of lymphoblast cell lines from patients with KIF5A splice-site mutations revealed the loss of mutant RNA expression and suggested haploinsufficiency as the most probable underlying molecular mechanism. Furthermore, mRNA sequencing of a rare non-synonymous missense mutation (predicting p.Arg1007Gly) located in the C-terminus of the protein shortly upstream of the splice donor of exon 27 revealed defective KIF5A pre-mRNA splicing in respective patient-derived cell lines owing to abrogation of the donor site. Finally, the non-synonymous single nucleotide variant rs113247976 (minor allele frequency = 1.00% in controls, n = 6137), also located in the C-terminal region [p.(Pro986Leu) in exon 26], was significantly enriched in familial amyotrophic lateral sclerosis patients (minor allele frequency = 3.40%; P = 1.28 × 10−7). Our study demonstrates that mutations located specifically in a C-terminal hotspot of KIF5A can cause a classical amyotrophic lateral sclerosis phenotype, and underline the involvement of intracellular transport processes in amyotrophic lateral sclerosis pathogenesis. PMID:29342275
Hot-spot KIF5A mutations cause familial ALS.
Brenner, David; Yilmaz, Rüstem; Müller, Kathrin; Grehl, Torsten; Petri, Susanne; Meyer, Thomas; Grosskreutz, Julian; Weydt, Patrick; Ruf, Wolfgang; Neuwirth, Christoph; Weber, Markus; Pinto, Susana; Claeys, Kristl G; Schrank, Berthold; Jordan, Berit; Knehr, Antje; Günther, Kornelia; Hübers, Annemarie; Zeller, Daniel; Kubisch, Christian; Jablonka, Sibylle; Sendtner, Michael; Klopstock, Thomas; de Carvalho, Mamede; Sperfeld, Anne; Borck, Guntram; Volk, Alexander E; Dorst, Johannes; Weis, Joachim; Otto, Markus; Schuster, Joachim; Del Tredici, Kelly; Braak, Heiko; Danzer, Karin M; Freischmidt, Axel; Meitinger, Thomas; Strom, Tim M; Ludolph, Albert C; Andersen, Peter M; Weishaupt, Jochen H
2018-01-12
Heterozygous missense mutations in the N-terminal motor or coiled-coil domains of the kinesin family member 5A (KIF5A) gene cause monogenic spastic paraplegia (HSP10) and Charcot-Marie-Tooth disease type 2 (CMT2). Moreover, heterozygous de novo frame-shift mutations in the C-terminal domain of KIF5A are associated with neonatal intractable myoclonus, a neurodevelopmental syndrome. These findings, together with the observation that many of the disease genes associated with amyotrophic lateral sclerosis disrupt cytoskeletal function and intracellular transport, led us to hypothesize that mutations in KIF5A are also a cause of amyotrophic lateral sclerosis. Using whole exome sequencing followed by rare variant analysis of 426 patients with familial amyotrophic lateral sclerosis and 6137 control subjects, we detected an enrichment of KIF5A splice-site mutations in amyotrophic lateral sclerosis (2/426 compared to 0/6137 in controls; P = 4.2 × 10-3), both located in a hot-spot in the C-terminus of the protein and predicted to affect splicing exon 27. We additionally show co-segregation with amyotrophic lateral sclerosis of two canonical splice-site mutations in two families. Investigation of lymphoblast cell lines from patients with KIF5A splice-site mutations revealed the loss of mutant RNA expression and suggested haploinsufficiency as the most probable underlying molecular mechanism. Furthermore, mRNA sequencing of a rare non-synonymous missense mutation (predicting p.Arg1007Gly) located in the C-terminus of the protein shortly upstream of the splice donor of exon 27 revealed defective KIF5A pre-mRNA splicing in respective patient-derived cell lines owing to abrogation of the donor site. Finally, the non-synonymous single nucleotide variant rs113247976 (minor allele frequency = 1.00% in controls, n = 6137), also located in the C-terminal region [p.(Pro986Leu) in exon 26], was significantly enriched in familial amyotrophic lateral sclerosis patients (minor allele frequency = 3.40%; P = 1.28 × 10-7). Our study demonstrates that mutations located specifically in a C-terminal hotspot of KIF5A can cause a classical amyotrophic lateral sclerosis phenotype, and underline the involvement of intracellular transport processes in amyotrophic lateral sclerosis pathogenesis. © The Author(s) (2018). Published by Oxford University Press on behalf of the Guarantors of Brain.
Language study on Spliced Semigraph using Folding techniques
NASA Astrophysics Data System (ADS)
Thiagarajan, K.; Padmashree, J.
2018-04-01
In this paper, we proposed algorithm to identify cut vertices and cut edges for n-Cut Spliced Semigraph and splicing the n-Cut Spliced Semigraph using cut vertices else cut edges or combination of cut vertex and cut edge and applying sequence of folding to the spliced semigraph to obtain the semigraph quadruple η(S)=(2, 1, 1, 1). We observed that the splicing and folding using both cut vertices and cut edges is applicable only for n-Cut Spliced Semigraph where n > 2. Also, we transformed the spliced semigraph into tree structure and studied the language for the semigraph with n+2 vertices and n+1 semivertices using Depth First Edge Sequence algorithm and obtain the language structure with sequence of alphabet ‘a’ and ‘b’.
Piekielko-Witkowska, Agnieszka; Kedzierska, Hanna; Poplawski, Piotr; Wojcicka, Anna; Rybicka, Beata; Maksymowicz, Maria; Grajkowska, Wieslawa; Matyja, Ewa; Mandat, Tomasz; Bonicki, Wieslaw; Nauman, Pawel
2013-06-01
Pituitary tumors belong to the group of most common neoplasms of the sellar region. Iodothyronine deiodinase types 1 (DIO1) and 2 (DIO2) are enzymes contributing to the levels of locally synthesized T3, a hormone regulating key physiological processes in the pituitary, including its development, cellular proliferation, and hormone secretion. Previous studies revealed that the expression of deiodinases in pituitary tumors is variable and, moreover, there is no correlation between mRNA and protein products of the particular gene, suggesting the potential role of posttranscriptional regulatory mechanisms. In this work we hypothesized that one of such mechanisms could be the alternative splicing. Therefore, we analyzed expression and sequences of DIO1 and DIO2 splicing variants in 30 pituitary adenomas and 9 non-tumorous pituitary samples. DIO2 mRNA was expressed as only two mRNA isoforms. In contrast, nine splice variants of DIO1 were identified. Among them, five were devoid of exon 3. In silico sequence analysis of DIO1 revealed multiple putative binding sites for splicing factor SF2/ASF, of which the top-ranked sites were located in exon 3. Silencing of SF2/ASF in pituitary tumor GH3 cells resulted in change of ratio between DIO1 isoforms with or without exon 3, favoring the expression of variants without exon 3. The expression of SF2/ASF mRNA in pituitary tumors was increased when compared with non-neoplastic control samples. In conclusion, we provide a new mechanism of posttranscriptional regulation of DIO1 and show deregulation of DIO1 expression in pituitary adenoma, possibly resulting from disturbed expression of SF2/ASF. Copyright © 2013 Elsevier B.V. All rights reserved.
Characterization of a splicing mutation in group A xeroderma pigmentosum
DOE Office of Scientific and Technical Information (OSTI.GOV)
Satokata, Ichiro; Tanaka, Kiyoji; Miura, Naoyuki
1990-12-01
The molecular basis of group A xeroderma pigmentosum (WP) was investigated by comparison of the nucleotide sequences of multiple clones of the XP group A complementing gene (XPAC) from a patient with group A XP with that of a normal gene. The clones showed a G {r arrow} C substitution at the 3{prime} splice acceptor site of intron 3, which altered the obligatory AG acceptor dinucleotide to AC. Nucleotide sequencing of cDNAs amplified by the polymerase chain reaction revealed that this single base substitution abolishes the canonical 3{prime} splice site, thus creating two abnormally spliced mRNA forms. The larger formmore » is identical with normal mRNA except for a dinucleotide deletion at the 5{prime} end of exon 4. This deletion results in a frameshift with premature translation termination in exon 4. The smaller form has a deletion of the entire exon 3 and the dinucleotide at the 5{prime} end of exon 4. The result of a transfection study provided additional evidence that this single base substitution is the disease-causing mutation. This single base substitution creates a new cleavage site for the restriction nuclease AlwNI. Analysis of AlwNI restriction fragment length polymorphism showed a high frequency of this mutation in Japanese patients with group A XP: 16 of 21 unrelated Japanese patients were homozygous and 4 were heterozygous for this mutation. However, 11 Caucasians and 2 Blacks with group A XP did not have this mutant allele. The polymorphic AlwNI restriction fragments are concluded to be useful for diagnosis of group A XP in Japanese subjects, including prenatal cases and carriers.« less
Becker, Y; Asher, Y; Tabor, E; Davidson, I; Malkinson, M
1994-01-01
A DNA segment of the MDV-1 BamHI-D fragment was sequenced, and the open reading frames (ORFs) present in the 4556 nucleotide fragment were analyzed by computer programs. Computer analysis identified 19 putative ORFs in the sequence ranging from a coding capacity of 37 amino acids (aa) (ORF-1a) to 684aa (ORF-1). The special properties of four ORFs (1a, 1, 2, and 3) were investigated. Two adjacent ORFs, ORF-1a and ORF-1, were found by computer analysis to have the properties of two introns encoding a glycoprotein: ORF-1a encodes an aa sequence with the properties of a signal peptide, and ORF-1 encodes a polypeptide with a membrane anchor domain and putative N-glycosylation sites in the aa sequence. ORF-1a and ORF-1 were found to be transcribed in MDV-1-infected cells. Two RNA transcripts were detected: a precursor RNA and its spliced form. Both are transcribed from a promoter located 5' to ORF-1a, and splice donor and acceptor sites are used to splice the mRNA after cleavage of a 71-nucleotide sequence. This finding suggest that ORF-1a and ORF-1 are two introns of a new MDV-1 glycoprotein gene. The DNA sequence containing ORF-1 was transiently expressed in COS-1 cells, and the viral protein produced in these cells was found to react with anti-MDV serotype-1 Antigen B-specific monoclonal antibodies. These studies indicate that the protein encoded by ORF-1 has antigenic properties resembling Antigen B of MDV-1. A gene homologous to ORF-1 was detected in the genome of both MDV-2(SB1) and MDV-3(HVT), which serve as commercial vaccine strains. Two additional ORFs were noted in the 4556 nucleotide sequence: ORF-2, which encodes a 333 aa polypeptide initiating in the UL and terminating in the TRL prior to the putative origin of replication, and ORF-3, which encodes a 155 aa polypeptide that is partly homologous to the phosphoprotein pp38 encoded by the BamHI-H sequence. The 65 N-terminal aa of the two gene products are identical, both being derived from the nucleotide sequences in the TRL and IRL, respectively. Additional homologous aa sequences are the hydrophobic aa domain in the middle of both proteins. The functions of ORF-2, ORF-3, and additional ORFs are under study.
Munroe, Stephen H.; Morales, Christopher H.; Duyck, Tessa H.; Waters, Paul D.
2015-01-01
The α-thyroid hormone receptor gene (TRα) codes for two functionally distinct proteins: TRα1, the α-thyroid hormone receptor; and TRα2, a non-hormone-binding variant. The final exon of TRα2 mRNA overlaps the 3’ end of Rev-erbα mRNA, which encodes another nuclear receptor on the opposite strand of DNA. To understand the evolution of this antisense overlap, we sequenced these genes and mRNAs in the platypus Orthorhynchus anatinus. Despite its strong homology with other mammals, the platypus TRα/Rev-erbα locus lacks elements essential for expression of TRα2. Comparative analysis suggests that alternative splicing of TRα2 mRNA expression evolved in a stepwise fashion before the divergence of eutherian and marsupial mammals. A short G-rich element (G30) located downstream of the alternative 3’splice site of TRα2 mRNA and antisense to the 3’UTR of Rev-erbα plays an important role in regulating TRα2 splicing. G30 is tightly conserved in eutherian mammals, but is absent in marsupials and monotremes. Systematic deletions and substitutions within G30 have dramatically different effects on TRα2 splicing, leading to either its inhibition or its enhancement. Mutations that disrupt one or more clusters of G residues enhance splicing two- to three-fold. These results suggest the G30 sequence can adopt a highly structured conformation, possibly a G-quadruplex, and that it is part of a complex splicing regulatory element which exerts both positive and negative effects on TRα2 expression. Since mutations that strongly enhance splicing in vivo have no effect on splicing in vitro, it is likely that the regulatory role of G30 is mediated through linkage of transcription and splicing. PMID:26368571
Liu, H X; Goodall, G J; Kole, R; Filipowicz, W
1995-01-16
We have performed a systematic study of the effect of artificial hairpins on pre-mRNA splicing in protoplasts of a dicot plant, Nicotiana plumbaginifolia. Hairpins with a potential to form 18 or 24 bp stems strongly inhibit splicing when they sequester the 5' splice site or are placed in the middle of short introns. However, similar 24 bp hairpins sequestering the 3' splice site do not prevent this site from being used as an acceptor. Utilization of the stem-located 3' site requires that the base of the stem is separated from the upstream 5' splice site by a minimum of approximately 45 nucleotides and that another 'helper' 3' splice site is present downstream of the stem. The results indicate that the spliceosome or factors associated with it may have a potential to unfold secondary structure present in the downstream portion of the intron, prior to or at the step of the 3' splice site selection. The finding that the helper 3' site is required for utilization of the stem-located acceptor confirms and extends previous observations, obtained with HeLa cell in vitro splicing systems, indicating that the 3' splice site may be recognized at least twice during spliceosome assembly.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Oestberg, Sara, E-mail: sara.ostberg@imbim.uu.se; Toermaenen Persson, Heidi, E-mail: heidi.tormanen.persson@imbim.uu.se; Akusjaervi, Goeran, E-mail: goran.akusjarvi@imbim.uu.se
2012-11-25
The adenovirus L4-33K protein is a key regulator involved in the temporal shift from early to late pattern of mRNA expression from the adenovirus major late transcription unit. L4-33K is a virus-encoded alternative splicing factor, which enhances processing of 3 Prime splice sites with a weak sequence context. Here we show that L4-33K expressed from a plasmid is localized at the nuclear margin of uninfected cells. During an infection L4-33K is relocalized to the periphery of E2A-72K containing viral replication centers. We also show that serine 192 in the tiny RS repeat of the conserved carboxy-terminus of L4-33K, which ismore » critical for the splicing enhancer function of L4-33K, is necessary for the nuclear localization and redistribution of the protein to viral replication sites. Collectively, our results show a good correlation between the activity of L4-33K as a splicing enhancer protein and its localization to the periphery of viral replication centers.« less
Circular RNAs: Unexpected outputs of many protein-coding genes
Wilusz, Jeremy E.
2017-01-01
ABSTRACT Pre-mRNAs from thousands of eukaryotic genes can be non-canonically spliced to generate circular RNAs, some of which accumulate to higher levels than their associated linear mRNA. Recent work has revealed widespread mechanisms that dictate whether the spliceosome generates a linear or circular RNA. For most genes, circular RNA biogenesis via backsplicing is far less efficient than canonical splicing, but circular RNAs can accumulate due to their long half-lives. Backsplicing is often initiated when complementary sequences from different introns base pair and bring the intervening splice sites close together. This process is further regulated by the combinatorial action of RNA binding proteins, which allow circular RNAs to be expressed in unique patterns. Some genes do not require complementary sequences to generate RNA circles and instead take advantage of exon skipping events. It is still unclear what most mature circular RNAs do, but future investigations into their functions will be facilitated by recently described methods to modulate circular RNA levels. PMID:27571848
iCLIP Predicts the Dual Splicing Effects of TIA-RNA Interactions
Briese, Michael; Zarnack, Kathi; Luscombe, Nicholas M.; Rot, Gregor; Zupan, Blaž; Curk, Tomaž; Ule, Jernej
2010-01-01
The regulation of alternative splicing involves interactions between RNA-binding proteins and pre-mRNA positions close to the splice sites. T-cell intracellular antigen 1 (TIA1) and TIA1-like 1 (TIAL1) locally enhance exon inclusion by recruiting U1 snRNP to 5′ splice sites. However, effects of TIA proteins on splicing of distal exons have not yet been explored. We used UV-crosslinking and immunoprecipitation (iCLIP) to find that TIA1 and TIAL1 bind at the same positions on human RNAs. Binding downstream of 5′ splice sites was used to predict the effects of TIA proteins in enhancing inclusion of proximal exons and silencing inclusion of distal exons. The predictions were validated in an unbiased manner using splice-junction microarrays, RT-PCR, and minigene constructs, which showed that TIA proteins maintain splicing fidelity and regulate alternative splicing by binding exclusively downstream of 5′ splice sites. Surprisingly, TIA binding at 5′ splice sites silenced distal cassette and variable-length exons without binding in proximity to the regulated alternative 3′ splice sites. Using transcriptome-wide high-resolution mapping of TIA-RNA interactions we evaluated the distal splicing effects of TIA proteins. These data are consistent with a model where TIA proteins shorten the time available for definition of an alternative exon by enhancing recognition of the preceding 5′ splice site. Thus, our findings indicate that changes in splicing kinetics could mediate the distal regulation of alternative splicing. PMID:21048981
Laurie, Andrew D; Kyle, Campbell V
Type I hyperlipoproteinemia, manifesting as chylomicronemia and severe hypertriglyceridemia, is a rare autosomal recessive disorder usually caused by mutations in the lipoprotein lipase gene (LPL). We sought to determine whether mutations in LPL could explain the clinical indications of a patient presenting with pancreatitis and hypertriglyceridemia. Coding regions of LPL were amplified by polymerase chain reaction and analyzed by nucleotide sequencing. The LPL messenger RNA transcript was also analyzed to investigate whether alternative splicing was occurring. The patient was homozygous for the mutation c.767_768insTAAATATT in exon 5 of the LPL gene. This mutation is predicted to result in either a truncated nonfunctional LPL, or alternatively a new 5' donor splice site may be used, resulting in a full-length LPL with an in-frame deletion of 3 amino acids. Analysis of messenger RNA from the patient showed that the new splice site is used in vivo. Homozygosity for a mutation in the LPL gene was consistent with the clinical findings. Use of the new splice site created by the insertion mutation rescues an otherwise damaging frameshift mutation, resulting in expression of an almost full-length LPL that is predicted to be partially functional. The patient therefore has a less severe form of type I hyperlipoproteinemia than would be expected if she lacked any functional LPL. Copyright © 2017 National Lipid Association. Published by Elsevier Inc. All rights reserved.
Homologous SV40 RNA trans-splicing
Eul, Joachim; Patzel, Volker
2013-01-01
Simian Virus 40 (SV40) is a polyomavirus found in both monkeys and humans, which causes cancer in some animal models. In humans, SV40 has been reported to be associated with cancers but causality has not been proven yet. The transforming activity of SV40 is mainly due to its 94-kD large T antigen, which binds to the retinoblastoma (pRb) and p53 tumor suppressor proteins, and thereby perturbs their functions. Here we describe a 100 kD super T antigen harboring a duplication of the pRB binding domain that was associated with unusual high cell transformation activity and that was generated by a novel mechanism involving homologous RNA trans-splicing of SV40 early transcripts in transformed rodent cells. Enhanced trans-splice activity was observed in clones carrying a single point mutation in the large T antigen 5′ donor splice site (ss). This mutation impaired cis-splicing in favor of an alternative trans-splice reaction via a cryptic 5′ss within a second cis-spliced SV40 pre-mRNA molecule and enabled detectable gene expression. Next to the cryptic 5′ss we identified additional trans-splice helper functions, including putative dimerization domains and a splice enhancer sequence. Our findings suggest RNA trans-splicing as a SV40-intrinsic mechanism that supports the diversification of viral RNA and phenotypes. PMID:24178438
Expanding the action of duplex RNAs into the nucleus: redirecting alternative splicing
Liu, Jing; Hu, Jiaxin; Corey, David R.
2012-01-01
Double-stranded RNAs are powerful agents for silencing gene expression in the cytoplasm of mammalian cells. The potential for duplex RNAs to control expression in the nucleus has received less attention. Here, we investigate the ability of small RNAs to redirect splicing. We identify RNAs targeting an aberrant splice site that restore splicing and production of functional protein. RNAs can target sequences within exons or introns and affect the inclusion of exons within SMN2 and dystrophin, genes responsible for spinal muscular atrophy and Duchenne muscular dystrophy, respectively. Duplex RNAs recruit argonaute 2 (AGO2) to pre-mRNA transcripts and altered splicing requires AGO2 expression. AGO2 promotes transcript cleavage in the cytoplasm, but recruitment of AGO2 to pre-mRNAs does not reduce transcript levels, exposing a difference between cytoplasmic and nuclear pathways. Involvement of AGO2 in splicing, a classical nuclear process, reinforces the conclusion from studies of RNA-mediated transcriptional silencing that RNAi pathways can be adapted to function in the mammalian nucleus. These data provide a new strategy for controlling splicing and expand the reach of small RNAs within the nucleus of mammalian cells. PMID:21948593
Application of hidden Markov models to biological data mining: a case study
NASA Astrophysics Data System (ADS)
Yin, Michael M.; Wang, Jason T.
2000-04-01
In this paper we present an example of biological data mining: the detection of splicing junction acceptors in eukaryotic genes. Identification or prediction of transcribed sequences from within genomic DNA has been a major rate-limiting step in the pursuit of genes. Programs currently available are far from being powerful enough to elucidate the gene structure completely. Here we develop a hidden Markov model (HMM) to represent the degeneracy features of splicing junction acceptor sites in eukaryotic genes. The HMM system is fully trained using an expectation maximization (EM) algorithm and the system performance is evaluated using the 10-way cross- validation method. Experimental results show that our HMM system can correctly classify more than 94% of the candidate sequences (including true and false acceptor sites) into right categories. About 90% of the true acceptor sites and 96% of the false acceptor sites in the test data are classified correctly. These results are very promising considering that only the local information in DNA is used. The proposed model will be a very important component of an effective and accurate gene structure detection system currently being developed in our lab.
A role for exon sequences in alternative splicing of the human fibronectin gene.
Mardon, H J; Sebastio, G; Baralle, F E
1987-01-01
Exon EDIIIA of the fibronectin (Fn) gene is alternatively spliced via pathways which either skip or include the whole exon in the messenger RNA (mRNA). We have investigated the role of EDIIIA exon sequences in the human Fn gene in determining alternative splicing of this exon during transient expression of alpha globin/Fn minigene hybrids in HeLa cells. We demonstrate that a DNA sequence of 81bp within the central region of exon EDIIIA is required for alternative splicing during processing of the primary transcript to generate both EDIIIA+ and EDIIIA- mRNA's. Furthermore, alternative splicing of EDIIIA only occurs when this sequence is present in the correct orientation since when it is in antisense orientation splicing always occurs via exon-skipping generating EDIIIA- mRNA. Images PMID:3671064
Unusual molecular findings in Kindler syndrome.
Arita, K; Wessagowit, V; Inamadar, A C; Palit, A; Fassihi, H; Lai-Cheong, J E; Pourreyron, C; South, A P; McGrath, J A
2007-12-01
Kindler syndrome (KS) is a rare inherited skin disorder with blistering and poikiloderma as its main clinical features. It is caused by loss-of-function mutations in the C20orf42 (KIND1) gene which encodes kindlin-1, an actin cytoskeleton-focal contact-associated protein which is predominantly expressed in keratinocytes. We investigated the molecular basis of KS in a 16-year-old Indian boy who had additional clinical findings, including scleroatrophic changes of the hands and feet, pseudoainhum and early onset of squamous cell carcinoma on his foot. Immunostaining for kindlin-1 in the patient's skin was completely absent and sequencing of C20orf42 (KIND1) genomic DNA showed a homozygous splice-site mutation at the -6 position, IVS9-6T-->A. Amplification and sequencing of cDNA from the skin revealed aberrant splicing with either deletion of exon 10 or deletion of exons 9, 10 and 11, both of which involve loss of the pleckstrin homology domain of kindlin-1 that is thought to play a role in cytoskeletal attachment and integrin-mediated cell signalling. Pathogenic splice-site mutations at the -6 position are unusual and have rarely been reported for any genetic disorder. Collectively, these findings extend the spectrum of clinical and molecular abnormalities in this rare genodermatosis.
Survey of gene splicing algorithms based on reads.
Si, Xiuhua; Wang, Qian; Zhang, Lei; Wu, Ruo; Ma, Jiquan
2017-11-02
Gene splicing is the process of assembling a large number of unordered short sequence fragments to the original genome sequence as accurately as possible. Several popular splicing algorithms based on reads are reviewed in this article, including reference genome algorithms and de novo splicing algorithms (Greedy-extension, Overlap-Layout-Consensus graph, De Bruijn graph). We also discuss a new splicing method based on the MapReduce strategy and Hadoop. By comparing these algorithms, some conclusions are drawn and some suggestions on gene splicing research are made.
López-Cuadros, Itzia; García-Gasca, Alejandra; Gomez-Anduro, Gracia; Escobedo-Fregoso, Cristina; Llera-Herrera, Raúl A; Ibarra, Ana M
2018-08-20
The Pacific white shrimp Penaeus vannamei is the most cultured shrimp species around the world. Because females grow larger than males, the culture of 'only females' is of great interest, but knowledge on sex determination and differentiation is required for producing only females. In an effort to obtain information associated with reproduction in P. vannamei, transcriptomic data from female gonads was generated, and partial sequences of a transcript were identified as Sex-lethal (Sxl). Its characterization indicated that, differently from other penaeids in which this gene has been isolated, there are six isoforms of the Sxl transcript in P. vannamei (PvanSxl 1-6). These isoforms result from alternative splicing at three splice sites (SS1, SS2, SS3). The first splice-site is unique to P. vannamei, as it has not been reported for other Arthropod species; the second splice-site (SS2) is common among crustaceans, and the third splice-site (SS3) is also unique to P. vannamei and when spliced-out, it is always together with SS2. All isoforms are expressed during embryogenesis as well as gametogenesis of both genders. The two shorter isoforms, PvanSxl-5 and PvanSxl-6, which result from the splicing of SS2 and SS3, were found mostly expressed in adult testis, but PvanSxl-6 was also expressed in oocytes during gametogenesis. During oogenesis, the second largest isoform, PvanSxl-2, which splices-out only SS1, and PvanSxl-4 that splices-out SS1 and SS2 were highly expressed. These two isoforms were also highly expressed during embryonic development. In situ hybridization allowed pinpointing more specifically the cells where the PvanSxl transcripts were expressed. During embryogenesis, hybridization was observed from the one-cell stage embryo to late gastrula. In the female gonad in previtellogenesis, hybridization occurred in the nucleus of oocytes, whereas in secondary vitellogenesis the transcript also hybridized cytoplasmic granules and cortical crypts. Finally, in situ hybridization corroborated the expression of PvanSxl also in the male gonad during spermatogenesis, mostly occurring in the cytoplasm from spermatogonia and spermatocytes. Copyright © 2018 Elsevier B.V. All rights reserved.
ACTG: novel peptide mapping onto gene models.
Choi, Seunghyuk; Kim, Hyunwoo; Paek, Eunok
2017-04-15
In many proteogenomic applications, mapping peptide sequences onto genome sequences can be very useful, because it allows us to understand origins of the gene products. Existing software tools either take the genomic position of a peptide start site as an input or assume that the peptide sequence exactly matches the coding sequence of a given gene model. In case of novel peptides resulting from genomic variations, especially structural variations such as alternative splicing, these existing tools cannot be directly applied unless users supply information about the variant, either its genomic position or its transcription model. Mapping potentially novel peptides to genome sequences, while allowing certain genomic variations, requires introducing novel gene models when aligning peptide sequences to gene structures. We have developed a new tool called ACTG (Amino aCids To Genome), which maps peptides to genome, assuming all possible single exon skipping, junction variation allowing three edit distances from the original splice sites, exon extension and frame shift. In addition, it can also consider SNVs (single nucleotide variations) during mapping phase if a user provides the VCF (variant call format) file as an input. Available at http://prix.hanyang.ac.kr/ACTG/search.jsp . eunokpaek@hanyang.ac.kr. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Citterio, Cintia E; Morales, Cecilia M; Bouhours-Nouet, Natacha; Machiavelli, Gloria A; Bueno, Elena; Gatelais, Frédérique; Coutant, Regis; González-Sarmiento, Rogelio; Rivolta, Carina M; Targovnik, Héctor M
2015-03-15
Several patients were identified with dyshormonogenesis caused by mutations in the thyroglobulin (TG) gene. These defects are inherited in an autosomal recessive manner and affected individuals are either homozygous or compound heterozygous for the mutations. The aim of the present study was to identify new TG mutations in a patient of Vietnamese origin affected by congenital hypothyroidism, goiter and low levels of serum TG. DNA sequencing identified the presence of compound heterozygous mutations in the TG gene: the maternal mutation consists of a novel c.745+1G>A (g.IVS6 + 1G>A), whereas the hypothetical paternal mutation consists of a novel c.7036+2T>A (g.IVS40 + 2T>A). The father was not available for segregation analysis. Ex-vivo splicing assays and subsequent RT-PCR analyses were performed on mRNA isolated from the eukaryotic-cells transfected with normal and mutant expression vectors. Minigene analysis of the c.745+1G>A mutant showed that the exon 6 is skipped during pre-mRNA splicing or partially included by use of a cryptic 5' splice site located to 55 nucleotides upstream of the authentic exon 6/intron 6 junction site. The functional analysis of c.7036+2T>A mutation showed a complete skipping of exon 40. The theoretical consequences of splice site mutations, predicted with the bioinformatics tool NNSplice, Fsplice, SPL, SPLM and MaxEntScan programs were investigated and evaluated in relation with the experimental evidence. These analyses predicted that both mutant alleles would result in the abolition of the authentic splice donor sites. The c.745+1G>A mutation originates two putative truncated proteins of 200 and 1142 amino acids, whereas c.7036+2T>A mutation results in a putative truncated protein of 2277 amino acids. In conclusion, we show that the c.745+1G>A mutation promotes the activation of a new cryptic donor splice site in the exon 6 of the TG gene. The functional consequences of these mutations could be structural changes in the protein molecule that alter the biosynthesis of thyroid hormones. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Zhang, Zijun; Xing, Yi
2017-09-19
Crosslinking or RNA immunoprecipitation followed by sequencing (CLIP-seq or RIP-seq) allows transcriptome-wide discovery of RNA regulatory sites. As CLIP-seq/RIP-seq reads are short, existing computational tools focus on uniquely mapped reads, while reads mapped to multiple loci are discarded. We present CLAM (CLIP-seq Analysis of Multi-mapped reads). CLAM uses an expectation-maximization algorithm to assign multi-mapped reads and calls peaks combining uniquely and multi-mapped reads. To demonstrate the utility of CLAM, we applied it to a wide range of public CLIP-seq/RIP-seq datasets involving numerous splicing factors, microRNAs and m6A RNA methylation. CLAM recovered a large number of novel RNA regulatory sites inaccessible by uniquely mapped reads. The functional significance of these sites was demonstrated by consensus motif patterns and association with alternative splicing (splicing factors), transcript abundance (AGO2) and mRNA half-life (m6A). CLAM provides a useful tool to discover novel protein-RNA interactions and RNA modification sites from CLIP-seq and RIP-seq data, and reveals the significant contribution of repetitive elements to the RNA regulatory landscape of the human transcriptome. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Characterization of a rare variant (c.2635-2A>G) of the MSH2 gene in a family with Lynch syndrome.
Cariola, Filomena; Disciglio, Vittoria; Valentini, Anna M; Lotesoriere, Claudio; Fasano, Candida; Forte, Giovanna; Russo, Luciana; Di Carlo, Antonio; Guglielmi, Floranna; Manghisi, Andrea; Lolli, Ivan; Caruso, Maria L; Simone, Cristiano
2018-04-01
Lynch syndrome is caused by germline mutations in one of the mismatch repair genes ( MLH1, MSH2, MSH6, and PMS2) or in the EPCAM gene. Lynch syndrome is defined on the basis of clinical, pathological, and genetic findings. Accordingly, the identification of predisposing genes allows for accurate risk assessment and tailored screening protocols. Here, we report a family case with three family members manifesting the Lynch syndrome phenotype, all of which harbor the rare variant c.2635-2A>G affecting the splice site consensus sequence of intron 15 of the MSH2 gene. This mutation was previously described only in one family with Lynch syndrome, in which mismatch repair protein expression in tumor tissues was not assessed. In this study, we report for the first time the molecular characterization of the MSH2 c.2635-2A>G variant through in silico prediction analysis, microsatellite instability, and mismatch repair protein expression experiments on tumor tissues of Lynch syndrome patients. The potential effect of the splice site variant was revealed by three splicing prediction bioinformatics tools, which suggested the generation of a new cryptic splicing site. The potential pathogenic role of this variant was also revealed by the presence of microsatellite instability and the absence of MSH2/MSH6 heterodimer protein expression in the tumor cells of cancer tissues of the affected family members. We provide compelling evidence in favor of the pathogenic role of the MSH2 variant c.2635-2A>G, which could induce an alteration of the canonical splice site and consequently an aberrant form of the protein product (MSH2).
Mullis, Primus E; Robinson, Iain C A F; Salemi, Souzan; Eblé, Andrée; Besson, Amélie; Vuissoz, Jean-Marc; Deladoey, Johnny; Simon, Dominique; Czernichow, Paul; Binder, Gerhard
2005-04-01
Four distinct familial types of isolated GH deficiency have been described so far, of which type II is the autosomal dominant inherited form. It is mainly caused by mutations within the first 6 bp of intervening sequence 3. However, other splice site and missense mutations have been reported. Based on in vitro experiments and transgenic animal data, there is strong evidence that there is a wide variability in phenotype in terms of the severity of GH deficiency. Therefore, we studied a total of 57 subjects belonging to 19 families suffering from different splice site as well as missense mutations within the GH-1 gene. The subjects presenting with a splice site mutation within the first 2 bp of intervening sequence 3 (5'IVS +1/+2 bp) leading to a skipping of exon 3 were found to be more likely to present in the follow-up with other pituitary hormone deficiencies. In addition, although the patients with missense mutations have previously been reported to be less affected, a number of patients presenting with the P89L missense GH form, showed some pituitary hormone impairment. The development of multiple hormonal deficiencies is not age dependent, and there is a clear variability in onset, severity, and progression, even within the same families. The message of clinical importance from these studies is that the pituitary endocrine status of all such patients should continue to be monitored closely over the years because further hormonal deficiencies may evolve with time.
A single splice site mutation in human-specific ARHGAP11B causes basal progenitor amplification
Florio, Marta; Namba, Takashi; Pääbo, Svante; Hiller, Michael; Huttner, Wieland B.
2016-01-01
The gene ARHGAP11B promotes basal progenitor amplification and is implicated in neocortex expansion. It arose on the human evolutionary lineage by partial duplication of ARHGAP11A, which encodes a Rho guanosine triphosphatase–activating protein (RhoGAP). However, a lack of 55 nucleotides in ARHGAP11B mRNA leads to loss of RhoGAP activity by GAP domain truncation and addition of a human-specific carboxy-terminal amino acid sequence. We show that these 55 nucleotides are deleted by mRNA splicing due to a single C→G substitution that creates a novel splice donor site. We reconstructed an ancestral ARHGAP11B complementary DNA without this substitution. Ancestral ARHGAP11B exhibits RhoGAP activity but has no ability to increase basal progenitors during neocortex development. Hence, a single nucleotide substitution underlies the specific properties of ARHGAP11B that likely contributed to the evolutionary expansion of the human neocortex. PMID:27957544
Nuzzo, F; Bulato, C; Nielsen, B I; Lee, K; Wielders, S J; Simioni, P; Key, N S; Castoldi, E
2015-03-01
Coagulation factor V (FV) deficiency is a rare autosomal recessive bleeding disorder. We investigated a patient with severe FV deficiency (FV:C < 3%) and moderate bleeding symptoms. Thrombin generation experiments showed residual FV expression in the patient's plasma, which was quantified as 0.7 ± 0.3% by a sensitive prothrombinase-based assay. F5 gene sequencing identified a novel missense mutation in exon 4 (c.578G>C, p.Cys193Ser), predicting the abolition of a conserved disulphide bridge, and an apparently synonymous variant in exon 8 (c.1281C>G). The observation that half of the patient's F5 mRNA lacked the last 18 nucleotides of exon 8 prompted us to re-evaluate the c.1281C>G variant for its possible effects on splicing. Bioinformatics sequence analysis predicted that this transversion would activate a cryptic donor splice site and abolish an exonic splicing enhancer. Characterization in a F5 minigene model confirmed that the c.1281C>G variant was responsible for the patient's splicing defect, which could be partially corrected by a mutation-specific morpholino antisense oligonucleotide. The aberrantly spliced F5 mRNA, whose stability was similar to that of the normal mRNA, encoded a putative FV mutant lacking amino acids 427-432. Expression in COS-1 cells indicated that the mutant protein is poorly secreted and not functional. In conclusion, the c.1281C>G mutation, which was predicted to be translationally silent and hence neutral, causes FV deficiency by impairing pre-mRNA splicing. This finding underscores the importance of cDNA analysis for the correct assessment of exonic mutations. © 2014 John Wiley & Sons Ltd.
Simon, Mariella T.; Ng, Bobby G.; Friederich, Marisa W.; Wang, Raymond Y.; Boyer, Monica; Kircher, Martin; Collard, Renata; Buckingham, Kati J.; Chang, Richard; Shendure, Jay; Nickerson, Deborah A.; Bamshad, Michael J.; Van Hove, Johan L.K.; Freeze, Hudson H.; Abdenur, Jose E.
2017-01-01
We report the clinical, biochemical, and molecular findings in two brothers with encephalopathy and multi-systemic disease. Abnormal transferrin glycoforms were suggestive of a type I congenital disorder of glycosylation (CDG). While exome sequencing was negative for CDG related candidate genes, the testing revealed compound heterozygous mutations in the mitochondrial elongation factor G gene (GFM1). One of the mutations had been reported previously while the second, novel variant was found deep in intron 6, activating a cryptic splice site. Functional studies demonstrated decreased GFM1 protein levels, suggested disrupted assembly of mitochondrial complexes III and V and decreased activities of mitochondrial complexes I and IV, all indicating combined OXPHOS deficiency. PMID:28216230
Efficient computation of optimal oligo-RNA binding.
Hodas, Nathan O; Aalberts, Daniel P
2004-01-01
We present an algorithm that calculates the optimal binding conformation and free energy of two RNA molecules, one or both oligomeric. This algorithm has applications to modeling DNA microarrays, RNA splice-site recognitions and other antisense problems. Although other recent algorithms perform the same calculation in time proportional to the sum of the lengths cubed, O((N1 + N2)3), our oligomer binding algorithm, called bindigo, scales as the product of the sequence lengths, O(N1*N2). The algorithm performs well in practice with the aid of a heuristic for large asymmetric loops. To demonstrate its speed and utility, we use bindigo to investigate the binding proclivities of U1 snRNA to mRNA donor splice sites.
Chiu, Yung-Tuen; Wong, John K L; Choi, Shing-Wan; Sze, Karen M F; Ho, Daniel W H; Chan, Lo-Kong; Lee, Joyce M F; Man, Kwan; Cherny, Stacey; Yang, Wan-Ling; Wong, Chun-Ming; Sham, Pak-Chung; Ng, Irene O L
2016-06-01
Hepatitis B virus (HBV) integration is common in HBV-associated hepatocellular carcinoma (HCC) and may play an important pathogenic role through the production of chimeric HBV-human transcripts. We aimed to screen the transcriptome for HBV integrations in HCCs. Transcriptome sequencing was performed on paired HBV-associated HCCs and corresponding non-tumorous liver tissues to identify viral-human chimeric sites. Validation was further performed in an expanded cohort of human HCCs. Here we report the discovery of a novel pre-mRNA splicing mechanism in generating HBV-human chimeric protein. This mechanism was exemplified by the formation of a recurrent HBV-cyclin A2 (CCNA2) chimeric transcript (A2S), as detected in 12.5% (6 of 48) of HCC patients, but in none of the 22 non-HCC HBV-associated cirrhotic liver samples examined. Upon the integration of HBV into the intron of the CCNA2 gene, the mammalian splicing machinery utilized the foreign splice sites at 282nt. and 458nt. of the HBV genome to generate a pseudo-exon, forming an in-frame chimeric fusion with CCNA2. The A2S chimeric protein gained a non-degradable property and promoted cell cycle progression, demonstrating its potential oncogenic functions. A pre-mRNA splicing mechanism is involved in the formation of HBV-human chimeric proteins. This represents a novel and possibly common mechanism underlying the formation of HBV-human chimeric transcripts from intronically integrated HBV genome with functional impact. HBV is involved in the mammalian pre-mRNA splicing machinery in the generation of potential tumorigenic HBV-human chimeras. This study also provided insight on the impact of intronic HBV integration with the gain of splice sites in the development of HBV-associated HCC. Copyright © 2016 European Association for the Study of the Liver. Published by Elsevier B.V. All rights reserved.
Comprehensive Characterization of Swine Cardiac Troponin T Proteoforms by Top-Down Mass Spectrometry
NASA Astrophysics Data System (ADS)
Lin, Ziqing; Guo, Fang; Gregorich, Zachery R.; Sun, Ruixiang; Zhang, Han; Hu, Yang; Shanmuganayagam, Dhanansayan; Ge, Ying
2018-04-01
Cardiac troponin T (cTnT) regulates the Ca2+-mediated interaction between myosin thick filaments and actin thin filaments during cardiac contraction and relaxation. cTnT is released into the blood following injury, and increased serum levels of the protein are used clinically as a biomarker for myocardial infarction. Moreover, mutations in cTnT are causative in a number of familial cardiomyopathies. With the increasing use of large animal (swine) model to recapitulate human diseases, it is essential to characterize species-dependent protein sequence variants, alternative RNA splicing, and post-translational modifications (PTMs), but challenges remain due to the incomplete database and lack of validation of the predicted splicing isoforms. Herein, we integrated top-down mass spectrometry (MS) with online liquid chromatography (LC) and immunoaffinity purification to comprehensively characterize miniature swine cTnT proteoforms, including those arising from alternative RNA splicing and PTMs. A total of seven alternative splicing isoforms of cTnT were identified by LC/MS from swine left ventricular tissue, with each isoform containing un-phosphorylated and mono-phosphorylated proteoforms. The phosphorylation site was localized to Ser1 for the mono-phosphorylated proteoforms of cTnT1, 3, 4, and 6 by online MS/MS combining collisionally activated dissociation (CAD) and electron transfer dissociation (ETD). Offline MS/MS on Fourier-transform ion cyclotron resonance (FT-ICR) mass spectrometer with CAD and electron capture dissociation (ECD) was then utilized to achieve deep sequencing of mono-phosphorylated cTnT1 (35.2 kDa) with a high sequence coverage of 87%. Taken together, this study demonstrated the unique advantage of top-down MS in the comprehensive characterization of protein alternative splicing isoforms together with PTMs. [Figure not available: see fulltext.
Takeda, Jun-ichi; Suzuki, Yutaka; Nakao, Mitsuteru; Barrero, Roberto A.; Koyanagi, Kanako O.; Jin, Lihua; Motono, Chie; Hata, Hiroko; Isogai, Takao; Nagai, Keiichi; Otsuki, Tetsuji; Kuryshev, Vladimir; Shionyu, Masafumi; Yura, Kei; Go, Mitiko; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Wiemann, Stefan; Nomura, Nobuo; Sugano, Sumio; Gojobori, Takashi; Imanishi, Tadashi
2006-01-01
We report the first genome-wide identification and characterization of alternative splicing in human gene transcripts based on analysis of the full-length cDNAs. Applying both manual and computational analyses for 56 419 completely sequenced and precisely annotated full-length cDNAs selected for the H-Invitational human transcriptome annotation meetings, we identified 6877 alternative splicing genes with 18 297 different alternative splicing variants. A total of 37 670 exons were involved in these alternative splicing events. The encoded protein sequences were affected in 6005 of the 6877 genes. Notably, alternative splicing affected protein motifs in 3015 genes, subcellular localizations in 2982 genes and transmembrane domains in 1348 genes. We also identified interesting patterns of alternative splicing, in which two distinct genes seemed to be bridged, nested or having overlapping protein coding sequences (CDSs) of different reading frames (multiple CDS). In these cases, completely unrelated proteins are encoded by a single locus. Genome-wide annotations of alternative splicing, relying on full-length cDNAs, should lay firm groundwork for exploring in detail the diversification of protein function, which is mediated by the fast expanding universe of alternative splicing variants. PMID:16914452
Novel mutations in LRP6 highlight the role of WNT signaling in tooth agenesis
Ludwig, Kerstin U.; Sullivan, Robert; van Rooij, Iris A.L.M.; Thonissen, Michelle; Swinnen, Steven; Phan, Milien; Conte, Federica; Ishorst, Nina; Gilissen, Christian; RoaFuentes, Laury; van de Vorst, Maartje; Henkes, Arjen; Steehouwer, Marloes; van Beusekom, Ellen; Bloemen, Marjon; Vankeirsbilck, Bruno; Bergé, Stefaan; Hens, Greet; Schoenaers, Joseph; Poorten, Vincent Vander; Roosenboom, Jasmien; Verdonck, An; Devriendt, Koen; Roeleveldt, Nel; Jhangiani, Shalini N.; Vissers, Lisenka E.L.M.; Lupski, James R.; de Ligt, Joep; Von den Hoff, Johannes W.; Pfundt, Rolph; Brunner, Han G.; Zhou, Huiqing; Dixon, Jill; Mangold, Elisabeth; van Bokhoven, Hans; Dixon, Michael J.; Kleefstra, Tjitske
2016-01-01
Purpose Here we aimed to identify a novel genetic cause of tooth agenesis (TA) and/or orofacial clefting (OFC) by combining whole exome sequencing (WES) and targeted re-sequencing in a large cohort of TA and OFC patients. Methods WES was performed in two unrelated patients, one with severe TA and OFC and another with severe TA only. After identifying deleterious mutations in a gene encoding the low density lipoprotein receptor-related protein 6 (LRP6), all its exons were re-sequenced with molecular inversion probes, in 67 patients with TA, 1,072 patients with OFC and in 706 controls. Results We identified a frameshift (c.4594delG, p.Cys1532fs) and a canonical splice site mutation (c.3398-2A>C, p.?) in LRP6 respectively in the patient with TA and OFC, and in the patient with severe TA only. The targeted re-sequencing showed significant enrichment of unique LRP6 variants in TA patients, but not in nonsyndromic OFC. From the 5 variants in patients with TA, 2 affect the canonical splice site and 3 were missense variants; all variants segregated with the dominant phenotype and in 1 case the missense mutation occurred de novo. Conclusion Mutations in LRP6 cause tooth agenesis in man. PMID:26963285
Congenital analbuminemia caused by a novel aberrant splicing in the albumin gene.
Caridi, Gianluca; Dagnino, Monica; Erdeve, Omer; Di Duca, Marco; Yildiz, Duran; Alan, Serdar; Atasay, Begum; Arsan, Saadet; Campagnoli, Monica; Galliano, Monica; Minchiotti, Lorenzo
2014-01-01
Congenital analbuminemia is a rare autosomal recessive disorder manifested by the presence of a very low amount of circulating serum albumin. It is an allelic heterogeneous defect, caused by variety of mutations within the albumin gene in homozygous or compound heterozygous state. Herein we report the clinical and molecular characterization of a new case of congenital analbuminemia diagnosed in a female newborn of consanguineous (first degree cousins) parents from Ankara, Turkey, who presented with a low albumin concentration (< 8 g/L) and severe clinical symptoms. The albumin gene of the index case was screened by single-strand conformation polymorphism, heteroduplex analysis, and direct DNA sequencing. The effect of the splicing mutation was evaluated by examining the cDNA obtained by reverse transcriptase - polymerase chain reaction (RT-PCR) from the albumin mRNA extracted from proband's leukocytes. DNA sequencing revealed that the proband is homozygous, and both parents are heterozygous, for a novel G>A transition at position c.1652+1, the first base of intron 12, which inactivates the strongly conserved GT dinucleotide at the 5' splice site consensus sequence of this intron. The splicing defect results in the complete skipping of the preceding exon (exon 12) and in a frame-shift within exon 13 with a premature stop codon after the translation of three mutant amino acid residues. Our results confirm the clinical diagnosis of congenital analbuminemia in the proband and the inheritance of the trait and contribute to shed light on the molecular genetics of analbuminemia.
A novel NHS mutation causes Nance-Horan Syndrome in a Chinese family.
Tian, Qi; Li, Yunping; Kousar, Rizwana; Guo, Hui; Peng, Fenglan; Zheng, Yu; Yang, Xiaohua; Long, Zhigao; Tian, Runyi; Xia, Kun; Lin, Haiying; Pan, Qian
2017-01-07
Nance-Horan Syndrome (NHS) (OMIM: 302350) is a rare X-linked developmental disorder characterized by bilateral congenital cataracts, with occasional dental anomalies, characteristic dysmorphic features, brachymetacarpia and mental retardation. Carrier females exhibit similar manifestations that are less severe than in affected males. Here, we report a four-generation Chinese family with multiple affected individuals presenting Nance-Horan Syndrome. Whole-exome sequencing combined with RT-PCR and Sanger sequencing was used to search for a genetic cause underlying the disease phenotype. Whole-exome sequencing identified in all affected individuals of the family a novel donor splicing site mutation (NM_198270: c.1045 + 2T > A) in intron 4 of the gene NHS, which maps to chromosome Xp22.13. The identified mutation results in an RNA processing defect causing a 416-nucleotide addition to exon 4 of the mRNA transcript, likely producing a truncated NHS protein. The donor splicing site mutation NM_198270: c.1045 + 2T > A of the NHS gene is the causative mutation in this Nance-Horan Syndrome family. This research broadens the spectrum of NHS gene mutations, contributing to our understanding of the molecular genetics of NHS.
Ezquerra-Inchausti, Maitane; Barandika, Olatz; Anasagasti, Ander; Irigoyen, Cristina; López de Munain, Adolfo; Ruiz-Ederra, Javier
2017-01-01
Retinitis pigmentosa is the most frequent group of inherited retinal dystrophies. It is highly heterogeneous, with more than 80 disease-causing genes 27 of which are known to cause autosomal dominant RP (adRP), having been identified. In this study a total of 29 index cases were ascertained based on a family tree compatible with adRP. A custom panel of 31 adRP genes was analysed by targeted next-generation sequencing using the Ion PGM platform in combination with Sanger sequencing. This allowed us to detect putative disease-causing mutations in 14 out of the 29 (48.28%) families analysed. Remarkably, around 38% of all adRP cases analysed showed mutations affecting the splicing process, mainly due to mutations in genes coding for spliceosome factors (SNRNP200 and PRPF8) but also due to splice-site mutations in RHO. Twelve of the 14 mutations found had been reported previously and two were novel mutations found in PRPF8 in two unrelated patients. In conclusion, our results will lead to more accurate genetic counselling and will contribute to a better characterisation of the disease. In addition, they may have a therapeutic impact in the future given the large number of studies currently underway based on targeted RNA splicing for therapeutic purposes. PMID:28045043
New discoveries of old SON: a link between RNA splicing and cancer.
Hickey, Christopher J; Kim, Jung-Hyun; Ahn, Eun-Young Erin
2014-02-01
The SON protein is a ubiquitously expressed DNA- and RNA-binding protein primarily localized to nuclear speckles. Although several early studies implicated SON in DNA-binding, tumorigenesis and apoptosis, functional significance of this protein had not been recognized until recent studies discovered SON as a novel RNA splicing co-factor. During constitutive RNA splicing, SON ensures efficient intron removal from the transcripts containing suboptimal splice sites. Importantly, SON-mediated splicing is required for proper processing of selective transcripts related to cell cycle, microtubules, centrosome maintenance, and genome stability. Moreover, SON regulates alternative splicing of RNAs from the genes involved in apoptosis and epigenetic modification. In addition to the role in RNA splicing, SON has an ability to suppress transcriptional activation at certain promoter/enhancer DNA sequences. Considering the multiple SON target genes which are directly involved in cell proliferation, genome stability and chromatin modifications, SON is an emerging player in gene regulation during cancer development and progression. Here, we summarize available information from several early studies on SON, and highlight recent discoveries describing molecular mechanisms of SON-mediated gene regulation. We propose that our future effort on better understanding of diverse SON functions would reveal novel targets for cancer therapy. © 2013 Wiley Periodicals, Inc.
Gaur, R K; Valcárcel, J; Green, M R
1995-01-01
Splicing of pre-mRNAs occurs via a lariat intermediate in which an intronic adenosine, embedded within a branch point sequence, forms a 2',5'-phosphodiester bond (RNA branch) with the 5' end of the intron. How the branch point is recognized and activated remains largely unknown. Using site-specific photochemical cross-linking, we have identified two proteins that specifically interact with the branch point during the splicing reaction. U2AF65, an essential splicing factor that binds to the adjacent polypyrimidine tract, crosslinks to the branch point at the earliest stage of spliceosome formation in an ATP-independent manner. A novel 28-kDa protein, which is a constituent of the mature spliceosome, contacts the branch point after the first catalytic step. Our results indicate that the branch point is sequentially recognized by distinct splicing factors in the course of the splicing reaction. Images FIGURE 1 FIGURE 2 FIGURE 3 FIGURE 4 FIGURE 5 FIGURE 6 FIGURE 7 FIGURE 8 FIGURE 9 PMID:7493318
Vigentini, Ileana; De Lorenzis, Gabriella; Picozzi, Claudia; Imazio, Serena; Merico, Annamaria; Galafassi, Silvia; Piškur, Jure; Foschino, Roberto
2012-06-15
In enology, "Brett" character refers to the wine spoilage caused by the yeast Dekkera/Brettanomyces bruxellensis and its production of volatile phenolic off-flavours. However, the spoilage potential of this yeast is strain-dependent. Therefore, a rapid and reliable recognition at the strain level is a key point to avoid serious economic losses. The present work provides an operative tool to assess the genetic intraspecific variation in this species through the use of introns as molecular targets. Firstly, the available partial D./B. bruxellensis genome sequence was investigated in order to build primers annealing to introns 5' splice site sequence (ISS). This analysis allowed the detection of a non-random vocabulary flanking the site and, exploiting this feature, the creation of specific probes for strain discrimination. Secondly, the separation of the intron splice site PCR fragments was obtained throughout the set up of a capillary electrophoresis protocol, giving a 94% repeatability threshold in our experimental conditions. The comparison of results obtained with ISS-PCR/CE versus the ones performed by mtDNA RFLP revealed that the former protocol is more discriminating and allowed a reliable identification at strain level. Actually sixty D./B. bruxellensis isolates were recognised as unique strains, showing a level of similarity below 79% and confirming the high genetic polymorphism existing within the species. Two main clusters were grouped at similarity levels of about 46% and 47%, respectively, showing a poor correlation with the geographic area of isolation. Moreover, from the evolutionary point of view, the proposed technique could determine the frequency of the genome rearrangements that can occur in D./B. bruxellesis populations. Copyright © 2012 Elsevier B.V. All rights reserved.
Yamaguchi, Junya; Sato, Yuri; Kita, Mizuho; Nomura, Sachio; Yamamoto, Noriko; Kato, Yo; Ishikawa, Yuichi; Arai, Masami
2015-10-01
Lynch syndrome is an autosomal dominantly inherited disease that is characterized by a predisposition to cancers, mainly colorectal cancer. Germline mutations of DNA mismatch repair genes such as MLH1, MSH2, MSH6 and PMS2 have been described in patients with Lynch syndrome. Here, we report deletion of 2 bp in the splice donor site of the MLH1 exon 6 (c.545+4_545+5delCA) in a 48-year-old Japanese woman with Lynch syndrome. RT-PCR direct sequencing analysis revealed that this mutation led to an increase in the level of an MLH1 transcript in which exon 6 was skipped, and may cause a frameshift (p.E153FfsX8). Therefore, this mutation appears to be pathogenic and is responsible for Lynch syndrome. Additionally, analysis of the patient's tumor cells indicated microsatellite instability high phenotype and loss of the MLH1 and PMS2 proteins. To our knowledge, this is a germline splice site mutation of MLH1 that has not been reported previously. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Mammalian splicing factor SF1 interacts with SURP domains of U2 snRNP-associated proteins
Crisci, Angela; Raleff, Flore; Bagdiul, Ivona; Raabe, Monika; Urlaub, Henning; Rain, Jean-Christophe; Krämer, Angela
2015-01-01
Splicing factor 1 (SF1) recognizes the branch point sequence (BPS) at the 3′ splice site during the formation of early complex E, thereby pre-bulging the BPS adenosine, thought to facilitate subsequent base-pairing of the U2 snRNA with the BPS. The 65-kDa subunit of U2 snRNP auxiliary factor (U2AF65) interacts with SF1 and was shown to recruit the U2 snRNP to the spliceosome. Co-immunoprecipitation experiments of SF1-interacting proteins from HeLa cell extracts shown here are consistent with the presence of SF1 in early splicing complexes. Surprisingly almost all U2 snRNP proteins were found associated with SF1. Yeast two-hybrid screens identified two SURP domain-containing U2 snRNP proteins as partners of SF1. A short, evolutionarily conserved region of SF1 interacts with the SURP domains, stressing their role in protein–protein interactions. A reduction of A complex formation in SF1-depleted extracts could be rescued with recombinant SF1 containing the SURP-interaction domain, but only partial rescue was observed with SF1 lacking this sequence. Thus, SF1 can initially recruit the U2 snRNP to the spliceosome during E complex formation, whereas U2AF65 may stabilize the association of the U2 snRNP with the spliceosome at later times. In addition, these findings may have implications for alternative splicing decisions. PMID:26420826
SplicePlot: a utility for visualizing splicing quantitative trait loci.
Wu, Eric; Nance, Tracy; Montgomery, Stephen B
2014-04-01
RNA sequencing has provided unprecedented resolution of alternative splicing and splicing quantitative trait loci (sQTL). However, there are few tools available for visualizing the genotype-dependent effects of splicing at a population level. SplicePlot is a simple command line utility that produces intuitive visualization of sQTLs and their effects. SplicePlot takes mapped RNA sequencing reads in BAM format and genotype data in VCF format as input and outputs publication-quality Sashimi plots, hive plots and structure plots, enabling better investigation and understanding of the role of genetics on alternative splicing and transcript structure. Source code and detailed documentation are available at http://montgomerylab.stanford.edu/spliceplot/index.html under Resources and at Github. SplicePlot is implemented in Python and is supported on Linux and Mac OS. A VirtualBox virtual machine running Ubuntu with SplicePlot already installed is also available.
Interplay between DMD Point Mutations and Splicing Signals in Dystrophinopathy Phenotypes
Juan-Mateu, Jonàs; González-Quereda, Lidia; Rodríguez, Maria José; Verdura, Edgard; Lázaro, Kira; Jou, Cristina; Nascimento, Andrés; Jiménez-Mallebrera, Cecilia; Colomer, Jaume; Monges, Soledad; Lubieniecki, Fabiana; Foncuberta, Maria Eugenia; Pascual-Pascual, Samuel Ignacio; Molano, Jesús; Baiget, Montserrat; Gallano, Pia
2013-01-01
DMD nonsense and frameshift mutations lead to severe Duchenne muscular dystrophy while in-frame mutations lead to milder Becker muscular dystrophy. Exceptions are found in 10% of cases and the production of alternatively spliced transcripts is considered a key modifier of disease severity. Several exonic mutations have been shown to induce exon-skipping, while splice site mutations result in exon-skipping or activation of cryptic splice sites. However, factors determining the splicing pathway are still unclear. Point mutations provide valuable information regarding the regulation of pre-mRNA splicing and elements defining exon identity in the DMD gene. Here we provide a comprehensive analysis of 98 point mutations related to clinical phenotype and their effect on muscle mRNA and dystrophin expression. Aberrant splicing was found in 27 mutations due to alteration of splice sites or splicing regulatory elements. Bioinformatics analysis was performed to test the ability of the available algorithms to predict consequences on mRNA and to investigate the major factors that determine the splicing pathway in mutations affecting splicing signals. Our findings suggest that the splicing pathway is highly dependent on the interplay between splice site strength and density of regulatory elements. PMID:23536893
FUCHS-towards full circular RNA characterization using RNAseq.
Metge, Franziska; Czaja-Hasse, Lisa F; Reinhardt, Richard; Dieterich, Chistoph
2017-01-01
Circular RNAs (circRNAs) belong to a recently re-discovered species of RNA that emerge during RNA maturation through a process called back-splicing. A downstream 5' splice site is linked to an upstream 3' splice site to form a circular transcript instead of a canonical linear transcript. Recent advances in next-generation sequencing (NGS) have brought circRNAs back into the focus of many scientists. Since then, several studies reported that circRNAs are differentially expressed across tissue types and developmental stages, implying that they are actively regulated and not merely a by-product of splicing. Though functional studies have shown that some circRNAs could act as miRNA-sponges, the function of most circRNAs remains unknown. To expand our understanding of possible roles of circular RNAs, we propose a new pipeline that could fully characterizes candidate circRNA structure from RNAseq data-FUCHS: FU ll CH aracterization of circular RNA using RNA- S equencing. Currently, most computational prediction pipelines use back-spliced reads to identify circular RNAs. FUCHS extends this concept by considering all RNA-seq information from long reads (typically >150 bp) to learn more about the exon coverage, the number of double break point fragments, the different circular isoforms arising from one host-gene, and the alternatively spliced exons within the same circRNA boundaries. This new knowledge will enable the user to carry out differential motif enrichment and miRNA seed analysis to determine potential regulators during circRNA biogenesis. FUCHS is an easy-to-use Python based pipeline that contributes a new aspect to the circRNA research.
Shozu, M; Zhao, Y; Bulun, S E; Simpson, E R
1998-04-01
The expression of aromatase is regulated in a tissue-specific fashion through alternative use of multiple promoter-specific first exons. To date, eight different first exons have been reported in human aromatase, namely I.1., I.2, I.3. I.4, I.5, PII, 2a, and 1f. Recently, we have found a new putative exon I in a RACE-generated library of THP-1 cells and have conducted studies to characterize this new exon I. We confirmed that the constructs containing -1552/+17 or less flanking sequence of this exon function as a promoter in THP-1 cells, JEG-3 cells and osteoblast-like cells obtained from a human fetus. Results of transfection assays using a series of deletion constructs and mutation constructs indicate that a 1-bp mismatch of the consensus TATA-like box (TTTAAT) and the consensus sequence of the initiator site, which is located 45 bp downstream of the putative TATA box, were functioning cooperatively as a core promoter. The putative transcription site was confirmed by the results of RT-PCR southern blot analysis. We examined the regulation and the expression of this exon, I.6, in several human cells and tissues by RT-PCR Southern blot analysis. THP-1 cells (mononuclear leukemic origin) and JEG-3 cells (choriocarcinoma origin) expressed exon I.6 in serum-free media. The level of expression was increased by serum and phorbol myristyl acetate (PMA) in both cell lines. Adipose stromal cells also expressed exon I.6 in the presence of PMA. In fetal osteoblasts, the expression of exon I.6 was increased most effectively by serum and less so by dexamethasone (DEX) + IL-1beta and DEX + IL-11, whereas induction by serum was suppressed by the addition of DEX. The level of expression was low in granulosa cells in culture and did not change with forskolin. On the other hand, dibutyryl cAMP suppressed PMA-stimulated expression of exon I.6 in THP-1 cells and adipose stromal cells. This result supports the hypothesis that the expression of exon I.6 is regulated mainly via an AP-1 binding site that is found upstream of the initiator site of the promoter region. Expression of exon I.6-specific transcripts was examined in several human tissues. Testis and bone obtained from normal adults expressed exon I.6. Testicular tumor and hepatic carcinoma expressed high levels of exon I.6, whereas granulosa cell tumor did not. Fetal liver and bone also showed a significant level of exon I.6 expression, but not so much as testicular tumor and hepatic tumor. Several splicing variants of exon I.6 were detected especially in THP-1 and JEG-3 cells, and to a lesser extent in primary cultures and tissue samples. These variants were identified as an unspliced form, a form spliced at the end of exon I.4, a form spliced at the end of exon I.3 (truncated) and a form spliced 220 bp downstream of the 3' end of exon I.6. The last variant revealed a new splicing site. Because most of the splicing variants contain the sequence specific for exon I.3, RT-PCR specific for exon I.3 can coamplify these splicing variants of exon I.6 transcripts. These results suggests that it is necessary to examine the expression of I.6 in tissues that are known to express exon I.3 such as breast adipose tissue, in which promoter usage of exon I of the aromatase gene switches from exon I.4 to I.3 in the course of malignant transformation.
Awan, Ali R; Manfredo, Amanda; Pleiss, Jeffrey A
2013-07-30
Alternative splicing is a potent regulator of gene expression that vastly increases proteomic diversity in multicellular eukaryotes and is associated with organismal complexity. Although alternative splicing is widespread in vertebrates, little is known about the evolutionary origins of this process, in part because of the absence of phylogenetically conserved events that cross major eukaryotic clades. Here we describe a lariat-sequencing approach, which offers high sensitivity for detecting splicing events, and its application to the unicellular fungus, Schizosaccharomyces pombe, an organism that shares many of the hallmarks of alternative splicing in mammalian systems but for which no previous examples of exon-skipping had been demonstrated. Over 200 previously unannotated splicing events were identified, including examples of regulated alternative splicing. Remarkably, an evolutionary analysis of four of the exons identified here as subject to skipping in S. pombe reveals high sequence conservation and perfect length conservation with their homologs in scores of plants, animals, and fungi. Moreover, alternative splicing of two of these exons have been documented in multiple vertebrate organisms, making these the first demonstrations of identical alternative-splicing patterns in species that are separated by over 1 billion y of evolution.
Li, Wencheng; You, Bei; Hoque, Mainul; Zheng, Dinghai; Luo, Wenting; Ji, Zhe; Park, Ji Yeon; Gunderson, Samuel I.; Kalsotra, Auinash; Manley, James L.; Tian, Bin
2015-01-01
Alternative cleavage and polyadenylation (APA) results in mRNA isoforms containing different 3’ untranslated regions (3’UTRs) and/or coding sequences. How core cleavage/polyadenylation (C/P) factors regulate APA is not well understood. Using siRNA knockdown coupled with deep sequencing, we found that several C/P factors can play significant roles in 3’UTR-APA. Whereas Pcf11 and Fip1 enhance usage of proximal poly(A) sites (pAs), CFI-25/68, PABPN1 and PABPC1 promote usage of distal pAs. Strong cis element biases were found for pAs regulated by CFI-25/68 or Fip1, and the distance between pAs plays an important role in APA regulation. In addition, intronic pAs are substantially regulated by splicing factors, with U1 mostly inhibiting C/P events in introns near the 5’ end of gene and U2 suppressing those in introns with features for efficient splicing. Furthermore, PABPN1 inhibits expression of transcripts with pAs near the transcription start site (TSS), a property possibly related to its role in RNA degradation. Finally, we found that groups of APA events regulated by C/P factors are also modulated in cell differentiation and development with distinct trends. Together, our results support an APA code where an APA event in a given cellular context is regulated by a number of parameters, including relative location to the TSS, splicing context, distance between competing pAs, surrounding cis elements and concentrations of core C/P factors. PMID:25906188
2014-01-01
Background Alternative splicing is an important process in higher eukaryotes that allows obtaining several transcripts from one gene. A specific case of alternative splicing is mutually exclusive splicing, in which exactly one exon out of a cluster of neighbouring exons is spliced into the mature transcript. Recently, a new algorithm for the prediction of these exons has been developed based on the preconditions that the exons of the cluster have similar lengths, sequence homology, and conserved splice sites, and that they are translated in the same reading frame. Description In this contribution we introduce Kassiopeia, a database and web application for the generation, storage, and presentation of genome-wide analyses of mutually exclusive exomes. Currently, Kassiopeia provides access to the mutually exclusive exomes of twelve Drosophila species, the thale cress Arabidopsis thaliana, the flatworm Caenorhabditis elegans, and human. Mutually exclusive spliced exons (MXEs) were predicted based on gene reconstructions from Scipio. Based on the standard prediction values, with which 83.5% of the annotated MXEs of Drosophila melanogaster were reconstructed, the exomes contain surprisingly more MXEs than previously supposed and identified. The user can search Kassiopeia using BLAST or browse the genes of each species optionally adjusting the parameters used for the prediction to reveal more divergent or only very similar exon candidates. Conclusions We developed a pipeline to predict MXEs in the genomes of several model organisms and a web interface, Kassiopeia, for their visualization. For each gene Kassiopeia provides a comprehensive gene structure scheme, the sequences and predicted secondary structures of the MXEs, and, if available, further evidence for MXE candidates from cDNA/EST data, predictions of MXEs in homologous genes of closely related species, and RNA secondary structure predictions. Kassiopeia can be accessed at http://www.motorprotein.de/kassiopeia. PMID:24507667
Co-evolution of SNF spliceosomal proteins with their RNA targets in trans-splicing nematodes.
Strange, Rex Meade; Russelburg, L Peyton; Delaney, Kimberly J
2016-08-01
Although the mechanism of pre-mRNA splicing has been well characterized, the evolution of spliceosomal proteins is poorly understood. The U1A/U2B″/SNF family (hereafter referred to as the SNF family) of RNA binding spliceosomal proteins participates in both the U1 and U2 small interacting nuclear ribonucleoproteins (snRNPs). The highly constrained nature of this system has inhibited an analysis of co-evolutionary trends between the proteins and their RNA binding targets. Here we report accelerated sequence evolution in the SNF protein family in Phylum Nematoda, which has allowed an analysis of protein:RNA co-evolution. In a comparison of SNF genes from ecdysozoan species, we found a correlation between trans-splicing species (nematodes) and increased phylogenetic branch lengths of the SNF protein family, with respect to their sister clade Arthropoda. In particular, we found that nematodes (~70-80 % of pre-mRNAs are trans-spliced) have experienced higher rates of SNF sequence evolution than arthropods (predominantly cis-spliced) at both the nucleotide and amino acid levels. Interestingly, this increased evolutionary rate correlates with the reliance on trans-splicing by nematodes, which would alter the role of the SNF family of spliceosomal proteins. We mapped amino acid substitutions to functionally important regions of the SNF protein, specifically to sites that are predicted to disrupt protein:RNA and protein:protein interactions. Finally, we investigated SNF's RNA targets: the U1 and U2 snRNAs. Both are more divergent in nematodes than arthropods, suggesting the RNAs have co-evolved with SNF in order to maintain the necessarily high affinity interaction that has been characterized in other species.
Expression of Kir7.1 and a Novel Kir7.1 Splice Variant in Native Human Retinal Pigment Epithelium
Yang, Dongli; Swaminathan, Anuradha; Zhang, Xiaoming; Hughes, Bret A.
2009-01-01
Previous studies on bovine retinal pigment epithelium (RPE) established that Kir7.1 channels compose this epithelium’s large apical membrane K+ conductance. The purpose of this study was to determine whether Kir7.1 and potential Kir7.1 splice variants are expressed in native adult human RPE and, if so, to determine their function and how they are generated. RT-PCR analysis indicated that human RPE expresses full-length Kir7.1 and a novel Kir7.1 splice variant, designated Kir7.1S. Analysis of the human Kir7.1 gene (KCNJ13) organization revealed that it contains 3 exons, 2 introns, and a novel alternative 5′ splice site in exon 2. In human RPE, the alternative usage of two competing 5′ splice sites in exon 2 gives rise to transcripts encoding full-length Kir7.1 and Kir7.1S, which is predicted to encode a truncated protein. Real-time PCR indicated that Kir7.1 transcript is nearly as abundant as GAPDH mRNA in human RPE whereas Kir7.1S transcript expression is 4-fold lower. Western blot analysis showed that the splice variant is translated in Xenopus oocytes injected with Kir7.1S cRNA and revealed the expression of full-length Kir7.1 but not Kir7.1S in adult human RPE. Co-expression of Kir7.1 with Kir7.1S in Xenopus oocytes had no effect on either the kinetics or amplitude of Kir7.1 currents. This study confirms the expression of Kir7.1 in human RPE, identifies a Kir7.1 splice variant resulting in predicted changes in protein sequence, and indicates that there no functional interaction between this splice variant and full-length Kir7.1. PMID:18035352
Muddukrishna, Bhavana; Jackson, Christopher A; Yu, Michael C
2017-06-01
Protein arginine methylation occurs on spliceosomal components and spliceosome-associated proteins, but how this modification contributes to their function in pre-mRNA splicing remains sparse. Here we provide evidence that protein arginine methylation of the yeast SR-/hnRNP-like protein Npl3 plays a role in facilitating efficient splicing of the SUS1 intron that harbors a non-consensus 5' splice site and branch site. In yeast cells lacking the major protein arginine methyltransferase HMT1, we observed a change in the co-transcriptional recruitment of the U1 snRNP subunit Snp1 and Npl3 to pre-mRNAs harboring both consensus (ECM33 and ASC1) and non-consensus (SUS1) 5' splice site and branch site. Using an Npl3 mutant that phenocopies wild-type Npl3 when expressed in Δhmt1 cells, we showed that the arginine methylation of Npl3 is responsible for this. Examination of pre-mRNA splicing efficiency in these mutants reveals the requirement of Npl3 methylation for the efficient splicing of SUS1 intron 1, but not of ECM33 or ASC1. Changing the 5' splice site and branch site in SUS1 intron 1 to the consensus form restored splicing efficiency in an Hmt1-independent manner. Results from biochemical studies show that methylation of Npl3 promotes its optimal association with the U1 snRNP through its association with the U1 snRNP subunit Mud1. Based on these data, we propose a model in which Hmt1, via arginine methylation of Npl3, facilitates U1 snRNP engagement with the pre-mRNA to promote usage of non-consensus splice sites by the splicing machinery. Published by Elsevier B.V.
hnRNP L controls HPV16 RNA polyadenylation and splicing in an Akt kinase-dependent manner
Kajitani, Naoko; Glahder, Jacob; Wu, Chengjun; Yu, Haoran; Nilsson, Kersti
2017-01-01
Abstract Inhibition of the Akt kinase activates HPV16 late gene expression by reducing HPV16 early polyadenylation and by activating HPV16 late L1 mRNA splicing. We identified ‘hot spots’ for RNA binding proteins at the early polyA signal and at splice sites on HPV16 late mRNAs. We observed that hnRNP L was associated with sequences at all HPV16 late splice sites and at the early polyA signal. Akt kinase inhibition resulted in hnRNP L dephosphorylation and reduced association of hnRNP L with HPV16 mRNAs. This was accompanied by an increased binding of U2AF65 and Sam68 to HPV16 mRNAs. Furthermore, siRNA knock-down of hnRNP L or Akt induced HPV16 gene expression. Treatment of HPV16 immortalized keratinocytes with Akt kinase inhibitor reduced hnRNP L binding to HPV16 mRNAs and induced HPV16 L1 mRNA production. Finally, deletion of the hnRNP L binding sites in HPV16 subgenomic expression plasmids resulted in activation of HPV16 late gene expression. In conclusion, the Akt kinase inhibits HPV16 late gene expression at the level of RNA processing by controlling the RNA-binding protein hnRNP L. We speculate that Akt kinase activity upholds an intracellular milieu that favours HPV16 early gene expression and suppresses HPV16 late gene expression. PMID:28934469
Fukao, T; Yamaguchi, S; Wakazono, A; Orii, T; Hoganson, G; Hashimoto, T
1994-01-01
We identified a novel exonic mutation which causes exon skipping in the mitochondrial acetoacetyl-CoA thiolase (T2) gene from a girl with T2 deficiency (GK07). GK07 is a compound heterozygote; the maternal allele has a novel G to T transversion at position 1136 causing Gly379 to Val substitution (G379V) of the T2 precursor. In case of in vivo expression analysis, cells transfected with this mutant cDNA showed no evidence of restored T2 activity. The paternal allele was associated with exon 8 skipping at the cDNA level. At the gene level, a C to T transition causing Gln272 to termination codon (Q272STOP) was identified within exon 8, 13 bp from the 5' splice site of intron 8 in the paternal allele. The mRNA with Q272STOP could not be detected in GK07 fibroblasts, presumably because pre-mRNA with Q272STOP was unstable because of the premature termination. In vivo splicing experiments revealed that the exonic mutation caused partial skipping of exon 8. This substitution was thought to alter the secondary structure of T2 pre-mRNA around exon 8 and thus impede normal splicing. The role of exon sequences in the splicing mechanism is indicated by the exon skipping which occurred with an exonic mutation. Images PMID:7907600
Congenital analbuminemia caused by a novel aberrant splicing in the albumin gene
Caridi, Gianluca; Dagnino, Monica; Erdeve, Omer; Di Duca, Marco; Yildiz, Duran; Alan, Serdar; Atasay, Begum; Arsan, Saadet; Campagnoli, Monica; Galliano, Monica; Minchiotti, Lorenzo
2014-01-01
Introduction: Congenital analbuminemia is a rare autosomal recessive disorder manifested by the presence of a very low amount of circulating serum albumin. It is an allelic heterogeneous defect, caused by variety of mutations within the albumin gene in homozygous or compound heterozygous state. Herein we report the clinical and molecular characterization of a new case of congenital analbuminemia diagnosed in a female newborn of consanguineous (first degree cousins) parents from Ankara, Turkey, who presented with a low albumin concentration (< 8 g/L) and severe clinical symptoms. Materials and methods: The albumin gene of the index case was screened by single-strand conformation polymorphism, heteroduplex analysis, and direct DNA sequencing. The effect of the splicing mutation was evaluated by examining the cDNA obtained by reverse transcriptase - polymerase chain reaction (RT-PCR) from the albumin mRNA extracted from proband’s leukocytes. Results: DNA sequencing revealed that the proband is homozygous, and both parents are heterozygous, for a novel G>A transition at position c.1652+1, the first base of intron 12, which inactivates the strongly conserved GT dinucleotide at the 5′ splice site consensus sequence of this intron. The splicing defect results in the complete skipping of the preceding exon (exon 12) and in a frame-shift within exon 13 with a premature stop codon after the translation of three mutant amino acid residues. Conclusions: Our results confirm the clinical diagnosis of congenital analbuminemia in the proband and the inheritance of the trait and contribute to shed light on the molecular genetics of analbuminemia. PMID:24627724
Cloning and characterization of the gene encoding IMP dehydrogenase from Arabidopsis thaliana.
Collart, F R; Osipiuk, J; Trent, J; Olsen, G J; Huberman, E
1996-10-03
We have cloned and characterized the gene encoding inosine monophosphate dehydrogenase (IMPDH) from Arabidopsis thaliana (At). The transcription unit of the At gene spans approximately 1900 bp and specifies a protein of 503 amino acids with a calculated relative molecular mass (M(r)) of 54,190. The gene is comprised of a minimum of four introns and five exons with all donor and acceptor splice sequences conforming to previously proposed consensus sequences. The deduced IMPDH amino-acid sequence from At shows a remarkable similarity to other eukaryotic IMPDH sequences, with a 48% identity to human Type II enzyme. Allowing for conservative substitutions, the enzyme is 69% similar to human Type II IMPDH. The putative active-site sequence of At IMPDH conforms to the IMP dehydrogenase/guanosine monophosphate reductase motif and contains an essential active-site cysteine residue.
Ben Rebeh, Imen; Morinière, Madeleine; Ayadi, Leila; Benzina, Zeineb; Charfedine, Ilhem; Feki, Jamel; Ayadi, Hammadi; Ghorbel, Abdelmonem; Baklouti, Faouzi; Masmoudi, Saber
2010-09-30
Recessive mutations of the myosin VIIA (MYO7A) gene are reported to be responsible for both a deaf-blindness syndrome (Usher type 1B [USH1B] and atypical Usher syndrome) and nonsyndromic hearing loss (HL; Deafness, Neurosensory, Autosomal Recessive 2 [DFNB2]). The existence of DFNB2 is controversial, and often there is no relationship between the type and location of the MYO7A mutations corresponding to the USH1B and DFNB2 phenotype. We investigated the molecular determinant of a mild form of retinopathy in association with a subtle splicing modulation of MYO7A mRNA. Affected members underwent detailed audiologic and ocular characterization. DNA samples from family members were genotyped with polymorphic microsatellite markers. Sequencing of MYO7A was performed. Endogenous lymphoid RNA analysis and a splicing minigene assay were used to study the effect of the c.1935G>A mutation. Funduscopy showed mild retinitis pigmentosa in adults with HL. Microsatellite analysis showed linkage to markers in the region on chromosome 11q13.5. Sequencing of MYO7A revealed a mutation in the last nucleotide of exon 16 (c.1935G>A), which corresponds to a substitution of a methionine to an isoleucine residue at amino acid 645 of the myosin VIIA. However, structural prediction of the molecular model of myosin VIIA shows that this amino acid replacement induces only minor structural changes in the immediate environment of the mutation and thus does not alter the overall native structure. We found that, although predominantly included in mature mRNA, exon 16 is in fact alternatively spliced in control cells and that the mutation at the very last position is associated with a switch toward a predominant exclusion of that exon. This observation was further supported using a splicing minigene transfection assay; the c.1935G>A mutation was found to trigger a partial impairment of the adjacent donor splice site, suggesting that the unique change at the last position of the exon is responsible for the enhanced exon exclusion in this family. This study shows how an exonic mutation that weakens the 5' splice site enhances a minor alternative splicing without abolishing a complete exclusion of the exon and therefore causes a less severe retinitis pigmentosa than the USH1B-associated alleles. It would be interesting to examine a possible correlation between intrafamilial phenotypic variability and the subtle variation in exon 16 inclusion, probably related to genetic background specificities.
Factors influencing alternative splice site utilization in vivo.
Fu, X Y; Manley, J L
1987-01-01
To study factors that influence the choice of alternative pre-mRNA splicing pathways, we introduced plasmids expressing either wild-type or mutated simian virus 40 (SV40) early regions into tissue culture cells and then measured the quantities of small-t and large-T RNAs produced. One important element controlling splice site selection was found to be the size of the intron removed in the production of small-t mRNA; expansion of this intron (from 66 to 77 or more nucleotides) resulted in a substantial increase in the amount of small-t mRNA produced relative to large-T mRNA. This suggests that in the normal course of SV40 early pre-mRNA processing, large-T splicing is at a competitive advantage relative to small-t splicing because of the small size of the latter intron. Several additional features of the pre-mRNA that can influence splice site selection were also identified by analyzing the effects of mutations containing splice site duplications. These include the strengths of competing 5' splice sites and the relative positions of splice sites in the pre-mRNA. Finally, we showed that the ratio of small-t to large-T mRNA was 10 to 15-fold greater in human 293 cells than in HeLa cells or other mammalian cell types. These results suggest the existence of cell-specific trans-acting factors that can dramatically alter the pattern of splice site selection in a pre-mRNA. Images PMID:3029566
Identification of an Intronic Splicing Enhancer Essential for the Inclusion of FGFR2 Exon IIIc*S⃞
Seth, Puneet; Miller, Heather B.; Lasda, Erika L.; Pearson, James L.; Garcia-Blanco, Mariano A.
2008-01-01
The ligand specificity of fibroblast growth factor receptor 2 (FGFR2) is determined by the alternative splicing of exons 8 (IIIb) or 9 (IIIc). Exon IIIb is included in epithelial cells, whereas exon IIIc is included in mesenchymal cells. Although a number of cis elements and trans factors have been identified that play a role in exon IIIb inclusion in epithelium, little is known about the activation of exon IIIc in mesenchyme. We report here the identification of a splicing enhancer required for IIIc inclusion. This 24-nucleotide (nt) downstream intronic splicing enhancer (DISE) is located within intron 9 immediately downstream of exon IIIc. DISE was able to activate the inclusion of heterologous exons rat FGFR2 IIIb and human β-globin exon 2 in cell lines from different tissues and species and also in HeLa cell nuclear extracts in vitro. DISE was capable of replacing the intronic activator sequence 1 (IAS1), a known IIIb splicing enhancer and vice versa. This fact, together with the requirement for DISE to be close to the 5′-splice site and the ability of DISE to promote binding of U1 snRNP, suggested that IAS1 and DISE belong to the same class of cis-acting elements. PMID:18256031
Brett, Maggie; McPherson, John; Zang, Zhi Jiang; Lai, Angeline; Tan, Ee-Shien; Ng, Ivy; Ong, Lai-Choo; Cham, Breana; Tan, Patrick; Rozen, Steve; Tan, Ene-Choo
2014-01-01
Developmental delay and/or intellectual disability (DD/ID) affects 1–3% of all children. At least half of these are thought to have a genetic etiology. Recent studies have shown that massively parallel sequencing (MPS) using a targeted gene panel is particularly suited for diagnostic testing for genetically heterogeneous conditions. We report on our experiences with using massively parallel sequencing of a targeted gene panel of 355 genes for investigating the genetic etiology of eight patients with a wide range of phenotypes including DD/ID, congenital anomalies and/or autism spectrum disorder. Targeted sequence enrichment was performed using the Agilent SureSelect Target Enrichment Kit and sequenced on the Illumina HiSeq2000 using paired-end reads. For all eight patients, 81–84% of the targeted regions achieved read depths of at least 20×, with average read depths overlapping targets ranging from 322× to 798×. Causative variants were successfully identified in two of the eight patients: a nonsense mutation in the ATRX gene and a canonical splice site mutation in the L1CAM gene. In a third patient, a canonical splice site variant in the USP9X gene could likely explain all or some of her clinical phenotypes. These results confirm the value of targeted MPS for investigating DD/ID in children for diagnostic purposes. However, targeted gene MPS was less likely to provide a genetic diagnosis for children whose phenotype includes autism. PMID:24690944
Molecular evaluation of five cardiac genes in Doberman Pinschers with dilated cardiomyopathy.
Meurs, Kathryn M; Hendrix, Kristina P; Norgard, Michelle M
2008-08-01
To sequence the exonic and splice site regions of 5 cardiac genes associated with the human form of familial dilated cardiomyopathy (DCM) in Doberman Pinschers with DCM and to identify a causative mutation. 5 unrelated Doberman Pinschers with DCM and 2 unaffected Labrador Retrievers (control dogs). Exonic and splice site regions of the 5 genes encoding the cardiac proteins troponin C, lamin A/C, cysteine- and glycine-rich protein 3, cardiac troponin T, and the beta-myosin heavy chain were sequenced. Sequences were compared for nucleotide changes between affected dogs and the published canine sequences and 2 control dogs. Base pair changes were considered to be causative for DCM if they were present in an affected dog but not in the control dogs or published sequences and if they involved a conserved amino acid and changed that amino acid to a different polarity, acid-base status, or structure. A causative mutation for DCM in Doberman Pinschers was not identified, although single nucleotide polymorphisms were detected in some dogs in the cysteine- and glycine-rich protein 3, beta-myosin heavy chain, and troponin T genes. Mutations in 5 of the cardiac genes associated with the development of DCM in humans did not appear to be causative for DCM in Doberman Pinschers. Continued evaluation of additional candidate genes or a focused approach with an association analysis is warranted to elucidate the molecular cause of this important cardiac disease in Doberman Pinschers.
Kuroyanagi, Hidehito; Watanabe, Yohei; Suzuki, Yutaka; Hagiwara, Masatoshi
2013-01-01
A large fraction of protein-coding genes in metazoans undergo alternative pre-mRNA splicing in tissue- or cell-type-specific manners. Recent genome-wide approaches have identified many putative-binding sites for some of tissue-specific trans-acting splicing regulators. However, the mechanisms of splicing regulation in vivo remain largely unknown. To elucidate the modes of splicing regulation by the neuron-specific CELF family RNA-binding protein UNC-75 in Caenorhabditis elegans, we performed deep sequencing of poly(A)+ RNAs from the unc-75(+)- and unc-75-mutant worms and identified more than 20 cassette and mutually exclusive exons repressed or activated by UNC-75. Motif searches revealed that (G/U)UGUUGUG stretches are enriched in the upstream and downstream introns of the UNC-75-repressed and -activated exons, respectively. Recombinant UNC-75 protein specifically binds to RNA fragments carrying the (G/U)UGUUGUG stretches in vitro. Bi-chromatic fluorescence alternative splicing reporters revealed that the UNC-75-target exons are regulated in tissue-specific and (G/U)UGUUGUG element-dependent manners in vivo. The unc-75 mutation affected the splicing reporter expression specifically in the nervous system. These results indicate that UNC-75 regulates alternative splicing of its target exons in neuron-specific and position-dependent manners through the (G/U)UGUUGUG elements in C. elegans. This study thus reveals the repertoire of target events for the CELF family in the living organism. PMID:23416545
Zhang, Yanju; Lameijer, Eric-Wubbo; 't Hoen, Peter A C; Ning, Zemin; Slagboom, P Eline; Ye, Kai
2012-02-15
RNA-seq is a powerful technology for the study of transcriptome profiles that uses deep-sequencing technologies. Moreover, it may be used for cellular phenotyping and help establishing the etiology of diseases characterized by abnormal splicing patterns. In RNA-Seq, the exact nature of splicing events is buried in the reads that span exon-exon boundaries. The accurate and efficient mapping of these reads to the reference genome is a major challenge. We developed PASSion, a pattern growth algorithm-based pipeline for splice site detection in paired-end RNA-Seq reads. Comparing the performance of PASSion to three existing RNA-Seq analysis pipelines, TopHat, MapSplice and HMMSplicer, revealed that PASSion is competitive with these packages. Moreover, the performance of PASSion is not affected by read length and coverage. It performs better than the other three approaches when detecting junctions in highly abundant transcripts. PASSion has the ability to detect junctions that do not have known splicing motifs, which cannot be found by the other tools. Of the two public RNA-Seq datasets, PASSion predicted ≈ 137,000 and 173,000 splicing events, of which on average 82 are known junctions annotated in the Ensembl transcript database and 18% are novel. In addition, our package can discover differential and shared splicing patterns among multiple samples. The code and utilities can be freely downloaded from https://trac.nbic.nl/passion and ftp://ftp.sanger.ac.uk/pub/zn1/passion.
Hamid, Fursham M; Makeyev, Eugene V
2014-11-01
Alternative splicing (AS) provides a potent mechanism for increasing protein diversity and modulating gene expression levels. How alternate splice sites are selected by the splicing machinery and how AS is integrated into gene regulation networks remain important questions of eukaryotic biology. Here we report that polypyrimidine tract-binding protein 1 (Ptbp1/PTB/hnRNP-I) controls alternate 5' and 3' splice site (5'ss and 3'ss) usage in a large set of mammalian transcripts. A top scoring event identified by our analysis was the choice between competing upstream and downstream 5'ss (u5'ss and d5'ss) in the exon 18 of the Hps1 gene. Hps1 is essential for proper biogenesis of lysosome-related organelles and loss of its function leads to a disease called type 1 Hermansky-Pudlak Syndrome (HPS). We show that Ptbp1 promotes preferential utilization of the u5'ss giving rise to stable mRNAs encoding a full-length Hps1 protein, whereas bias towards d5'ss triggered by Ptbp1 down-regulation generates transcripts susceptible to nonsense-mediated decay (NMD). We further demonstrate that Ptbp1 binds to pyrimidine-rich sequences between the u5'ss and d5'ss and activates the former site rather than repressing the latter. Consistent with this mechanism, u5'ss is intrinsically weaker than d5'ss, with a similar tendency observed for other genes with Ptbp1-induced u5'ss bias. Interestingly, the brain-enriched Ptbp1 paralog Ptbp2/nPTB/brPTB stimulated the u5'ss utilization but with a considerably lower efficiency than Ptbp1. This may account for the tight correlation between Hps1 with Ptbp1 expression levels observed across mammalian tissues. More generally, these data expand our understanding of AS regulation and uncover a post-transcriptional strategy ensuring co-expression of a subordinate gene with its master regulator through an AS-NMD tracking mechanism.
Mutations in DSTYK and dominant urinary tract malformations.
Sanna-Cherchi, Simone; Sampogna, Rosemary V; Papeta, Natalia; Burgess, Katelyn E; Nees, Shannon N; Perry, Brittany J; Choi, Murim; Bodria, Monica; Liu, Yan; Weng, Patricia L; Lozanovski, Vladimir J; Verbitsky, Miguel; Lugani, Francesca; Sterken, Roel; Paragas, Neal; Caridi, Gianluca; Carrea, Alba; Dagnino, Monica; Materna-Kiryluk, Anna; Santamaria, Giuseppe; Murtas, Corrado; Ristoska-Bojkovska, Nadica; Izzi, Claudia; Kacak, Nilgun; Bianco, Beatrice; Giberti, Stefania; Gigante, Maddalena; Piaggio, Giorgio; Gesualdo, Loreto; Vukic, Durdica Kosuljandic; Vukojevic, Katarina; Saraga-Babic, Mirna; Saraga, Marijan; Gucev, Zoran; Allegri, Landino; Latos-Bielenska, Anna; Casu, Domenica; State, Matthew; Scolari, Francesco; Ravazzolo, Roberto; Kiryluk, Krzysztof; Al-Awqati, Qais; D'Agati, Vivette D; Drummond, Iain A; Tasic, Velibor; Lifton, Richard P; Ghiggeri, Gian Marco; Gharavi, Ali G
2013-08-15
Congenital abnormalities of the kidney and the urinary tract are the most common cause of pediatric kidney failure. These disorders are highly heterogeneous, and the etiologic factors are poorly understood. We performed genomewide linkage analysis and whole-exome sequencing in a family with an autosomal dominant form of congenital abnormalities of the kidney or urinary tract (seven affected family members). We also performed a sequence analysis in 311 unrelated patients, as well as histologic and functional studies. Linkage analysis identified five regions of the genome that were shared among all affected family members. Exome sequencing identified a single, rare, deleterious variant within these linkage intervals, a heterozygous splice-site mutation in the dual serine-threonine and tyrosine protein kinase gene (DSTYK). This variant, which resulted in aberrant splicing of messenger RNA, was present in all affected family members. Additional, independent DSTYK mutations, including nonsense and splice-site mutations, were detected in 7 of 311 unrelated patients. DSTYK is highly expressed in the maturing epithelia of all major organs, localizing to cell membranes. Knockdown in zebrafish resulted in developmental defects in multiple organs, which suggested loss of fibroblast growth factor (FGF) signaling. Consistent with this finding is the observation that DSTYK colocalizes with FGF receptors in the ureteric bud and metanephric mesenchyme. DSTYK knockdown in human embryonic kidney cells inhibited FGF-stimulated phosphorylation of extracellular-signal-regulated kinase (ERK), the principal signal downstream of receptor tyrosine kinases. We detected independent DSTYK mutations in 2.3% of patients with congenital abnormalities of the kidney or urinary tract, a finding that suggests that DSTYK is a major determinant of human urinary tract development, downstream of FGF signaling. (Funded by the National Institutes of Health and others.).
Mutations in DSTYK and Dominant Urinary Tract Malformations
Sanna-Cherchi, Simone; Nees, Shannon N.; Perry, Brittany J.; Choi, Murim; Bodria, Monica; Liu, Yan; Weng, Patricia L.; Lozanovski, Vladimir J.; Verbitsky, Miguel; Lugani, Francesca; Sterken, Roel; Paragas, Neal; Caridi, Gianluca; Carrea, Alba; Dagnino, Monica; Materna-Kiryluk, Anna; Santamaria, Giuseppe; Murtas, Corrado; Ristoska-Bojkovska, Nadica; Izzi, Claudia; Kacak, Nilgun; Bianco, Beatrice; Giberti, Stefania; Gigante, Maddalena; Piaggio, Giorgio; Gesualdo, Loreto; Vukic, Durdica Kosuljandic; Vukojevic, Katarina; Saraga-Babic, Mirna; Saraga, Marijan; Gucev, Zoran; Allegri, Landino; Latos-Bielenska, Anna; Casu, Domenica; State, Matthew; Scolari, Francesco; Ravazzolo, Roberto; Kiryluk, Krzysztof; Al-Awqati, Qais; D'Agati, Vivette D.; Drummond, Iain A.; Tasic, Velibor; Lifton, Richard P.; Ghiggeri, Gian Marco; Gharavi, Ali G.
2013-01-01
BACKGROUND Congenital abnormalities of the kidney and the urinary tract are the most common cause of pediatric kidney failure. These disorders are highly heterogeneous, and the etiologic factors are poorly understood. METHODS We performed genomewide linkage analysis and whole-exome sequencing in a family with an autosomal dominant form of congenital abnormalities of the kidney or urinary tract (seven affected family members). We also performed a sequence analysis in 311 unrelated patients, as well as histologic and functional studies. RESULTS Linkage analysis identified five regions of the genome that were shared among all affected family members. Exome sequencing identified a single, rare, deleterious variant within these linkage intervals, a heterozygous splice-site mutation in the dual serine–threonine and tyrosine protein kinase gene (DSTYK). This variant, which resulted in aberrant splicing of messenger RNA, was present in all affected family members. Additional, independent DSTYK mutations, including nonsense and splice-site mutations, were detected in 7 of 311 unrelated patients. DSTYK is highly expressed in the maturing epithelia of all major organs, localizing to cell membranes. Knockdown in zebrafish resulted in developmental defects in multiple organs, which suggested loss of fibroblast growth factor (FGF) signaling. Consistent with this finding is the observation that DSTYK colocalizes with FGF receptors in the ureteric bud and metanephric mesenchyme. DSTYK knockdown in human embryonic kidney cells inhibited FGF-stimulated phosphorylation of extracellular-signal-regulated kinase (ERK), the principal signal downstream of receptor tyrosine kinases. CONCLUSIONS We detected independent DSTYK mutations in 2.3% of patients with congenital abnormalities of the kidney or urinary tract, a finding that suggests that DSTYK is a major determinant of human urinary tract development, downstream of FGF signaling. (Funded by the National Institutes of Health and others.) PMID:23862974
Suzuki, Takashi; Brown, Judy J.; Swift, Larry L.
2016-01-01
Microsomal triglyceride transfer protein (MTP) is essential for the assembly of triglyceride-rich apolipoprotein B-containing lipoproteins. Previous studies in our laboratory identified a novel splice variant of MTP in mice that we named MTP-B. MTP-B has a unique first exon (1B) located 2.7 kB upstream of the first exon (1A) for canonical MTP (MTP-A). The two mature isoforms, though nearly identical in sequence and function, have different tissue expression patterns. In this study we report the identification of a second MTP splice variant (MTP-C), which contains both exons 1B and 1A. MTP-C is expressed in all the tissues we tested. In cells transfected with MTP-C, protein expression was less than 15% of that found when the cells were transfected with MTP-A or MTP-B. In silico analysis of the 5’-UTR of MTP-C revealed seven ATGs upstream of the start site for MTP-A, which is the only viable start site in frame with the main coding sequence. One of those ATGs was located in the 5’-UTR for MTP-A. We generated reporter constructs in which the 5’-UTRs of MTP-A or MTP-C were inserted between an SV40 promoter and the coding sequence of the luciferase gene and transfected these constructs into HEK 293 cells. Luciferase activity was significantly reduced by the MTP-C 5’-UTR, but not by the MTP-A 5’-UTR. We conclude that alternative splicing plays a key role in regulating MTP expression by introducing unique 5’-UTRs, which contain elements that alter translation efficiency, enabling the cell to optimize MTP levels and activity. PMID:26771188
Bhattarai, Sunil; Aly, Ahmed; Garcia, Kristy; Ruiz, Diandra; Pontarelli, Fabrizio; Dharap, Ashutosh
2018-06-03
Gene expression in cerebral ischemia has been a subject of intense investigations for several years. Studies utilizing probe-based high-throughput methodologies such as microarrays have contributed significantly to our existing knowledge but lacked the capacity to dissect the transcriptome in detail. Genome-wide RNA-sequencing (RNA-seq) enables comprehensive examinations of transcriptomes for attributes such as strandedness, alternative splicing, alternative transcription start/stop sites, and sequence composition, thus providing a very detailed account of gene expression. Leveraging this capability, we conducted an in-depth, genome-wide evaluation of the protein-coding transcriptome of the adult mouse cortex after transient focal ischemia at 6, 12, or 24 h of reperfusion using RNA-seq. We identified a total of 1007 transcripts at 6 h, 1878 transcripts at 12 h, and 1618 transcripts at 24 h of reperfusion that were significantly altered as compared to sham controls. With isoform-level resolution, we identified 23 splice variants arising from 23 genes that were novel mRNA isoforms. For a subset of genes, we detected reperfusion time-point-dependent splice isoform switching, indicating an expression and/or functional switch for these genes. Finally, for 286 genes across all three reperfusion time-points, we discovered multiple, distinct, simultaneously expressed and differentially altered isoforms per gene that were generated via alternative transcription start/stop sites. Of these, 165 isoforms derived from 109 genes were novel mRNAs. Together, our data unravel the protein-coding transcriptome of the cerebral cortex at an unprecedented depth to provide several new insights into the flexibility and complexity of stroke-related gene transcription and transcript organization.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ris-Stalpers, C.; Verleun-Mooijman, M.C.T.; Blaeij, T.J.P. de
1994-04-01
The analysis of the androgen receptor (AR) gene, mRNA, and protein in a subject with X-linked Reifenstein syndrome (partial androgen insensitivity) is reported. The presence of two mature AR transcripts in genital skin fibroblasts of the patient is established, and, by reverse transcriptase-PCR and RNase transcription analysis, the wild-type transcript and a transcript in which exon 3 sequences are absent without disruption of the translational reading frame are identified. Sequencing and hybridization analysis show a deletion of >6 kb in intron 2 of the human AR gene, starting 18 bp upstream of exon 3. The deletion includes the putative branch-pointmore » sequence (BPS) but not the acceptor splice site on the intron 2/exon 3 boundary. The deletion of the putative intron 2 BPS results in 90% inhibition of wild-type splicing. The mutant transcript encodes an AR protein lacking the second zinc finger of the DNA-binding domain. Western/immunoblotting analysis is used to show that the mutant AR protein is expressed in genital skin fibroblasts of the patient. The residual 10% wild-type transcript can be the result of the use of a cryptic BPS located 63 bp upstream of the intron 2/exon 3 boundary of the mutant AR gene. The mutated AR protein has no transcription-activating potential and does not influence the transactivating properties of the wild-type AR, as tested in cotransfection studies. It is concluded that the partial androgen-insensitivity syndrome of this patient is the consequence of the limited amount of wild-type AR protein expressed in androgen target cells, resulting from the deletion of the intron 2 putative BPS. 42 refs., 6 figs., 1 tab.« less
Margaglione, M; Santacroce, R; Colaizzo, D; Seripa, D; Vecchione, G; Lupone, M R; De Lucia, D; Fortina, P; Grandone, E; Perricone, C; Di Minno, G
2000-10-01
Congenital afibrinogenemia is a rare autosomal recessive disorder characterized by a hemorrhagic diathesis of variable severity. Although more than 100 families with this disorder have been described, genetic defects have been characterized in few cases. An investigation of a young propositus, offspring of a consanguineous marriage, with undetectable levels of functional and quantitative fibrinogen, was conducted. Sequence analysis of the fibrinogen genes showed a homozygous G-to-A mutation at the fifth nucleotide (nt 2395) of the third intervening sequence (IVS) of the gamma-chain gene. Her first-degree relatives, who had approximately half the normal fibrinogen values and showed concordance between functional and immunologic levels, were heterozygtes. The G-to-A change predicts the disappearance of a donor splice site. After transfection with a construct, containing either the wild-type or the mutated sequence, cells with the mutant construct showed an aberrant messenger RNA (mRNA), consistent with skipping of exon 3, but not the expected mRNA. Sequencing of the abnormal mRNA showed the complete absence of exon 3. Skipping of exon 3 predicts the deletion of amino acid sequence from residue 16 to residue 75 and shifting of reading frame at amino acid 76 with a premature stop codon within exon 4 at position 77. Thus, the truncated gamma-chain gene product would not interact with other chains to form the mature fibrinogen molecule. The current findings show that mutations within highly conserved IVS regions of fibrinogen genes could affect the efficiency of normal splicing, giving rise to congenital afibrinogenemia.
Horiuchi, Keiko; Perez-Cerezales, Serafín; Papasaikas, Panagiotis; Ramos-Ibeas, Priscila; López-Cardona, Angela Patricia; Laguna-Barraza, Ricardo; Fonseca Balvís, Noelia; Pericuesta, Eva; Fernández-González, Raul; Planells, Benjamín; Viera, Alberto; Suja, Jose Angel; Ross, Pablo Juan; Alén, Francisco; Orio, Laura; Rodriguez de Fonseca, Fernando; Pintado, Belén; Valcárcel, Juan; Gutiérrez-Adán, Alfonso
2018-04-03
The U2AF35-like ZRSR1 has been implicated in the recognition of 3' splice site during spliceosome assembly, but ZRSR1 knockout mice do not show abnormal phenotypes. To analyze ZRSR1 function and its precise role in RNA splicing, we generated ZRSR1 mutant mice containing truncating mutations within its RNA-recognition motif. Homozygous mutant mice exhibited severe defects in erythrocytes, muscle stretch, and spermatogenesis, along with germ cell sloughing and apoptosis, ultimately leading to azoospermia and male sterility. Testis RNA sequencing (RNA-seq) analyses revealed increased intron retention of both U2- and U12-type introns, including U12-type intron events in genes with key functions in spermatogenesis and spermatid development. Affected U2 introns were commonly found flanking U12 introns, suggesting functional cross-talk between the two spliceosomes. The splicing and tissue defects observed in mutant mice attributed to ZRSR1 loss of function suggest a physiological role for this factor in U12 intron splicing. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
On splice site prediction using weight array models: a comparison of smoothing techniques
NASA Astrophysics Data System (ADS)
Taher, Leila; Meinicke, Peter; Morgenstern, Burkhard
2007-11-01
In most eukaryotic genes, protein-coding exons are separated by non-coding introns which are removed from the primary transcript by a process called "splicing". The positions where introns are cut and exons are spliced together are called "splice sites". Thus, computational prediction of splice sites is crucial for gene finding in eukaryotes. Weight array models are a powerful probabilistic approach to splice site detection. Parameters for these models are usually derived from m-tuple frequencies in trusted training data and subsequently smoothed to avoid zero probabilities. In this study we compare three different ways of parameter estimation for m-tuple frequencies, namely (a) non-smoothed probability estimation, (b) standard pseudo counts and (c) a Gaussian smoothing procedure that we recently developed.
Genome-wide mapping of alternative splicing in Arabidopsis thaliana
Filichkin, Sergei A.; Priest, Henry D.; Givan, Scott A.; Shen, Rongkun; Bryant, Douglas W.; Fox, Samuel E.; Wong, Weng-Keen; Mockler, Todd C.
2010-01-01
Alternative splicing can enhance transcriptome plasticity and proteome diversity. In plants, alternative splicing can be manifested at different developmental stages, and is frequently associated with specific tissue types or environmental conditions such as abiotic stress. We mapped the Arabidopsis transcriptome at single-base resolution using the Illumina platform for ultrahigh-throughput RNA sequencing (RNA-seq). Deep transcriptome sequencing confirmed a majority of annotated introns and identified thousands of novel alternatively spliced mRNA isoforms. Our analysis suggests that at least ∼42% of intron-containing genes in Arabidopsis are alternatively spliced; this is significantly higher than previous estimates based on cDNA/expressed sequence tag sequencing. Random validation confirmed that novel splice isoforms empirically predicted by RNA-seq can be detected in vivo. Novel introns detected by RNA-seq were substantially enriched in nonconsensus terminal dinucleotide splice signals. Alternative isoforms with premature termination codons (PTCs) comprised the majority of alternatively spliced transcripts. Using an example of an essential circadian clock gene, we show that intron retention can generate relatively abundant PTC+ isoforms and that this specific event is highly conserved among diverse plant species. Alternatively spliced PTC+ isoforms can be potentially targeted for degradation by the nonsense mediated mRNA decay (NMD) surveillance machinery or regulate the level of functional transcripts by the mechanism of regulated unproductive splicing and translation (RUST). We demonstrate that the relative ratios of the PTC+ and reference isoforms for several key regulatory genes can be considerably shifted under abiotic stress treatments. Taken together, our results suggest that like in animals, NMD and RUST may be widespread in plants and may play important roles in regulating gene expression. PMID:19858364
Mutations of RNA splicing factors in hematological malignancies.
Shukla, Girish C; Singh, Jagjit
2017-11-28
Systematic large-scale cancer genomic studies have produced numerous significant findings. These studies have not only revealed new cancer-promoting genes, but they also have identified cancer-promoting functions of previously known "housekeeping" genes. These studies have identified numerous mutations in genes which play a fundamental role in nuclear precursor mRNA splicing. Somatic mutations and copy number variation in many of the splicing factors which participate in the formation of multiple spliceosomal complexes appear to play a role in many cancers and in particular in myelodysplastic syndromes (MDS). Mutated proteins seem to interfere with the recognition of the authentic splice sites (SS) leading to utilization of suboptimal alternative splicing sites generating aberrantly spliced mRNA isoforms. This short review is focusing on the function of the splice factors involved in the formation of splicing complexes and potential mechanisms which affect usage of the authentic splice site recognition. Copyright © 2017 Elsevier B.V. All rights reserved.
Ma, Long; Tan, Zhiping; Teng, Yanling; Hoersch, Sebastian; Horvitz, H. Robert
2011-01-01
The in vivo analysis of the roles of splicing factors in regulating alternative splicing in animals remains a challenge. Using a microarray-based screen, we identified a Caenorhabditis elegans gene, tos-1, that exhibited three of the four major types of alternative splicing: intron retention, exon skipping, and, in the presence of U2AF large subunit mutations, the use of alternative 3′ splice sites. Mutations in the splicing factors U2AF large subunit and SF1/BBP altered the splicing of tos-1. 3′ splice sites of the retained intron or before the skipped exon regulate the splicing pattern of tos-1. Our study provides in vivo evidence that intron retention and exon skipping can be regulated largely by the identities of 3′ splice sites. PMID:22033331
HSA: a heuristic splice alignment tool.
Bu, Jingde; Chi, Xuebin; Jin, Zhong
2013-01-01
RNA-Seq methodology is a revolutionary transcriptomics sequencing technology, which is the representative of Next generation Sequencing (NGS). With the high throughput sequencing of RNA-Seq, we can acquire much more information like differential expression and novel splice variants from deep sequence analysis and data mining. But the short read length brings a great challenge to alignment, especially when the reads span two or more exons. A two steps heuristic splice alignment tool is generated in this investigation. First, map raw reads to reference with unspliced aligner--BWA; second, split initial unmapped reads into three equal short reads (seeds), align each seed to the reference, filter hits, search possible split position of read and extend hits to a complete match. Compare with other splice alignment tools like SOAPsplice and Tophat2, HSA has a better performance in call rate and efficiency, but its results do not as accurate as the other software to some extent. HSA is an effective spliced aligner of RNA-Seq reads mapping, which is available at https://github.com/vlcc/HSA.
PRP5: a helicase-like protein required for mRNA splicing in yeast.
Dalbadie-McFarland, G; Abelson, J
1990-01-01
A 96-kDa protein predicted by the DNA sequence of the Saccharomyces cerevisiae PRP5 gene contains a domain that bears a striking resemblance to a family of RNA helicases characterized by the conserved amino acid sequence Asp-Glu-Ala-Asp (D-E-A-D). Previous work indicated that the product of the PRP5 gene is required for splicing and that spliceosome assembly does not occur in its absence. However, its precise role in splicing and the nature of its biochemical activity remained unknown. To examine the role of PRP5 in splicing, we cloned the gene by complementation of a temperature-sensitive mutation and determined its DNA sequence. We discuss here the possible roles for an RNA helicase in splicing and for the activity of the PRP5 protein. Images PMID:2349233
Schultz, Kris Ann; Harris, Anne; Messinger, Yoav; Sencer, Susan; Baldinger, Shari; Dehner, Louis P.; Hill, D. Ashley
2015-01-01
Germline DICER1 mutations have been described in individuals with pleuropulmonary blastoma (PPB), ovarian Sertoli-Leydig cell tumor (SLCT), sarcomas, multinodular goiter, thyroid carcinoma, cystic nephroma and other neoplastic conditions. Early results from the International Ovarian and Testicular Stromal Tumor Registry show germline DICER1 mutations in 48% of girls and women with SLCT. In this report, a young woman presented with ovarian undifferentiated sarcoma. Four years later, she presented with SLCT. She was successfully treated for both malignancies. Sequence results showed a germline intronic mutation in DICER1. This mutation results in an exact duplication of the six bases at the splice site at the intron 23 and exon 24 junction. Predicted improper splicing leads to inclusion of 10 bases of intronic sequence, frameshift and premature truncation of the protein disrupting the RNase IIIb domain. A second individual with SLCT was found to have an identical germline mutation. In each of the ovarian tumors, an additional somatic mutation in the RNase IIIb domain of DICER1 was found. In rare patients, germline intronic mutations in DICER1 that are predicted to cause incorrect splicing can also contribute to the pathogenesis of SLCT. PMID:26289771
An RNAi-enhanced Logic Circuit for Cancer Specific Detection and Destruction
2010-07-01
Bcl-2 family: mBax (Mus musculus), hBax ( Homo sapiens ), and its mutant hBax-S184A [4]. A plasmid containing the tested gene was transfected into HEK...the far-red fluorescent protein mKate to express the Gata3 mStaple. Intron- feature sequences – donor site, branch point, poly- pyrimidine tract, and...intron-exon junction. Among the donor and acceptor sequences found in literature our intron features were chosen according SplicePort [5], an
Quirin, Christina; Rohmer, Stanimira; Fernández-Ulibarri, Inés; Behr, Michael; Hesse, Andrea; Engelhardt, Sarah; Erbs, Philippe; Enk, Alexander H.
2011-01-01
Abstract Key challenges facing cancer therapy are the development of tumor-specific drugs and potent multimodal regimens. Oncolytic adenoviruses possess the potential to realize both aims by restricting virus replication to tumors and inserting therapeutic genes into the virus genome, respectively. A major effort in this regard is to express transgenes in a tumor-specific manner without affecting virus replication. Using both luciferase as a sensitive reporter and genetic prodrug activation, we show that promoter control of E1A facilitates highly selective expression of transgenes inserted into the late transcription unit. This, however, required multistep optimization of late transgene expression. Transgene insertion via internal ribosome entry site (IRES), splice acceptor (SA), or viral 2A sequences resulted in replication-dependent expression. Unexpectedly, analyses in appropriate substrates and with matching control viruses revealed that IRES and SA, but not 2A, facilitated indirect transgene targeting via tyrosinase promoter control of E1A. Transgene expression via SA was more selective (up to 1,500-fold) but less effective than via IRES. Notably, we also revealed transgene-dependent interference with splicing. Hence, the prodrug convertase FCU1 (a cytosine deaminase–uracil phosphoribosyltransferase fusion protein) was expressed only after optimizing the sequence surrounding the SA site and mutating a cryptic splice site within the transgene. The resulting tyrosinase promoter-regulated and FCU1-encoding adenovirus combined effective oncolysis with targeted prodrug activation therapy of melanoma. Thus, prodrug activation showed potent bystander killing and increased cytotoxicity of the virus up to 10-fold. We conclude that armed oncolytic viruses can be improved substantially by comparing and optimizing strategies for targeted transgene expression, thereby implementing selective and multimodal cancer therapies. PMID:20939692
Information theory-based analysis of CYP2C19, CYP2D6 and CYP3A5 splicing mutations.
Rogan, Peter K; Svojanovsky, Stan; Leeder, J Steven
2003-04-01
Several mutations are known or suspected to affect mRNA splicing of CYP2C19, CYP2D6 and CYP3A5 genes; however, little experimental evidence exists to support these conclusions. The present study applies mathematical models that measure changes in information content of splice sites in these genes to demonstrate the relationship between the predicted phenotypes of these variants to the corresponding genotypes. Based on information analysis, the CYP2C19*2 variant activates a new cryptic site 40 nucleotides downstream of the natural splice site. CYP2C19*7 abolishes splicing at the exon 5 donor site. The CYP2D6*4 allele similarly inactivates splicing at the acceptor site of exon 4 and activates a new cryptic site one nucleotide downstream of the natural acceptor. CYP2D6*11 inactivates the acceptor site of exon 2. The CYP3A5*3 allele activates a new cryptic site 236 nucleotides upstream of the exon 4 natural acceptor site. CYP3A5*5 inactivates the exon 5 donor site and CYP3A5*6 strengthens a site upstream of the natural donor site, resulting in skipping of exon 7. Other previously described missense and nonsense mutations at terminal codons of exons in these genes affected splicing. CYP2D6*8 and CYP2D6*14 both decrease the strength of the exon 3 donor site, producing transcripts lacking this exon. The results of information analysis are consistent with the poor metabolizer phenotypes observed in patients with these mutations, and illustrate the potential value of these mathematical models to quantitatively evaluate the functional consequences of new mutations suspected of altering mRNA splicing.
Context-dependent control of alternative splicing by RNA-binding proteins
Fu, Xiang-Dong; Ares, Manuel
2015-01-01
Sequence-specific RNA-binding proteins (RBPs) bind to pre-mRNA to control alternative splicing, but it is not yet possible to read the ‘splicing code’ that dictates splicing regulation on the basis of genome sequence. Each alternative splicing event is controlled by multiple RBPs, the combined action of which creates a distribution of alternatively spliced products in a given cell type. As each cell type expresses a distinct array of RBPs, the interpretation of regulatory information on a given RNA target is exceedingly dependent on the cell type. RBPs also control each other’s functions at many levels, including by mutual modulation of their binding activities on specific regulatory RNA elements. In this Review, we describe some of the emerging rules that govern the highly context-dependent and combinatorial nature of alternative splicing regulation. PMID:25112293
Circular RNA Expression: Its Potential Regulation and Function.
Salzman, Julia
2016-05-01
In 2012, a new feature of eukaryotic gene expression emerged: ubiquitous expression of circular RNA (circRNA) from genes traditionally thought to express messenger or linear noncoding (nc)RNA only. CircRNAs are covalently closed, circular RNA molecules that typically comprise exonic sequences and are spliced at canonical splice sites. This feature of gene expression was first recognized in humans and mouse, but it quickly emerged that it was common across essentially all eukaryotes studied by molecular biologists. CircRNA abundance, and even which alternatively spliced circRNA isoforms are expressed, varies by cell type and can exceed the abundance of the traditional linear mRNA or ncRNA transcript. CircRNAs are enriched in the brain and increase in abundance during fetal development. Together, these features raise fundamental questions regarding the regulation of circRNA in cis and in trans, and its function. Copyright © 2016. Published by Elsevier Ltd.
NASA Astrophysics Data System (ADS)
Mudaber, M. H.; Yusof, Y.; Mohamad, M. S.
2017-09-01
Predicting the existence of restriction enzymes sequences on the recombinant DNA fragments, after accomplishing the manipulating reaction, via mathematical approach is considered as a convenient way in terms of DNA recombination. In terms of mathematics, for this characteristic of the recombinant DNA strands, which involve the recognition sites of restriction enzymes, is called persistent and permanent. Normally differentiating the persistency and permanency of two stages recombinant DNA strands using wet-lab experiment is expensive and time-consuming due to running the experiment at two stages as well as adding more restriction enzymes on the reaction. Therefore, in this research, by using Yusof-Goode (Y-G) model the difference between persistent and permanent splicing language of some two stages is investigated. Two theorems were provided, which show the persistency and non-permanency of two stages DNA splicing language.
Li, Jie; Xu, Peiwen; Huang, Sexin; Gao, Ming; Zou, Yang; Kang, Ranran; Gao, Yuan
2017-04-10
To identify potential mutation of PHEX gene in two patients from a family affected with X-linked hypophosphatemia (XLH). PCR and Sanger sequencing were performed on blood samples from the patients and 100 healthy controls. Reverse transcription-PCR (RT-PCR) was used to determine the mRNA expression in patient samples. A splicing site mutation, IVS21+2T>G, was found in the PHEX gene in both patients but not among the 100 healthy controls. RT-PCR confirmed that exon 21 of the PHEX gene was deleted. The novel splicing mutation IVS21+2T>G of the PHEX gene probably underlies the XLH in this pedigree. At the mRNA level, the mutation has led to removal of exon 21 and shift of the open reading frame (p.Val691fsx), resulting in premature termination of protein translation.
Gamonet, Clémentine; Bole-Richard, Elodie; Delherme, Aurélia; Aubin, François; Toussirot, Eric; Garnache-Ottou, Francine; Godet, Yann; Ysebaert, Loïc; Tournilhac, Olivier; Caroline, Dartigeas; Larosa, Fabrice; Deconinck, Eric; Saas, Philippe; Borg, Christophe; Deschamps, Marina; Ferrand, Christophe
2015-01-01
CD20 is a B cell lineage-specific marker expressed by normal and leukemic B cells and targeted by several antibody immunotherapies. We have previously shown that the protein from a CD20 mRNA splice variant (D393-CD20) is expressed at various levels in leukemic B cells or lymphoma B cells but not in resting, sorted B cells from the peripheral blood of healthy donors. Western blot (WB) analysis of B malignancy primary samples showed additional CD20 signals. Deep molecular PCR analysis revealed four new sequences corresponding to in-frame CD20 splice variants (D657-CD20, D618-CD20, D480-CD20, and D177-CD20) matching the length of WB signals. We demonstrated that the cell spliceosome machinery can process ex vivo D480-, D657-, and D618-CD20 transcript variants by involving canonical sites associated with cryptic splice sites. Results of specific and quantitative RT-PCR assays showed that these CD20 splice variants are differentially expressed in B malignancies. Moreover, Epstein-Barr virus (EBV) transformation modified the CD20 splicing profile and mainly increased the D393-CD20 variant transcripts. Finally, investigation of three cohorts of chronic lymphocytic leukemia (CLL) patients showed that the total CD20 splice variant expression was higher in a stage B and C sample collection compared to routinely collected CLL samples or relapsed refractory stage A, B, or C CLL. The involvement of these newly discovered alternative CD20 transcript variants in EBV transformation makes them interesting molecular indicators, as does their association with oncogenesis rather than non-oncogenic B cell diseases, differential expression in B cell malignancies, and correlation with CLL stage and some predictive CLL markers. This potential should be investigated in further studies.
Pen, Anja E; Nyegaard, Mette; Fang, Mingyan; Jiang, Hui; Christensen, Rikke; Mølgaard, Henning; Andersen, Henning; Ulhøi, Benedicte Parm; Østergaard, John R; Væth, Signe; Sommerlund, Mette; de Brouwer, Arjan P M; Zhang, Xiuqing; Jensen, Uffe B
2015-04-01
We describe a Danish family with an, until recently, unknown X-linked disease with muscular dystrophy (MD), facial dysmorphology and pulmonary artery hypoplasia. One patient died suddenly before age 20 and another was resuscitated from cardiac arrest at the age of 28. Linkage analysis pointed to a region of 25 Mb from 123.6 Mb to 148.4 Mb on chromosome X containing over 100 genes. Exome sequencing identified a single nucleotide splice site mutation c.502-2A > T, which is located 5' to exon 6 in the gene encoding four and a half LIM domain 1 (FHL1) protein. FHL1 expresses three main splice variants, known as FHL1A, FHL1B and FHL1C. In healthy individuals, FHL1A is the predominant splice variant and is mainly found in skeletal and cardiac muscle. The FHL1 transcript profiles from two affected individuals were investigated in skin fibroblasts with quantitative real-time PCR. This demonstrated loss of isoform A and B, and an almost 200-fold overexpression of isoform C confirming that lack of FHL1A and overexpression of FHL1C results in an extended phenotype of EDMD as recently shown by Tiffin et al. [2013]. Copyright © 2015 Elsevier Masson SAS. All rights reserved.
Successful COG8 and PDF overlap is mediated by alterations in splicing and polyadenylation signals.
Pereira-Castro, Isabel; Quental, Rita; da Costa, Luís T; Amorim, António; Azevedo, Luisa
2012-02-01
Although gene-free areas compose the great majority of eukaryotic genomes, a significant fraction of genes overlaps, i.e., unique nucleotide sequences are part of more than one transcription unit. In this work, the evolutionary history and origin of a same-strand gene overlap is dissected through the analysis of COG8 (component of oligomeric Golgi complex 8) and PDF (peptide deformylase). Comparative genomic surveys reveal that the relative locations of these two genes have been changing over the last 445 million years from distinct chromosomal locations in fish to overlapping in rodents and primates, indicating that the overlap between these genes precedes their divergence. The overlap between the two genes was initiated by the gain of a novel splice donor site between the COG8 stop codon and PDF initiation codon. Splicing is accomplished by the use of the PDF acceptor, leading COG8 to share the 3'end with PDF. In primates, loss of the ancestral polyadenylation signal for COG8 makes the overlap between COG8 and PDF mandatory, while in mouse and rat concurrent overlapping and non-overlapping Cog8 transcripts exist. Altogether, we demonstrate that the origin, evolution and preservation of the COG8/PDF same-strand overlap follow similar mechanistic steps as those documented for antisense overlaps where gain and/or loss of splice sites and polyadenylation signals seems to drive the process.
Hong, Yoonki; Kim, Woo Jin; Bang, Chi Young; Lee, Jae Cheol; Oh, Yeon-Mok
2016-04-01
Lung cancer is the most common cause of cancer related death. Alterations in gene sequence, structure, and expression have an important role in the pathogenesis of lung cancer. Fusion genes and alternative splicing of cancer-related genes have the potential to be oncogenic. In the current study, we performed RNA-sequencing (RNA-seq) to investigate potential fusion genes and alternative splicing in non-small cell lung cancer. RNA was isolated from lung tissues obtained from 86 subjects with lung cancer. The RNA samples from lung cancer and normal tissues were processed with RNA-seq using the HiSeq 2000 system. Fusion genes were evaluated using Defuse and ChimeraScan. Candidate fusion transcripts were validated by Sanger sequencing. Alternative splicing was analyzed using multivariate analysis of transcript sequencing and validated using quantitative real time polymerase chain reaction. RNA-seq data identified oncogenic fusion genes EML4-ALK and SLC34A2-ROS1 in three of 86 normal-cancer paired samples. Nine distinct fusion transcripts were selected using DeFuse and ChimeraScan; of which, four fusion transcripts were validated by Sanger sequencing. In 33 squamous cell carcinoma, 29 tumor specific skipped exon events and six mutually exclusive exon events were identified. ITGB4 and PYCR1 were top genes that showed significant tumor specific splice variants. In conclusion, RNA-seq data identified novel potential fusion transcripts and splice variants. Further evaluation of their functional significance in the pathogenesis of lung cancer is required.
Yu, T; Wang, X; Ding, Q; Fu, Q; Dai, J; Lu, Y; Xi, X; Wang, H
2009-11-01
Factor VII deficiency which transmitted as an autosomal recessive disorder is a rare haemorrhagic condition. The aim of this study was to identify the molecular genetic defect and determine its functional consequences in a Chinese pedigree with FVII deficiency. The proband was diagnosed as inherited coagulation FVII deficiency by reduced plasma levels of FVII activity (4.4%) and antigen (38.5%). All nine exons and their flanking sequence of F7 gene were amplified by polymerase chain reaction (PCR) for the proband and the PCR products were directly sequenced. The compound heterozygous mutations of F7 (NM_000131.3) c.572-1G>A and F7 (NM_000131.3) c.1165T>G; p.Cys389Gly were identified in the proband's F7 gene. To investigate the splicing patterns associated with F7 c.572-1G>A, ectopic transcripts in leucocytes of the proband were analyzed. F7 minigenes, spanning from intron 4 to intron 7 and carrying either an A or a G at position -1 of intron 5, were constructed and transiently transfected into human embryonic kidney (HEK) 293T cells, followed by RT-PCR analysis. The aberrant transcripts from the F7 c.572-1G>A mutant allele were not detected by ectopic transcription study. Sequencing of the RT-PCR products from the mutant transfectant demonstrated the production of an erroneously spliced mRNA with exon 6 skipping, whereas a normal splicing occurred in the wide type transfectant. The aberrant mRNA produced from the F7 c.572-1G>A mutant allele is responsible for the factor VII deficiency in this pedigree.
Guerriero, Gea; Spadiut, Oliver; Kerschbamer, Christine; Giorno, Filomena; Baric, Sanja; Ezcurra, Inés
2016-01-01
Cellulose synthase (CesA) genes constitute a complex multigene family with six major phylogenetic clades in angiosperms. The recently sequenced genome of domestic apple, Malus×domestica, was mined for CesA genes, by blasting full-length cellulose synthase protein (CESA) sequences annotated in the apple genome against protein databases from the plant models Arabidopsis thaliana and Populus trichocarpa. Thirteen genes belonging to the six angiosperm CesA clades and coding for proteins with conserved residues typical of processive glycosyltransferases from family 2 were detected. Based on their phylogenetic relationship to Arabidopsis CESAs, as well as expression patterns, a nomenclature is proposed to facilitate further studies. Examination of their genomic organization revealed that MdCesA8-A is closely linked and co-oriented with WDR53, a gene coding for a WD40 repeat protein. The WDR53 and CesA8 genes display conserved collinearity in dicots and are partially co-expressed in the apple xylem. Interestingly, the presence of a bicistronic WDR53–CesA8A transcript was detected in phytoplasma-infected phloem tissues of apple. The bicistronic transcript contains a spliced intergenic sequence that is predicted to fold into hairpin structures typical of internal ribosome entry sites, suggesting its potential cap-independent translation. Surprisingly, the CesA8A cistron is alternatively spliced and lacks the zinc-binding domain. The possible roles of WDR53 and the alternatively spliced CESA8 variant during cellulose biosynthesis in M.×domestica are discussed. PMID:23048131
Berkers, Celia R.; de Jong, Annemieke; Schuurman, Karianne G.; Linnemann, Carsten; Meiring, Hugo D.; Janssen, Lennert; Neefjes, Jacques J.; Schumacher, Ton N. M.; Rodenko, Boris
2015-01-01
Peptide splicing, in which two distant parts of a protein are excised and then ligated to form a novel peptide, can generate unique MHC class I–restricted responses. Because these peptides are not genetically encoded and the rules behind proteasomal splicing are unknown, it is difficult to predict these spliced Ags. In the current study, small libraries of short peptides were used to identify amino acid sequences that affect the efficiency of this transpeptidation process. We observed that splicing does not occur at random, neither in terms of the amino acid sequences nor through random splicing of peptides from different sources. In contrast, splicing followed distinct rules that we deduced and validated both in vitro and in cells. Peptide ligation was quantified using a model peptide and demonstrated to occur with up to 30% ligation efficiency in vitro, provided that optimal structural requirements for ligation were met by both ligating partners. In addition, many splicing products could be formed from a single protein. Our splicing rules will facilitate prediction and detection of new spliced Ags to expand the peptidome presented by MHC class I Ags. PMID:26401003
Spliced DNA Sequences in the Paramecium Germline: Their Properties and Evolutionary Potential
Catania, Francesco; McGrath, Casey L.; Doak, Thomas G.; Lynch, Michael
2013-01-01
Despite playing a crucial role in germline-soma differentiation, the evolutionary significance of developmentally regulated genome rearrangements (DRGRs) has received scant attention. An example of DRGR is DNA splicing, a process that removes segments of DNA interrupting genic and/or intergenic sequences. Perhaps, best known for shaping immune-system genes in vertebrates, DNA splicing plays a central role in the life of ciliated protozoa, where thousands of germline DNA segments are eliminated after sexual reproduction to regenerate a functional somatic genome. Here, we identify and chronicle the properties of 5,286 sequences that putatively undergo DNA splicing (i.e., internal eliminated sequences [IESs]) across the genomes of three closely related species of the ciliate Paramecium (P. tetraurelia, P. biaurelia, and P. sexaurelia). The study reveals that these putative IESs share several physical characteristics. Although our results are consistent with excision events being largely conserved between species, episodes of differential IES retention/excision occur, may have a recent origin, and frequently involve coding regions. Our findings indicate interconversion between somatic—often coding—DNA sequences and noncoding IESs, and provide insights into the role of DNA splicing in creating potentially functional genetic innovation. PMID:23737328
The intron 1 of HPV 16 has a suboptimal branch point at a guanosine.
De la Rosa-Rios, Marco Antonio; Martínez-Salazar, Martha; Martínez-Garcia, Martha; González-Bonilla, César; Villegas-Sepúlveda, Nicolás
2006-06-01
The branch point sequence (BPS) of intron 1 of the HPV-16 was determined via RT-PCR in a cell free system, using lariat intermediates obtained by in vitro splicing reactions. We used synthetic E6/E7 transcripts and HeLa nuclear protein extracts to obtain the splicing intermediates. Then, a divergent oligonucleotide primer set, pairing on the lariat RNA that encompassed the 2'-5' phosphodiester bond formed between the 5' end of the intron and the BPS, was used for cDNA synthesis and PCR amplification. Subsequent RT-PCR assays revealed four splicing intermediates, made up of a major intermediary corresponding to the BPS and four cryptic branched sequences. Only intermediates bound at the 5' end of the intron are probably the authentic branch point sequence, and all of them branch at guanosine 328 instead of the typical adenosine. Unusually, the BPS of intron 1 of HPV-16 is a suboptimal sequence (AGUGAGU) that differs from the eukaryotic consensus BPS, which correlates with the splicing profile observed for early transcripts of HPV-16 in tumors and tumor derived cell lines. The implications of this unusual branch point sequence for splicing of the HPV-16 pre-mRNA are discussed.
Splicing of designer exons informs a biophysical model for exon definition
Arias, Mauricio A.; Chasin, Lawrence A.
2015-01-01
Pre-mRNA molecules in humans contain mostly short internal exons flanked by longer introns. To explain the removal of such introns, exon recognition instead of intron recognition has been proposed. We studied this exon definition using designer exons (DEs) made up of three prototype modules of our own design: an exonic splicing enhancer (ESE), an exonic splicing silencer (ESS), and a Reference Sequence (R) predicted to be neither. Each DE was examined as the central exon in a three-exon minigene. DEs made of R modules showed a sharp size dependence, with exons shorter than 14 nt and longer than 174 nt splicing poorly. Changing the strengths of the splice sites improved longer exon splicing but worsened shorter exon splicing, effectively displacing the curve to the right. For the ESE we found, unexpectedly, that its enhancement efficiency was independent of its position within the exon. For the ESS we found a step-wise positional increase in its effects; it was most effective at the 3′ end of the exon. To apply these results quantitatively, we developed a biophysical model for exon definition of internal exons undergoing cotranscriptional splicing. This model features commitment to inclusion before the downstream exon is synthesized and competition between skipping and inclusion fates afterward. Collision of both exon ends to form an exon definition complex was incorporated to account for the effect of size; ESE/ESS effects were modeled on the basis of stabilization/destabilization. This model accurately predicted the outcome of independent experiments on more complex DEs that combined ESEs and ESSs. PMID:25492963
Huang, J M; Wang, Z Y; Ju, Z H; Wang, C F; Li, Q L; Sun, T; Hou, Q L; Hang, S Q; Hou, M H; Zhong, J F
2011-12-21
Bovine lactoferrin (bLF) is a member of the transferrin family; it plays an important role in the innate immune response. We identified novel splice variants of the bLF gene in mastitis-infected and healthy cows. Reverse transcription-polymerase chain reaction (RT-PCR) and clone sequencing analysis were used to screen the splice variants of the bLF gene in the mammary gland, spleen and liver tissues. One main transcript corresponding to the bLF reference sequence was found in three tissues in both healthy and mastitis-infected cows. Quantitative real-time PCR analysis showed that the expression levels of the LF gene's main transcript were not significantly different in tissues from healthy versus mastitis-infected cows. However, the new splice variant, LF-AS2, which has the exon-skipping alternative splicing pattern, was only identified in mammary glands infected with Staphylococcus aureus. Sequencing analysis showed that the new splice variant was 251 bp in length, including exon 1, part of exon 2, part of exon 16, and exon 17. We conclude that bLF may play a role in resistance to mastitis through alternative splicing mechanisms.
Soukarieh, Omar; Gaildrat, Pascaline; Hamieh, Mohamad; Drouet, Aurélie; Baert-Desurmont, Stéphanie; Frébourg, Thierry; Tosi, Mario; Martins, Alexandra
2016-01-01
The identification of a causal mutation is essential for molecular diagnosis and clinical management of many genetic disorders. However, even if next-generation exome sequencing has greatly improved the detection of nucleotide changes, the biological interpretation of most exonic variants remains challenging. Moreover, particular attention is typically given to protein-coding changes often neglecting the potential impact of exonic variants on RNA splicing. Here, we used the exon 10 of MLH1, a gene implicated in hereditary cancer, as a model system to assess the prevalence of RNA splicing mutations among all single-nucleotide variants identified in a given exon. We performed comprehensive minigene assays and analyzed patient’s RNA when available. Our study revealed a staggering number of splicing mutations in MLH1 exon 10 (77% of the 22 analyzed variants), including mutations directly affecting splice sites and, particularly, mutations altering potential splicing regulatory elements (ESRs). We then used this thoroughly characterized dataset, together with experimental data derived from previous studies on BRCA1, BRCA2, CFTR and NF1, to evaluate the predictive power of 3 in silico approaches recently described as promising tools for pinpointing ESR-mutations. Our results indicate that ΔtESRseq and ΔHZEI-based approaches not only discriminate which variants affect splicing, but also predict the direction and severity of the induced splicing defects. In contrast, the ΔΨ-based approach did not show a compelling predictive power. Our data indicates that exonic splicing mutations are more prevalent than currently appreciated and that they can now be predicted by using bioinformatics methods. These findings have implications for all genetically-caused diseases. PMID:26761715
RNA Splicing in a New Rhabdovirus from Culex Mosquitoes▿†
Kuwata, Ryusei; Isawa, Haruhiko; Hoshino, Keita; Tsuda, Yoshio; Yanase, Tohru; Sasaki, Toshinori; Kobayashi, Mutsuo; Sawabe, Kyoko
2011-01-01
Among members of the order Mononegavirales, RNA splicing events have been found only in the family Bornaviridae. Here, we report that a new rhabdovirus isolated from the mosquito Culex tritaeniorhynchus replicates in the nuclei of infected cells and requires RNA splicing for viral mRNA maturation. The virus, designated Culex tritaeniorhynchus rhabdovirus (CTRV), shares a similar genome organization with other rhabdoviruses, except for the presence of a putative intron in the coding region for the L protein. Molecular phylogenetic studies indicated that CTRV belongs to the family Rhabdoviridae, but it is yet to be assigned a genus. Electron microscopic analysis revealed that the CTRV virion is extremely elongated, unlike virions of rhabdoviruses, which are generally bullet shaped. Northern hybridization confirmed that a large transcript (approximately 6,500 nucleotides [nt]) from the CTRV L gene was present in the infected cells. Strand-specific reverse transcription-PCR (RT-PCR) analyses identified the intron-exon boundaries and the 76-nt intron sequence, which contains the typical motif for eukaryotic spliceosomal intron-splice donor/acceptor sites (GU-AG), a predicted branch point, and a polypyrimidine tract. In situ hybridization exhibited that viral RNAs are primarily localized in the nucleus of infected cells, indicating that CTRV replicates in the nucleus and is allowed to utilize the host's nuclear splicing machinery. This is the first report of RNA splicing among the members of the family Rhabdoviridae. PMID:21507977
RNA splicing in a new rhabdovirus from Culex mosquitoes.
Kuwata, Ryusei; Isawa, Haruhiko; Hoshino, Keita; Tsuda, Yoshio; Yanase, Tohru; Sasaki, Toshinori; Kobayashi, Mutsuo; Sawabe, Kyoko
2011-07-01
Among members of the order Mononegavirales, RNA splicing events have been found only in the family Bornaviridae. Here, we report that a new rhabdovirus isolated from the mosquito Culex tritaeniorhynchus replicates in the nuclei of infected cells and requires RNA splicing for viral mRNA maturation. The virus, designated Culex tritaeniorhynchus rhabdovirus (CTRV), shares a similar genome organization with other rhabdoviruses, except for the presence of a putative intron in the coding region for the L protein. Molecular phylogenetic studies indicated that CTRV belongs to the family Rhabdoviridae, but it is yet to be assigned a genus. Electron microscopic analysis revealed that the CTRV virion is extremely elongated, unlike virions of rhabdoviruses, which are generally bullet shaped. Northern hybridization confirmed that a large transcript (approximately 6,500 nucleotides [nt]) from the CTRV L gene was present in the infected cells. Strand-specific reverse transcription-PCR (RT-PCR) analyses identified the intron-exon boundaries and the 76-nt intron sequence, which contains the typical motif for eukaryotic spliceosomal intron-splice donor/acceptor sites (GU-AG), a predicted branch point, and a polypyrimidine tract. In situ hybridization exhibited that viral RNAs are primarily localized in the nucleus of infected cells, indicating that CTRV replicates in the nucleus and is allowed to utilize the host's nuclear splicing machinery. This is the first report of RNA splicing among the members of the family Rhabdoviridae.
Generation and Analysis of the Expressed Sequence Tags from the Mycelium of Ganoderma lucidum
Huang, Yen-Hua; Wu, Hung-Yi; Wu, Keh-Ming; Liu, Tze-Tze; Liou, Ruey-Fen; Tsai, Shih-Feng; Shiao, Ming-Shi; Ho, Low-Tone; Tzean, Shean-Shong; Yang, Ueng-Cheng
2013-01-01
Ganoderma lucidum (G. lucidum) is a medicinal mushroom renowned in East Asia for its potential biological effects. To enable a systematic exploration of the genes associated with the various phenotypes of the fungus, the genome consortium of G. lucidum has carried out an expressed sequence tag (EST) sequencing project. Using a Sanger sequencing based approach, 47,285 ESTs were obtained from in vitro cultures of G. lucidum mycelium of various durations. These ESTs were further clustered and merged into 7,774 non-redundant expressed loci. The features of these expressed contigs were explored in terms of over-representation, alternative splicing, and natural antisense transcripts. Our results provide an invaluable information resource for exploring the G. lucidum transcriptome and its regulation. Many cases of the genes over-represented in fast-growing dikaryotic mycelium are closely related to growth, such as cell wall and bioactive compound synthesis. In addition, the EST-genome alignments containing putative cassette exons and retained introns were manually curated and then used to make inferences about the predominating splice-site recognition mechanism of G. lucidum. Moreover, a number of putative antisense transcripts have been pinpointed, from which we noticed that two cases are likely to reveal hitherto undiscovered biological pathways. To allow users to access the data and the initial analysis of the results of this project, a dedicated web site has been created at http://csb2.ym.edu.tw/est/. PMID:23658685
van den Berg, L; Kwant, L; Hestand, M S; van Oost, B A; Leegwater, P A J
2005-01-01
Aggressive behavior is the most frequently encountered behavioral problem in dogs. Abnormalities in brain serotonin metabolism have been described in aggressive dogs. We studied canine serotonergic genes to investigate genetic factors underlying canine aggression. Here, we describe the characterization of three genes of the canine serotonergic system: the serotonin receptor 1A and 2A gene (htr1A and htr2A) and the serotonin transporter gene (slc6A4). We isolated canine bacterial artificial chromosome clones containing these genes and designed oligonucleotides for genomic sequencing of coding regions and intron-exon boundaries. Golden retrievers were analyzed for DNA sequence variations. We found two nonsynonymous single nucleotide polymorphisms (SNPs) in the coding sequence of htr1A; one SNP close to a splice site in htr2A; and two SNPs in slc6A4, one in the coding sequence and one close to a splice site. In addition, we identified a polymorphic microsatellite marker for each gene. Htr1A is a strong candidate for involvement in the domestication of the dog. We genotyped the htr1A SNPs in 41 dogs of seven breeds with diverse behavioral characteristics. At least three SNP haplotypes were found. Our results do not support involvement of the gene in domestication.
Crystal Structure of the Extracellular Cholinesterase-Like Domain from Neuroligin-2
DOE Office of Scientific and Technical Information (OSTI.GOV)
Koehnke,J.; Jin, X.; Budreck, E.
Neuroligins (NLs) are catalytically inactive members of a family of cholinesterase-like transmembrane proteins that mediate cell adhesion at neuronal synapses. Postsynaptic neuroligins engage in Ca2+-dependent transsynaptic interactions via their extracellular cholinesterase domain with presynaptic neurexins (NRXs). These interactions may be regulated by two short splice insertions (termed A and B) in the NL cholinesterase domain. Here, we present the 3.3- Angstroms crystal structure of the ectodomain from NL2 containing splice insertion A (NL2A). The overall structure of NL2A resembles that of cholinesterases, but several structural features are unique to the NL proteins. First, structural elements surrounding the esterase active-site regionmore » differ significantly between active esterases and NL2A. On the opposite surface of the NL2A molecule, the positions of the A and B splice insertions identify a candidate NRX interaction site of the NL protein. Finally, sequence comparisons of NL isoforms allow for mapping the location of residues of previously identified mutations in NL3 and NL4 found in patients with autism spectrum disorders. Overall, the NL2 structure promises to provide a valuable model for dissecting NL isoform- and synapse-specific functions.« less
Crystal structure of the extracellular cholinesterase-like domain from neuroligin-2
Koehnke, Jesko; Jin, Xiangshu; Budreck, Elaine C.; Posy, Shoshana; Scheiffele, Peter; Honig, Barry; Shapiro, Lawrence
2008-01-01
Neuroligins (NLs) are catalytically inactive members of a family of cholinesterase-like transmembrane proteins that mediate cell adhesion at neuronal synapses. Postsynaptic neuroligins engage in Ca2+-dependent transsynaptic interactions via their extracellular cholinesterase domain with presynaptic neurexins (NRXs). These interactions may be regulated by two short splice insertions (termed A and B) in the NL cholinesterase domain. Here, we present the 3.3-Å crystal structure of the ectodomain from NL2 containing splice insertion A (NL2A). The overall structure of NL2A resembles that of cholinesterases, but several structural features are unique to the NL proteins. First, structural elements surrounding the esterase active-site region differ significantly between active esterases and NL2A. On the opposite surface of the NL2A molecule, the positions of the A and B splice insertions identify a candidate NRX interaction site of the NL protein. Finally, sequence comparisons of NL isoforms allow for mapping the location of residues of previously identified mutations in NL3 and NL4 found in patients with autism spectrum disorders. Overall, the NL2 structure promises to provide a valuable model for dissecting NL isoform- and synapse-specific functions. PMID:18250328
Genomic structure and expression of STM2, the chromosome 1 familial Alzheimer disease gene.
Levy-Lahad, E; Poorkaj, P; Wang, K; Fu, Y H; Oshima, J; Mulligan, J; Schellenberg, G D
1996-06-01
Mutations in the gene STM2 result in autosomal dominant familial Alzheimer disease. To screen for mutations and to identify regulatory elements for this gene, the genomic DNA sequence and intron-exon structure were determined. Twelve exons including 10 coding exons were identified in a genomic region spanning 23,737 bp. The first 2 exons encode the 5'-untranslated region. Expression analysis of STM2 indicates that two transcripts of 2.4 and 2.8 kb are found in skeletal muscle, pancreas, and heart. In addition, a splice variant of the 2.4-kb transcript was identified that is the result of the use of an alternative splice acceptor site located in exon 10. The use of this site results in a transcript lacking a single glutamate. The promotor for this gene and the alternatively spliced exons leading to the 2.8-kb form of the gene remain to be identified. Expression of STM2 was high in skeletal muscle and pancreas, with comparatively low levels observed in brain. This expression pattern is intriguing since in Alzheimer disease, pathology and degeneration are observed only in the central nervous system.
Tokuhiro, Keizo; Miyagawa, Yasushi; Yamada, Shuichi; Hirose, Mika; Ohta, Hiroshi; Nishimune, Yoshitake; Tanaka, Hiromitsu
2007-03-01
Haspin is a unique protein kinase expressed predominantly in haploid male germ cells. The genomic structure of haspin (Gsg2) has revealed it to be intronless, and the entire transcription unit is in an intron of the integrin alphaE (Itgae) gene. Transcription occurs from a bidirectional promoter that also generates an alternatively spliced integrin alphaE-derived mRNA (Aed). In mice, the testis-specific alternative splicing of Aed is expressed bidirectionally downstream from the Gsg2 transcription initiation site, and a segment consisting of 26 bp transcribes both genomic DNA strands between Gsg2 and the Aed transcription initiation sites. To investigate the mechanisms for this unique gene regulation, we cloned and characterized the Gsg2 promoter region. The 193-bp genomic fragment from the 5' end of the Gsg2 and Aed genes, fused with EGFP and DsRed genes, drove the expression of both proteins in haploid germ cells of transgenic mice. This promoter element contained only a GC-rich sequence, and not the previously reported DNA sequences known to bind various transcription factors--with the exception of E2F1, TCFAP2A1 (AP2), and SP1. Here, we show that the 193-bp DNA sequence is sufficient for the specific, bidirectional, and synchronous expression in germ cells in the testis. We also demonstrate the existence of germ cell nuclear factors specifically bound to the promoter sequence. This activity may be regulated by binding to the promoter sequence with germ cell-specific nuclear complex(es) without regulation via DNA methylation.
Tran, Trung T; Bollineni, Ravi C; Strozynski, Margarita; Koehler, Christian J; Thiede, Bernd
2017-07-07
Alternative splicing is a mechanism in eukaryotes by which different forms of mRNAs are generated from the same gene. Identification of alternative splice variants requires the identification of peptides specific for alternative splice forms. For this purpose, we generated a human database that contains only unique tryptic peptides specific for alternative splice forms from Swiss-Prot entries. Using this database allows an easy access to splice variant-specific peptide sequences that match to MS data. Furthermore, we combined this database without alternative splice variant-1-specific peptides with human Swiss-Prot. This combined database can be used as a general database for searching of LC-MS data. LC-MS data derived from in-solution digests of two different cell lines (LNCaP, HeLa) and phosphoproteomics studies were analyzed using these two databases. Several nonalternative splice variant-1-specific peptides were found in both cell lines, and some of them seemed to be cell-line-specific. Control and apoptotic phosphoproteomes from Jurkat T cells revealed several nonalternative splice variant-1-specific peptides, and some of them showed clear quantitative differences between the two states.
Fenn, Joe; Boursnell, Mike; Hitti, Rebekkah J; Jenkins, Christopher A; Terry, Rebecca L; Priestnall, Simon L; Kenny, Patrick J; Mellersh, Cathryn S; Forman, Oliver P
2016-08-26
Cerebellar cortical degeneration (CCD) is an increasingly recognised neurodegenerative disease process affecting many dog breeds. Typical presentation consists of a progressive cerebellar ataxia, with a variable age at onset and rate of progression between different breeds. Cerebellar histopathological findings typically consist of primary Purkinje neuronal degeneration and loss, with variable secondary depletion of the granular and molecular cell layers. Causative genes have been identified associated with CCD in several breeds, allowing screening for selective breeding to reduce the prevalence of these conditions. There have been no previous reports of CCD in Hungarian Vizslas. Two full-sibling Hungarian Vizsla puppies from a litter of nine presented with a history of progressive ataxia, starting around three months of age. Clinical signs included marked hypermetric and dysmetric ataxia, truncal sway, intention tremors and absent menace responses, with positional horizontal nystagmus in one dog. Routine diagnostic investigations were unremarkable, and magnetic resonance imaging performed in one dog revealed mild craniodorsal cerebellar sulci widening, supportive of cerebellar atrophy. Owners of both dogs elected for euthanasia shortly after the onset of signs. Histopathological examination revealed primary Purkinje neuron loss consistent with CCD. Whole genome sequencing was used to successfully identify a disease-associated splice donor site variant in the sorting nexin 14 gene (SNX14) as a strong causative candidate. An altered SNX14 splicing pattern for a CCD case was demonstrated by RNA analysis, and no SNX14 protein could be detected in CCD case cerebellum by western blotting. SNX14 is involved in maintaining normal neuronal excitability and synaptic transmission, and a mutation has recently been found to cause autosomal recessive cerebellar ataxia and intellectual disability syndrome in humans. Genetic screening of 133 unaffected Hungarian Vizslas revealed the presence of three heterozygotes, supporting the presence of carriers in the wider population. This is the first report of CCD in Hungarian Vizsla dogs and identifies a highly associated splice donor site mutation in SNX14, with an autosomal recessive mode of inheritance suspected.
Smith, Lindsay D.; Dickinson, Rachel L.; Lucas, Christian M.; Cousins, Alex; Malygin, Alexey A.; Weldon, Carika; Perrett, Andrew J.; Bottrill, Andrew R.; Searle, Mark S.; Burley, Glenn A.; Eperon, Ian C.
2014-01-01
Summary The use of oligonucleotides to activate the splicing of selected exons is limited by a poor understanding of the mechanisms affected. A targeted bifunctional oligonucleotide enhancer of splicing (TOES) anneals to SMN2 exon 7 and carries an exonic splicing enhancer (ESE) sequence. We show that it stimulates splicing specifically of intron 6 in the presence of repressing sequences in intron 7. Complementarity to the 5′ end of exon 7 increases U2AF65 binding, but the ESE sequence is required for efficient recruitment of U2 snRNP. The ESE forms at least three coexisting discrete states: a quadruplex, a complex containing only hnRNP F/H, and a complex enriched in the activator SRSF1. Neither hnRNP H nor quadruplex formation contributes to ESE activity. The results suggest that splicing limited by weak signals can be rescued by rapid exchange of TOES oligonucleotides in various complexes and raise the possibility that SR proteins associate transiently with ESEs. PMID:25263560
King, Benjamin L.; Shi, Ling Fang; Kao, Peter; Clusin, William T.
2015-01-01
Elasmobranchs detect small potentials using excitable cells of the ampulla of Lorenzini which have calcium-activated K+ channels, first described in l974. A distinctive feature of the outward current in voltage clamped ampullae is its apparent insensitivity to voltage. The sequence of a BK channel α isoform expressed in the ampulla of the skate was characterized. A signal peptide is present at the beginning of the gene. When compared to human isoform 1 (the canonical sequence), the largest difference was absence of a 59 amino acid region from the S8-S9 intracellular linker that contains the strex regulatory domain. The ampulla isoform was also compared with the isoform predicted˜ in late skate embryos where strex was also absent. The BK voltage sensors were conserved in both skate isoforms. Differences between the skate and human BK channel included alternative splicing. Alternative splicing occurs at seven previously defined sites that are characteristic for BK channels in general and hair cells in particular. Skate BK sequences were highly similar to the Australian ghost shark and several other vertebrate species. Based on alignment of known BK sequences with the skate genome and transcriptome, there are at least two isoforms of Kcnma1α expressed in the skate. One of the β subunits (β4), which is known to decrease voltage sensitivity, was also identified in the skate genome and transcriptome and in the ampulla. These studies advance our knowledge of BK channels and suggest further studies in the ampulla and other excitable tissues. PMID:26687710
Oliveira, Jorge; Negrão, Luís; Fineza, Isabel; Taipa, Ricardo; Melo-Pires, Manuel; Fortuna, Ana Maria; Gonçalves, Ana Rita; Froufe, Hugo; Egas, Conceição; Santos, Rosário; Sousa, Mário
2015-06-01
Muscular dystrophies (MDs) are a group of hereditary muscle disorders that include two particularly heterogeneous subgroups: limb-girdle MD and congenital MD, linked to 52 different genes (seven common to both subgroups). Massive parallel sequencing technology may avoid the usual stepwise gene-by-gene analysis. We report the whole-exome sequencing (WES) analysis of a patient with childhood-onset progressive MD, also presenting mental retardation and dilated cardiomyopathy. Conventional sequencing had excluded eight candidate genes. WES of the trio (patient and parents) was performed using the ion proton sequencing system. Data analysis resorted to filtering steps using the GEMINI software revealed a novel silent variant in the choline kinase beta (CHKB) gene. Inspection of sequence alignments ultimately identified the causal variant (CHKB:c.1031+3G>C). This splice site mutation was confirmed using Sanger sequencing and its effect was further evaluated with gene expression analysis. On reassessment of the muscle biopsy, typical abnormal mitochondrial oxidative changes were observed. Mutations in CHKB have been shown to cause phosphatidylcholine deficiency in myofibers, causing a rare form of CMD (only 21 patients reported). Notwithstanding interpretative difficulties that need to be overcome before the integration of WES in the diagnostic workflow, this work corroborates its utility in solving cases from highly heterogeneous groups of diseases, in which conventional diagnostic approaches fail to provide a definitive diagnosis.
SpliceDisease database: linking RNA splicing and disease.
Wang, Juan; Zhang, Jie; Li, Kaibo; Zhao, Wei; Cui, Qinghua
2012-01-01
RNA splicing is an important aspect of gene regulation in many organisms. Splicing of RNA is regulated by complicated mechanisms involving numerous RNA-binding proteins and the intricate network of interactions among them. Mutations in cis-acting splicing elements or its regulatory proteins have been shown to be involved in human diseases. Defects in pre-mRNA splicing process have emerged as a common disease-causing mechanism. Therefore, a database integrating RNA splicing and disease associations would be helpful for understanding not only the RNA splicing but also its contribution to disease. In SpliceDisease database, we manually curated 2337 splicing mutation disease entries involving 303 genes and 370 diseases, which have been supported experimentally in 898 publications. The SpliceDisease database provides information including the change of the nucleotide in the sequence, the location of the mutation on the gene, the reference Pubmed ID and detailed description for the relationship among gene mutations, splicing defects and diseases. We standardized the names of the diseases and genes and provided links for these genes to NCBI and UCSC genome browser for further annotation and genomic sequences. For the location of the mutation, we give direct links of the entry to the respective position/region in the genome browser. The users can freely browse, search and download the data in SpliceDisease at http://cmbi.bjmu.edu.cn/sdisease.
Identification and analysis of pig chimeric mRNAs using RNA sequencing data
2012-01-01
Background Gene fusion is ubiquitous over the course of evolution. It is expected to increase the diversity and complexity of transcriptomes and proteomes through chimeric sequence segments or altered regulation. However, chimeric mRNAs in pigs remain unclear. Here we identified some chimeric mRNAs in pigs and analyzed the expression of them across individuals and breeds using RNA-sequencing data. Results The present study identified 669 putative chimeric mRNAs in pigs, of which 251 chimeric candidates were detected in a set of RNA-sequencing data. The 618 candidates had clear trans-splicing sites, 537 of which obeyed the canonical GU-AG splice rule. Only two putative pig chimera variants whose fusion junction was overlapped with that of a known human chimeric mRNA were found. A set of unique chimeric events were considered middle variances in the expression across individuals and breeds, and revealed non-significant variance between sexes. Furthermore, the genomic region of the 5′ partner gene shares a similar DNA sequence with that of the 3′ partner gene for 458 putative chimeric mRNAs. The 81 of those shared DNA sequences significantly matched the known DNA-binding motifs in the JASPAR CORE database. Four DNA motifs shared in parental genomic regions had significant similarity with known human CTCF binding sites. Conclusions The present study provided detailed information on some pig chimeric mRNAs. We proposed a model that trans-acting factors, such as CTCF, induced the spatial organisation of parental genes to the same transcriptional factory so that parental genes were coordinatively transcribed to give birth to chimeric mRNAs. PMID:22925561
NASA Astrophysics Data System (ADS)
Zhang, Chunxi; Zhang, Zuchen; Song, Jingming; Wu, Chunxiao; Song, Ningfang
2015-03-01
A splicing parameter optimization method to increase the tensile strength of splicing joint between photonic crystal fiber (PCF) and conventional fiber is demonstrated. Based on the splicing recipes provided by splicer or fiber manufacturers, the optimal values of some major splicing parameters are obtained in sequence, and a conspicuous improvement in the mechanical strength of splicing joints between PCFs and conventional fibers is validated through experiments.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sampaio, S.O.; Mei, C.; Butcher, E.C.
The mucosal addressin cell adhesion molecule-1 (MAdCAM-1) is expressed selectively at venular sites of lymphocyte extravasation into mucosal lymphoid tissues and lamina propria, where it directs local lymphocyte trafficking. MAdCAM-1 is a multifunctional type I transmembrane adhesion molecule comprising two distal Ig domains involved in {alpha}4{beta}7 integrin binding, a mucin-like region able to display L-selectin-binding carbohydrates, and a membrane-proximal Ig domain homologous to IgA. We show in this work that the MAdCAM-1 gene is located on chromosome 10 and contains five exons. The signal peptide and each one of the three Ig domains are encoded by a distinct exon, whereasmore » the transmembrane, cytoplasmic tail, and 3{prime}-untranslated region of MAdCAM-1 are combined on a single exon. The mucin-like region and the third Ig domain are encoded together on exon 4. An alternatively spliced MAdCAM-1 mRNA is identified that lacks the mucin/IgA-homologous exon 4-encoded sequences. This short variant of MAdCAM-1 may be specialized to support {alpha}4{beta}7-dependent adhesion strengthening, independent of carbohydrate-presenting function. Sequences 5{prime} of the transcription start site include tandem nuclear factor-KB sites; AP-1, AP-2, and signal peptide-1 binding sites; and an estrogen response element. Our findings reinforce the correspondence between the multidomain structure and versatile functions of this vascular addressin, and suggest an additional level of regulation of carbohydrate-presenting capability, and thus of its importance in lectin-mediated vs. {alpha}4{beta}7-dependent adhesive events in lymphocyte trafficking. 46 refs., 6 figs., 1 tab.« less
Alternative Splicing of a Novel Inducible Exon Diversifies the CASK Guanylate Kinase Domain
Dembowski, Jill A.; An, Ping; Scoulos-Hanson, Maritsa; Yeo, Gene; Han, Joonhee; Fu, Xiang-Dong; Grabowski, Paula J.
2012-01-01
Alternative pre-mRNA splicing has a major impact on cellular functions and development with the potential to fine-tune cellular localization, posttranslational modification, interaction properties, and expression levels of cognate proteins. The plasticity of regulation sets the stage for cells to adjust the relative levels of spliced mRNA isoforms in response to stress or stimulation. As part of an exon profiling analysis of mouse cortical neurons stimulated with high KCl to induce membrane depolarization, we detected a previously unrecognized exon (E24a) of the CASK gene, which encodes for a conserved peptide insertion in the guanylate kinase interaction domain. Comparative sequence analysis shows that E24a appeared selectively in mammalian CASK genes as part of a >3,000 base pair intron insertion. We demonstrate that a combination of a naturally defective 5′ splice site and negative regulation by several splicing factors, including SC35 (SRSF2) and ASF/SF2 (SRSF1), drives E24a skipping in most cell types. However, this negative regulation is countered with an observed increase in E24a inclusion after neuronal stimulation and NMDA receptor signaling. Taken together, E24a is typically a skipped exon, which awakens during neuronal stimulation with the potential to diversify the protein interaction properties of the CASK polypeptide. PMID:23008758
Regulation of Alternative Splicing in Vivo by Overexpression of Antagonistic Splicing Factors
NASA Astrophysics Data System (ADS)
Caceres, Javier F.; Stamm, Stefan; Helfman, David M.; Krainer, Adrian R.
1994-09-01
The opposing effects of SF2/ASF and heterogeneous nuclear ribonucleoprotein (hnRNP) A1 influence alternative splicing in vitro. SF2/ASF or hnRNP A1 complementary DNAs were transiently overexpressed in HeLa cells, and the effect on alternative splicing of several cotransfected reporter genes was measured. Increased expression of SF2/ASF activated proximal 5' splice sites, promoted inclusion of a neuron-specific exon, and prevented abnormal exon skipping. Increased expression of hnRNP A1 activated distal 5' splice sites. Therefore, variations in the intracellular levels of antagonistic splicing factors influence different modes of alternative splicing in vivo and may be a natural mechanism for tissue-specific or developmental regulation of gene expression.
Weirather, Jason L.; Afshar, Pegah Tootoonchi; Clark, Tyson A.; Tseng, Elizabeth; Powers, Linda S.; Underwood, Jason G.; Zabner, Joseph; Korlach, Jonas; Wong, Wing Hung; Au, Kin Fai
2015-01-01
We developed an innovative hybrid sequencing approach, IDP-fusion, to detect fusion genes, determine fusion sites and identify and quantify fusion isoforms. IDP-fusion is the first method to study gene fusion events by integrating Third Generation Sequencing long reads and Second Generation Sequencing short reads. We applied IDP-fusion to PacBio data and Illumina data from the MCF-7 breast cancer cells. Compared with the existing tools, IDP-fusion detects fusion genes at higher precision and a very low false positive rate. The results show that IDP-fusion will be useful for unraveling the complexity of multiple fusion splices and fusion isoforms within tumorigenesis-relevant fusion genes. PMID:26040699
Emerick, Mark C; Stein, Rebecca; Kunze, Robin; McNulty, Megan M; Regan, Melissa R; Hanck, Dorothy A; Agnew, William S
2006-08-01
We describe the regulated transcriptome of CACNA1G, a human gene for T-type Ca(v)3.1 calcium channels that is subject to extensive alternative RNA splicing. Fifteen sites of transcript variation include 2 alternative 5'-UTR promoter sites, 2 alternative 3'-UTR polyadenylation sites, and 11 sites of alternative splicing within the open reading frame. A survey of 1580 fetal and adult human brain full-length complementary DNAs reveals a family of 30 distinct transcripts, including multiple functional forms that vary in expression with development. Statistical analyses of fetal and adult transcript populations reveal patterns of linkages among intramolecular splice site configurations that change dramatically with development. A shift from nearly independent, biased splicing in fetal transcripts to strongly concerted splicing in adult transcripts suggests progressive activation of multiple "programs" of splicing regulation that reorganize molecular structures in differentiating cells. Patch-clamp studies of nine selected variants help relate splicing regulation to permutations of the gating parameters most likely to modify T-channel physiology in expressing neurons. Gating behavior reflects combinatorial interactions between variable domains so that molecular phenotype depends on ensembles of coselected domains, consistent with the observed emergence of concerted splicing during development. We conclude that the structural gene and networks of splicing regulatory factors define an integrated system for the phenotypic variation of Ca(v)3.1 biophysics during nervous system development. Copyright 2006 Wiley-Liss, Inc.
Wang, Haoran; Wang, Mingxiu; Cheng, Qiang
2018-03-08
Detection of complex splice sites (SSs) and polyadenylation sites (PASs) of eukaryotic genes is essential for the elucidation of gene regulatory mechanisms. Transcriptome-wide studies using high-throughput sequencing (HTS) have revealed prevalent alternative splicing (AS) and alternative polyadenylation (APA) in plants. However, small-scale and high-depth HTS aimed at detecting genes or gene families are very few and limited. We explored a convenient and flexible method for profiling SSs and PASs, which combines rapid amplification of 3'-cDNA ends (3'-RACE) and HTS. Fourteen NAC (NAM, ATAF1/2, CUC2) transcription factor genes of Populus trichocarpa were analyzed by 3'-RACE-seq. Based on experimental reproducibility, boundary sequence analysis and reverse transcription PCR (RT-PCR) verification, only canonical SSs were considered to be authentic. Based on stringent criteria, candidate PASs without any internal priming features were chosen as authentic PASs and assumed to be PAS-rich markers. Thirty-four novel canonical SSs, six intronic/internal exons and thirty 3'-UTR PAS-rich markers were revealed by 3'-RACE-seq. Using 3'-RACE and real-time PCR, we confirmed that three APA transcripts ending in/around PAS-rich markers were differentially regulated in response to plant hormones. Our results indicate that 3'-RACE-seq is a robust and cost-effective method to discover SSs and label active regions subjected to APA for genes or gene families. The method is suitable for small-scale AS and APA research in the initial stage.
Defective control of pre–messenger RNA splicing in human disease
Shkreta, Lulzim
2016-01-01
Examples of associations between human disease and defects in pre–messenger RNA splicing/alternative splicing are accumulating. Although many alterations are caused by mutations in splicing signals or regulatory sequence elements, recent studies have noted the disruptive impact of mutated generic spliceosome components and splicing regulatory proteins. This review highlights recent progress in our understanding of how the altered splicing function of RNA-binding proteins contributes to myelodysplastic syndromes, cancer, and neuropathologies. PMID:26728853
Manananggal - a novel viewer for alternative splicing events.
Barann, Matthias; Zimmer, Ralf; Birzele, Fabian
2017-02-21
Alternative splicing is an important cellular mechanism that can be analyzed by RNA sequencing. However, identification of splicing events in an automated fashion is error-prone. Thus, further validation is required to select reliable instances of alternative splicing events (ASEs). There are only few tools specifically designed for interactive inspection of ASEs and available visualization approaches can be significantly improved. Here, we present Manananggal, an application specifically designed for the identification of splicing events in next generation sequencing data. Manananggal includes a web application for visual inspection and a command line tool that allows for ASE detection. We compare the sashimi plots available in the IGV Viewer, the DEXSeq splicing plots and SpliceSeq to the Manananggal interface and discuss the advantages and drawbacks of these tools. We show that sashimi plots (such as those used by the IGV Viewer and SpliceSeq) offer a practical solution for simple ASEs, but also indicate short-comings for highly complex genes. Manananggal is an interactive web application that offers functions specifically tailored to the identification of alternative splicing events that other tools are lacking. The ability to select a subset of isoforms allows an easier interpretation of complex alternative splicing events. In contrast to SpliceSeq and the DEXSeq splicing plot, Manananggal does not obscure the gene structure by showing full transcript models that makes it easier to determine which isoforms are expressed and which are not.
Mubiru, James N; Yang, Alice S; Olsen, Christian; Nayak, Sudhir; Livi, Carolina B; Dick, Edward J; Owston, Michael; Garcia-Forey, Magdalena; Shade, Robert E; Rogers, Jeffrey
2014-01-01
The function of prostate-specific antigen (PSA) is to liquefy the semen coagulum so that the released sperm can fuse with the ovum. Fifteen spliced variants of the PSA gene have been reported in humans, but little is known about alternative splicing in nonhuman primates. Positive selection has been reported in sex- and reproductive-related genes from sea urchins to Drosophila to humans; however, there are few studies of adaptive evolution of the PSA gene. Here, using polymerase chain reaction (PCR) product cloning and sequencing, we study PSA transcript variant heterogeneity in the prostates of chimpanzees (Pan troglodytes), cynomolgus monkeys (Macaca fascicularis), baboons (Papio hamadryas anubis), and African green monkeys (Chlorocebus aethiops). Six PSA variants were identified in the chimpanzee prostate, but only two variants were found in cynomolgus monkeys, baboons, and African green monkeys. In the chimpanzee the full-length transcript is expressed at the same magnitude as the transcripts that retain intron 3. We have found previously unidentified splice variants of the PSA gene, some of which might be linked to disease conditions. Selection on the PSA gene was studied in 11 primate species by computational methods using the sequences reported here for African green monkey, cynomolgus monkey, baboon, and chimpanzee and other sequences available in public databases. A codon-based analysis (dN/dS) of the PSA gene identified potential adaptive evolution at five residue sites (Arg45, Lys70, Gln144, Pro189, and Thr203).
Hamid, Fursham M.; Makeyev, Eugene V.
2014-01-01
Alternative splicing (AS) provides a potent mechanism for increasing protein diversity and modulating gene expression levels. How alternate splice sites are selected by the splicing machinery and how AS is integrated into gene regulation networks remain important questions of eukaryotic biology. Here we report that polypyrimidine tract-binding protein 1 (Ptbp1/PTB/hnRNP-I) controls alternate 5′ and 3′ splice site (5′ss and 3′ss) usage in a large set of mammalian transcripts. A top scoring event identified by our analysis was the choice between competing upstream and downstream 5′ss (u5′ss and d5′ss) in the exon 18 of the Hps1 gene. Hps1 is essential for proper biogenesis of lysosome-related organelles and loss of its function leads to a disease called type 1 Hermansky-Pudlak Syndrome (HPS). We show that Ptbp1 promotes preferential utilization of the u5′ss giving rise to stable mRNAs encoding a full-length Hps1 protein, whereas bias towards d5′ss triggered by Ptbp1 down-regulation generates transcripts susceptible to nonsense-mediated decay (NMD). We further demonstrate that Ptbp1 binds to pyrimidine-rich sequences between the u5′ss and d5′ss and activates the former site rather than repressing the latter. Consistent with this mechanism, u5′ss is intrinsically weaker than d5′ss, with a similar tendency observed for other genes with Ptbp1-induced u5′ss bias. Interestingly, the brain-enriched Ptbp1 paralog Ptbp2/nPTB/brPTB stimulated the u5′ss utilization but with a considerably lower efficiency than Ptbp1. This may account for the tight correlation between Hps1 with Ptbp1 expression levels observed across mammalian tissues. More generally, these data expand our understanding of AS regulation and uncover a post-transcriptional strategy ensuring co-expression of a subordinate gene with its master regulator through an AS-NMD tracking mechanism. PMID:25375251
Larson, Amy; Fair, Benjamin Jung; Pleiss, Jeffrey A
2016-06-01
Pre-mRNA splicing is an essential component of eukaryotic gene expression and is highly conserved from unicellular yeasts to humans. Here, we present the development and implementation of a sequencing-based reverse genetic screen designed to identify nonessential genes that impact pre-mRNA splicing in the fission yeast Schizosaccharomyces pombe, an organism that shares many of the complex features of splicing in higher eukaryotes. Using a custom-designed barcoding scheme, we simultaneously queried ∼3000 mutant strains for their impact on the splicing efficiency of two endogenous pre-mRNAs. A total of 61 nonessential genes were identified whose deletions resulted in defects in pre-mRNA splicing; enriched among these were factors encoding known or predicted components of the spliceosome. Included among the candidates identified here are genes with well-characterized roles in other RNA-processing pathways, including heterochromatic silencing and 3' end processing. Splicing-sensitive microarrays confirm broad splicing defects for many of these factors, revealing novel functional connections between these pathways. Copyright © 2016 Larson et al.
Larson, Amy; Fair, Benjamin Jung; Pleiss, Jeffrey A.
2016-01-01
Pre-mRNA splicing is an essential component of eukaryotic gene expression and is highly conserved from unicellular yeasts to humans. Here, we present the development and implementation of a sequencing-based reverse genetic screen designed to identify nonessential genes that impact pre-mRNA splicing in the fission yeast Schizosaccharomyces pombe, an organism that shares many of the complex features of splicing in higher eukaryotes. Using a custom-designed barcoding scheme, we simultaneously queried ∼3000 mutant strains for their impact on the splicing efficiency of two endogenous pre-mRNAs. A total of 61 nonessential genes were identified whose deletions resulted in defects in pre-mRNA splicing; enriched among these were factors encoding known or predicted components of the spliceosome. Included among the candidates identified here are genes with well-characterized roles in other RNA-processing pathways, including heterochromatic silencing and 3ʹ end processing. Splicing-sensitive microarrays confirm broad splicing defects for many of these factors, revealing novel functional connections between these pathways. PMID:27172183
Li, Niu; Song, Aiyun; Ding, Lixia; Zhu, Hua; Li, Guoqiang; Miao, Yan; Wang, Jian; Li, Benshang; Chen, Jing
2018-07-01
Fanconi anemia (FA) is a rare autosomal recessive or X-linked disorder with highly variable clinical manifestations and an incidence of ∼1 to 5 in 1 million births. To date, 15 bona fide FA genes have been reported to be responsible for the known FA complementation groups and the FANCA gene accounts for almost 60%. In the present study, we report a special Chinese family, which has 2 children with classic FA characteristics. Via 2-step analysis of the whole-exome sequencing data and verification using multiplex ligation-dependent probe amplification test, one child was found to have a novel compound heterozygous mutation of a splicing variant (c.1471-1G>A) and a large intragenic deletion (exons 23-30 del) of the FANCA gene. The other child had the same splicing variant and another novel large deletion (exons 1-18 del) in the FANCA gene. Clone sequencing showed the c.1471-1G>A variant generate an altered transcript with 1 cryptic splice site in intron 15, resulting in a premature termination codon (p.Val490HisfsX6). This study not only shows the complexity of FA molecular diagnosis via comprehensively studying the FA pathogenic genes and the mutational spectrum, but also has significant reference value for the future molecular diagnosis of FA.
Systematic Analysis of Splice-Site-Creating Mutations in Cancer
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jayasinghe, Reyka G.; Cao, Song; Gao, Qingsong
For the past decade, cancer genomic studies have focused on mutations leading to splice-site disruption, overlooking those having splice-creating potential. Here, we applied a bioinformatic tool, MiSplice, for the large-scale discovery of splice-site-creating mutations (SCMs) across 8,656 TCGA tumors. We report 1,964 originally mis-annotated mutations having clear evidence of creating alternative splice junctions. TP53 and GATA3 have 26 and 18 SCMs, respectively, and ATRX has 5 from lower-grade gliomas. Mutations in 11 genes, including PARP1, BRCA1, and BAP1, were experimentally validated for splice-site-creating function. Notably, we found that neoantigens induced by SCMs are likely several folds more immunogenic compared tomore » missense mutations, exemplified by the recurrent GATA3 SCM. Further, high expression of PD-1 and PD-L1 was observed in tumors with SCMs, suggesting candidates for immune blockade therapy. Finally, our work highlights the importance of integrating DNA and RNA data for understanding the functional and the clinical implications of mutations in human diseases.« less
Systematic Analysis of Splice-Site-Creating Mutations in Cancer.
Jayasinghe, Reyka G; Cao, Song; Gao, Qingsong; Wendl, Michael C; Vo, Nam Sy; Reynolds, Sheila M; Zhao, Yanyan; Climente-González, Héctor; Chai, Shengjie; Wang, Fang; Varghese, Rajees; Huang, Mo; Liang, Wen-Wei; Wyczalkowski, Matthew A; Sengupta, Sohini; Li, Zhi; Payne, Samuel H; Fenyö, David; Miner, Jeffrey H; Walter, Matthew J; Vincent, Benjamin; Eyras, Eduardo; Chen, Ken; Shmulevich, Ilya; Chen, Feng; Ding, Li
2018-04-03
For the past decade, cancer genomic studies have focused on mutations leading to splice-site disruption, overlooking those having splice-creating potential. Here, we applied a bioinformatic tool, MiSplice, for the large-scale discovery of splice-site-creating mutations (SCMs) across 8,656 TCGA tumors. We report 1,964 originally mis-annotated mutations having clear evidence of creating alternative splice junctions. TP53 and GATA3 have 26 and 18 SCMs, respectively, and ATRX has 5 from lower-grade gliomas. Mutations in 11 genes, including PARP1, BRCA1, and BAP1, were experimentally validated for splice-site-creating function. Notably, we found that neoantigens induced by SCMs are likely several folds more immunogenic compared to missense mutations, exemplified by the recurrent GATA3 SCM. Further, high expression of PD-1 and PD-L1 was observed in tumors with SCMs, suggesting candidates for immune blockade therapy. Our work highlights the importance of integrating DNA and RNA data for understanding the functional and the clinical implications of mutations in human diseases. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Systematic Analysis of Splice-Site-Creating Mutations in Cancer
Jayasinghe, Reyka G.; Cao, Song; Gao, Qingsong; ...
2018-04-05
For the past decade, cancer genomic studies have focused on mutations leading to splice-site disruption, overlooking those having splice-creating potential. Here, we applied a bioinformatic tool, MiSplice, for the large-scale discovery of splice-site-creating mutations (SCMs) across 8,656 TCGA tumors. We report 1,964 originally mis-annotated mutations having clear evidence of creating alternative splice junctions. TP53 and GATA3 have 26 and 18 SCMs, respectively, and ATRX has 5 from lower-grade gliomas. Mutations in 11 genes, including PARP1, BRCA1, and BAP1, were experimentally validated for splice-site-creating function. Notably, we found that neoantigens induced by SCMs are likely several folds more immunogenic compared tomore » missense mutations, exemplified by the recurrent GATA3 SCM. Further, high expression of PD-1 and PD-L1 was observed in tumors with SCMs, suggesting candidates for immune blockade therapy. Finally, our work highlights the importance of integrating DNA and RNA data for understanding the functional and the clinical implications of mutations in human diseases.« less
Zhang, Yanju; Lameijer, Eric-Wubbo; 't Hoen, Peter A. C.; Ning, Zemin; Slagboom, P. Eline; Ye, Kai
2012-01-01
Motivation: RNA-seq is a powerful technology for the study of transcriptome profiles that uses deep-sequencing technologies. Moreover, it may be used for cellular phenotyping and help establishing the etiology of diseases characterized by abnormal splicing patterns. In RNA-Seq, the exact nature of splicing events is buried in the reads that span exon–exon boundaries. The accurate and efficient mapping of these reads to the reference genome is a major challenge. Results: We developed PASSion, a pattern growth algorithm-based pipeline for splice site detection in paired-end RNA-Seq reads. Comparing the performance of PASSion to three existing RNA-Seq analysis pipelines, TopHat, MapSplice and HMMSplicer, revealed that PASSion is competitive with these packages. Moreover, the performance of PASSion is not affected by read length and coverage. It performs better than the other three approaches when detecting junctions in highly abundant transcripts. PASSion has the ability to detect junctions that do not have known splicing motifs, which cannot be found by the other tools. Of the two public RNA-Seq datasets, PASSion predicted ∼ 137 000 and 173 000 splicing events, of which on average 82 are known junctions annotated in the Ensembl transcript database and 18% are novel. In addition, our package can discover differential and shared splicing patterns among multiple samples. Availability: The code and utilities can be freely downloaded from https://trac.nbic.nl/passion and ftp://ftp.sanger.ac.uk/pub/zn1/passion Contact: y.zhang@lumc.nl; k.ye@lumc.nl Supplementary information: Supplementary data are available at Bioinformatics online. PMID:22219203
Yoshimoto, Rei; Kaida, Daisuke; Furuno, Masaaki; Burroughs, A. Maxwell; Noma, Shohei; Suzuki, Harukazu; Kawamura, Yumi; Hayashizaki, Yoshihide; Mayeda, Akila; Yoshida, Minoru
2017-01-01
Spliceostatin A (SSA) is a methyl ketal derivative of FR901464, a potent antitumor compound isolated from a culture broth of Pseudomonas sp. no. 2663. These compounds selectively bind to the essential spliceosome component SF3b, a subcomplex of the U2 snRNP, to inhibit pre-mRNA splicing. However, the mechanism of SSA's antitumor activity is unknown. It is noteworthy that SSA causes accumulation of a truncated form of the CDK inhibitor protein p27 translated from CDKN1B pre-mRNA, which is involved in SSA-induced cell-cycle arrest. However, it is still unclear whether pre-mRNAs are uniformly exported from the nucleus following SSA treatment. We performed RNA-seq analysis on nuclear and cytoplasmic fractions of SSA-treated cells. Our statistical analyses showed that intron retention is the major consequence of SSA treatment, and a small number of intron-containing pre-mRNAs leak into the cytoplasm. Using a series of reporter plasmids to investigate the roles of intronic sequences in the pre-mRNA leakage, we showed that the strength of the 5′ splice site affects pre-mRNA leakage. Additionally, we found that the level of pre-mRNA leakage is related to transcript length. These results suggest that the strength of the 5′ splice site and the length of the transcripts are determinants of the pre-mRNA leakage induced by SF3b inhibitors. PMID:27754875
A conserved intronic U1 snRNP-binding sequence promotes trans-splicing in Drosophila
Gao, Jun-Li; Fan, Yu-Jie; Wang, Xiu-Ye; Zhang, Yu; Pu, Jia; Li, Liang; Shao, Wei; Zhan, Shuai; Hao, Jianjiang
2015-01-01
Unlike typical cis-splicing, trans-splicing joins exons from two separate transcripts to produce chimeric mRNA and has been detected in most eukaryotes. Trans-splicing in trypanosomes and nematodes has been characterized as a spliced leader RNA-facilitated reaction; in contrast, its mechanism in higher eukaryotes remains unclear. Here we investigate mod(mdg4), a classic trans-spliced gene in Drosophila, and report that two critical RNA sequences in the middle of the last 5′ intron, TSA and TSB, promote trans-splicing of mod(mdg4). In TSA, a 13-nucleotide (nt) core motif is conserved across Drosophila species and is essential and sufficient for trans-splicing, which binds U1 small nuclear RNP (snRNP) through strong base-pairing with U1 snRNA. In TSB, a conserved secondary structure acts as an enhancer. Deletions of TSA and TSB using the CRISPR/Cas9 system result in developmental defects in flies. Although it is not clear how the 5′ intron finds the 3′ introns, compensatory changes in U1 snRNA rescue trans-splicing of TSA mutants, demonstrating that U1 recruitment is critical to promote trans-splicing in vivo. Furthermore, TSA core-like motifs are found in many other trans-spliced Drosophila genes, including lola. These findings represent a novel mechanism of trans-splicing, in which RNA motifs in the 5′ intron are sufficient to bring separate transcripts into close proximity to promote trans-splicing. PMID:25838544
Splicing fidelity: DEAD/H-box ATPases as molecular clocks.
Koodathingal, Prakash; Staley, Jonathan P
2013-07-01
The spliceosome discriminates against suboptimal substrates, both during assembly and catalysis, thereby enhancing specificity during pre-mRNA splicing. Central to such fidelity mechanisms are a conserved subset of the DEAD- and DEAH-box ATPases, which belong to a superfamily of proteins that mediate RNP rearrangements in almost all RNA-dependent processes in the cell. Through an investigation of the mechanisms contributing to the specificity of 5' splice site cleavage, two related reports, one from our lab and the other from the Cheng lab, have provided insights into fidelity mechanisms utilized by the spliceosome. In our work, we found evidence for a kinetic proofreading mechanism in splicing in which the DEAH-box ATPase Prp16 discriminates against substrates undergoing slow 5' splice site cleavage. Additionally, our study revealed that discriminated substrates are discarded through a general spliceosome disassembly pathway, mediated by another DEAH-box ATPase Prp43. In their work, Tseng et al. described the underlying molecular events through which Prp16 discriminates against a splicing substrate during 5' splice site cleavage. Here, we present a synthesis of these two studies and, additionally, provide the first biochemical evidence for discrimination of a suboptimal splicing substrate just prior to 5' splice site cleavage. Together, these findings support a general mechanism for a ubiquitous superfamily of ATPases in enhancing specificity during RNA-dependent processes in the cell.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Miller, S.I.; Wirth, D.F.
1988-06-01
The 5' ends of Leishmania mRNAs contain an identical 35-nucleotide sequence termed the spliced leader (SL) or 5' mini-exon. The SL sequence is at the 5' end of an 85-nucleotide primary transcript that contains a consensus eucaryotic 5' intron-exon splice junction immediately 3' to the SL. The SL is added to protein-coding genes immediately 3' to a consensus eucaryotic 3' intron-exon splice junction. The authors' previous work demonstrated possible intermediates in discontinuous mRNA processing that contain the 50 nucleotides of the SL primary transcript 3' to the SL, the SL intron sequence (SLIS). These RNAs have a 5' terminus atmore » the splice junction of the SL and the SLIS. The authors examined a Leishmania nuclear extract for these RNAs in ribonucleoprotein (RNP) particles. Density centrifugation analysis showed that the SL RNA is predominately in RNP complexes at 60S, while the SLIS-containing RNAs are in complexes at 40S. They also demonstrated that the SLIS can be released from polyadenylated RNA by incubation with a HeLa cell extract containing debranching enzymatic activity. These data suggested that Leishmania enriettii mRNAs are assembled by bimolecular or trans splicing as has been recently demonstrated for Trypanosoma brucei. Furthermore, they determined the partial sequence of the Leishmania U2 equivalent RNA and demonstrated that it cosediments with the SL RNA at 60S in a nuclear extract. These RNP particles may be analogous to so-called spliceosomes that have been demonstrated in other systems.« less
NASA Astrophysics Data System (ADS)
Pollastro, Pasquale; Rampone, Salvatore
The aim of this work is to describe a cleaning procedure of GenBank data, producing material to train and to assess the prediction accuracy of computational approaches for gene characterization. A procedure (GenBank2HS3D) has been defined, producing a dataset (HS3D - Homo Sapiens Splice Sites Dataset) of Homo Sapiens Splice regions extracted from GenBank (Rel.123 at this time). It selects, from the complete GenBank Primate Division, entries of Human Nuclear DNA according with several assessed criteria; then it extracts exons and introns from these entries (actually 4523 + 3802). Donor and acceptor sites are then extracted as windows of 140 nucleotides around each splice site (3799 + 3799). After discarding windows not including canonical GT-AG junctions (65 + 74), including insufficient data (not enough material for a 140 nucleotide window) (686 + 589), including not AGCT bases (29 + 30), and redundant (218 + 226), the remaining windows (2796 + 2880) are reported in the dataset. Finally, windows of false splice sites are selected by searching canonical GT-AG pairs in not splicing positions (271 937 + 332 296). The false sites in a range +/- 60 from a true splice site are marked as proximal. HS3D, release 1.2 at this time, is available at the Web server of the University of Sannio: http://www.sci.unisannio.it/docenti/rampone/.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hagiwara, Yoko; Nishio, Hisahide; Kitoh, Yoshihiko
1994-01-01
The mutations in one-third of Duchenne and Becker muscular dystrophy patients remain unknown, as they do not involve gross rearrangements of the dystrophin gene. The authors now report a defect in the splicing of precursor mRNA (pre-mRNA), resulting from a maternally inherited mutation of the dystrophin gene in a patient with Becker muscular dystrophy. This defect results from a G-to-T transversion at the terminal nucleotide of exon 13, within the 5[prime] splice site of intron 13, and causes complete skipping of exon 13 during processing of dystrophin pre-mRNA. The predicted polypeptide encoded by the aberrant mRNA is a truncated dystrophinmore » lacking 40 amino acids from the amino-proximal end of the rod domain. This is the first report of an intraexon point mutation that completely inactivates a 5[prime] splice donor site in dystrophin pre-mRNA. Analysis of the genomic context of the G[sup [minus]1]-to-T mutation at the 5[prime] splice site supports the exon-definition model of pre-mRNA splicing and contributes to the understanding of splice-site selection. 48 refs., 5 figs.« less
King, Benjamin L; Shi, Ling Fang; Kao, Peter; Clusin, William T
2016-03-01
Elasmobranchs detect small potentials using excitable cells of the ampulla of Lorenzini which have calcium-activated K(+) channels, first described in 1974. A distinctive feature of the outward current in voltage clamped ampullae is its apparent insensitivity to voltage. The sequence of a BK channel α isoform expressed in the ampulla of the skate was characterized. A signal peptide is present at the beginning of the gene. When compared to human isoform 1 (the canonical sequence), the largest difference was absence of a 59 amino acid region from the S8-S9 intra-cellular linker that contains the strex regulatory domain. The ampulla isoform was also compared with the isoform predicted in late skate embryos where strex was also absent. The BK voltage sensors were conserved in both skate isoforms. Differences between the skate and human BK channel included alternative splicing. Alternative splicing occurs at seven previously defined sites that are characteristic for BK channels in general and hair cells in particular. Skate BK sequences were highly similar to the Australian ghost shark and several other vertebrate species. Based on alignment of known BK sequences with the skate genome and transcriptome, there are at least two isoforms of Kcnma1α expressed in the skate. One of the β subunits (β4), which is known to decrease voltage sensitivity, was also identified in the skate genome and transcriptome and in the ampulla. These studies advance our knowledge of BK channels and suggest further studies in the ampulla and other excitable tissues. Copyright © 2015 Elsevier B.V. All rights reserved.
Seim, Inge; Jeffery, Penny L; Thomas, Patrick B; Walpole, Carina M; Maugham, Michelle; Fung, Jenny N T; Yap, Pei-Yi; O'Keeffe, Angela J; Lai, John; Whiteside, Eliza J; Herington, Adrian C; Chopin, Lisa K
2016-06-01
The peptide hormone ghrelin is a potent orexigen produced predominantly in the stomach. It has a number of other biological actions, including roles in appetite stimulation, energy balance, the stimulation of growth hormone release and the regulation of cell proliferation. Recently, several ghrelin gene splice variants have been described. Here, we attempted to identify conserved alternative splicing of the ghrelin gene by cross-species sequence comparisons. We identified a novel human exon 2-deleted variant and provide preliminary evidence that this splice variant and in1-ghrelin encode a C-terminally truncated form of the ghrelin peptide, termed minighrelin. These variants are expressed in humans and mice, demonstrating conservation of alternative splicing spanning 90 million years. Minighrelin appears to have similar actions to full-length ghrelin, as treatment with exogenous minighrelin peptide stimulates appetite and feeding in mice. Forced expression of the exon 2-deleted preproghrelin variant mirrors the effect of the canonical preproghrelin, stimulating cell proliferation and migration in the PC3 prostate cancer cell line. This is the first study to characterise an exon 2-deleted preproghrelin variant and to demonstrate sequence conservation of ghrelin gene-derived splice variants that encode a truncated ghrelin peptide. This adds further impetus for studies into the alternative splicing of the ghrelin gene and the function of novel ghrelin peptides in vertebrates.
Lisboa, Bianca Cristina Garcia; Machado, Tamara da Rocha; Pimenta, Daniel Carvalho; Han, Sang Won
2007-02-01
Human cytidine deaminase (HCD) catalyzes the deamination of cytidine or deoxycytidine to uridine or deoxyuridine, respectively. The genomic sequence of HCD is formed by 31 kb with 4 exons and several alternative splicing signals, but an alternative form of HCD has yet to be reported. Here we describe the cloning and characterization of a small form of HCD, HSCD, and it is likely to be a product of alternative splicing of HCD. The alignment of DNA sequences shows that the HSCD matches HCD in 2 parts, except for a deletion of 170 bp. Based on the HCD genome organization, exons 1 and 4 should be joined and all sequences of introns and exons 2 and 3 should be deleted by splicing. This alternative splicing shifted the translation of the reading frame from the point of splicing. The estimated molecular mass is 9.8 kDa, and this value was confirmed by Western blot and mass spectroscopy after expressing the gene fused with glutathionine-S-transferase in the pGEX vector. The deletion and shift of the reading frame caused a loss of HCD activity, which was confirmed by enzyme assay and also with NIH3T3 cells modified to express HSCD and challenged against cytosine arabinoside. In this work we describe the identification and characterization of HSCD, which is the product of alternative splicing of the HCD gene.
SEQassembly: A Practical Tools Program for Coding Sequences Splicing
NASA Astrophysics Data System (ADS)
Lee, Hongbin; Yang, Hang; Fu, Lei; Qin, Long; Li, Huili; He, Feng; Wang, Bo; Wu, Xiaoming
CDS (Coding Sequences) is a portion of mRNA sequences, which are composed by a number of exon sequence segments. The construction of CDS sequence is important for profound genetic analysis such as genotyping. A program in MATLAB environment is presented, which can process batch of samples sequences into code segments under the guide of reference exon models, and splice these code segments of same sample source into CDS according to the exon order in queue file. This program is useful in transcriptional polymorphism detection and gene function study.
Cytochrome C oxydase deficiency: SURF1 gene investigation in patients with Leigh syndrome.
Maalej, Marwa; Kammoun, Thouraya; Alila-Fersi, Olfa; Kharrat, Marwa; Ammar, Marwa; Felhi, Rahma; Mkaouar-Rebai, Emna; Keskes, Leila; Hachicha, Mongia; Fakhfakh, Faiza
2018-03-18
Leigh syndrome (LS) is a rare progressive neurodegenerative disorder occurring in infancy. The most common clinical signs reported in LS are growth retardation, optic atrophy, ataxia, psychomotor retardation, dystonia, hypotonia, seizures and respiratory disorders. The paper reported a manifestation of 3 Tunisian patients presented with LS syndrome. The aim of this study is the MT[HYPHEN]ATP6 and SURF1 gene screening in Tunisian patients affected with classical Leigh syndrome and the computational investigation of the effect of detected mutations on its structure and functions by clinical and bioinformatics analyses. After clinical investigations, three Tunisian patients were tested for mutations in both MT-ATP6 and SURF1 genes by direct sequencing followed by in silico analyses to predict the effects of sequence variation. The result of mutational analysis revealed the absence of mitochondrial mutations in MT-ATP6 gene and the presence of a known homozygous splice site mutation c.516-517delAG in sibling patients added to the presence of a novel double het mutations in LS patient (c.752-18 A > C/c. c.751 + 16G > A). In silico analyses of theses intronic variations showed that it could alters splicing processes as well as SURF1 protein translation. Leigh syndrome (LS) is a rare progressive neurodegenerative disorder occurring in infancy. The most common clinical signs reported in LS are growth retardation, optic atrophy, ataxia, psychomotor retardation, dystonia, hypotonia, seizures and respiratory disorders. The paper reported a manifestation of 3 Tunisian patients presented with LS syndrome. The aim of this study is MT-ATP6 and SURF1 genes screening in Tunisian patients affected with classical Leigh syndrome and the computational investigation of the effect of detected mutations on its structure and functions. After clinical investigations, three Tunisian patients were tested for mutations in both MT-ATP6 and SURF1 genes by direct sequencing followed by in silico analysis to predict the effects of sequence variation. The result of mutational analysis revealed the absence of mitochondrial mutations in MT-ATP6 gene and the presence of a known homozygous splice site mutation c.516-517delAG in sibling patients added to the presence of a novel double het mutations in LS patient (c.752-18 A>C/ c.751+16G>A). In silico analysis of theses intronic vaiations showed that it could alters splicing processes as well as SURF1 protein translation. Copyright © 2018 Elsevier Inc. All rights reserved.
Jyotsana, Nidhi; Heuser, Michael
2018-02-01
Mutations in genes associated with splicing have been found in hematologic malignancies, but also in solid cancers. Aberrant cancer specific RNA splicing either results from mutations or misexpression of the spliceosome genes directly, or from mutations in splice sites of oncogenes or tumor suppressors. Areas covered: In this review, we present molecular targets of aberrant splicing in various malignancies, information on existing and emerging therapeutics against such targets, and strategies for future drug development. Expert opinion: Alternative splicing is an important mechanism that controls gene expression, and hence pharmacologic and genetic control of aberrant alternative RNA splicing has been proposed as a potential therapy in cancer. To identify and validate aberrant RNA splicing patterns as therapeutic targets we need to (1) characterize the most common genetic aberrations of the spliceosome and of splice sites, (2) understand the dysregulated downstream pathways and (3) exploit in-vivo disease models of aberrant splicing. Antisense oligonucleotides show promising activity, but will benefit from improved delivery tools. Inhibitors of mutated splicing factors require improved specificity, as alternative and aberrant splicing are often intertwined like two sides of the same coin. In summary, targeting aberrant splicing is an early but emerging field in cancer treatment.
The GRK4 subfamily of G protein-coupled receptor kinases. Alternative splicing, gene organization, and sequence conservation.
Premont RT, Macrae AD, Aparicio SA, Kendall HE, Welch JE, Lefkowitz RJ.
Department of Medicine, Howard Hughes Medical Institute, Duke Univer...
Jamroz, E; Paprocka, J; Sokół, M; Popowska, E; Ciara, E
2013-01-01
Ornithine transcarbamylase (OTC) deficiency, an X-linked, semidominant disorder, is the most common inherited de-fect in ureagenesis, resulting in hyperammonaemia type II. The OTC gene, localised on chromosome X, has been mapp-ed to band Xp21.1, proximate to the Duchenne muscular dystrophy (DMD) gene. More than 350 different mutations, including missense, nonsense, splice-site changes, small de-letions or insertions and gross deletions, have been describ-ed so far. Almost all mutations in consensus splicing sites confer a neonatal phenotype. Most mutations in the OTC gene are 'private' and are distributed throughout the gene with a paucity of mutation in the sequence encoding the leader peptide (exon 1 and beginning of exon 2) and in exon 7. They have familial origin or occur de novo. Even with sequencing of the entire reading frame and exon/intron boundaries, only about 80% of the mutations are detected in patients with proven OTC deficiency. The remainder probably occur within the introns or in regulatory domains. The authors present a 4-year-old boy with the unreported missense mutation c.802A>G. The nucleotide transition leads to amino acid substitution Met to Val at codon 268 of the OTC protein.
Ousterout, David G; Kabadi, Ami M; Thakore, Pratiksha I; Perez-Pinera, Pablo; Brown, Matthew T; Majoros, William H; Reddy, Timothy E; Gersbach, Charles A
2015-01-01
Duchenne muscular dystrophy (DMD) is caused by genetic mutations that result in the absence of dystrophin protein expression. Oligonucleotide-induced exon skipping can restore the dystrophin reading frame and protein production. However, this requires continuous drug administration and may not generate complete skipping of the targeted exon. In this study, we apply genome editing with zinc finger nucleases (ZFNs) to permanently remove essential splicing sequences in exon 51 of the dystrophin gene and thereby exclude exon 51 from the resulting dystrophin transcript. This approach can restore the dystrophin reading frame in ~13% of DMD patient mutations. Transfection of two ZFNs targeted to sites flanking the exon 51 splice acceptor into DMD patient myoblasts led to deletion of this genomic sequence. A clonal population was isolated with this deletion and following differentiation we confirmed loss of exon 51 from the dystrophin mRNA transcript and restoration of dystrophin protein expression. Furthermore, transplantation of corrected cells into immunodeficient mice resulted in human dystrophin expression localized to the sarcolemmal membrane. Finally, we quantified ZFN toxicity in human cells and mutagenesis at predicted off-target sites. This study demonstrates a powerful method to restore the dystrophin reading frame and protein expression by permanently deleting exons. PMID:25492562
Control of alternative splicing by forskolin through hnRNP K during neuronal differentiation.
Cao, Wenguang; Razanau, Aleh; Feng, Dairong; Lobo, Vincent G; Xie, Jiuyong
2012-09-01
The molecular basis of cell signal-regulated alternative splicing at the 3' splice site remains largely unknown. We isolated a protein kinase A-responsive ribonucleic acid (RNA) element from a 3' splice site of the synaptosomal-associated protein 25 (Snap25) gene for forskolin-inhibited splicing during neuronal differentiation of rat pheochromocytoma PC12 cells. The element binds specifically to heterogeneous nuclear ribonucleo protein (hnRNP) K in a phosphatase-sensitive way, which directly competes with the U2 auxiliary factor U2AF65, an essential component of early spliceosomes. Transcripts with similarly localized hnRNP K target motifs upstream of alternative exons are enriched in genes often associated with neurological diseases. We show that such motifs upstream of the Runx1 exon 6 also bind hnRNP K, and importantly, hnRNP K is required for forskolin-induced repression of the exon. Interestingly, this exon encodes the peptide domain that determines the switch of the transcriptional repressor/activator activity of Runx1, a change known to be critical in specifying neuron lineages. Consistent with an important role of the target genes in neurons, knocking down hnRNP K severely disrupts forskolin-induced neurite growth. Thus, through hnRNP K, the neuronal differentiation stimulus forskolin targets a critical 3' splice site component of the splicing machinery to control alternative splicing of crucial genes. This also provides a regulated direct competitor of U2AF65 for cell signal control of 3' splice site usage.
Behlouli, Asma; Bonnet, Crystel; Abdi, Samia; Hasbellaoui, Mokhtar; Boudjenah, Farid; Hardelin, Jean-Pierre; Louha, Malek; Makrelouf, Mohamed; Ammar-Khodja, Fatima; Zenati, Akila; Petit, Christine
2016-08-01
Congenital deafness is certainly one of the most common monogenic diseases in humans, but it is also one of the most genetically heterogeneous, which makes molecular diagnosis challenging in most cases. Whole-exome sequencing in two out of three Algerian siblings affected by recessively-inherited, moderate to severe sensorineural deafness allowed us to identify a novel splice donor site mutation (c.5272+1G > A) in the gene encoding α-tectorin, a major component of the cochlear tectorial membrane. The mutation was present at the homozygous state in the three affected siblings, and at the heterozygous state in their unaffected, consanguineous parents. To our knowledge, this is the first reported TECTA mutation leading to the DFNB21 form of hearing impairment among Maghrebian individuals suffering from congenital hearing impairment, which further illustrates the diversity of the genes involved in congenital deafness in the Maghreb. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Splicing-Related Features of Introns Serve to Propel Evolution
Luo, Yuping; Li, Chun; Gong, Xi; Wang, Yanlu; Zhang, Kunshan; Cui, Yaru; Sun, Yi Eve; Li, Siguang
2013-01-01
The role of spliceosomal intronic structures played in evolution has only begun to be elucidated. Comparative genomic analyses of fungal snoRNA sequences, which are often contained within introns and/or exons, revealed that about one-third of snoRNA-associated introns in three major snoRNA gene clusters manifested polymorphisms, likely resulting from intron loss and gain events during fungi evolution. Genomic deletions can clearly be observed as one mechanism underlying intron and exon loss, as well as generation of complex introns where several introns lie in juxtaposition without intercalating exons. Strikingly, by tracking conserved snoRNAs in introns, we found that some introns had moved from one position to another by excision from donor sites and insertion into target sties elsewhere in the genome without needing transposon structures. This study revealed the origin of many newly gained introns. Moreover, our analyses suggested that intron-containing sequences were more prone to sustainable structural changes than DNA sequences without introns due to intron's ability to jump within the genome via unknown mechanisms. We propose that splicing-related structural features of introns serve as an additional motor to propel evolution. PMID:23516505
Thermodynamic Modeling of Donor Splice Site Recognition in pre-mRNA
NASA Astrophysics Data System (ADS)
Aalberts, Daniel P.; Garland, Jeffrey A.
2004-03-01
When eukaryotic genes are edited by the spliceosome, the first step in intron recognition is the binding of a U1 snRNA with the donor (5') splice site. We model this interaction thermodynamically to identify splice sites. Applied to a set of 65 annotated genes, our Finding with Binding method achieves a significant separation between real and false sites. Analyzing binding patterns allows us to discard a large number of decoy sites. Our results improve statistics-based methods for donor site recognition, demonstrating the promise of physical modeling to find functional elements in the genome.
Thermodynamic modeling of donor splice site recognition in pre-mRNA
NASA Astrophysics Data System (ADS)
Garland, Jeffrey A.; Aalberts, Daniel P.
2004-04-01
When eukaryotic genes are edited by the spliceosome, the first step in intron recognition is the binding of a U1 small nuclear RNA with the donor ( 5' ) splice site. We model this interaction thermodynamically to identify splice sites. Applied to a set of 65 annotated genes, our “finding with binding” method achieves a significant separation between real and false sites. Analyzing binding patterns allows us to discard a large number of decoy sites. Our results improve statistics-based methods for donor site recognition, demonstrating the promise of physical modeling to find functional elements in the genome.
Millard, T P; Ashton, G H S; Kondeatis, E; Vaughan, R W; Hughes, G R V; Khamashta, M A; Hawk, J L M; McGregor, J M; McGrath, J A
2002-02-01
The Ro 60 kDa protein (Ro60 or SSA2) is the major component of the Ro ribonucleoprotein (Ro RNP) complex, to which an immune response is a specific feature of several autoimmune diseases. The genomic organization and any sequence variation within the DNA encoding Ro60 are unknown. To characterize the Ro60 gene structure and to assess whether any sequence alterations might be associated with serum anti-Ro antibody in subacute cutaneous lupus erythematosus (SCLE), thus potentially providing new insight into disease pathogenesis. The cDNA sequence for Ro60 was obtained from the NCBI database and used for a BLAST search for a clone containing the entire genomic sequence. The intron-exon borders were confirmed by designing intronic primer pairs to flank each exon, which were then used to amplify genomic DNA for automated sequencing from 36 caucasian patients with SCLE (anti-Ro positive) and 49 with discoid LE (DLE, anti-Ro negative), in addition to 36 healthy caucasian controls. Heteroduplex analysis of polymerase chain reaction (PCR) products from patients and controls spanning all Ro60 exons (1-8) revealed a common bandshift in the PCR products spanning exon 7. Sequencing of the corresponding PCR products demonstrated an A > G substitution at nucleotide position 1318-7, within the consensus acceptor splice site of exon 7 (GenBank XM001901). The allele frequencies were major allele A (0.71) and minor allele G (0.29) in 72 control chromosomes, with no significant differences found between SCLE patients, DLE patients and controls. The genomic organization of the DNA encoding the Ro60 protein is described, including a common polymorphism within the consensus acceptor splice site of exon 7. Our delineation of a strategy for the genomic amplification of Ro60 forms a basis for further examination of the pathological functions of the Ro RNP in autoimmune disease.
Li, Xiaoze; Johansson, Cecilia; Cardoso Palacios, Carlos; Mossberg, Anki; Dhanjal, Soniya; Bergvall, Monika; Schwartz, Stefan
2013-01-01
The most commonly used 3′-splice site on the human papillomavirus type 16 (HPV-16) genome named SA3358 is used to produce HPV-16 early mRNAs encoding E4, E5, E6 and E7, and late mRNAs encoding L1 and L2. We have previously shown that SA3358 is suboptimal and is totally dependent on a downstream splicing enhancer containingmultiple potential ASF/SF2 binding sites. Here weshow that only one of the predicted ASF/SF2 sites accounts for the majority of the enhancer activity. We demonstrate that single nucleotide substitutions in this predicted ASF/SF2 site impair enhancer function and that this correlates with less efficient binding to ASF/SF2 in vitro. We provide evidence that HPV-16 mRNAs that arespliced to SA3358 interact with ASF/SF2 in living cells. In addition,mutational inactivation of the ASF/SF2 site weakened the enhancer at SA3358 in episomal forms of the HPV-16 genome, indicating that the enhancer is active in the context of the full HPV-16 genome.This resulted in induction of HPV-16 late gene expression as a result of competition from late splice site SA5639. Furthermore, inactivation of the ASF/SF2 site of the SA3358 splicing enhancer reduced the ability of E6- and E7-encoding HPV-16 plasmids to increase the life span of primary keratinocytes in vitro, demonstrating arequirement for an intact splicing enhancer of SA3358 forefficient production of the E6 and E7 mRNAs. These results link the strength of the HPV-16 SA3358 splicing enhancer to expression of E6 and E7 and to the pathogenic properties of HPV-16. PMID:24039800
On the path to genetic novelties: insights from programmed DNA elimination and RNA splicing.
Catania, Francesco; Schmitz, Jürgen
2015-01-01
Understanding how genetic novelties arise is a central goal of evolutionary biology. To this end, programmed DNA elimination and RNA splicing deserve special consideration. While programmed DNA elimination reshapes genomes by eliminating chromatin during organismal development, RNA splicing rearranges genetic messages by removing intronic regions during transcription. Small RNAs help to mediate this class of sequence reorganization, which is not error-free. It is this imperfection that makes programmed DNA elimination and RNA splicing excellent candidates for generating evolutionary novelties. Leveraging a number of these two processes' mechanistic and evolutionary properties, which have been uncovered over the past years, we present recently proposed models and empirical evidence for how splicing can shape the structure of protein-coding genes in eukaryotes. We also chronicle a number of intriguing similarities between the processes of programmed DNA elimination and RNA splicing, and highlight the role that the variation in the population-genetic environment may play in shaping their target sequences. © 2015 Wiley Periodicals, Inc.
The RNA Splicing Response to DNA Damage.
Shkreta, Lulzim; Chabot, Benoit
2015-10-29
The number of factors known to participate in the DNA damage response (DDR) has expanded considerably in recent years to include splicing and alternative splicing factors. While the binding of splicing proteins and ribonucleoprotein complexes to nascent transcripts prevents genomic instability by deterring the formation of RNA/DNA duplexes, splicing factors are also recruited to, or removed from, sites of DNA damage. The first steps of the DDR promote the post-translational modification of splicing factors to affect their localization and activity, while more downstream DDR events alter their expression. Although descriptions of molecular mechanisms remain limited, an emerging trend is that DNA damage disrupts the coupling of constitutive and alternative splicing with the transcription of genes involved in DNA repair, cell-cycle control and apoptosis. A better understanding of how changes in splice site selection are integrated into the DDR may provide new avenues to combat cancer and delay aging.
The RNA Splicing Response to DNA Damage
Shkreta, Lulzim; Chabot, Benoit
2015-01-01
The number of factors known to participate in the DNA damage response (DDR) has expanded considerably in recent years to include splicing and alternative splicing factors. While the binding of splicing proteins and ribonucleoprotein complexes to nascent transcripts prevents genomic instability by deterring the formation of RNA/DNA duplexes, splicing factors are also recruited to, or removed from, sites of DNA damage. The first steps of the DDR promote the post-translational modification of splicing factors to affect their localization and activity, while more downstream DDR events alter their expression. Although descriptions of molecular mechanisms remain limited, an emerging trend is that DNA damage disrupts the coupling of constitutive and alternative splicing with the transcription of genes involved in DNA repair, cell-cycle control and apoptosis. A better understanding of how changes in splice site selection are integrated into the DDR may provide new avenues to combat cancer and delay aging. PMID:26529031
QUANTIFYING ALTERNATIVE SPLICING FROM PAIRED-END RNA-SEQUENCING DATA.
Rossell, David; Stephan-Otto Attolini, Camille; Kroiss, Manuel; Stöcker, Almond
2014-03-01
RNA-sequencing has revolutionized biomedical research and, in particular, our ability to study gene alternative splicing. The problem has important implications for human health, as alternative splicing may be involved in malfunctions at the cellular level and multiple diseases. However, the high-dimensional nature of the data and the existence of experimental biases pose serious data analysis challenges. We find that the standard data summaries used to study alternative splicing are severely limited, as they ignore a substantial amount of valuable information. Current data analysis methods are based on such summaries and are hence sub-optimal. Further, they have limited flexibility in accounting for technical biases. We propose novel data summaries and a Bayesian modeling framework that overcome these limitations and determine biases in a non-parametric, highly flexible manner. These summaries adapt naturally to the rapid improvements in sequencing technology. We provide efficient point estimates and uncertainty assessments. The approach allows to study alternative splicing patterns for individual samples and can also be the basis for downstream analyses. We found a several fold improvement in estimation mean square error compared popular approaches in simulations, and substantially higher consistency between replicates in experimental data. Our findings indicate the need for adjusting the routine summarization and analysis of alternative splicing RNA-seq studies. We provide a software implementation in the R package casper.
Evaluation of IFITM3 rs12252 Association With Severe Pediatric Influenza Infection.
Randolph, Adrienne G; Yip, Wai-Ki; Allen, Emma Kaitlynn; Rosenberger, Carrie M; Agan, Anna A; Ash, Stephanie A; Zhang, Yu; Bhangale, Tushar R; Finkelstein, David; Cvijanovich, Natalie Z; Mourani, Peter M; Hall, Mark W; Su, Helen C; Thomas, Paul G
2017-07-01
Interferon-induced transmembrane protein 3 (IFITM3) restricts endocytic fusion of influenza virus. IFITM3 rs12252_C, a putative alternate splice site, has been associated with influenza severity in adults. IFITM3 has not been evaluated in pediatric influenza. The Pediatric Influenza (PICFLU) study enrolled children with suspected influenza infection across 38 pediatric intensive care units during November 2008 to April 2016. IFITM3 was sequenced in patients and parents were genotyped for specific variants for family-based association testing. rs12252 was genotyped in 54 African-American pediatric outpatients with influenza (FLU09), included in the population-based comparisons with 1000 genomes. Splice site analysis of rs12252_C was performed using PICFLU and FLU09 patient RNA. In PICFLU, 358 children had influenza infection. We identified 22 rs12252_C homozygotes in 185 white non-Hispanic children. rs12252_C was not associated with influenza infection in population or family-based analyses. We did not identify the Δ21 IFITM3 isoform in RNAseq data. The rs12252 genotype was not associated with IFITM3 expression levels, nor with critical illness severity. No novel rare IFITM3 functional variants were identified. rs12252 was not associated with susceptibility to influenza-related critical illness in children or with critical illness severity. Our data also do not support it being a splice site. © The Author 2017. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail journals.permissions@oup.com.
Mutations in the Promoter Region of the Aldolase B Gene that cause Hereditary Fructose Intolerance
Coffee, Erin M.; Tolan, Dean R.
2010-01-01
SUMMARY Hereditary fructose intolerance (HFI) is a potentially fatal inherited metabolic disease caused by a deficiency of aldolase B activity in the liver and kidney. Over 40 disease-causing mutations are known in the protein-coding region of ALDOB. Mutations upstream of the protein-coding portion of ALDOB are reported here for the first time. DNA sequence analysis of 61 HFI patients revealed single base mutations in the promoter, intronic enhancer, and the first exon, which is entirely untranslated. One mutation, g.–132G>A, is located within the promoter at an evolutionarily conserved nucleotide within a transcription factor-binding site. A second mutation, IVS1+1G>C, is at the donor splice site of the first exon. In vitro electrophoretic mobility shift assays show a decrease in nuclear extract-protein binding at the g.–132G>A mutant site. The promoter mutation results in decreased transcription using luciferase reporter plasmids. Analysis of cDNA from cells transfected with plasmids harboring the IVS1+1G>C mutation results in aberrant splicing leading to complete retention of the first intron (~ 5 kb). The IVS1+1G>C splicing mutation results in loss of luciferase activity from a reporter plasmid. These novel mutations in ALDOB represent 2% of alleles in American HFI patients, with IVS1+1G>C representing a significantly higher allele frequency (6%) among HFI patients of Hispanic and African-American ethnicity. PMID:20882353
Meurs, Kathryn M; Lahmers, Sunshine; Keene, Bruce W; White, Stephen N; Oyama, Mark A; Mauceli, Evan; Lindblad-Toh, Kerstin
2012-08-01
Familial dilated cardiomyopathy is a primary myocardial disease that can result in the development of congestive heart failure and sudden cardiac death. Spontaneous animal models of familial dilated cardiomyopathy exist and the Doberman pinscher dog is one of the most commonly reported canine breeds. The objective of this study was to evaluate familial dilated cardiomyopathy in the Doberman pinscher dog using a genome-wide association study for a genetic alteration(s) associated with the development of this disease in this canine model. Genome-wide association analysis identified an area of statistical significance on canine chromosome 14 (p(raw) = 9.999e-05 corrected for genome-wide significance), fine-mapping of additional SNPs flanking this region localized a signal to 23,774,190-23,781,919 (p = 0.001) and DNA sequencing identified a 16-base pair deletion in the 5' donor splice site of intron 10 of the pyruvate dehydrogenase kinase 4 gene in affected dogs (p < 0.0001). Electron microscopy of myocardium from affected dogs demonstrated disorganization of the Z line, mild to moderate T tubule and sarcoplasmic reticulum dilation, marked pleomorphic mitochondrial alterations with megamitochondria, scattered mitochondria with whorling and vacuolization and mild aggregates of lipofuscin granules. In conclusion, we report the identification of a splice site deletion in the PDK4 gene that is associated with the development of familial dilated cardiomyopathy in the Doberman pinscher dog.
New mutations in the NHS gene in Nance-Horan Syndrome families from the Netherlands.
Florijn, Ralph J; Loves, Willem; Maillette de Buy Wenniger-Prick, Liesbeth J J M; Mannens, Marcel M A M; Tijmes, Nel; Brooks, Simon P; Hardcastle, Alison J; Bergen, Arthur A B
2006-09-01
Mutations in the NHS gene cause Nance-Horan Syndrome (NHS), a rare X-chromosomal recessive disorder with variable features, including congenital cataract, microphthalmia, a peculiar form of the ear and dental anomalies. We investigated the NHS gene in four additional families with NHS from the Netherlands, by dHPLC and direct sequencing. We identified an unique mutation in each family. Three out of these four mutations were not reported before. We report here the first splice site sequence alteration mutation and three protein truncating mutations. Our results suggest that X-linked cataract and NHS are allelic disorders.
Identification and cloning of a gamma 3 subunit splice variant of the human GABA(A) receptor.
Poulsen, C F; Christjansen, K N; Hastrup, S; Hartvig, L
2000-05-31
cDNA sequences encoding two forms of the GABA(A) gamma 3 receptor subunit were cloned from human hippocampus. The nucleotide sequences differ by the absence (gamma 3S) or presence (gamma 3L) of 18 bp located in the presumed intracellular loop between transmembrane region (TM) III and IV. The extra 18 bp in the gamma 3L subunit generates a consensus site for phosphorylation by protein kinase C (PKC). Analysis of human genomic DNA encoding the gamma 3 subunit reveals that the 18 bp insert is contiguous with the upstream proximal exon.
Branchpoint selection in the splicing of U12-dependent introns in vitro.
McConnell, Timothy S; Cho, Soo-Jin; Frilander, Mikko J; Steitz, Joan A
2002-05-01
In metazoans, splicing of introns from pre-mRNAs can occur by two pathways: the major U2-dependent or the minor U12-dependent pathways. Whereas the U2-dependent pathway has been well characterized, much about the U12-dependent pathway remains to be discovered. Most of the information regarding U12-type introns has come from in vitro studies of a very few known introns of this class. To expand our understanding of U12-type splicing, especially to test the hypothesis that the simple base-pairing mechanism between the intron and U12 snRNA defines the branchpoint of U12-dependent introns, additional in vitro splicing substrates were created from three putative U12-type introns: the third intron of the Xenopus RPL1 a gene (XRP), the sixth intron of the Xenopus TFIIS.oA gene (XTF), and the first intron of the human Sm E gene (SME). In vitro splicing in HeLa nuclear extract confirmed U12-dependent splicing of each of these introns. Surprisingly, branchpoint mapping of the XRP splicing intermediate shows use of the upstream rather than the downstream of two consecutive adenosines within the branchpoint sequence (BPS), contrary to the prediction based on alignment with the sixth intron of human P120, a U12-dependent intron whose branch site was previously determined. Also, in the SME intron, the position of the branchpoint A residue within the region base paired with U12 differs from that in P120 and XTF. Analysis of these three additional introns therefore rules out simple models for branchpoint selection by the U12-type spliceosome.
Branchpoint selection in the splicing of U12-dependent introns in vitro.
McConnell, Timothy S; Cho, Soo-Jin; Frilander, Mikko J; Steitz, Joan A
2002-01-01
In metazoans, splicing of introns from pre-mRNAs can occur by two pathways: the major U2-dependent or the minor U12-dependent pathways. Whereas the U2-dependent pathway has been well characterized, much about the U12-dependent pathway remains to be discovered. Most of the information regarding U12-type introns has come from in vitro studies of a very few known introns of this class. To expand our understanding of U12-type splicing, especially to test the hypothesis that the simple base-pairing mechanism between the intron and U12 snRNA defines the branchpoint of U12-dependent introns, additional in vitro splicing substrates were created from three putative U12-type introns: the third intron of the Xenopus RPL1 a gene (XRP), the sixth intron of the Xenopus TFIIS.oA gene (XTF), and the first intron of the human Sm E gene (SME). In vitro splicing in HeLa nuclear extract confirmed U12-dependent splicing of each of these introns. Surprisingly, branchpoint mapping of the XRP splicing intermediate shows use of the upstream rather than the downstream of two consecutive adenosines within the branchpoint sequence (BPS), contrary to the prediction based on alignment with the sixth intron of human P120, a U12-dependent intron whose branch site was previously determined. Also, in the SME intron, the position of the branchpoint A residue within the region base paired with U12 differs from that in P120 and XTF. Analysis of these three additional introns therefore rules out simple models for branchpoint selection by the U12-type spliceosome. PMID:12022225
High-throughput sequencing methods to study neuronal RNA-protein interactions.
Ule, Jernej
2009-12-01
UV-cross-linking and RNase protection, combined with high-throughput sequencing, have provided global maps of RNA sites bound by individual proteins or ribosomes. Using a stringent purification protocol, UV-CLIP (UV-cross-linking and immunoprecipitation) was able to identify intronic and exonic sites bound by splicing regulators in mouse brain tissue. Ribosome profiling has been used to quantify ribosome density on budding yeast mRNAs under different environmental conditions. Post-transcriptional regulation in neurons requires high spatial and temporal precision, as is evident from the role of localized translational control in synaptic plasticity. It remains to be seen if the high-throughput methods can be applied quantitatively to study the dynamics of RNP (ribonucleoprotein) remodelling in specific neuronal populations during the neurodegenerative process. It is certain, however, that applications of new biochemical techniques followed by high-throughput sequencing will continue to provide important insights into the mechanisms of neuronal post-transcriptional regulation.
Bauer, William J.; Heath, Jason; Jenkins, Jermaine L.; Kielkopf, Clara L.
2012-01-01
T-cell intracellular antigen-1 (TIA-1) regulates developmental and stress-responsive pathways through distinct activities at the levels of alternative pre-mRNA splicing and mRNA translation. The TIA-1 polypeptide contains three RNA recognition motifs (RRMs). The central RRM2 and C-terminal RRM3 associate with cellular mRNAs. The N-terminal RRM1 enhances interactions of a C-terminal Q-rich domain of TIA-1 with the U1-C splicing factor, despite linear separation of the domains in the TIA-1 sequence. Given the expanded functional repertoire of the RRM family, it was unknown whether TIA-1 RRM1 contributes to RNA binding as well as documented protein interactions. To address this question, we used isothermal titration calorimetry and small-angle X-ray scattering (SAXS) to dissect the roles of the TIA-1 RRMs in RNA recognition. Notably, the fas RNA exhibited two binding sites with indistinguishable affinities for TIA-1. Analyses of TIA-1 variants established that RRM1 was dispensable for binding AU-rich fas sites, yet all three RRMs were required to bind a polyU RNA with high affinity. SAXS analyses demonstrated a `V' shape for a TIA-1 construct comprising the three RRMs, and revealed that its dimensions became more compact in the RNA-bound state. The sequence-selective involvement of TIA-1 RRM1 in RNA recognition suggests a possible role for RNA sequences in regulating the distinct functions of TIA-1. Further implications for U1-C recruitment by the adjacent TIA-1 binding sites of the fas pre-mRNA and the bent TIA-1 shape, which organizes the N- and C-termini on the same side of the protein, are discussed. PMID:22154808
Dutta, Debargh; Gunasekera, Devi; Ragni, Margaret V; Pratt, Kathleen P
2016-12-27
The most frequent mutations resulting in hemophilia A are an intron 22 or intron 1 gene inversion, which together cause ∼50% of severe hemophilia A cases. We report a simple and accurate RNA-based assay to detect these mutations in patients and heterozygous carriers. The assays do not require specialized equipment or expensive reagents; therefore, they may provide useful and economic protocols that could be standardized for central laboratory testing. RNA is purified from a blood sample, and reverse transcription nested polymerase chain reaction (RT-NPCR) reactions amplify DNA fragments with the F8 sequence spanning the exon 22 to 23 splice site (intron 22 inversion test) or the exon 1 to 2 splice site (intron 1 inversion test). These sequences will be amplified only from F8 RNA without an intron 22 or intron 1 inversion mutation, respectively. Additional RT-NPCR reactions are then carried out to amplify the inverted sequences extending from F8 exon 19 to the first in-frame stop codon within intron 22 or a chimeric transcript containing F8 exon 1 and the VBP1 gene. These latter 2 products are produced only by individuals with an intron 22 or intron 1 inversion mutation, respectively. The intron 22 inversion mutations may be further classified (eg, as type 1 or type 2, reflecting the specific homologous recombination sites) by the standard DNA-based "inverse-shifting" PCR assay if desired. Efficient Bcl I and T4 DNA ligase enzymes that cleave and ligate DNA in minutes were used, which is a substantial improvement over previous protocols that required overnight incubations. These protocols can accurately detect F8 inversion mutations via same-day testing of patient samples.
2013-01-01
Background The production of multiple transcript isoforms from one gene is a major source of transcriptome complexity. RNA-Seq experiments, in which transcripts are converted to cDNA and sequenced, allow the resolution and quantification of alternative transcript isoforms. However, methods to analyze splicing are underdeveloped and errors resulting in incorrect splicing calls occur in every experiment. Results We used RNA-Seq data to develop sequencing and aligner error models. By applying these error models to known input from simulations, we found that errors result from false alignment to minor splice motifs and antisense stands, shifted junction positions, paralog joining, and repeat induced gaps. By using a series of quantitative and qualitative filters, we eliminated diagnosed errors in the simulation, and applied this to RNA-Seq data from Drosophila melanogaster heads. We used high-confidence junction detections to specifically interrogate local splicing differences between transcripts. This method out-performed commonly used RNA-seq methods to identify known alternative splicing events in the Drosophila sex determination pathway. We describe a flexible software package to perform these tasks called Splicing Analysis Kit (Spanki), available at http://www.cbcb.umd.edu/software/spanki. Conclusions Splice-junction centric analysis of RNA-Seq data provides advantages in specificity for detection of alternative splicing. Our software provides tools to better understand error profiles in RNA-Seq data and improve inference from this new technology. The splice-junction centric approach that this software enables will provide more accurate estimates of differentially regulated splicing than current tools. PMID:24209455
Sturgill, David; Malone, John H; Sun, Xia; Smith, Harold E; Rabinow, Leonard; Samson, Marie-Laure; Oliver, Brian
2013-11-09
The production of multiple transcript isoforms from one gene is a major source of transcriptome complexity. RNA-Seq experiments, in which transcripts are converted to cDNA and sequenced, allow the resolution and quantification of alternative transcript isoforms. However, methods to analyze splicing are underdeveloped and errors resulting in incorrect splicing calls occur in every experiment. We used RNA-Seq data to develop sequencing and aligner error models. By applying these error models to known input from simulations, we found that errors result from false alignment to minor splice motifs and antisense stands, shifted junction positions, paralog joining, and repeat induced gaps. By using a series of quantitative and qualitative filters, we eliminated diagnosed errors in the simulation, and applied this to RNA-Seq data from Drosophila melanogaster heads. We used high-confidence junction detections to specifically interrogate local splicing differences between transcripts. This method out-performed commonly used RNA-seq methods to identify known alternative splicing events in the Drosophila sex determination pathway. We describe a flexible software package to perform these tasks called Splicing Analysis Kit (Spanki), available at http://www.cbcb.umd.edu/software/spanki. Splice-junction centric analysis of RNA-Seq data provides advantages in specificity for detection of alternative splicing. Our software provides tools to better understand error profiles in RNA-Seq data and improve inference from this new technology. The splice-junction centric approach that this software enables will provide more accurate estimates of differentially regulated splicing than current tools.
Rösel-Hillgärtner, Tanja Dorothe; Hung, Lee-Hsueh; Khrameeva, Ekaterina; Le Querrec, Patrick; Gelfand, Mikhail S.; Bindereif, Albrecht
2013-01-01
The U1 small nuclear ribonucleoprotein (snRNP)-specific U1C protein participates in 5′ splice site recognition and regulation of pre-mRNA splicing. Based on an RNA-Seq analysis in HeLa cells after U1C knockdown, we found a conserved, intra-U1 snRNP cross-regulation that links U1C and U1-70K expression through alternative splicing and U1 snRNP assembly. To investigate the underlying regulatory mechanism, we combined mutational minigene analysis, in vivo splice-site blocking by antisense morpholinos, and in vitro binding experiments. Alternative splicing of U1-70K pre-mRNA creates the normal (exons 7–8) and a non-productive mRNA isoform, whose balance is determined by U1C protein levels. The non-productive isoform is generated through a U1C-dependent alternative 3′ splice site, which requires an adjacent cluster of regulatory 5′ splice sites and binding of intact U1 snRNPs. As a result of nonsense-mediated decay (NMD) of the non-productive isoform, U1-70K mRNA and protein levels are down-regulated, and U1C incorporation into the U1 snRNP is impaired. U1-70K/U1C-deficient particles are assembled, shifting the alternative splicing balance back towards productive U1-70K splicing, and restoring assembly of intact U1 snRNPs. Taken together, we established a novel feedback regulation that controls U1-70K/U1C homeostasis and ensures correct U1 snRNP assembly and function. PMID:24146627
Griffith, M; Mwenifumbo, J C; Cheung, P Y; Paul, J E; Pugh, T J; Tang, M J; Chittaranjan, S; Morin, R D; Asano, J K; Ally, A A; Miao, L; Lee, A; Chan, S Y; Taylor, G; Severson, T; Hou, Y-C; Griffith, O L; Cheng, G S W; Novik, K; Moore, R; Luk, M; Owen, D; Brown, C J; Morin, G B; Gill, S; Tai, I T; Marra, M A
2013-04-01
The drug fluorouracil (5-FU) is a widely used antimetabolite chemotherapy in the treatment of colorectal cancer. The gene uridine monophosphate synthetase (UMPS) is thought to be primarily responsible for conversion of 5-FU to active anticancer metabolites in tumor cells. Mutation or aberrant expression of UMPS may contribute to 5-FU resistance during treatment. We undertook a characterization of UMPS mRNA isoform expression and sequence variation in 5-FU-resistant cell lines and drug-naive or -exposed primary and metastatic tumors. We observed reciprocal differential expression of two UMPS isoforms in a colorectal cancer cell line with acquired 5-FU resistance relative to the 5-FU-sensitive cell line from which it was derived. A novel isoform arising as a consequence of exon skipping was increased in abundance in resistant cells. The underlying mechanism responsible for this shift in isoform expression was determined to be a heterozygous splice site mutation acquired in the resistant cell line. We developed sequencing and expression assays to specifically detect alternative UMPS isoforms and used these to determine that UMPS was recurrently disrupted by mutations and aberrant splicing in additional 5-FU-resistant colorectal cancer cell lines and colorectal tumors. The observed mutations, aberrant splicing and downregulation of UMPS represent novel mechanisms for acquired 5-FU resistance in colorectal cancer.
Unusual Phenotypic Features in a Patient with a Novel Splice Mutation in the GHRHR Gene
Hilal, Latifa; Hajaji, Yassir; Vie-Luton, Marie-Pierre; Ajaltouni, Zeina; Benazzouz, Bouchra; Chana, Maha; Chraïbi, Adelmajid; Kadiri, Abdelkrim; Amselem, Serge; Sobrier, Marie-Laure
2008-01-01
Isolated growth hormone deficiency (IGHD) may be of genetic origin. One of the few genes involved in that condition encodes the growth hormone releasing hormone receptor (GHRHR) that, through its ligand (GHRH), plays a pivotal role in the GH synthesis and secretion by the pituitary. Our objective is to describe the phenotype of two siblings born to a consanguineous union presenting with short stature (IGHD) and Magnetic Resonance Imaging (MRI) abnormalities, and to identify the molecular basis of this condition. Our main outcome measures were clinical and endocrinological investigations, MRI of the pituitary region, study of the GHRHR gene sequence and transcripts. In both patients, the severe growth retardation (−5SD) was combined with anterior pituitary hypoplasia. In addition to these classical phenotypic features for IGHD, one of the patients had a Chiari I malformation, an arachnoid cyst, and a dysmorphic anterior pituitary. A homozygous sequence variation in the consensus donor splice site of intron 1 (IVS1 + 2T > G) of the GHRHR gene was identified in both patients. Using in vitro transcription assay, we showed that this mutation results in abnormal splicing of GHRHR transcripts. In this report, which broadens the phenotype associated with GHRHR defects, we discuss the possible role of the GHRHR in the proper development of extrapituitary structures, through a mechanism that could be direct or secondary to severe GH deficiency. PMID:18297129
Modelling reveals kinetic advantages of co-transcriptional splicing.
Aitken, Stuart; Alexander, Ross D; Beggs, Jean D
2011-10-01
Messenger RNA splicing is an essential and complex process for the removal of intron sequences. Whereas the composition of the splicing machinery is mostly known, the kinetics of splicing, the catalytic activity of splicing factors and the interdependency of transcription, splicing and mRNA 3' end formation are less well understood. We propose a stochastic model of splicing kinetics that explains data obtained from high-resolution kinetic analyses of transcription, splicing and 3' end formation during induction of an intron-containing reporter gene in budding yeast. Modelling reveals co-transcriptional splicing to be the most probable and most efficient splicing pathway for the reporter transcripts, due in part to a positive feedback mechanism for co-transcriptional second step splicing. Model comparison is used to assess the alternative representations of reactions. Modelling also indicates the functional coupling of transcription and splicing, because both the rate of initiation of transcription and the probability that step one of splicing occurs co-transcriptionally are reduced, when the second step of splicing is abolished in a mutant reporter.
Splice Site Mutations in the ATP7A Gene
Møller, Lisbeth Birk
2011-01-01
Menkes disease (MD) is caused by mutations in the ATP7A gene. We describe 33 novel splice site mutations detected in patients with MD or the milder phenotypic form, Occipital Horn Syndrome. We review these 33 mutations together with 28 previously published splice site mutations. We investigate 12 mutations for their effect on the mRNA transcript in vivo. Transcriptional data from another 16 mutations were collected from the literature. The theoretical consequences of splice site mutations, predicted with the bioinformatics tool Human Splice Finder, were investigated and evaluated in relation to in vivo results. Ninety-six percent of the mutations identified in 45 patients with classical MD were predicted to have a significant effect on splicing, which concurs with the absence of any detectable wild-type transcript in all 19 patients investigated in vivo. Sixty-seven percent of the mutations identified in 12 patients with milder phenotypes were predicted to have no significant effect on splicing, which concurs with the presence of wild-type transcript in 7 out of 9 patients investigated in vivo. Both the in silico predictions and the in vivo results support the hypothesis previously suggested by us and others, that the presence of some wild-type transcript is correlated to a milder phenotype. PMID:21494555
Control of alternative splicing by forskolin through hnRNP K during neuronal differentiation
Cao, Wenguang; Razanau, Aleh; Feng, Dairong; Lobo, Vincent G.; Xie, Jiuyong
2012-01-01
The molecular basis of cell signal-regulated alternative splicing at the 3′ splice site remains largely unknown. We isolated a protein kinase A-responsive ribonucleic acid (RNA) element from a 3′ splice site of the synaptosomal-associated protein 25 (Snap25) gene for forskolin-inhibited splicing during neuronal differentiation of rat pheochromocytoma PC12 cells. The element binds specifically to heterogeneous nuclear ribonucleo protein (hnRNP) K in a phosphatase-sensitive way, which directly competes with the U2 auxiliary factor U2AF65, an essential component of early spliceosomes. Transcripts with similarly localized hnRNP K target motifs upstream of alternative exons are enriched in genes often associated with neurological diseases. We show that such motifs upstream of the Runx1 exon 6 also bind hnRNP K, and importantly, hnRNP K is required for forskolin-induced repression of the exon. Interestingly, this exon encodes the peptide domain that determines the switch of the transcriptional repressor/activator activity of Runx1, a change known to be critical in specifying neuron lineages. Consistent with an important role of the target genes in neurons, knocking down hnRNP K severely disrupts forskolin-induced neurite growth. Thus, through hnRNP K, the neuronal differentiation stimulus forskolin targets a critical 3′ splice site component of the splicing machinery to control alternative splicing of crucial genes. This also provides a regulated direct competitor of U2AF65 for cell signal control of 3′ splice site usage. PMID:22684629
Harrison, Neale; Kalbfleisch, Andreas; Connolly, Bernadette; Pettitt, Jonathan; Müller, Berndt
2010-08-01
Spliced-leader (SL) trans-splicing has been found in all molecularly characterized nematode species to date, and it is likely to be a nematode synapomorphy. Most information regarding SL trans-splicing has come from the study of nematodes from a single monophyletic group, the Rhabditida, all of which employ SL RNAs that are identical to, or variants of, the SL1 RNA first characterized in Caenorhabditis elegans. In contrast, the more distantly related Trichinella spiralis, belonging to the subclass Dorylaimia, utilizes a distinct set of SL RNAs that display considerable sequence diversity. To investigate whether this is true of other members of the Dorylaimia, we have characterized SL RNAs from Prionchulus punctatus. Surprisingly, this revealed the presence of a set of SLs that show clear sequence similarity to the SL2 family of spliced leaders, which have previously only been found within the rhabditine group (which includes C. elegans). Expression of one of the P. punctatus SL RNAs in C. elegans reveals that it can compete specifically with the endogenous C. elegans SL2 spliced leaders, being spliced to the pre-mRNAs derived from downstream genes in operons, but does not compete with the SL1 spliced leaders. This discovery raises the possibility that SL2-like spliced leaders were present in the last common ancestor of the nematode phylum.
SL1 RNA gene recovery from Enterobius vermicularis ancient DNA in pre-Columbian human coprolites.
Iñiguez, Alena Mayo; Reinhard, Karl; Carvalho Gonçalves, Marcelo Luiz; Ferreira, Luiz Fernando; Araújo, Adauto; Paulo Vicente, Ana Carolina
2006-11-01
Enterobius vermicularis, pinworm, is one of the most common helminths worldwide, infecting nearly a billion people at all socio-economic levels. In prehistoric populations the paleoparasitological findings show a pinworm homogeneous distribution among hunter-gatherers in North America, intensified with the advent of agriculture. This same increase also occurred in the transition from nomad hunter-gatherers to sedentary farmers in South America, although E. vermicularis infection encompasses only the ancient Andean peoples, with no record among the pre-Colombian populations in the South American lowlands. However, the outline of pinworm paleoepidemiology has been supported by microscopic finding of eggs recovered from coprolites. Since molecular techniques are precise and sensitive in detecting pathogen ancient DNA (aDNA), and also could provide insights into the parasite evolutionary history, in this work we have performed a molecular paleoparasitological study of E. vermicularis. aDNA was recovered and pinworm 5S rRNA spacer sequences were determined from pre-Columbian coprolites (4110 BC-AD 900) from four different North and South American archaeological sites. The sequence analysis confirmed E. vermicularis identity and revealed a similarity among ancient and modern sequences. Moreover, polymorphisms were identified at the relative positions 160, 173 and 180, in independent coprolite samples from Tulán, San Pedro de Atacama, Chile (1080-950 BC). We also verified the presence of peculiarities (Splicing leader (SL1) RNA sequence, spliced donor site, the Sm antigen biding site, and RNA secondary structure) which characterise the SL1 RNA gene. The analysis shows that the SL1 RNA gene of contemporary pinworms was present in pre-Columbian E. vermicularis by 6110 years ago. We were successful in detecting E. vermicularis aDNA even in coprolites without direct microscopic evidence of the eggs, improving the diagnosis of helminth infections in the past and further pinworm paleoepidemiological studies.
DOE Office of Scientific and Technical Information (OSTI.GOV)
He, Guo-Shun; Grabowski, G.A.
1992-10-01
Gaucher disease is the most frequent lysosomal storage disease and the most prevalent Jewish genetic disease. About 30 identified missense mutations are causal to the defective activity of acid [beta]-glucosidase in this disease. cDNAs were characterized from a moderately affected 9-year-old Ashkenazi Jewish Gaucher disease type 1 patient whose 80-years-old, enzyme-deficient, 1226G (Asn[sup 370][yields]Ser [N370S]) homozygous grandfather was nearly asymptomatic. Sequence analyses revealed four populations of cDNAs with either the 1226G mutation, an exact exon 2 ([Delta] EX2) deletion, a deletion of exon 2 and the first 115 bp of exon 3 ([Delta] EX2-3), or a completely normal sequence. Aboutmore » 50% of the cDNAs were the [Delta] EX2, the [Delta] EX2-3, and the normal cDNAs, in a ratio of 6:3:1. Specific amplification and characterization of exon 2 and 5[prime] and 3[prime] intronic flanking sequences from the structural gene demonstrated clones with either the normal sequence or with a G[sup +1][yields]A[sup +1] transition at the exon 2/intron 2 boundary. This mutation destroyed the splice donor consensus site (U1 binding site) for mRNA processing. This transition also was present at the corresponding exon/intron boundary of the highly homologous pseudogene. This new mutation, termed [open quotes]IVS2 G[sup +1],[close quotes] is the first in the Ashkenazi Jewish population. The occurrence of this [open quotes]pseudogene[close quotes]-type mutation in the structural gene indicates the role of acid [beta]-glucosidase pseudogene and structural gene rearrangements in the pathogenesis of this disease. 33 refs., 8 figs., 1 tab.« less
Novel MSH2 splice-site mutation in a young patient with Lynch syndrome
Liccardo, Raffaella; De Rosa, Marina; Izzo, Paola; Duraturo, Francesca
2018-01-01
Lynch Syndrome (LS) is associated with germline mutations in one of the mismatch repair (MMR) genes, including MutL homolog 1 (MLH1), MutS homolog 2 (MSH2), MSH6, PMS1 homolog 2, mismatch repair system component (PMS2), MLH3 and MSH3. The mutations identified in MMR genes are point mutations or large rearrangements. The point mutations are certainly pathogenetic whether they determine formation of truncated protein. The mutations that arise in splice sites are classified as ‘likely pathogenic’ variants. In the present study, a novel splicing mutation was identified, (named c.212-1g>a), in the MSH2 gene. This novel mutation in the consensus splice site of MSH2 exon 2 leads to the loss of the canonical splice site, without skipping in-frame of exon 2; also with the formation of 2 aberrant transcripts, due to the activation of novel splice sites in exon 2. This mutation was identified in a young patient who developed colon cancer at the age of 26 years and their belongs to family that met the ‘Revised Amsterdam Criteria’. The present study provided insight into the molecular mechanism determining the pathogenicity of this novel MSH2 mutation and it reaffirms the importance of genetic testing in LS. PMID:29568967
Kawaguchi, Risa; Kiryu, Hisanori
2016-05-06
RNA secondary structure around splice sites is known to assist normal splicing by promoting spliceosome recognition. However, analyzing the structural properties of entire intronic regions or pre-mRNA sequences has been difficult hitherto, owing to serious experimental and computational limitations, such as low read coverage and numerical problems. Our novel software, "ParasoR", is designed to run on a computer cluster and enables the exact computation of various structural features of long RNA sequences under the constraint of maximal base-pairing distance. ParasoR divides dynamic programming (DP) matrices into smaller pieces, such that each piece can be computed by a separate computer node without losing the connectivity information between the pieces. ParasoR directly computes the ratios of DP variables to avoid the reduction of numerical precision caused by the cancellation of a large number of Boltzmann factors. The structural preferences of mRNAs computed by ParasoR shows a high concordance with those determined by high-throughput sequencing analyses. Using ParasoR, we investigated the global structural preferences of transcribed regions in the human genome. A genome-wide folding simulation indicated that transcribed regions are significantly more structural than intergenic regions after removing repeat sequences and k-mer frequency bias. In particular, we observed a highly significant preference for base pairing over entire intronic regions as compared to their antisense sequences, as well as to intergenic regions. A comparison between pre-mRNAs and mRNAs showed that coding regions become more accessible after splicing, indicating constraints for translational efficiency. Such changes are correlated with gene expression levels, as well as GC content, and are enriched among genes associated with cytoskeleton and kinase functions. We have shown that ParasoR is very useful for analyzing the structural properties of long RNA sequences such as mRNAs, pre-mRNAs, and long non-coding RNAs whose lengths can be more than a million bases in the human genome. In our analyses, transcribed regions including introns are indicated to be subject to various types of structural constraints that cannot be explained from simple sequence composition biases. ParasoR is freely available at https://github.com/carushi/ParasoR .
Melangath, Geetha; Sen, Titash; Kumar, Rakesh; Bawa, Pushpinder; Srinivasan, Subha; Vijayraghavan, Usha
2017-01-01
Budding yeast spliceosomal factors ScSlu7 and ScPrp18 interact and mediate intron 3'ss choice during second step pre-mRNA splicing. The fission yeast genome with abundant multi-intronic transcripts, degenerate splice signals and SR proteins is an apt unicellular fungal model to deduce roles for core spliceosomal factors in alternative splice-site choice, intron retention and to study the cellular implications of regulated splicing. From our custom microarray data we deduce a stringent reproducible subset of S. pombe alternative events. We examined the role of factors SpSlu7 or SpPrp18 for these splice events and investigated the relationship to growth phase and stress. Wild-type log and stationary phase cells showed ats1+ exon 3 skipped and intron 3 retained transcripts. Interestingly the non-consensus 5'ss in ats1+ intron 3 caused SpSlu7 and SpPrp18 dependent intron retention. We validated the use of an alternative 5'ss in dtd1+ intron 1 and of an upstream alternative 3'ss in DUF3074 intron 1. The dtd1+ intron 1 non-canonical 5'ss yielded an alternative mRNA whose levels increased in stationary phase. Utilization of dtd1+ intron 1 sub-optimal 5' ss required functional SpPrp18 and SpSlu7 while compromise in SpSlu7 function alone hampered the selection of the DUF3074 intron 1 non canonical 3'ss. We analysed the relative abundance of these splice isoforms during mild thermal, oxidative and heavy metal stress and found stress-specific splice patterns for ats1+ and DUF3074 intron 1 some of which were SpSlu7 and SpPrp18 dependent. By studying ats1+ splice isoforms during compromised transcription elongation rates in wild-type, spslu7-2 and spprp18-5 mutant cells we found dynamic and intron context-specific effects in splice-site choice. Our work thus shows the combinatorial effects of splice site strength, core splicing factor functions and transcription elongation kinetics to dictate alternative splice patterns which in turn serve as an additional recourse of gene regulation in fission yeast.
Kumar, Rakesh; Bawa, Pushpinder; Srinivasan, Subha
2017-01-01
Budding yeast spliceosomal factors ScSlu7 and ScPrp18 interact and mediate intron 3’ss choice during second step pre-mRNA splicing. The fission yeast genome with abundant multi-intronic transcripts, degenerate splice signals and SR proteins is an apt unicellular fungal model to deduce roles for core spliceosomal factors in alternative splice-site choice, intron retention and to study the cellular implications of regulated splicing. From our custom microarray data we deduce a stringent reproducible subset of S. pombe alternative events. We examined the role of factors SpSlu7 or SpPrp18 for these splice events and investigated the relationship to growth phase and stress. Wild-type log and stationary phase cells showed ats1+ exon 3 skipped and intron 3 retained transcripts. Interestingly the non-consensus 5’ss in ats1+ intron 3 caused SpSlu7 and SpPrp18 dependent intron retention. We validated the use of an alternative 5’ss in dtd1+ intron 1 and of an upstream alternative 3’ss in DUF3074 intron 1. The dtd1+ intron 1 non-canonical 5’ss yielded an alternative mRNA whose levels increased in stationary phase. Utilization of dtd1+ intron 1 sub-optimal 5’ ss required functional SpPrp18 and SpSlu7 while compromise in SpSlu7 function alone hampered the selection of the DUF3074 intron 1 non canonical 3’ss. We analysed the relative abundance of these splice isoforms during mild thermal, oxidative and heavy metal stress and found stress-specific splice patterns for ats1+ and DUF3074 intron 1 some of which were SpSlu7 and SpPrp18 dependent. By studying ats1+ splice isoforms during compromised transcription elongation rates in wild-type, spslu7-2 and spprp18-5 mutant cells we found dynamic and intron context-specific effects in splice-site choice. Our work thus shows the combinatorial effects of splice site strength, core splicing factor functions and transcription elongation kinetics to dictate alternative splice patterns which in turn serve as an additional recourse of gene regulation in fission yeast. PMID:29236736
Ohrt, Thomas; Odenwälder, Peter; Dannenberg, Julia; Prior, Mira; Warkocki, Zbigniew; Schmitzová, Jana; Karaduman, Ramazan; Gregor, Ingo; Enderlein, Jörg; Fabrizio, Patrizia; Lührmann, Reinhard
2013-01-01
Step 2 catalysis of pre-mRNA splicing entails the excision of the intron and ligation of the 5′ and 3′ exons. The tasks of the splicing factors Prp16, Slu7, Prp18, and Prp22 in the formation of the step 2 active site of the spliceosome and in exon ligation, and the timing of their recruitment, remain poorly understood. Using a purified yeast in vitro splicing system, we show that only the DEAH-box ATPase Prp16 is required for formation of a functional step 2 active site and for exon ligation. Efficient docking of the 3′ splice site (3′SS) to the active site requires only Slu7/Prp18 but not Prp22. Spliceosome remodeling by Prp16 appears to be subtle as only the step 1 factor Cwc25 is dissociated prior to step 2 catalysis, with its release dependent on docking of the 3′SS to the active site and Prp16 action. We show by fluorescence cross-correlation spectroscopy that Slu7/Prp18 and Prp16 bind early to distinct, low-affinity binding sites on the step-1-activated B* spliceosome, which are subsequently converted into high-affinity sites. Our results shed new light on the factor requirements for step 2 catalysis and the dynamics of step 1 and 2 factors during the catalytic steps of splicing. PMID:23685439
Qian, Xiaoxiao; Matthews, Laura; Lightman, Stafford; Ray, David; Norman, Michael
2015-01-01
Alternative splicing events from tandem donor sites result in mRNA variants coding for additional amino acids in the DNA binding domain of both the glucocorticoid (GR) and mineralocorticoid (MR) receptors. We now show that expression of both splice variants is extensively conserved in mammalian species, providing strong evidence for their functional significance. An exception to the conservation of the MR tandem splice site (an A at position +5 of the MR+12 donor site in the mouse) was predicted to decrease U1 small nuclear RNA binding. In accord with this prediction, we were unable to detect the MR+12 variant in this species. The one exception to the conservation of the GR tandem splice site, an A at position +3 of the platypus GRγ donor site that was predicted to enhance binding of U1 snRNA, was unexpectedly associated with decreased expression of the variant from the endogenous gene as well as a minigene. An intronic pyrimidine motif present in both GR and MR genes was found to be critical for usage of the downstream donor site, and overexpression of TIA1/TIAL1 RNA binding proteins, which are known to bind such motifs, led to a marked increase in the proportion of GRγ and MR+12. These results provide striking evidence for conservation of a complex splicing mechanism that involves processes other than stochastic spliceosome binding and identify a mechanism that would allow regulation of variant expression. PMID:19819975
MutPred Splice: machine learning-based prediction of exonic variants that disrupt splicing
2014-01-01
We have developed a novel machine-learning approach, MutPred Splice, for the identification of coding region substitutions that disrupt pre-mRNA splicing. Applying MutPred Splice to human disease-causing exonic mutations suggests that 16% of mutations causing inherited disease and 10 to 14% of somatic mutations in cancer may disrupt pre-mRNA splicing. For inherited disease, the main mechanism responsible for the splicing defect is splice site loss, whereas for cancer the predominant mechanism of splicing disruption is predicted to be exon skipping via loss of exonic splicing enhancers or gain of exonic splicing silencer elements. MutPred Splice is available at http://mutdb.org/mutpredsplice. PMID:24451234
Quaking and PTB control overlapping splicing regulatory networks during muscle cell differentiation
Hall, Megan P.; Nagel, Roland J.; Fagg, W. Samuel; Shiue, Lily; Cline, Melissa S.; Perriman, Rhonda J.; Donohue, John Paul; Ares, Manuel
2013-01-01
Alternative splicing contributes to muscle development, but a complete set of muscle-splicing factors and their combinatorial interactions are unknown. Previous work identified ACUAA (“STAR” motif) as an enriched intron sequence near muscle-specific alternative exons such as Capzb exon 9. Mass spectrometry of myoblast proteins selected by the Capzb exon 9 intron via RNA affinity chromatography identifies Quaking (QK), a protein known to regulate mRNA function through ACUAA motifs in 3′ UTRs. We find that QK promotes inclusion of Capzb exon 9 in opposition to repression by polypyrimidine tract-binding protein (PTB). QK depletion alters inclusion of 406 cassette exons whose adjacent intron sequences are also enriched in ACUAA motifs. During differentiation of myoblasts to myotubes, QK levels increase two- to threefold, suggesting a mechanism for QK-responsive exon regulation. Combined analysis of the PTB- and QK-splicing regulatory networks during myogenesis suggests that 39% of regulated exons are under the control of one or both of these splicing factors. This work provides the first evidence that QK is a global regulator of splicing during muscle development in vertebrates and shows how overlapping splicing regulatory networks contribute to gene expression programs during differentiation. PMID:23525800
Bertke, Andrea S; Patel, Amita; Imai, Yumi; Apakupakul, Kathleen; Margolis, Todd P; Krause, Philip R
2009-10-01
Herpes simplex virus 1 (HSV-1) and HSV-2 cause similar acute infections but differ in their abilities to reactivate from trigeminal and lumbosacral dorsal root ganglia. During latency, HSV-1 and HSV-2 also preferentially express their latency-associated transcripts (LATs) in different sensory neuronal subtypes that are positive for A5 and KH10 markers, respectively. Chimeric virus studies showed that LAT region sequences influence both of these viral species-specific phenotypes. To further map the LAT region sequences responsible for these phenotypes, we constructed the chimeric virus HSV2-LAT-E1, in which exon 1 (from the LAT TATA to the intron splice site) was replaced by the corresponding sequence from HSV-1 LAT. In intravaginally infected guinea pigs, HSV2-LAT-E1 reactivated inefficiently relative to the efficiency of its rescuant and wild-type HSV-2, but it yielded similar levels of viral DNA, LAT, and ICP0 during acute and latent infection. HSV2-LAT-E1 preferentially expressed the LAT in A5+ neurons (as does HSV-1), while the chimeric viruses HSV2-LAT-P1 (LAT promoter swap) and HSV2-LAT-S1 (LAT sequence swap downstream of the promoter) exhibited neuron subtype-specific latent LAT expression phenotypes more similar to that of HSV-2 than that of HSV-1. Rescuant viruses displayed the wild-type HSV-2 phenotypes of efficient reactivation in the guinea pig genital model and a tendency to express LAT in KH10+ neurons. The region that is critical for HSV species-specific differences in latency and reactivation thus lies between the LAT TATA and the intron splice site, and minor differences in the 5' ends of chimeric sequences in HSV2-LAT-E1 and HSV2-LAT-S1 point to sequences immediately downstream of the LAT TATA.
Bertke, Andrea S.; Patel, Amita; Imai, Yumi; Apakupakul, Kathleen; Margolis, Todd P.; Krause, Philip R.
2009-01-01
Herpes simplex virus 1 (HSV-1) and HSV-2 cause similar acute infections but differ in their abilities to reactivate from trigeminal and lumbosacral dorsal root ganglia. During latency, HSV-1 and HSV-2 also preferentially express their latency-associated transcripts (LATs) in different sensory neuronal subtypes that are positive for A5 and KH10 markers, respectively. Chimeric virus studies showed that LAT region sequences influence both of these viral species-specific phenotypes. To further map the LAT region sequences responsible for these phenotypes, we constructed the chimeric virus HSV2-LAT-E1, in which exon 1 (from the LAT TATA to the intron splice site) was replaced by the corresponding sequence from HSV-1 LAT. In intravaginally infected guinea pigs, HSV2-LAT-E1 reactivated inefficiently relative to the efficiency of its rescuant and wild-type HSV-2, but it yielded similar levels of viral DNA, LAT, and ICP0 during acute and latent infection. HSV2-LAT-E1 preferentially expressed the LAT in A5+ neurons (as does HSV-1), while the chimeric viruses HSV2-LAT-P1 (LAT promoter swap) and HSV2-LAT-S1 (LAT sequence swap downstream of the promoter) exhibited neuron subtype-specific latent LAT expression phenotypes more similar to that of HSV-2 than that of HSV-1. Rescuant viruses displayed the wild-type HSV-2 phenotypes of efficient reactivation in the guinea pig genital model and a tendency to express LAT in KH10+ neurons. The region that is critical for HSV species-specific differences in latency and reactivation thus lies between the LAT TATA and the intron splice site, and minor differences in the 5′ ends of chimeric sequences in HSV2-LAT-E1 and HSV2-LAT-S1 point to sequences immediately downstream of the LAT TATA. PMID:19641003
Maghami, Fatemeh; Tabei, Seyed Mohammad Bagher; Moravej, Hossein; Dastsooz, Hassan; Modarresi, Farzaneh; Silawi, Mohammad; Faghihi, Mohammad Ali
2018-05-25
Osteogenesis imperfecta (OI) is a group of connective tissue disorder caused by mutations of genes involved in the production of collagen and its supporting proteins. Although the majority of reported OI variants are in COL1A1 and COL1A2 genes, recent reports have shown problems in other non-collagenous genes involved in the post translational modifications, folding and transport, transcription and proliferation of osteoblasts, bone mineralization, and cell signaling. Up to now, 17 types of OI have been reported in which types I to IV are the most frequent cases with autosomal dominant pattern of inheritance. Here we report an 8- year- old boy with OI who has had multiple fractures since birth and now he is wheelchair-dependent. To identify genetic cause of OI in our patient, whole exome sequencing (WES) was carried out and it revealed a novel deleterious homozygote splice acceptor site mutation (c.1257-2A > G, IVS7-2A > G) in FKBP10 gene in the patient. Then, the identified mutation was confirmed using Sanger sequencing in the proband as homozygous and in his parents as heterozygous, indicating its autosomal recessive pattern of inheritance. In addition, we performed RT-PCR on RNA transcripts originated from skin fibroblast of the proband to analyze the functional effect of the mutation on splicing pattern of FKBP10 gene and it showed skipping of the exon 8 of this gene. Moreover, Real-Time PCR was carried out to quantify the expression level of FKBP10 in the proband and his family members in which it revealed nearly the full decrease in the level of FKBP10 expression in the proband and around 75% decrease in its level in the carriers of the mutation, strongly suggesting the pathogenicity of the mutation. Our study identified, for the first time, a private pathogenic splice site mutation in FKBP10 gene and further prove the involvement of this gene in the rare cases of autosomal recessive OI type XI with distinguished clinical manifestations.
Splicing-related genes are alternatively spliced upon changes in ambient temperatures in plants
Bucher, Johan; Lammers, Michiel; Busscher-Lange, Jacqueline; Bonnema, Guusje; Rodenburg, Nicole; Proveniers, Marcel C. G.; Angenent, Gerco C.
2017-01-01
Plants adjust their development and architecture to small variations in ambient temperature. In a time in which temperatures are rising world-wide, the mechanism by which plants are able to sense temperature fluctuations and adapt to it, is becoming of special interest. By performing RNA-sequencing on two Arabidopsis accession and one Brassica species exposed to temperature alterations, we showed that alternative splicing is an important mechanism in ambient temperature sensing and adaptation. We found that amongst the differentially alternatively spliced genes, splicing related genes are enriched, suggesting that the splicing machinery itself is targeted for alternative splicing when temperature changes. Moreover, we showed that many different components of the splicing machinery are targeted for ambient temperature regulated alternative splicing. Mutant analysis of a splicing related gene that was differentially spliced in two of the genotypes showed an altered flowering time response to different temperatures. We propose a two-step mechanism where temperature directly influences alternative splicing of the splicing machinery genes, followed by a second step where the altered splicing machinery affects splicing of downstream genes involved in the adaptation to altered temperatures. PMID:28257507
Tanaka, Arisa; Aoki, Fugaku; Suzuki, Masataka G
2018-05-26
The transformer (tra) gene, which is a female-determining master gene in the housefly Musca domestica, acts as a memory device for sex determination via its auto-regulatory function, i.e., through the contribution of the TRA protein to female-specific splicing of its own pre-mRNA. The TRA protein contains 4 small domains that are specifically conserved among TRA proteins (domains 1-4). Domain 2, also named TRA-CAM domain, is the most conserved, but its function remains unknown. To examine whether these domains are involved in the auto-regulatory function, we performed in vitro splicing assays using a tra minigene containing a partial genomic sequence of the M. domestica tra (Mdtra) gene. Co-transfection of the Mdtra minigene and an MdTRA protein expression vector into cultured insect cells strongly induced female-specific splicing of the minigene. A series of deletion mutation analyses demonstrated that these domains act complementarily to induce female-specific splicing. Domain 1 and the TRA-CAM domain were necessary for the female-specific splicing when the MdTRA protein lacked both domains 3 and 4. In this situation, mutation of the well-conserved 3 amino acids (GEG) in the TRA-CAM domain significantly reduced the female-specific splicing activity of MdTRA. GST-pull down analyses demonstrated that the MdTRA protein specifically enriched on the male-specific exonic region (exon 2b), which contains the putative TRA/TRA-2 binding sites, and that the GEG mutation disrupts this enrichment. Since the MdTRA protein interacts with its own pre-mRNA through TRA-2, our findings suggest that the conserved amino acid residues in the TRA-CAM domain may be crucial for the interaction between MdTRA and TRA-2, enhancing MdTRA recruitment on its pre-mRNA to induce female-specific splicing of tra in the housefly. © 2018 S. Karger AG, Basel.
Theory on the Coupled Stochastic Dynamics of Transcription and Splice-Site Recognition
Murugan, Rajamanickam; Kreiman, Gabriel
2012-01-01
Eukaryotic genes are typically split into exons that need to be spliced together to form the mature mRNA. The splicing process depends on the dynamics and interactions among transcription by the RNA polymerase II complex (RNAPII) and the spliceosomal complex consisting of multiple small nuclear ribonucleo proteins (snRNPs). Here we propose a biophysically plausible initial theory of splicing that aims to explain the effects of the stochastic dynamics of snRNPs on the splicing patterns of eukaryotic genes. We consider two different ways to model the dynamics of snRNPs: pure three-dimensional diffusion and a combination of three- and one-dimensional diffusion along the emerging pre-mRNA. Our theoretical analysis shows that there exists an optimum position of the splice sites on the growing pre-mRNA at which the time required for snRNPs to find the 5′ donor site is minimized. The minimization of the overall search time is achieved mainly via the increase in non-specific interactions between the snRNPs and the growing pre-mRNA. The theory further predicts that there exists an optimum transcript length that maximizes the probabilities for exons to interact with the snRNPs. We evaluate these theoretical predictions by considering human and mouse exon microarray data as well as RNAseq data from multiple different tissues. We observe that there is a broad optimum position of splice sites on the growing pre-mRNA and an optimum transcript length, which are roughly consistent with the theoretical predictions. The theoretical and experimental analyses suggest that there is a strong interaction between the dynamics of RNAPII and the stochastic nature of snRNP search for 5′ donor splicing sites. PMID:23133354
Weirather, Jason L; Afshar, Pegah Tootoonchi; Clark, Tyson A; Tseng, Elizabeth; Powers, Linda S; Underwood, Jason G; Zabner, Joseph; Korlach, Jonas; Wong, Wing Hung; Au, Kin Fai
2015-10-15
We developed an innovative hybrid sequencing approach, IDP-fusion, to detect fusion genes, determine fusion sites and identify and quantify fusion isoforms. IDP-fusion is the first method to study gene fusion events by integrating Third Generation Sequencing long reads and Second Generation Sequencing short reads. We applied IDP-fusion to PacBio data and Illumina data from the MCF-7 breast cancer cells. Compared with the existing tools, IDP-fusion detects fusion genes at higher precision and a very low false positive rate. The results show that IDP-fusion will be useful for unraveling the complexity of multiple fusion splices and fusion isoforms within tumorigenesis-relevant fusion genes. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Chen, Neng; Tranebjærg, Lisbeth; Rendtorff, Nanna Dahl; Schrijver, Iris
2011-01-01
Pendred syndrome and DFNB4 (autosomal recessive nonsyndromic congenital deafness, locus 4) are associated with autosomal recessive congenital sensorineural hearing loss and mutations in the SLC26A4 gene. Extensive allelic heterogeneity, however, necessitates analysis of all exons and splice sites to identify mutations for individual patients. Although Sanger sequencing is the gold standard for mutation detection, screening methods supplemented with targeted sequencing can provide a cost-effective alternative. One such method, denaturing high-performance liquid chromatography, was developed for clinical mutation detection in SLC26A4. However, this method inherently cannot distinguish homozygous changes from wild-type sequences. High-resolution melting (HRM), on the other hand, can detect heterozygous and homozygous changes cost-effectively, without any post-PCR modifications. We developed a closed-tube HRM mutation detection method specific for SLC26A4 that can be used in the clinical diagnostic setting. Twenty-eight primer pairs were designed to cover all 21 SLC26A4 exons and splice junction sequences. Using the resulting amplicons, initial HRM analysis detected all 45 variants previously identified by sequencing. Subsequently, a 384-well plate format was designed for up to three patient samples per run. Blinded HRM testing on these plates of patient samples collected over 1 year in a clinical diagnostic laboratory accurately detected all variants identified by sequencing. In conclusion, HRM with targeted sequencing is a reliable, simple, and cost-effective method for SLC26A4 mutation screening and detection. PMID:21704276
Tissue-selective restriction of RNA editing of CaV1.3 by splicing factor SRSF9.
Huang, Hua; Kapeli, Katannya; Jin, Wenhao; Wong, Yuk Peng; Arumugam, Thiruma Valavan; Koh, Joanne Huifen; Srimasorn, Sumitra; Mallilankaraman, Karthik; Chua, John Jia En; Yeo, Gene W; Soong, Tuck Wah
2018-05-04
Adenosine DeAminases acting on RNA (ADAR) catalyzes adenosine-to-inosine (A-to-I) conversion within RNA duplex structures. While A-to-I editing is often dynamically regulated in a spatial-temporal manner, the mechanisms underlying its tissue-selective restriction remain elusive. We have previously reported that transcripts of voltage-gated calcium channel CaV1.3 are subject to brain-selective A-to-I RNA editing by ADAR2. Here, we show that editing of CaV1.3 mRNA is dependent on a 40 bp RNA duplex formed between exon 41 and an evolutionarily conserved editing site complementary sequence (ECS) located within the preceding intron. Heterologous expression of a mouse minigene that contained the ECS, intermediate intronic sequence and exon 41 with ADAR2 yielded robust editing. Interestingly, editing of CaV1.3 was potently inhibited by serine/arginine-rich splicing factor 9 (SRSF9). Mechanistically, the inhibitory effect of SRSF9 required direct RNA interaction. Selective down-regulation of SRSF9 in neurons provides a basis for the neuron-specific editing of CaV1.3 transcripts.
NASA Astrophysics Data System (ADS)
Xing, Pengwei; Su, Ran; Guo, Fei; Wei, Leyi
2017-04-01
N6-methyladenosine (m6A) refers to methylation of the adenosine nucleotide acid at the nitrogen-6 position. It plays an important role in a series of biological processes, such as splicing events, mRNA exporting, nascent mRNA synthesis, nuclear translocation and translation process. Numerous experiments have been done to successfully characterize m6A sites within sequences since high-resolution mapping of m6A sites was established. However, as the explosive growth of genomic sequences, using experimental methods to identify m6A sites are time-consuming and expensive. Thus, it is highly desirable to develop fast and accurate computational identification methods. In this study, we propose a sequence-based predictor called RAM-NPPS for identifying m6A sites within RNA sequences, in which we present a novel feature representation algorithm based on multi-interval nucleotide pair position specificity, and use support vector machine classifier to construct the prediction model. Comparison results show that our proposed method outperforms the state-of-the-art predictors on three benchmark datasets across the three species, indicating the effectiveness and robustness of our method. Moreover, an online webserver implementing the proposed predictor has been established at http://server.malab.cn/RAM-NPPS/. It is anticipated to be a useful prediction tool to assist biologists to reveal the mechanisms of m6A site functions.
Shahzad, Mohsin; Yousaf, Sairah; Waryah, Yar M; Gul, Hadia; Kausar, Tasleem; Tariq, Nabeela; Mahmood, Umair; Ali, Muhammad; Khan, Muzammil A; Waryah, Ali M; Shaikh, Rehan S; Riazuddin, Saima; Ahmed, Zubair M
2017-03-07
Nonsyndromic oculocutaneous Albinism (nsOCA) is clinically characterized by the loss of pigmentation in the skin, hair, and iris. OCA is amongst the most common causes of vision impairment in children. To date, pathogenic variants in six genes have been identified in individuals with nsOCA. Here, we determined the identities, frequencies, and clinical consequences of OCA alleles in 94 previously unreported Pakistani families. Combination of Sanger and Exome sequencing revealed 38 alleles, including 22 novel variants, segregating with nsOCA phenotype in 80 families. Variants of TYR and OCA2 genes were the most common cause of nsOCA, occurring in 43 and 30 families, respectively. Twenty-two novel variants include nine missense, four splice site, two non-sense, one insertion and six gross deletions. In vitro studies revealed retention of OCA proteins harboring novel missense alleles in the endoplasmic reticulum (ER) of transfected cells. Exon-trapping assays with constructs containing splice site alleles revealed errors in splicing. As eight alleles account for approximately 56% (95% CI: 46.52-65.24%) of nsOCA cases, primarily enrolled from Punjab province of Pakistan, hierarchical strategies for variant detection would be feasible and cost-efficient genetic tests for OCA in families with similar origin. Thus, we developed Tetra-primer ARMS assays for rapid, reliable, reproducible and economical screening of most of these common alleles.
Conservation/Mutation in the Splice Sites of Mitochondrial Solute Carrier Genes of Vertebrates.
Calvello, Rosa; Panaro, Maria A; Salvatore, Rosaria; Mitolo, Vincenzo; Cianciulli, Antonia
2016-10-01
The "canonical" introns begin by the dinucleotide GT and end by the dinucleotide AG. GT, together with a few downstream nucleotides, and AG, with a few of the immediately preceding nucleotides, are thought to be the strongest splicing signals (5'ss and 3'ss, respectively). We examined the composition of the intronic initial and terminal hexanucleotides of the mitochondrial solute carrier genes (SLC25A's) of zebrafish, chicken, mouse, and human. These genes are orthologous and we selected the transcripts in which the arrangement of exons and introns was superimposable in the species considered. Both 5'ss and 3'ss were highly polymorphic, with 104 and 126 different configurations, respectively, in our sample. In the line of evolution from zebrafish to chicken, as well as in that from zebrafish to mammals, the average nucleotide conservation in the four variable nucleotides was about 50 % at 5' and 40 % at 3'. In the divergent evolution of mouse and human, the conservation was about 80 % at 5' and 70 % at 3'. Despite these changes, the splicing signals remain strong enough to operate at the same site. At both 5' and 3', the frequency of a nucleotide at a given position in the zebrafish sequence is positively correlated with its conservation in chicken and mammals, suggesting that selection continued to operate in birds and mammals along similar lines.
Ramsden, Richard; Arms, Luther; Davis, Trisha N; Muller, Eric G D
2011-06-27
Inteins are proteins that catalyze their own removal from within larger precursor proteins. In the process they splice the flanking protein sequences, termed the N-and C-terminal exteins. Large inteins frequently have a homing endonuclease that is involved in maintaining the intein in the host. Splicing and nuclease activity are independent and distinct domains in the folded structure. We show here that other biochemical activities can be incorporated into an intein in place of the endonuclease without affecting splicing and that these activities can provide genetic selection for the intein. We have coupled such a genetically marked intein with GFP as the N-terminal extein to create a cassette to introduce GFP within the interior of a targeted protein. The Pch PRP8 mini-intein of Penicillium chrysogenum was modified to include: 1) aminoglycoside phosphotransferase; 2) imidazoleglycerol-phosphate dehydratase, His5 from S. pombe ; 3) hygromycin B phosphotransferase; and 4) the transcriptional activator LexA-VP16. The proteins were inserted at the site of the lost endonuclease. When expressed in E. coli, all of the modified inteins spliced at high efficiency. Splicing efficiency was also greater than 96% when expressed from a plasmid in S. cerevisiae. In addition the inteins conferred either G418 or hygromycin resistance, or histidine or leucine prototropy, depending on the inserted marker and the yeast genetic background. DNA encoding the marked inteins coupled to GFP as the N-terminal extein was PCR amplified with ends homologous to an internal site in the yeast calmodulin gene CMD1. The DNA was transformed into yeast and integrants obtained by direct selection for the intein's marker. The His5-marked intein yielded a fully functional calmodulin that was tagged with GFP within its central linker. Inteins continue to show their flexibility as tools in molecular biology. The Pch PRP8 intein can successfully tolerate a variety of genetic markers and still retain high splicing efficiency. We have shown that a genetically marked intein can be used to insert GFP in one-step within a target protein in vivo.
Detection of alternative splice variants at the proteome level in Aspergillus flavus.
Chang, Kung-Yen; Georgianna, D Ryan; Heber, Steffen; Payne, Gary A; Muddiman, David C
2010-03-05
Identification of proteins from proteolytic peptides or intact proteins plays an essential role in proteomics. Researchers use search engines to match the acquired peptide sequences to the target proteins. However, search engines depend on protein databases to provide candidates for consideration. Alternative splicing (AS), the mechanism where the exon of pre-mRNAs can be spliced and rearranged to generate distinct mRNA and therefore protein variants, enable higher eukaryotic organisms, with only a limited number of genes, to have the requisite complexity and diversity at the proteome level. Multiple alternative isoforms from one gene often share common segments of sequences. However, many protein databases only include a limited number of isoforms to keep minimal redundancy. As a result, the database search might not identify a target protein even with high quality tandem MS data and accurate intact precursor ion mass. We computationally predicted an exhaustive list of putative isoforms of Aspergillus flavus proteins from 20 371 expressed sequence tags to investigate whether an alternative splicing protein database can assign a greater proportion of mass spectrometry data. The newly constructed AS database provided 9807 new alternatively spliced variants in addition to 12 832 previously annotated proteins. The searches of the existing tandem MS spectra data set using the AS database identified 29 new proteins encoded by 26 genes. Nine fungal genes appeared to have multiple protein isoforms. In addition to the discovery of splice variants, AS database also showed potential to improve genome annotation. In summary, the introduction of an alternative splicing database helps identify more proteins and unveils more information about a proteome.
Inhibition of Human Immunodeficiency Virus Replication by Antisense Oligodeoxynucleotides
NASA Astrophysics Data System (ADS)
Goodchild, John; Agrawal, Sudhir; Civeira, Maria P.; Sarin, Prem S.; Sun, Daisy; Zamecnik, Paul C.
1988-08-01
Twenty different target sites within human immunodeficiency virus (HIV) RNA were selected for studies of inhibition of HIV replication by antisense oligonucleotides. Target sites were selected based on their potential capacity to block recognition functions during viral replication. Antisense oligomers complementary to sites within or near the sequence repeated at the ends of retrovirus RNA (R region) and to certain splice sites were most effective. The effect of antisense oligomer length on inhibiting virus replication was also investigated, and preliminary toxicity studies in mice show that these compounds are toxic only at high levels. The results indicate potential usefulness for these oligomers in the treatment of patients with acquired immunodeficiency syndrome (AIDS) and AIDS-related complex either alone or in combination with other drugs.
An indicator gene to demonstrate intracellular transposition of defective retroviruses.
Heidmann, T; Heidmann, O; Nicolas, J F
1988-01-01
An indicator gene for detection and quantitation of RNA-mediated transposition was constructed (neoRT). It was inserted into a Moloney murine leukemia provirus (Mo-MLV) deleted for the envelope gene to test for intracellular transposition of defective retroviruses [Mo-MLV(neo)RT]. NeoRT contains the selectable neo gene (which confers resistance to the drug G418), inactivated by a polyadenylylation sequence inserted between the neo promotor and coding sequence. The polyadenylylation sequence is flanked (on the antisense strand of the DNA) by a donor and an acceptor splice site so as to be removed upon passage of the provirus through an RNA intermediate. 3T3 cells transfected with the defective Mo-MLV(neo)RT provirus are sensitive to G418. After trans-complementation with Mo-MLV, viral transcripts confer resistance to G418 upon infection of test cells. In the resistant cells, the polyadenylylation sequence has been removed, as a result in most cases of precise splicing of the intronic domain. Retrotransposition of the defective Mo-MLV(neo)RT provirus was demonstrated by submitting transfected G418-sensitive clones to selection. Between 1 and 10 G418-resistant clones were obtained per 10(7) cells. Several possess additional copies, with evidence for precise removal of the intronic domain. By using target test cells in coculture experiments, extracellular intermediates of retrotransposition could not be detected. Images PMID:2832848
Schwarze, Ulrike; Hata, Ryu-Ichiro; McKusick, Victor A.; Shinkai, Hiroshi; Hoyme, H. Eugene; Pyeritz, Reed E.; Byers, Peter H.
2004-01-01
Splice site mutations in the COL1A2 gene of type I collagen can give rise to forms of Ehlers-Danlos syndrome (EDS) because of partial or complete skipping of exon 6, as well as to mild, moderate, or lethal forms of osteogenesis imperfecta as a consequence of skipping of other exons. We identified three unrelated individuals with a rare recessively inherited form of EDS (characterized by joint hypermobility, skin hyperextensibility, and cardiac valvular defects); in two of them, COL1A2 messenger RNA (mRNA) instability results from compound heterozygosity for splice site mutations in the COL1A2 gene, and, in the third, it results from homozygosity for a nonsense codon. The splice site mutations led to use of cryptic splice donor sites, creation of a downstream premature termination codon, and extremely unstable mRNA. In the wild-type allele, the two introns (IVS11 and IVS24) in which these mutations occurred were usually spliced slowly in relation to their respective immediate upstream introns. In the mutant alleles, the upstream intron was removed, so that exon skipping could not occur. In the context of the mutation in IVS24, computer-generated folding of a short stretch of mRNA surrounding the mutation site demonstrated realignment of the relationships between the donor and acceptor sites that could facilitate use of a cryptic donor site. These findings suggest that the order of intron removal is an important variable in prediction of mutation outcome at splice sites and that folding of the nascent mRNA could be one element that contributes to determination of order of splicing. The complete absence of proα2(I) chains has the surprising effect of producing cardiac valvular disease without bone involvement. PMID:15077201
Schwarze, Ulrike; Hata, Ryu-Ichiro; McKusick, Victor A; Shinkai, Hiroshi; Hoyme, H Eugene; Pyeritz, Reed E; Byers, Peter H
2004-05-01
Splice site mutations in the COL1A2 gene of type I collagen can give rise to forms of Ehlers-Danlos syndrome (EDS) because of partial or complete skipping of exon 6, as well as to mild, moderate, or lethal forms of osteogenesis imperfecta as a consequence of skipping of other exons. We identified three unrelated individuals with a rare recessively inherited form of EDS (characterized by joint hypermobility, skin hyperextensibility, and cardiac valvular defects); in two of them, COL1A2 messenger RNA (mRNA) instability results from compound heterozygosity for splice site mutations in the COL1A2 gene, and, in the third, it results from homozygosity for a nonsense codon. The splice site mutations led to use of cryptic splice donor sites, creation of a downstream premature termination codon, and extremely unstable mRNA. In the wild-type allele, the two introns (IVS11 and IVS24) in which these mutations occurred were usually spliced slowly in relation to their respective immediate upstream introns. In the mutant alleles, the upstream intron was removed, so that exon skipping could not occur. In the context of the mutation in IVS24, computer-generated folding of a short stretch of mRNA surrounding the mutation site demonstrated realignment of the relationships between the donor and acceptor sites that could facilitate use of a cryptic donor site. These findings suggest that the order of intron removal is an important variable in prediction of mutation outcome at splice sites and that folding of the nascent mRNA could be one element that contributes to determination of order of splicing. The complete absence of pro alpha 2(I) chains has the surprising effect of producing cardiac valvular disease without bone involvement.
Kralovicova, Jana; Knut, Marcin; Cross, Nicholas C. P.; Vorechovsky, Igor
2015-01-01
The auxiliary factor of U2 small nuclear RNA (U2AF) is a heterodimer consisting of 65- and 35-kD proteins that bind the polypyrimidine tract (PPT) and AG dinucleotides at the 3′ splice site (3′ss). The gene encoding U2AF35 (U2AF1) is alternatively spliced, giving rise to two isoforms U2AF35a and U2AF35b. Here, we knocked down U2AF35 and each isoform and characterized transcriptomes of HEK293 cells with varying U2AF35/U2AF65 and U2AF35a/b ratios. Depletion of both isoforms preferentially modified alternative RNA processing events without widespread failure to recognize 3′ss or constitutive exons. Over a third of differentially used exons were terminal, resulting largely from the use of known alternative polyadenylation (APA) sites. Intronic APA sites activated in depleted cultures were mostly proximal whereas tandem 3′UTR APA was biased toward distal sites. Exons upregulated in depleted cells were preceded by longer AG exclusion zones and PPTs than downregulated or control exons and were largely activated by PUF60 and repressed by CAPERα. The U2AF(35) repression and activation was associated with a significant interchange in the average probabilities to form single-stranded RNA in the optimal PPT and branch site locations and sequences further upstream. Although most differentially used exons were responsive to both U2AF subunits and their inclusion correlated with U2AF levels, a small number of transcripts exhibited distinct responses to U2AF35a and U2AF35b, supporting the existence of isoform-specific interactions. These results provide new insights into function of U2AF and U2AF35 in alternative RNA processing. PMID:25779042
Spliced synthetic genes as internal controls in RNA sequencing experiments.
Hardwick, Simon A; Chen, Wendy Y; Wong, Ted; Deveson, Ira W; Blackburn, James; Andersen, Stacey B; Nielsen, Lars K; Mattick, John S; Mercer, Tim R
2016-09-01
RNA sequencing (RNA-seq) can be used to assemble spliced isoforms, quantify expressed genes and provide a global profile of the transcriptome. However, the size and diversity of the transcriptome, the wide dynamic range in gene expression and inherent technical biases confound RNA-seq analysis. We have developed a set of spike-in RNA standards, termed 'sequins' (sequencing spike-ins), that represent full-length spliced mRNA isoforms. Sequins have an entirely artificial sequence with no homology to natural reference genomes, but they align to gene loci encoded on an artificial in silico chromosome. The combination of multiple sequins across a range of concentrations emulates alternative splicing and differential gene expression, and it provides scaling factors for normalization between samples. We demonstrate the use of sequins in RNA-seq experiments to measure sample-specific biases and determine the limits of reliable transcript assembly and quantification in accompanying human RNA samples. In addition, we have designed a complementary set of sequins that represent fusion genes arising from rearrangements of the in silico chromosome to aid in cancer diagnosis. RNA sequins provide a qualitative and quantitative reference with which to navigate the complexity of the human transcriptome.
Cancer-Associated Perturbations in Alternative Pre-messenger RNA Splicing.
Shkreta, Lulzim; Bell, Brendan; Revil, Timothée; Venables, Julian P; Prinos, Panagiotis; Elela, Sherif Abou; Chabot, Benoit
2013-01-01
For most of our 25,000 genes, the removal of introns by pre-messenger RNA (pre-mRNA) splicing represents an essential step toward the production of functional messenger RNAs (mRNAs). Alternative splicing of a single pre-mRNA results in the production of different mRNAs. Although complex organisms use alternative splicing to expand protein function and phenotypic diversity, patterns of alternative splicing are often altered in cancer cells. Alternative splicing contributes to tumorigenesis by producing splice isoforms that can stimulate cell proliferation and cell migration or induce resistance to apoptosis and anticancer agents. Cancer-specific changes in splicing profiles can occur through mutations that are affecting splice sites and splicing control elements, and also by alterations in the expression of proteins that control splicing decisions. Recent progress in global approaches that interrogate splicing diversity should help to obtain specific splicing signatures for cancer types. The development of innovative approaches for annotating and reprogramming splicing events will more fully establish the essential contribution of alternative splicing to the biology of cancer and will hopefully provide novel targets and anticancer strategies. Metazoan genes are usually made up of several exons interrupted by introns. The introns are removed from the pre-mRNA by RNA splicing. In conjunction with other maturation steps, such as capping and polyadenylation, the spliced mRNA is then transported to the cytoplasm to be translated into a functional protein. The basic mechanism of splicing requires accurate recognition of each extremity of each intron by the spliceosome. Introns are identified by the binding of U1 snRNP to the 5' splice site and the U2AF65/U2AF35 complex to the 3' splice site. Following these interactions, other proteins and snRNPs are recruited to generate the complete spliceosomal complex needed to excise the intron. While many introns are constitutively removed by the spliceosome, other splice junctions are not used systematically, generating the phenomenon of alternative splicing. Alternative splicing is therefore the process by which a single species of pre-mRNA can be matured to produce different mRNA molecules (Fig. 1). Depending on the number and types of alternative splicing events, a pre-mRNA can generate from two to several thousands different mRNAs leading to the production of a corresponding number of proteins. It is now believed that the expression of at least 70 % of human genes is subjected to alternative splicing, implying an enormous contribution to proteomic diversity, and by extension, to the development and the evolution of complex animals. Defects in splicing have been associated with human diseases (Caceres and Kornblihtt, Trends Genet 18(4):186-93, 2002, Cartegni et al., Nat Rev Genet 3(4):285-98, 2002, Pagani and Baralle, Nat Rev Genet 5(5):389-96, 2004), including cancer (Brinkman, Clin Biochem 37(7):584-94, 2004, Venables, Bioessays 28(4):378-86, 2006, Srebrow and Kornblihtt, J Cell Sci 119(Pt 13):2635-2641, 2006, Revil et al., Bull Cancer 93(9):909-919, 2006, Venables, Transworld Res Network, 2006, Pajares et al., Lancet Oncol 8(4):349-57, 2007, Skotheim and Nees, Int J Biochem Cell Biol 39:1432-1449, 2007). Numerous studies have now confirmed the existence of specific differences in the alternative splicing profiles between normal and cancer tissues. Although there are a few cases where specific mutations are the primary cause for these changes, global alterations in alternative splicing in cancer cells may be primarily derived from changes in the expression of RNA-binding proteins that control splice site selection. Overall, these cancer-specific differences in alternative splicing offer an immense potential to improve the diagnosis and the prognosis of cancer. This review will focus on the functional impact of cancer-associated alternative splicing variants, the molecular determinants that alter the splicing decisions in cancer cells, and future therapeutic strategies.
Lemoine, E; Merceron, D; Sallantin, J; Nguifo, E M
1999-01-01
This paper describes a new approach to problem solving by splitting up problem component parts between software and hardware. Our main idea arises from the combination of two previously published works. The first one proposed a conceptual environment of concept modelling in which the machine and the human expert interact. The second one reported an algorithm based on reconfigurable hardware system which outperforms any kind of previously published genetic data base scanning hardware or algorithms. Here we show how efficient the interaction between the machine and the expert is when the concept modelling is based on reconfigurable hardware system. Their cooperation is thus achieved with an real time interaction speed. The designed system has been partially applied to the recognition of primate splice junctions sites in genetic sequences.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ponthier, Julie L.; Schluepen, Christina; Chen, Weiguo
Activation of protein 4.1R exon 16 (E16) inclusion during erythropoiesis represents a physiologically important splicing switch that increases 4.1R affinity for spectrin and actin. Previous studies showed that negative regulation of E16 splicing is mediated by the binding of hnRNP A/B proteins to silencer elements in the exon and that downregulation of hnRNP A/B proteins in erythroblasts leads to activation of E16 inclusion. This paper demonstrates that positive regulation of E16 splicing can be mediated by Fox-2 or Fox-1, two closely related splicing factors that possess identical RNA recognition motifs. SELEX experiments with human Fox-1 revealed highly selective binding tomore » the hexamer UGCAUG. Both Fox-1 and Fox-2 were able to bind the conserved UGCAUG elements in the proximal intron downstream of E16, and both could activate E16 splicing in HeLa cell co-transfection assays in a UGCAUG-dependent manner. Conversely, knockdown of Fox-2 expression, achieved with two different siRNA sequences resulted in decreased E16 splicing. Moreover, immunoblot experiments demonstrate mouse erythroblasts express Fox-2, but not Fox-1. These findings suggest that Fox-2 is a physiological activator of E16 splicing in differentiating erythroid cells in vivo. Recent experiments show that UGCAUG is present in the proximal intron sequence of many tissue-specific alternative exons, and we propose that the Fox family of splicing enhancers plays an important role in alternative splicing switches during differentiation in metazoan organisms.« less
Pan, Ling; Pasternak, David A; Xu, Jin; Xu, Mingming; Lu, Zhigang; Pasternak, Gavril W; Pan, Ying-Xian
2017-01-01
The sigma1 receptor acts as a chaperone at the endoplasmic reticulum, associates with multiple proteins in various cellular systems, and involves in a number of diseases, such as addiction, pain, cancer and psychiatric disorders. The sigma1 receptor is encoded by the single copy SIGMAR1 gene. The current study identifies five alternatively spliced variants of the mouse sigma1 receptor gene using a polymerase chain reaction cloning approach. All the splice variants are generated by exon skipping or alternative 3' or 5' splicing, producing the truncated sigma1 receptor. Similar alternative splicing has been observed in the human SIGMAR1 gene based on the molecular cloning or genome sequence prediction, suggesting conservation of alternative splicing of SIGMAR1 gene. Using quantitative polymerase chain reactions, we demonstrate differential expression of several splice variants in mouse tissues and brain regions. When expressed in HEK293 cells, all the splice variants fail to bind sigma ligands, implicating that each truncated region in these splice variants is important for ligand binding. However, co-immunoprecipitation (Co-IP) study in HEK293 cells co-transfected with tagged constructs reveals that all the splice variants maintain their ability to physically associate with a mu opioid receptor (mMOR-1), providing useful information to correlate the motifs/sequences necessary for their physical association. Furthermore, a competition Co-IP study showed that all the variants can disrupt in a dose-dependent manner the dimerization of the original sigma1 receptor with mMOR-1, suggesting a potential dominant negative function and providing significant insights into their function.
PASTA: splice junction identification from RNA-Sequencing data
2013-01-01
Background Next generation transcriptome sequencing (RNA-Seq) is emerging as a powerful experimental tool for the study of alternative splicing and its regulation, but requires ad-hoc analysis methods and tools. PASTA (Patterned Alignments for Splicing and Transcriptome Analysis) is a splice junction detection algorithm specifically designed for RNA-Seq data, relying on a highly accurate alignment strategy and on a combination of heuristic and statistical methods to identify exon-intron junctions with high accuracy. Results Comparisons against TopHat and other splice junction prediction software on real and simulated datasets show that PASTA exhibits high specificity and sensitivity, especially at lower coverage levels. Moreover, PASTA is highly configurable and flexible, and can therefore be applied in a wide range of analysis scenarios: it is able to handle both single-end and paired-end reads, it does not rely on the presence of canonical splicing signals, and it uses organism-specific regression models to accurately identify junctions. Conclusions PASTA is a highly efficient and sensitive tool to identify splicing junctions from RNA-Seq data. Compared to similar programs, it has the ability to identify a higher number of real splicing junctions, and provides highly annotated output files containing detailed information about their location and characteristics. Accurate junction data in turn facilitates the reconstruction of the splicing isoforms and the analysis of their expression levels, which will be performed by the remaining modules of the PASTA pipeline, still under development. Use of PASTA can therefore enable the large-scale investigation of transcription and alternative splicing. PMID:23557086
A deep intronic mutation in the SLC12A3 gene leads to Gitelman syndrome.
Nozu, Kandai; Iijima, Kazumoto; Nozu, Yoshimi; Ikegami, Ei; Imai, Takehide; Fu, Xue Jun; Kaito, Hiroshi; Nakanishi, Koichi; Yoshikawa, Norishige; Matsuo, Masafumi
2009-11-01
Many mutations have been detected in the SLC12A3 gene of Gitelman syndrome (GS, OMIM 263800) patients. In previous studies, only one mutant allele was detected in approximately 20 to 41% of patients with GS; however, the exact reason for the nonidentification has not been established. In this study, we used RT-PCR using mRNA to investigate for the first time transcript abnormalities caused by deep intronic mutation. Direct sequencing analysis of leukocyte DNA identified one base insertion in exon 6 (c.818_819insG), but no mutation was detected in another allele. We analyzed RNA extracted from leukocytes and urine sediments and detected unknown sequence containing 238bp between exons 13 and 14. The genomic DNA analysis of intron 13 revealed a single-base substitution (c.1670-191C>T) that creates a new donor splice site within the intron resulting in the inclusion of a novel cryptic exon in mRNA. This is the first report of creation of a splice site by a deep intronic single-nucleotide change in GS and the first report to detect the onset mechanism in a patient with GS and missing mutation in one allele. This molecular onset mechanism may partly explain the poor success rate of mutation detection in both alleles of patients with GS.
Intronic splicing mutations in PTCH1 cause Gorlin syndrome.
Bholah, Zaynab; Smith, Miriam J; Byers, Helen J; Miles, Emma K; Evans, D Gareth; Newman, William G
2014-09-01
Gorlin syndrome is an autosomal dominant disorder characterized by multiple early-onset basal cell carcinoma, odontogenic keratocysts and skeletal abnormalities. It is caused by heterozygous mutations in the tumour suppressor PTCH1. Routine clinical genetic testing, by Sanger sequencing and multiplex ligation-dependent probe amplification (MLPA) to confirm a clinical diagnosis of Gorlin syndrome, identifies a mutation in 60-90 % of cases. We undertook RNA analysis on lymphocytes from ten individuals diagnosed with Gorlin syndrome, but without known PTCH1 mutations by exonic sequencing or MLPA. Two altered PTCH1 transcripts were identified. Genomic DNA sequence analysis identified an intron 7 mutation c.1068-10T>A, which created a strong cryptic splice acceptor site, leading to an intronic insertion of eight bases; this is predicted to create a frameshift p.(His358Alafs*12). Secondly, a deep intronic mutation c.2561-2057A>G caused an inframe insertion of 78 intronic bases in the cDNA transcript, leading to a premature stop codon p.(Gly854fs*3). The mutations are predicted to cause loss of function of PTCH1, consistent with its tumour suppressor function. The findings indicate the importance of RNA analysis to detect intronic mutations in PTCH1 not identified by routine screening techniques.
Comparative Analyses of DNA Methylation and Sequence Evolution Using Nasonia Genomes
Park, Jungsun; Peng, Zuogang; Zeng, Jia; Elango, Navin; Park, Taesung; Wheeler, Dave; Werren, John H.; Yi, Soojin V.
2011-01-01
The functional and evolutionary significance of DNA methylation in insect genomes remains to be resolved. Nasonia is well situated for comparative analyses of DNA methylation and genome evolution, since the genomes of a moderately distant outgroup species as well as closely related sibling species are available. Using direct sequencing of bisulfite-converted DNA, we uncovered a substantial level of DNA methylation in 17 of 18 Nasonia vitripennis genes and a strong correlation between methylation level and CpG depletion. Notably, in the sex-determining locus transformer, the exon that is alternatively spliced between the sexes is heavily methylated in both males and females, whereas other exons are only sparsely methylated. Orthologous genes of the honeybee and Nasonia show highly similar relative levels of CpG depletion, despite ∼190 My divergence. Densely and sparsely methylated genes in these species also exhibit similar functional enrichments. We found that the degree of CpG depletion is negatively correlated with substitution rates between closely related Nasonia species for synonymous, nonsynonymous, and intron sites. This suggests that mutation rates increase with decreasing levels of germ line methylation. Thus, DNA methylation is prevalent in the Nasonia genome, may participate in regulatory processes such as sex determination and alternative splicing, and is correlated with several aspects of genome and sequence evolution. PMID:21693438
Revealing the transcriptomic complexity of switchgrass by PacBio long-read sequencing.
Zuo, Chunman; Blow, Matthew; Sreedasyam, Avinash; Kuo, Rita C; Ramamoorthy, Govindarajan Kunde; Torres-Jerez, Ivone; Li, Guifen; Wang, Mei; Dilworth, David; Barry, Kerrie; Udvardi, Michael; Schmutz, Jeremy; Tang, Yuhong; Xu, Ying
2018-01-01
Switchgrass ( Panicum virgatum L.) is an important bioenergy crop widely used for lignocellulosic research. While extensive transcriptomic analyses have been conducted on this species using short read-based sequencing techniques, very little has been reliably derived regarding alternatively spliced (AS) transcripts. We present an analysis of transcriptomes of six switchgrass tissue types pooled together, sequenced using Pacific Biosciences (PacBio) single-molecular long-read technology. Our analysis identified 105,419 unique transcripts covering 43,570 known genes and 8795 previously unknown genes. 45,168 are novel transcripts of known genes. A total of 60,096 AS transcripts are identified, 45,628 being novel. We have also predicted 1549 transcripts of genes involved in cell wall construction and remodeling, 639 being novel transcripts of known cell wall genes. Most of the predicted transcripts are validated against Illumina-based short reads. Specifically, 96% of the splice junction sites in all the unique transcripts are validated by at least five Illumina reads. Comparisons between genes derived from our identified transcripts and the current genome annotation revealed that among the gene set predicted by both analyses, 16,640 have different exon-intron structures. Overall, substantial amount of new information is derived from the PacBio RNA data regarding both the transcriptome and the genome of switchgrass.
SMITten by the Speed of Splicing.
Johnson, Tracy L; Ares, Manuel
2016-04-07
Splicing occurs co-transcriptionally, but relative rates of splicing and transcription that might reveal mechanisms of their coordinated control have remained mysterious. Now, Carrillo Oesterreich et al. show that the fastest introns are gone nearly as soon as the 3' splice site is transcribed and that introns have distinct splicing kinetics with respect to polymerase progression along the gene. Copyright © 2016 Elsevier Inc. All rights reserved.
Lemahieu, V; Gastier, J M; Francke, U
1999-01-01
Wiskott-Aldrich syndrome (WAS) is an X-linked recessive immunodeficiency characterized by thrombocytopenia, eczema, and recurrent infections, and caused by mutations in the WAS protein (WASP) gene. WASP contains several functional domains through which it interacts with proteins involved in intracellular signaling and regulation of the actin cytoskeleton. In this report, 17 WASP gene mutations were identified, 12 of which are novel. DNA of affected males and obligate carriers was PCR amplified and analyzed by SSCA, heteroduplex analysis, and direct sequencing. The effects of the mutations at the mRNA and protein level were ascertained by RT-PCR and Western blot analyses. All missense mutations were located in exons 1-4. Most of the nonsense, frameshift and splice site mutations were found in exons 6-11. Mutations that alter splice sites led to the synthesis of several types of mRNAs, a fraction of which represented the normally spliced product. The presence of normally spliced transcripts was correlated with a milder phenotype. When one such case was studied by Western blotting, reduced amounts of normal-size WASP were present. In other cases as well, a correlation was found between the amount of normal or mutant WASP present and the phenotypes of the affected individuals. No protein was detected in two individuals with severe WAS. Reduced levels of a normal-size WASP with a missense mutation were seen in two individuals with XLT. It is concluded that mutation analysis at the DNA level is not sufficient for predicting clinical course. Studies at the transcript and protein level are needed for a better assessment.
ANXA11 mutations prevail in Chinese ALS patients with and without cognitive dementia.
Zhang, Kang; Liu, Qing; Liu, Keqiang; Shen, Dongchao; Tai, Hongfei; Shu, Shi; Ding, Qingyun; Fu, Hanhui; Liu, Shuangwu; Wang, Zhili; Li, Xiaoguang; Liu, Mingsheng; Zhang, Xue; Cui, Liying
2018-06-01
To investigate the genetic contribution of ANXA11 , a gene associated with amyotrophic lateral sclerosis (ALS), in Chinese ALS patients with and without cognitive dementia. Sequencing all the coding exons of ANXA11 and intron-exon boundaries in 18 familial amyotrophic lateral sclerosis (FALS), 353 unrelated sporadic amyotrophic lateral sclerosis (SALS), and 12 Chinese patients with ALS-frontotemporal lobar dementia (ALS-FTD). The transcripts in peripheral blood generated from a splicing mutation were examined by reverse transcriptase PCR. We identified 6 nonsynonymous heterozygous mutations (5 novel and 1 recurrent), 1 splice site mutation, and 1 deletion of 10 amino acids (not accounted in the mutant frequency) in 11 unrelated patients, accounting for a mutant frequency of 5.6% (1/18) in FALS, 2.3% (8/353) in SALS, and 8.3% (1/12) in ALS-FTD. The deletion of 10 amino acids was detected in 1 clinically undetermined male with an ALS family history who had atrophy in hand muscles and myotonic discharges revealed by EMG. The novel p. P36R mutation was identified in 1 FALS index, 1 patient with SALS, and 1 ALS-FTD. The splicing mutation (c.174-2A>G) caused in-frame skipping of the entire exon 6. The rest missense mutations including p.D40G, p.V128M, p.S229R, p.R302C and p.G491R were found in 6 unrelated patients with SALS. The ANXA11 gene is one of the most frequently mutated genes in Chinese patients with SALS. A canonical splice site mutation leading to skipping of the entire exon 6 further supports the loss-of-function mechanism. In addition, the study findings further expand the ANXA11 phenotype, first highlighting its pathogenic role in ALS-FTD.
Regulation of alternative splicing in Drosophila by 56 RNA binding proteins
Brooks, Angela N.; Duff, Michael O.; May, Gemma; ...
2015-08-20
Alternative splicing is regulated by RNA binding proteins (RBPs) that recognize pre-mRNA sequence elements and activate or repress adjacent exons. Here, we used RNA interference and RNA-seq to identify splicing events regulated by 56 Drosophila proteins, some previously unknown to regulate splicing. Nearly all proteins affected alternative first exons, suggesting that RBPs play important roles in first exon choice. Half of the splicing events were regulated by multiple proteins, demonstrating extensive combinatorial regulation. We observed that SR and hnRNP proteins tend to act coordinately with each other, not antagonistically. We also identified a cross-regulatory network where splicing regulators affected themore » splicing of pre-mRNAs encoding other splicing regulators. In conclusion, this large-scale study substantially enhances our understanding of recent models of splicing regulation and provides a resource of thousands of exons that are regulated by 56 diverse RBPs.« less
Mutations in KIAA0753 cause Joubert syndrome associated with growth hormone deficiency
Stephen, Joshi; Vilboux, Thierry; Mian, Luhe; Kuptanon, Chulaluck; Sinclair, Courtney M.; Yildirimli, Deniz; Maynard, Dawn M.; Bryant, Joy; Fischer, Roxanne; Vemulapalli, Meghana; Mullikin, James C.; Huizing, Marjan; Gahl, William A.
2017-01-01
Joubert syndrome and related disorders (JSRD) are a heterogeneous group of ciliopathies defined based on the mid-hindbrain abnormalities that result in the characteristic “molar tooth sign” on brain imaging. The core clinical findings of JSRD are hypotonia, developmental delay, abnormal eye movements and breathing abnormalities. To date, more than 30 JSRD genes that encode proteins important for structure and/or function of cilia have been identified. Here, we present 2 siblings with Joubert syndrome associated with growth hormone deficiency. Whole exome sequencing of the family identified compound heterozygous mutations in KIAA0753, i.e., a missense mutation (p.Arg257Gly) and an intronic mutation (c.2359-1G>C). The intronic mutation alters normal splicing by activating a cryptic acceptor splice site in exon 16. The novel acceptor site skips nine nucleotides, deleting three amino acids from the protein coding frame. KIAA0753 (OFIP) is a centrosome and pericentriolar satellite protein, previously not known to cause Joubert syndrome. We present comprehensive clinical descriptions of the Joubert syndrome patients as well as the cellular phenotype of defective ciliogenesis in the patients’ fibroblasts. PMID:28220259
Mutations in KIAA0753 cause Joubert syndrome associated with growth hormone deficiency.
Stephen, Joshi; Vilboux, Thierry; Mian, Luhe; Kuptanon, Chulaluck; Sinclair, Courtney M; Yildirimli, Deniz; Maynard, Dawn M; Bryant, Joy; Fischer, Roxanne; Vemulapalli, Meghana; Mullikin, James C; Huizing, Marjan; Gahl, William A; Malicdan, May Christine V; Gunay-Aygun, Meral
2017-04-01
Joubert syndrome and related disorders (JSRD) are a heterogeneous group of ciliopathies defined based on the mid-hindbrain abnormalities that result in the characteristic "molar tooth sign" on brain imaging. The core clinical findings of JSRD are hypotonia, developmental delay, abnormal eye movements and breathing abnormalities. To date, more than 30 JSRD genes that encode proteins important for structure and/or function of cilia have been identified. Here, we present 2 siblings with Joubert syndrome associated with growth hormone deficiency. Whole exome sequencing of the family identified compound heterozygous mutations in KIAA0753, i.e., a missense mutation (p.Arg257Gly) and an intronic mutation (c.2359-1G>C). The intronic mutation alters normal splicing by activating a cryptic acceptor splice site in exon 16. The novel acceptor site skips nine nucleotides, deleting three amino acids from the protein coding frame. KIAA0753 (OFIP) is a centrosome and pericentriolar satellite protein, previously not known to cause Joubert syndrome. We present comprehensive clinical descriptions of the Joubert syndrome patients as well as the cellular phenotype of defective ciliogenesis in the patients' fibroblasts.
Singh, Smriti; Narayanan, Sathiya Pandi; Biswas, Kajal; Gupta, Amit; Ahuja, Neha; Yadav, Sandhya; Panday, Rajendra Kumar; Samaiya, Atul; Sharan, Shyam K.
2017-01-01
Aberrant alternative splicing and epigenetic changes are both associated with various cancers, but epigenetic regulation of alternative splicing in cancer is largely unknown. Here we report that the intragenic DNA methylation-mediated binding of Brother of Regulator of Imprinted Sites (BORIS) at the alternative exon of Pyruvate Kinase (PKM) is associated with cancer-specific splicing that promotes the Warburg effect and breast cancer progression. Interestingly, the inhibition of DNA methylation, BORIS depletion, or CRISPR/Cas9-mediated deletion of the BORIS binding site leads to a splicing switch from cancer-specific PKM2 to normal PKM1 isoform. This results in the reversal of the Warburg effect and the inhibition of breast cancer cell growth, which may serve as a useful approach to inhibit the growth of breast cancer cells. Importantly, our results show that in addition to PKM splicing, BORIS also regulates the alternative splicing of several genes in a DNA methylation-dependent manner. Our findings highlight the role of intragenic DNA methylation and DNA binding protein BORIS in cancer-specific splicing and its role in tumorigenesis. PMID:29073069
Phylogenetic Analysis of Nuclear-Encoded RNA Maturases
Malik, Sunita; Upadhyaya, KC; Khurana, SM Paul
2017-01-01
Posttranscriptional processes, such as splicing, play a crucial role in gene expression and are prevalent not only in nuclear genes but also in plant mitochondria where splicing of group II introns is catalyzed by a class of proteins termed maturases. In plant mitochondria, there are 22 mitochondrial group II introns. matR, nMAT1, nMAT2, nMAT3, and nMAT4 proteins have been shown to be required for efficient splicing of several group II introns in Arabidopsis thaliana. Nuclear maturases (nMATs) are necessary for splicing of mitochondrial genes, leading to normal oxidative phosphorylation. Sequence analysis through phylogenetic tree (including bootstrapping) revealed high homology with maturase sequences of A thaliana and other plants. This study shows the phylogenetic relationship of nMAT proteins between A thaliana and other nonredundant plant species taken from BLASTP analysis. PMID:28607538
Landscape of the spliced leader trans-splicing mechanism in Schistosoma mansoni.
Boroni, Mariana; Sammeth, Michael; Gava, Sandra Grossi; Jorge, Natasha Andressa Nogueira; Macedo, Andréa Mara; Machado, Carlos Renato; Mourão, Marina Moraes; Franco, Glória Regina
2018-03-01
Spliced leader dependent trans-splicing (SLTS) has been described as an important RNA regulatory process that occurs in different organisms, including the trematode Schistosoma mansoni. We identified more than seven thousand putative SLTS sites in the parasite, comprising genes with a wide spectrum of functional classes, which underlines the SLTS as a ubiquitous mechanism in the parasite. Also, SLTS gene expression levels span several orders of magnitude, showing that SLTS frequency is not determined by the expression level of the target gene, but by the presence of particular gene features facilitating or hindering the trans-splicing mechanism. Our in-depth investigation of SLTS events demonstrates widespread alternative trans-splicing (ATS) acceptor sites occurring in different regions along the entire gene body, highlighting another important role of SLTS generating alternative RNA isoforms in the parasite, besides the polycistron resolution. Particularly for introns where SLTS directly competes for the same acceptor substrate with cis-splicing, we identified for the first time additional and important features that might determine the type of splicing. Our study substantially extends the current knowledge of RNA processing by SLTS in S. mansoni, and provide basis for future studies on the trans-splicing mechanism in other eukaryotes.
Hadjikyriacou, Andrea; Yang, Yanzhong; Espejo, Alexsandra; Bedford, Mark T.; Clarke, Steven G.
2015-01-01
Human protein arginine methyltransferase (PRMT) 9 symmetrically dimethylates arginine residues on splicing factor SF3B2 (SAP145) and has been functionally linked to the regulation of alternative splicing of pre-mRNA. Site-directed mutagenesis studies on this enzyme and its substrate had revealed essential unique residues in the double E loop and the importance of the C-terminal duplicated methyltransferase domain. In contrast to what had been observed with other PRMTs and their physiological substrates, a peptide containing the methylatable Arg-508 of SF3B2 was not recognized by PRMT9 in vitro. Although amino acid substitutions of residues surrounding Arg-508 had no great effect on PRMT9 recognition of SF3B2, moving the arginine residue within this sequence abolished methylation. PRMT9 and PRMT5 are the only known mammalian enzymes capable of forming symmetric dimethylarginine (SDMA) residues as type II PRMTs. We demonstrate here that the specificity of these enzymes for their substrates is distinct and not redundant. The loss of PRMT5 activity in mouse embryo fibroblasts results in almost complete loss of SDMA, suggesting that PRMT5 is the primary SDMA-forming enzyme in these cells. PRMT9, with its duplicated methyltransferase domain and conserved sequence in the double E loop, appears to have a unique structure and specificity among PRMTs for methylating SF3B2 and potentially other polypeptides. PMID:25979344
A study of alternative splicing in the pig
2010-01-01
Background Since at least half of the genes in mammalian genomes are subjected to alternative splicing, alternative pre-mRNA splicing plays an important contribution to the complexity of the mammalian proteome. Expressed sequence tags (ESTs) provide evidence of a great number of possible alternative isoforms. With the EST resource for the domestic pig now containing more than one million porcine ESTs, it is possible to identify alternative splice forms of the individual transcripts in this species from the EST data with some confidence. Results The pig EST data generated by the Sino-Danish Pig Genome project has been assembled with publicly available ESTs and made available in the PigEST database. Using the Distiller package 2,515 EST clusters with candidate alternative isoforms were identified in the EST data with high confidence. In agreement with general observations in human and mouse, we find putative splice variants in about 30% of the contigs with more than 50 ESTs. Based on the criteria that a minimum of two EST sequences confirmed each splice event, a list of 100 genes with the most distinct tissue-specific alternative splice events was generated from the list of candidates. To confirm the tissue specificity of the splice events, 10 genes with functional annotation were randomly selected from which 16 individual splice events were chosen for experimental verification by quantitative PCR (qPCR). Six genes were shown to have tissue specific alternatively spliced transcripts with expression patterns matching those of the EST data. The remaining four genes had tissue-restricted expression of alternative spliced transcripts. Five out of the 16 splice events that were experimentally verified were found to be putative pig specific. Conclusions In accordance with human and rodent studies we estimate that approximately 30% of the porcine genes undergo alternative splicing. We found a good correlation between EST predicted tissue-specificity and experimentally validated splice events in different porcine tissue. This study indicates that a cluster size of around 50 ESTs is optimal for in silico detection of alternative splicing. Although based on a limited number of splice events, the study supports the notion that alternative splicing could have an important impact on species differentiation since 31% of the splice events studied appears to be species specific. PMID:20444244
Genetic diagnosis of familial hypercholesterolaemia by targeted next-generation sequencing
Maglio, C; Mancina, R M; Motta, B M; Stef, M; Pirazzi, C; Palacios, L; Askaryar, N; Borén, J; Wiklund, O; Romeo, S
2014-01-01
Maglio C., Mancina R. M., Motta B. M., Stef M., Pirazzi C., Palacios L., Askaryar N., Borén J., Wiklund O., Romeo S. (University of Gothenburg, Gothenburg, Sweden; University Magna Graecia of Catanzaro, Italy; University of Milan, Italy; Progenika Biopharma SA, Derio, Spain). Genetic diagnosis of familial hypercholesterolaemia by targeted next-generation sequencing. Objectives The aim of this study was to combine clinical criteria and next-generation sequencing (pyrosequencing) to establish a diagnosis of familial hypercholesterolaemia (FH). Design, setting and subjects A total of 77 subjects with a Dutch Lipid Clinic Network score of ≥3 (possible, probable or definite FH clinical diagnosis) were recruited from the Lipid Clinic at Sahlgrenska Hospital, Gothenburg, Sweden. Next-generation sequencing was performed in all subjects using SEQPRO LIPO RS, a kit that detects mutations in the low-density lipoprotein receptor (LDLR), apolipoprotein B (APOB), proprotein convertase subtilisin/kexin type 9 (PCSK9) and LDLR adapter protein 1 (LDLRAP1) genes; copy-number variations in the LDLR gene were also examined. Results A total of 26 mutations were detected in 50 subjects (65% success rate). Amongst these, 23 mutations were in the LDLR gene, two in the APOB gene and one in the PCSK9 gene. Four mutations with unknown pathogenicity were detected in LDLR. Of these, three mutations (Gly505Asp, Ile585Thr and Gln660Arg) have been previously reported in subjects with FH, but their pathogenicity has not been proved. The fourth, a mutation in LDLR affecting a splicing site (exon 6–intron 6) has not previously been reported; it was found to segregate with high cholesterol levels in the family of the proband. Conclusions Using a combination of clinical criteria and targeted next-generation sequencing, we have achieved FH diagnosis with a high success rate. Furthermore, we identified a new splicing-site mutation in the LDLR gene. PMID:24785115
The genomic structure of the human UFO receptor.
Schulz, A S; Schleithoff, L; Faust, M; Bartram, C R; Janssen, J W
1993-02-01
Using a DNA transfection-tumorigenicity assay we have recently identified the UFO oncogene. It encodes a tyrosine kinase receptor characterized by the juxtaposition of two immunoglobulin-like and two fibronectin type III repeats in its extracellular domain. Here we describe the genomic organization of the human UFO locus. The UFO receptor is encoded by 20 exons that are distributed over a region of 44 kb. Different isoforms of UFO mRNA are generated by alternative splicing of exon 10 and differential usage of two imperfect polyadenylation sites resulting in the presence or absence of 1.5-kb 3' untranslated sequences. Primer extension and S1 nuclease analyses revealed multiple transcriptional initiation sites including a major site 169 bp upstream of the translation start site. The promoter region is GC rich, lacks TATA and CAAT boxes, but contains potential recognition sites for a variety of trans-acting factors, including Sp1, AP-2 and the cyclic AMP response element-binding protein. Proto-UFO and its oncogenic counterpart exhibit identical cDNA and promoter regions sequences. Possible modes of UFO activation are discussed.
Functional domains of the human splicing factor ASF/SF2.
Zuo, P; Manley, J L
1993-01-01
The human splicing factor ASF/SF2 displays two predominant activities in in vitro splicing assays: (i) it is an essential factor apparently required for all splices and (ii) it is able to switch utilization of alternative 5' splice sites in a concentration-dependent manner. ASF/SF2 is the prototype of a family of proteins typified by the presence of one or two RNP-type RNA binding domains (RBDs) and a region highly enriched in repeating arginine-serine dipeptides (RS regions). Here we describe a functional analysis of ASF/SF2, which defines several regions essential for one, or both, of its two principal activities, and provides insights into how this type of protein functions in splicing. Two isoforms of the protein, which arise from alternative splicing, are by themselves inactive, but each can block the activity of ASF/SF2, thereby functioning as splicing repressors. Some, but not all, mutations in the RS region prevent ASF/SF2 from functioning as an essential splicing factor. However, the entire RS region can be deleted without reducing splice site switching activity, indicating that it is not absolutely required for interaction with other splicing factors. Experiments with deletion and substitution mutants reveal that the protein contains two related, but highly diverged, RBDs, and that both are essential for activity. Each RBD by itself retains the ability to bind RNA, although optimal binding requires both domains. Images PMID:8223481
Deep intronic GPR143 mutation in a Japanese family with ocular albinism
Naruto, Takuya; Okamoto, Nobuhiko; Masuda, Kiyoshi; Endo, Takao; Hatsukawa, Yoshikazu; Kohmoto, Tomohiro; Imoto, Issei
2015-01-01
Deep intronic mutations are often ignored as possible causes of human disease. Using whole-exome sequencing, we analysed genomic DNAs of a Japanese family with two male siblings affected by ocular albinism and congenital nystagmus. Although mutations or copy number alterations of coding regions were not identified in candidate genes, the novel intronic mutation c.659-131 T > G within GPR143 intron 5 was identified as hemizygous in affected siblings and as heterozygous in the unaffected mother. This mutation was predicted to create a cryptic splice donor site within intron 5 and activate a cryptic acceptor site at 41nt upstream, causing the insertion into the coding sequence of an out-of-frame 41-bp pseudoexon with a premature stop codon in the aberrant transcript, which was confirmed by minigene experiments. This result expands the mutational spectrum of GPR143 and suggests the utility of next-generation sequencing integrated with in silico and experimental analyses for improving the molecular diagnosis of this disease. PMID:26061757
Deep intronic GPR143 mutation in a Japanese family with ocular albinism.
Naruto, Takuya; Okamoto, Nobuhiko; Masuda, Kiyoshi; Endo, Takao; Hatsukawa, Yoshikazu; Kohmoto, Tomohiro; Imoto, Issei
2015-06-10
Deep intronic mutations are often ignored as possible causes of human disease. Using whole-exome sequencing, we analysed genomic DNAs of a Japanese family with two male siblings affected by ocular albinism and congenital nystagmus. Although mutations or copy number alterations of coding regions were not identified in candidate genes, the novel intronic mutation c.659-131 T > G within GPR143 intron 5 was identified as hemizygous in affected siblings and as heterozygous in the unaffected mother. This mutation was predicted to create a cryptic splice donor site within intron 5 and activate a cryptic acceptor site at 41nt upstream, causing the insertion into the coding sequence of an out-of-frame 41-bp pseudoexon with a premature stop codon in the aberrant transcript, which was confirmed by minigene experiments. This result expands the mutational spectrum of GPR143 and suggests the utility of next-generation sequencing integrated with in silico and experimental analyses for improving the molecular diagnosis of this disease.
Pan, Ling; Pasternak, David A.; Xu, Jin; Xu, Mingming; Lu, Zhigang; Pasternak, Gavril W.
2017-01-01
The sigma1 receptor acts as a chaperone at the endoplasmic reticulum, associates with multiple proteins in various cellular systems, and involves in a number of diseases, such as addiction, pain, cancer and psychiatric disorders. The sigma1 receptor is encoded by the single copy SIGMAR1 gene. The current study identifies five alternatively spliced variants of the mouse sigma1 receptor gene using a polymerase chain reaction cloning approach. All the splice variants are generated by exon skipping or alternative 3’ or 5’ splicing, producing the truncated sigma1 receptor. Similar alternative splicing has been observed in the human SIGMAR1 gene based on the molecular cloning or genome sequence prediction, suggesting conservation of alternative splicing of SIGMAR1 gene. Using quantitative polymerase chain reactions, we demonstrate differential expression of several splice variants in mouse tissues and brain regions. When expressed in HEK293 cells, all the splice variants fail to bind sigma ligands, implicating that each truncated region in these splice variants is important for ligand binding. However, co-immunoprecipitation (Co-IP) study in HEK293 cells co-transfected with tagged constructs reveals that all the splice variants maintain their ability to physically associate with a mu opioid receptor (mMOR-1), providing useful information to correlate the motifs/sequences necessary for their physical association. Furthermore, a competition Co-IP study showed that all the variants can disrupt in a dose-dependent manner the dimerization of the original sigma1 receptor with mMOR-1, suggesting a potential dominant negative function and providing significant insights into their function. PMID:28350844
77 FR 28240 - Airworthiness Directives; The Boeing Company Airplanes
Federal Register 2010, 2011, 2012, 2013, 2014
2012-05-14
... multiple site damage cracks in the radial web lap and tear strap splices of the aft pressure bulkhead at... multiple site damage cracks in the radial web lap and tear strap splices of the aft pressure bulkhead at...
Sarmiento, José M; Añazco, Carolina C; Campos, Danae M; Prado, Gregory N; Navarro, Javier; González, Carlos B
2004-11-05
In rat kidney, two alternatively spliced transcripts are generated from the V2 vasopressin receptor gene. The large transcript (1.2 kb) encodes the canonical V2 receptor, whereas the small transcript encodes a splice variant displaying a distinct sequence corresponding to the putative seventh transmembrane domain and the intracellular C terminus of the V2 receptor. This work showed that the small spliced transcript is translated in the rat kidney collecting tubules. However, the protein encoded by the small transcript (here called the V2b splice variant) is retained inside the cell, in contrast to the preferential surface distribution of the V2 receptor (here called the V2a receptor). Cells expressing the V2b splice variant do not exhibit binding to 3H-labeled vasopressin. Interestingly, we found that expression of the splice variant V2b down-regulates the surface expression of the V2a receptor, most likely via the formation of V2a.V2b heterodimers as demonstrated by co-immunoprecipitation and fluorescence resonance energy transfer experiments between the V2a receptor and the V2b splice variant. The V2b splice variant would then be acting as a dominant negative. The effect of the V2b splice variant is specific, as it does not affect the surface expression of the G protein-coupled interleukin-8 receptor (CXCR1). Furthermore, the sequence encompassing residues 242-339, corresponding to the C-terminal domain of the V2b splice variant, also down-regulates the surface expression of the V2a receptor. We suggest that some forms of nephrogenic diabetes insipidus are due to overexpression of the splice variant V2b, which could retain the wild-type V2a receptor inside the cell via the formation of V2a.V2b heterodimers.
Mutation analysis of pre-mRNA splicing genes in Chinese families with retinitis pigmentosa
Pan, Xinyuan; Chen, Xue; Liu, Xiaoxing; Gao, Xiang; Kang, Xiaoli; Xu, Qihua; Chen, Xuejuan; Zhao, Kanxing; Zhang, Xiumei; Chu, Qiaomei; Wang, Xiuying
2014-01-01
Purpose Seven genes involved in precursor mRNA (pre-mRNA) splicing have been implicated in autosomal dominant retinitis pigmentosa (adRP). We sought to detect mutations in all seven genes in Chinese families with RP, to characterize the relevant phenotypes, and to evaluate the prevalence of mutations in splicing genes in patients with adRP. Methods Six unrelated families from our adRP cohort (42 families) and two additional families with RP with uncertain inheritance mode were clinically characterized in the present study. Targeted sequence capture with next-generation massively parallel sequencing (NGS) was performed to screen mutations in 189 genes including all seven pre-mRNA splicing genes associated with adRP. Variants detected with NGS were filtered with bioinformatics analyses, validated with Sanger sequencing, and prioritized with pathogenicity analysis. Results Mutations in pre-mRNA splicing genes were identified in three individual families including one novel frameshift mutation in PRPF31 (p.Leu366fs*1) and two known mutations in SNRNP200 (p.Arg681His and p.Ser1087Leu). The patients carrying SNRNP200 p.R681H showed rapid disease progression, and the family carrying p.S1087L presented earlier onset ages and more severe phenotypes compared to another previously reported family with p.S1087L. In five other families, we identified mutations in other RP-related genes, including RP1 p. Ser781* (novel), RP2 p.Gln65* (novel) and p.Ile137del (novel), IMPDH1 p.Asp311Asn (recurrent), and RHO p.Pro347Leu (recurrent). Conclusions Mutations in splicing genes identified in the present and our previous study account for 9.5% in our adRP cohort, indicating the important role of pre-mRNA splicing deficiency in the etiology of adRP. Mutations in the same splicing gene, or even the same mutation, could correlate with different phenotypic severities, complicating the genotype–phenotype correlation and clinical prognosis. PMID:24940031
Kapahnke, Marcel; Banning, Antje; Tikkanen, Ritva
2016-12-14
The clustered regularly interspaced short palindromic repeats (CRISPR)-associated sequence 9 (CRISPR/Cas9) system is widely used for genome editing purposes as it facilitates an efficient knockout of a specific gene in, e.g. cultured cells. Targeted double-strand breaks are introduced to the target sequence of the guide RNAs, which activates the cellular DNA repair mechanism for non-homologous-end-joining, resulting in unprecise repair and introduction of small deletions or insertions. Due to this, sequence alterations in the coding region of the target gene frequently cause frame-shift mutations, facilitating degradation of the mRNA. We here show that such CRISPR/Cas9-mediated alterations in the target exon may also result in altered splicing of the respective pre-mRNA, most likely due to mutations of splice-regulatory sequences. Using the human FLOT-1 gene as an example, we demonstrate that such altered splicing products also give rise to aberrant protein products. These may potentially function as dominant-negative proteins and thus interfere with the interpretation of the data generated with these cell lines. Since most researchers only control the consequences of CRISPR knockout at genomic and protein level, our data should encourage to also check the alterations at the mRNA level.
Wang, Qi; Diao, Ying; Xu, Zhenping; Li, Xiaohui; Luo, Xiao Ping; Xu, Haibo; Ouyang, Ping; Liu, Mugen; Hu, Zhongli; Wang, Qing K; Liu, Jing Yu
2009-12-10
A Chinese family with autosomal recessive pituitary dwarfism was identified and the proband was evaluated by MRI and hormonal analysis, which revealed pituitary dwarfism with a complete growth hormone deficiency. MRI showed a pituitary gland with a small anterior pituitary of 2.2mm and evidence of hypoplastic pituitary. Linkage analysis with markers spanning 17 known genes for dwarfism revealed linkage of the family to the growth hormone-releasing hormone receptor (GHRHR) gene. Mutational analysis of all exons and exon-intron boundaries of GHRHR was carried out using direct DNA sequence analysis. A novel homozygosis mutation, a G to A transition located in the splice donor site at the beginning of intron 8 (IVS8+1G>A), was identified in the proband. The two other patients in the family are homozygous, whereas the living mother of the proband is heterozygous for the IVS8+1G>A mutation. The mutation was not found in 100 normal chromosomes from healthy Chinese individuals of Han nationality. An in vitro splicing assay using HeLa cells transfected with expression vectors containing the normal or the mutant GHRHR minigenes consisting of genomic fragments spanning exons 7-9 showed that the IVS8+1G>A mutation caused abnormal splicing, which is predicted to give rise to truncation or frameshift, leading to severely truncated GHRHR proteins. These results provide strong evidence that the splicing mutation IVS8+1G>A of GHRHR is a cause of pituitary dwarfism in the Chinese family.
Rensing, Stefan A; Fritzowsky, Dana; Lang, Daniel; Reski, Ralf
2005-01-01
Background The moss Physcomitrella patens is an emerging plant model system due to its high rate of homologous recombination, haploidy, simple body plan, physiological properties as well as phylogenetic position. Available EST data was clustered and assembled, and provided the basis for a genome-wide analysis of protein encoding genes. Results We have clustered and assembled Physcomitrella patens EST and CDS data in order to represent the transcriptome of this non-seed plant. Clustering of the publicly available data and subsequent prediction resulted in a total of 19,081 non-redundant ORF. Of these putative transcripts, approximately 30% have a homolog in both rice and Arabidopsis transcriptome. More than 130 transcripts are not present in seed plants but can be found in other kingdoms. These potential "retained genes" might have been lost during seed plant evolution. Functional annotation of these genes reveals unequal distribution among taxonomic groups and intriguing putative functions such as cytotoxicity and nucleic acid repair. Whereas introns in the moss are larger on average than in the seed plant Arabidopsis thaliana, position and amount of introns are approximately the same. Contrary to Arabidopsis, where CDS contain on average 44% G/C, in Physcomitrella the average G/C content is 50%. Interestingly, moss orthologs of Arabidopsis genes show a significant drift of codon fraction usage, towards the seed plant. While averaged codon bias is the same in Physcomitrella and Arabidopsis, the distribution pattern is different, with 15% of moss genes being unbiased. Species-specific, sensitive and selective splice site prediction for Physcomitrella has been developed using a dataset of 368 donor and acceptor sites, utilizing a support vector machine. The prediction accuracy is better than those achieved with tools trained on Arabidopsis data. Conclusion Analysis of the moss transcriptome displays differences in gene structure, codon and splice site usage in comparison with the seed plant Arabidopsis. Putative retained genes exhibit possible functions that might explain the peculiar physiological properties of mosses. Both the transcriptome representation (including a BLAST and retrieval service) and splice site prediction have been made available on , setting the basis for assembly and annotation of the Physcomitrella genome, of which draft shotgun sequences will become available in 2005. PMID:15784153
Circular RNA biogenesis can proceed through an exon-containing lariat precursor.
Barrett, Steven P; Wang, Peter L; Salzman, Julia
2015-06-09
Pervasive expression of circular RNA is a recently discovered feature of eukaryotic gene expression programs, yet its function remains largely unknown. The presumed biogenesis of these RNAs involves a non-canonical 'backsplicing' event. Recent studies in mammalian cell culture posit that backsplicing is facilitated by inverted repeats flanking the circularized exon(s). Although such sequence elements are common in mammals, they are rare in lower eukaryotes, making current models insufficient to describe circularization. Through systematic splice site mutagenesis and the identification of splicing intermediates, we show that circular RNA in Schizosaccharomyces pombe is generated through an exon-containing lariat precursor. Furthermore, we have performed high-throughput and comprehensive mutagenesis of a circle-forming exon, which enabled us to discover a systematic effect of exon length on RNA circularization. Our results uncover a mechanism for circular RNA biogenesis that may account for circularization in genes that lack noticeable flanking intronic secondary structure.
HITS-CLIP yields genome-wide insights into brain alternative RNA processing
NASA Astrophysics Data System (ADS)
Licatalosi, Donny D.; Mele, Aldo; Fak, John J.; Ule, Jernej; Kayikci, Melis; Chi, Sung Wook; Clark, Tyson A.; Schweitzer, Anthony C.; Blume, John E.; Wang, Xuning; Darnell, Jennifer C.; Darnell, Robert B.
2008-11-01
Protein-RNA interactions have critical roles in all aspects of gene expression. However, applying biochemical methods to understand such interactions in living tissues has been challenging. Here we develop a genome-wide means of mapping protein-RNA binding sites in vivo, by high-throughput sequencing of RNA isolated by crosslinking immunoprecipitation (HITS-CLIP). HITS-CLIP analysis of the neuron-specific splicing factor Nova revealed extremely reproducible RNA-binding maps in multiple mouse brains. These maps provide genome-wide in vivo biochemical footprints confirming the previous prediction that the position of Nova binding determines the outcome of alternative splicing; moreover, they are sufficiently powerful to predict Nova action de novo. HITS-CLIP revealed a large number of Nova-RNA interactions in 3' untranslated regions, leading to the discovery that Nova regulates alternative polyadenylation in the brain. HITS-CLIP, therefore, provides a robust, unbiased means to identify functional protein-RNA interactions in vivo.
Jia, Ying; Li, Xiaoge; Yang, Dong; Xu, Yi; Guo, Ying; Li, Xin
2018-01-01
The current study aims to identify the pathogenic sites in a core pedigree of Usher syndrome (USH). A core pedigree of USH was analyzed by whole exome sequencing (WES). Mutations were verified by polymerase chain reaction (PCR) amplification and Sanger sequencing. Two pathogenic variations (c.849+2T>C and c.5994G>A) in MYO7A were successfully identified and individually separated from parents. One variant (c.849+2T>C) was nonsense mutation, causing the protein terminated in advance, and the other one (c.5994G>A) located near the boundary of exon could cause aberrant splicing. This study provides a meaningful exploration for identification of clinical core genetic pedigrees. Copyright © 2017 Elsevier B.V. All rights reserved.
Bitar, Mainá; Boroni, Mariana; Macedo, Andréa M.; Machado, Carlos R.; Franco, Glória R.
2013-01-01
The spliced leader (SL) is a gene that generates a functional ncRNA that is composed of two regions: an intronic region of unknown function (SLi) and an exonic region (SLe), which is transferred to the 5′ end of independent transcripts yielding mature mRNAs, in a process known as spliced leader trans-splicing (SLTS). The best described function for SLTS is to solve polycistronic transcripts into monocistronic units, specifically in Trypanosomatids. In other metazoans, it is speculated that the SLe addition could lead to increased mRNA stability, differential recruitment of the translational machinery, modification of the 5′ region or a combination of these effects. Although important aspects of this mechanism have been revealed, several features remain to be elucidated. We have analyzed 157 SLe sequences from 148 species from seven phyla and found a high degree of conservation among the sequences of species from the same phylum, although no considerable similarity seems to exist between sequences of species from different phyla. When analyzing case studies, we found evidence that a given SLe will always be related to a given set of transcripts in different species from the same phylum, and therefore, different SLe sequences from the same species would regulate different sets of transcripts. In addition, we have observed distinct transcript categories to be preferential targets for the SLe addition in different phyla. This work sheds light into crucial and controversial aspects of the SLTS mechanism. It represents a comprehensive study concerning various species and different characteristics of this important post-transcriptional regulatory mechanism. PMID:24130571
Wang, Peter Lincoln; Lacayo, Norman; Brown, Patrick O.
2012-01-01
Most human pre-mRNAs are spliced into linear molecules that retain the exon order defined by the genomic sequence. By deep sequencing of RNA from a variety of normal and malignant human cells, we found RNA transcripts from many human genes in which the exons were arranged in a non-canonical order. Statistical estimates and biochemical assays provided strong evidence that a substantial fraction of the spliced transcripts from hundreds of genes are circular RNAs. Our results suggest that a non-canonical mode of RNA splicing, resulting in a circular RNA isoform, is a general feature of the gene expression program in human cells. PMID:22319583
A laboratory study of multiple site damage in fuselage lap splices
DOT National Transportation Integrated Search
1993-12-01
This report details an experimental study that was conducted to explore the causes of : fuselage lap splice multiple site damage (MSD), which has been observed in several : aging aircraft. MSD was partially responsible for the 1988 Aloha Airlines acc...
Molecular mechanisms of pathogenesis in hepatocellular carcinoma revealed by RNA‑sequencing.
Liu, Yao; Yang, Zhe; Du, Feng; Yang, Qiao; Hou, Jie; Yan, Xiaohong; Geng, Yi; Zhao, Yaning; Wang, Hua
2017-11-01
The present study aimed to explore the underlying molecular mechanisms of hepatocellular carcinoma (HCC). RNA‑sequencing profiles GSM629264 and GSM629265, from the GSE25599 data set, were downloaded from the Gene Expression Omnibus database and processed by quality evaluation. GSM629264 and GSM629265 were from HCC and adjacent non‑cancerous tissues, respectively. TopHat software was used for alignment analysis, followed by the detection of novel splicing sites. In addition, the Cufflinks software package was used to analyze gene expressions, and the Cuffdiff program was used to screen for differently expressed genes (DEGs) and differentially expressed splicing variants. Gene ontology functional enrichment and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses of DEGs were also performed. Transcription factors (TFs) and microRNAs (miRNAs) that regulate DEGs were identified, and a protein‑protein interaction (PPI) network was constructed. The hub node in the PPI network was obtained, and the TFs and miRNAs that regulated the hub node were further predicted. The quality of the sequencing data met the standards for analysis, and the clean reads were ~65%. Most sequencing reads mapped into coding sequence exons (CDS_exons), whereas other reads mapped into exon 3' untranslated regions (UTR_Exons), 5'UTR_Exons and Introns. Upregulated and downregulated DEGs between HCC and adjacent non‑cancerous tissues were screened. Genes of differentially expressed splicing variants were identified, including vesicle‑associated membrane protein 4, phosphatidylinositol glycan anchor biosynthesis class C, protein disulfide isomerase family A member 4 and growth arrest specific 5. Screened DEGs were enriched in the complement pathway. In the PPI network, ubiquitin C (UBC) was the hub node. UBC was predicted to be regulated by several TFs, including specificity protein 1 (SP1), FBJ murine osteosarcoma viral oncogene homolog (FOS), proto‑oncogene c‑JUN (JUN), FOS‑like antigen 2 (FOSL2) and SWI/SNF‑related, matrix‑associated, actin‑dependent regulator of chromatin, subfamily A, member 4 (SMARCA4), and several miRNAs, including miR‑30 and miR‑181. Results from the present study demonstrated that UBC, SP1, FOS, JUN, FOSL2, SMARCA4, miR‑30 and miR‑181 may participate in the development of HCC.
Cloning a Chymotrypsin-Like 1 (CTRL-1) Protease cDNA from the Jellyfish Nemopilema nomurai
Heo, Yunwi; Kwon, Young Chul; Bae, Seong Kyeong; Hwang, Duhyeon; Yang, Hye Ryeon; Choudhary, Indu; Lee, Hyunkyoung; Yum, Seungshic; Shin, Kyoungsoon; Yoon, Won Duk; Kang, Changkeun; Kim, Euikyung
2016-01-01
An enzyme in a nematocyst extract of the Nemopilema nomurai jellyfish, caught off the coast of the Republic of Korea, catalyzed the cleavage of chymotrypsin substrate in an amidolytic kinetic assay, and this activity was inhibited by the serine protease inhibitor, phenylmethanesulfonyl fluoride. We isolated the full-length cDNA sequence of this enzyme, which contains 850 nucleotides, with an open reading frame of 801 encoding 266 amino acids. A blast analysis of the deduced amino acid sequence showed 41% identity with human chymotrypsin-like (CTRL) and the CTRL-1 precursor. Therefore, we designated this enzyme N. nomurai CTRL-1. The primary structure of N. nomurai CTRL-1 includes a leader peptide and a highly conserved catalytic triad of His69, Asp117, and Ser216. The disulfide bonds of chymotrypsin and the substrate-binding sites are highly conserved compared with the CTRLs of other species, including mammalian species. Nemopilema nomurai CTRL-1 is evolutionarily more closely related to Actinopterygii than to Scyphozoan (Aurelia aurita) or Hydrozoan (Hydra vulgaris). The N. nomurai CTRL1 was amplified from the genomic DNA with PCR using specific primers designed based on the full-length cDNA, and then sequenced. The N. nomurai CTRL1 gene contains 2434 nucleotides and four distinct exons. The 5′ donor splice (GT) and 3′ acceptor splice sequences (AG) are wholly conserved. This is the first report of the CTRL1 gene and cDNA structures in the jellyfish N. nomurai. PMID:27399771
Cloning a Chymotrypsin-Like 1 (CTRL-1) Protease cDNA from the Jellyfish Nemopilema nomurai.
Heo, Yunwi; Kwon, Young Chul; Bae, Seong Kyeong; Hwang, Duhyeon; Yang, Hye Ryeon; Choudhary, Indu; Lee, Hyunkyoung; Yum, Seungshic; Shin, Kyoungsoon; Yoon, Won Duk; Kang, Changkeun; Kim, Euikyung
2016-07-05
An enzyme in a nematocyst extract of the Nemopilema nomurai jellyfish, caught off the coast of the Republic of Korea, catalyzed the cleavage of chymotrypsin substrate in an amidolytic kinetic assay, and this activity was inhibited by the serine protease inhibitor, phenylmethanesulfonyl fluoride. We isolated the full-length cDNA sequence of this enzyme, which contains 850 nucleotides, with an open reading frame of 801 encoding 266 amino acids. A blast analysis of the deduced amino acid sequence showed 41% identity with human chymotrypsin-like (CTRL) and the CTRL-1 precursor. Therefore, we designated this enzyme N. nomurai CTRL-1. The primary structure of N. nomurai CTRL-1 includes a leader peptide and a highly conserved catalytic triad of His(69), Asp(117), and Ser(216). The disulfide bonds of chymotrypsin and the substrate-binding sites are highly conserved compared with the CTRLs of other species, including mammalian species. Nemopilema nomurai CTRL-1 is evolutionarily more closely related to Actinopterygii than to Scyphozoan (Aurelia aurita) or Hydrozoan (Hydra vulgaris). The N. nomurai CTRL1 was amplified from the genomic DNA with PCR using specific primers designed based on the full-length cDNA, and then sequenced. The N. nomurai CTRL1 gene contains 2434 nucleotides and four distinct exons. The 5' donor splice (GT) and 3' acceptor splice sequences (AG) are wholly conserved. This is the first report of the CTRL1 gene and cDNA structures in the jellyfish N. nomurai.
Sangermano, Riccardo; Khan, Mubeen; Cornelis, Stéphanie S; Richelle, Valerie; Albert, Silvia; Garanto, Alejandro; Elmelik, Duaa; Qamar, Raheel; Lugtenberg, Dorien; van den Born, L Ingeborgh; Collin, Rob W J; Cremers, Frans P M
2018-01-01
Stargardt disease is caused by variants in the ABCA4 gene, a significant part of which are noncanonical splice site (NCSS) variants. In case a gene of interest is not expressed in available somatic cells, small genomic fragments carrying potential disease-associated variants are tested for splice abnormalities using in vitro splice assays. We recently discovered that when using small minigenes lacking the proper genomic context, in vitro results do not correlate with splice defects observed in patient cells. We therefore devised a novel strategy in which a bacterial artificial chromosome was employed to generate midigenes, splice vectors of varying lengths (up to 11.7 kb) covering almost the entire ABCA4 gene. These midigenes were used to analyze the effect of all 44 reported and three novel NCSS variants on ABCA4 pre-mRNA splicing. Intriguingly, multi-exon skipping events were observed, as well as exon elongation and intron retention. The analysis of all reported NCSS variants in ABCA4 allowed us to reveal the nature of aberrant splicing events and to classify the severity of these mutations based on the residual fraction of wild-type mRNA. Our strategy to generate large overlapping splice vectors carrying multiple exons, creating a toolbox for robust and high-throughput analysis of splice variants, can be applied to all human genes. © 2018 Sangermano et al.; Published by Cold Spring Harbor Laboratory Press.
A variant Tc4 transposable element in the nematode C. elegans could encode a novel protein.
Li, W; Shaw, J E
1993-01-01
A variant C. elegans Tc4 transposable element, Tc4-rh1030, has been sequenced and is 3483 bp long. The Tc4 element that had been analyzed previously is 1605 bp long, consists of two 774-bp nearly perfect inverted terminal repeats connected by a 57-bp loop, and lacks significant open reading frames. In Tc4-rh1030, by comparison, a 2343-bp novel sequence is present in place of a 477-bp segment in one of the inverted repeats. The novel sequence of Tc4-rh1030 is present about five times per haploid genome and is invariably associated with Tc4 elements; we have used the designation Tc4v to denote this variant subfamily of Tc4 elements. Sequence analysis of three cDNA clones suggests that a Tc4v element contains at least five exons that could encode a novel basic protein of 537 amino acid residues. On northern blots, a 1.6-kb Tc4v-specific transcript was detected in the mutator strain TR679 but not in the wild-type strain N2; Tc4 elements are known to transpose in TR679 but appear to be quiescent in N2. We have analyzed transcripts produced by an unc-33 gene that has the Tc4-rh1030 insertional mutation in its transcribed region; all or almost all of the Tc4v sequence is frequently spliced out of the mutant unc-33 transcripts, sometimes by means of non-consensus splice acceptor sites. Images PMID:8382791
Kempers, M J E; van der Crabben, S N; de Vroede, M; Alfen-van der Velden, J; Netea-Maier, R T; Duim, R A J; Otten, B J; Losekoot, M; Wit, J M
2013-01-01
Congenital isolated growth hormone deficiency (IGHD) is a rare endocrine disorder that presents with severe proportionate growth failure. Dominant (type II) IGHD is usually caused by heterozygous mutations of GH1. The presentation of newly affected family members in 3 families with dominant IGHD in whom previous genetic testing had not demonstrated a GH1 mutation or had not been performed, prompted us to identify the underlying genetic cause. GH1 was sequenced in 3 Caucasian families with a clinical autosomal dominant IGHD. All affected family members had severe growth hormone (GH) deficiency that became apparent in the first 2 years of life. GH treatment led to a marked increase in height SDS. So far, no other pituitary dysfunctions have become apparent. In the first family a novel splice site mutation in GH1 was identified (c.172-1G>C, IVS2-1G>C). In two other families a previously reported splice site mutation (c.291+1G>A, IVS3+1G>A) was found. These data show that several years after negative genetic testing it was now possible to make a genetic diagnosis in these families with a well-defined, clearly heritable, autosomal dominant IGHD. This underscores the importance of clinical and genetic follow-up in a multidisciplinary setting. It also shows that even without a positive family history, genetic testing should be considered if the phenotype is strongly suggestive for a genetic syndrome. Identification of pathogenic mutations, like these GH1 mutations, has important clinical implications for the surveillance and genetic counseling of patients and expands our knowledge on the genotype-phenotype correlation. © 2013 S. Karger AG, Basel.
A Novel SLC27A4 Splice Acceptor Site Mutation in Great Danes with Ichthyosis.
Metzger, Julia; Wöhlke, Anne; Mischke, Reinhard; Hoffmann, Annalena; Hewicker-Trautwein, Marion; Küch, Eva-Maria; Naim, Hassan Y; Distl, Ottmar
2015-01-01
Ichthyoses are a group of various different types of hereditary disorders affecting skin cornification. They are characterized by hyperkeratoses of different severity levels and are associated with a dry and scaling skin. Genome-wide association analysis of nine affected and 13 unaffected Great Danes revealed a genome-wide significant peak on chromosome 9 at 57-58 Mb in the region of SLC27A4. Sequence analysis of genomic DNA of SLC27A4 revealed the non-synonymous SNV SLC27A4:g.8684G>A in perfect association with ichthyosis-affection in Great Danes. The mutant transcript of SLC27A4 showed an in-frame loss of 54 base pairs in exon 8 probably induced by a new splice acceptor site motif created by the mutated A- allele of the SNV. Genotyping 413 controls from 35 different breeds of dogs and seven wolves revealed that this mutation could not be found in other populations except in Great Danes. Affected dogs revealed high amounts of mutant transcript but only low levels of the wild type transcript. Targeted analyses of SLC27A4 protein from skin tissues of three affected and two unaffected Great Danes indicated a markedly reduced or not detectable wild type and truncated protein levels in affected dogs but a high expression of wild type SLC27A4 protein in unaffected controls. Our data provide evidence of a new splice acceptor site creating SNV that results in a reduction or loss of intact SLC27A4 protein and probably explains the severe skin phenotype in Great Danes. Genetic testing will allow selective breeding to prevent ichthyosis-affected puppies in the future.
Exome Sequencing Identifies a REEP1 Mutation Involved in Distal Hereditary Motor Neuropathy Type V
Beetz, Christian; Pieber, Thomas R.; Hertel, Nicole; Schabhüttl, Maria; Fischer, Carina; Trajanoski, Slave; Graf, Elisabeth; Keiner, Silke; Kurth, Ingo; Wieland, Thomas; Varga, Rita-Eva; Timmerman, Vincent; Reilly, Mary M.; Strom, Tim M.; Auer-Grumbach, Michaela
2012-01-01
The distal hereditary motor neuropathies (dHMNs) are a heterogeneous group of neurodegenerative disorders affecting the lower motoneuron. In a family with both autosomal-dominant dHMN and dHMN type V (dHMN/dHMN-V) present in three generations, we excluded mutations in all genes known to be associated with a dHMN phenotype through Sanger sequencing and defined three potential loci through linkage analysis. Whole-exome sequencing of two affected individuals revealed a single candidate variant within the linking regions, i.e., a splice-site alteration in REEP1 (c.304-2A>G). A minigene assay confirmed complete loss of splice-acceptor functionality and skipping of the in-frame exon 5. The resulting mRNA is predicted to be expressed at normal levels and to encode an internally shortened protein (p.102_139del). Loss-of-function REEP1 mutations have previously been identified in dominant hereditary spastic paraplegia (HSP), a disease associated with upper-motoneuron pathology. Consistent with our clinical-genetic data, we show that REEP1 is strongly expressed in the lower motoneurons as well. Upon exogeneous overexpression in cell lines we observe a subcellular localization defect for p.102_139del that differs from that observed for the known HSP-associated missense mutation c.59C>A (p.Ala20Glu). Moreover, we show that p.102_139del, but not p.Ala20Glu, recruits atlastin-1, i.e., one of the REEP1 binding partners, to the altered sites of localization. These data corroborate the loss-of-function nature of REEP1 mutations in HSP and suggest that a different mechanism applies in REEP1-associated dHMN. PMID:22703882
OCA2 splice site variant in German Spitz dogs with oculocutaneous albinism.
Caduff, Madleina; Bauer, Anina; Jagannathan, Vidhya; Leeb, Tosso
2017-01-01
We investigated a German Spitz family where the mating of a black male to a white female had yielded three puppies with an unexpected light brown coat color, lightly pigmented lips and noses, and blue eyes. Combined linkage and homozygosity analysis based on a fully penetrant monogenic autosomal recessive mode of inheritance identified a critical interval of 15 Mb on chromosome 3. We obtained whole genome sequence data from one affected dog, three wolves, and 188 control dogs. Filtering for private variants revealed a single variant with predicted high impact in the critical interval in LOC100855460 (XM_005618224.1:c.377+2T>G LT844587.1:c.-45+2T>G). The variant perfectly co-segregated with the phenotype in the family. We genotyped 181 control dogs with normal pigmentation from diverse breeds including 22 unrelated German Spitz dogs, which were all homozygous wildtype. Comparative sequence analyses revealed that LOC100855460 actually represents the 5'-end of the canine OCA2 gene. The CanFam 3.1 reference genome assembly is incorrect and separates the first two exons from the remaining exons of the OCA2 gene. We amplified a canine OCA2 cDNA fragment by RT-PCR and determined the correct full-length mRNA sequence (LT844587.1). Variants in the OCA2 gene cause oculocutaneous albinism type 2 (OCA2) in humans, pink-eyed dilution in mice, and similar phenotypes in corn snakes, medaka and Mexican cave tetra fish. We therefore conclude that the observed oculocutaneous albinism in German Spitz is most likely caused by the identified variant in the 5'-splice site of the first intron of the canine OCA2 gene.
Amelio, Antonio L; Caputi, Massimo; Conkright, Michael D
2009-01-01
The CREB regulated transcription co-activators (CRTCs) regulate many biological processes by integrating and converting environmental inputs into transcriptional responses. Although the mechanisms by which CRTCs sense cellular signals are characterized, little is known regarding how CRTCs contribute to the regulation of cAMP inducible genes. Here we show that these dynamic regulators, unlike other co-activators, independently direct either pre-mRNA splice-site selection or transcriptional activation depending on the cell type or promoter context. Moreover, in other scenarios, the CRTC co-activators coordinately regulate transcription and splicing. Mutational analyses showed that CRTCs possess distinct functional domains responsible for regulating either pre-mRNA splicing or transcriptional activation. Interestingly, the CRTC1–MAML2 oncoprotein lacks the splicing domain and is incapable of altering splice-site selection despite robustly activating transcription. The differential usage of these distinct domains allows CRTCs to selectively mediate multiple facets of gene regulation, indicating that co-activators are not solely restricted to coordinating alternative splicing with increase in transcriptional activity. PMID:19644446
Intragenic motifs regulate the transcriptional complexity of Pkhd1/PKHD1
Boddu, Ravindra; Yang, Chaozhe; O’Connor, Amber K.; Hendrickson, Robert Curtis; Boone, Braden; Cui, Xiangqin; Garcia-Gonzalez, Miguel; Igarashi, Peter; Onuchic, Luiz F.; Germino, Gregory G.
2014-01-01
Autosomal recessive polycystic kidney disease (ARPKD) results from mutations in the human PKHD1 gene. Both this gene, and its mouse ortholog, Pkhd1, are primarily expressed in renal and biliary ductal structures. The mouse protein product, fibrocystin/polyductin complex (FPC), is a 445-kDa protein encoded by a 67-exon transcript that spans >500 kb of genomic DNA. In the current study, we observed multiple alternatively spliced Pkhd1 transcripts that varied in size and exon composition in embryonic mouse kidney, liver, and placenta samples, as well as among adult mouse pancreas, brain, heart, lung, testes, liver, and kidney. Using reverse transcription PCR and RNASeq, we identified 22 novel Pkhd1 kidney transcripts with unique exon junctions. Various mechanisms of alternative splicing were observed, including exon skipping, use of alternate acceptor/donor splice sites, and inclusion of novel exons. Bioinformatic analyses identified, and exon-trapping minigene experiments validated, consensus binding sites for serine/arginine-rich proteins that modulate alternative splicing. Using site-directed mutagenesis, we examined the functional importance of selected splice enhancers. In addition, we demonstrated that many of the novel transcripts were polysome bound, thus likely translated. Finally, we determined that the human PKHD1 R760H missense variant alters a splice enhancer motif that disrupts exon splicing in vitro and is predicted to truncate the protein. Taken together, these data provide evidence of the complex transcriptional regulation of Pkhd1/PKHD1 and identified motifs that regulate its splicing. Our studies indicate that Pkhd1/PKHD1 transcription is modulated, in part by intragenic factors, suggesting that aberrant PKHD1 splicing represents an unappreciated pathogenic mechanism in ARPKD. PMID:24984783
Alternative Splicing in Neurogenesis and Brain Development.
Su, Chun-Hao; D, Dhananjaya; Tarn, Woan-Yuh
2018-01-01
Alternative splicing of precursor mRNA is an important mechanism that increases transcriptomic and proteomic diversity and also post-transcriptionally regulates mRNA levels. Alternative splicing occurs at high frequency in brain tissues and contributes to every step of nervous system development, including cell-fate decisions, neuronal migration, axon guidance, and synaptogenesis. Genetic manipulation and RNA sequencing have provided insights into the molecular mechanisms underlying the effects of alternative splicing in stem cell self-renewal and neuronal fate specification. Timely expression and perhaps post-translational modification of neuron-specific splicing regulators play important roles in neuronal development. Alternative splicing of many key transcription regulators or epigenetic factors reprograms the transcriptome and hence contributes to stem cell fate determination. During neuronal differentiation, alternative splicing also modulates signaling activity, centriolar dynamics, and metabolic pathways. Moreover, alternative splicing impacts cortical lamination and neuronal development and function. In this review, we focus on recent progress toward understanding the contributions of alternative splicing to neurogenesis and brain development, which has shed light on how splicing defects may cause brain disorders and diseases.
Malone, Andrew F; Funk, Steven D; Alhamad, Tarek; Miner, Jeffrey H
2017-06-01
Many COL4A5 splice region variants have been described in patients with X-linked Alport syndrome, but few have been confirmed by functional analysis to actually cause defective splicing. We sought to demonstrate that a novel COL4A5 splice region variant in a family with Alport syndrome is pathogenic using functional studies. We also describe an alternative method of diagnosis. Targeted next-generation sequencing results of an individual with Alport syndrome were analyzed and the results confirmed by Sanger sequencing in family members. A splicing reporter minigene assay was used to examine the variant's effect on splicing in transfected cells. Plucked hair follicles from patients and controls were examined for collagen IV proteins using immunofluorescence microscopy. A novel splice region mutation in COL4A5, c.1780-6T>G, was identified and segregated with disease in this family. This variant caused frequent skipping of exon 25, resulting in a frameshift and truncation of collagen α5(IV) protein. We also developed and validated a new approach to characterize the expression of collagen α5(IV) protein in the basement membranes of plucked hair follicles. Using this approach we demonstrated reduced collagen α5(IV) protein in affected male and female individuals in this family, supporting frequent failure of normal splicing. Differing normal to abnormal transcript ratios in affected individuals carrying splice region variants may contribute to variable disease severity observed in Alport families. Examination of plucked hair follicles in suspected X-linked Alport syndrome patients may offer a less invasive alternative method of diagnosis and serve as a pathogenicity test for COL4A5 variants of uncertain significance.
Malone, Andrew F.; Funk, Steven D.; Alhamad, Tarek; Miner, Jeffrey H.
2016-01-01
Introduction Many COL4A5 splice region variants have been described in patients with X-linked Alport syndrome, but few have been confirmed by functional analysis to actually cause defective splicing. We sought to demonstrate that a novel COL4A5 splice region variant in a family with Alport syndrome is pathogenic using functional studies. We also describe an alternative method of diagnosis. Methods We analyzed targeted next-generation sequencing results of an individual with Alport syndrome and confirmed results by Sanger sequencing in family members. A splicing reporter minigene assay was used to examine the variant’s effect on splicing in transfected cells. Plucked hair follicles from patients and controls were examined for collagen IV proteins using immunofluorescence microscopy. Results A novel splice region mutation in COL4A5, c.1780-6T>G, was identified and segregated with disease in this family. This variant caused frequent skipping of exon 25, resulting in a frameshift and truncation of collagen α5(IV) protein. We also developed and validated a new approach to characterize the expression of collagen α5(IV) protein in the basement membranes of plucked hair follicles. We demonstrated reduced collagen α5(IV) protein in affected male and female individuals in this family, supporting frequent failure of normal splicing. Conclusions Differing normal to abnormal transcript ratios in affected individuals carrying splice region variants may contribute to variable disease severity observed in Alport families. Examination of plucked hair follicles in suspected X-linked Alport syndrome patients may offer a less invasive alternative method of diagnosis and serve as a pathogenicity test for COL4A5 variants of uncertain significance. PMID:28013382
Conditional Toxin Splicing Using a Split Intein System.
Alford, Spencer C; O'Sullivan, Connor; Howard, Perry L
2017-01-01
Protein toxin splicing mediated by split inteins can be used as a strategy for conditional cell ablation. The approach requires artificial fragmentation of a potent protein toxin and tethering each toxin fragment to a split intein fragment. The toxin-intein fragments are, in turn, fused to dimerization domains, such that addition of a dimerizing agent reconstitutes the split intein. These chimeric toxin-intein fusions remain nontoxic until the dimerizer is added, resulting in activation of intein splicing and ligation of toxin fragments to form an active toxin. Considerations for the engineering and implementation of conditional toxin splicing (CTS) systems include: choice of toxin split site, split site (extein) chemistry, and temperature sensitivity. The following method outlines design criteria and implementation notes for CTS using a previously engineered system for splicing a toxin called sarcin, as well as for developing alternative CTS systems.
Greaves, Erin A; Copeland, Nikki A; Coverley, Dawn; Ainscough, Justin F X
2012-05-15
CIZ1 is a nuclear-matrix-associated DNA replication factor unique to higher eukaryotes, for which alternatively spliced isoforms have been associated with a range of disorders. In vitro, the CIZ1 N-terminus interacts with cyclin E and cyclin A at distinct sites, enabling functional cooperation with cyclin-A-Cdk2 to promote replication initiation. C-terminal sequences anchor CIZ1 to fixed sites on the nuclear matrix, imposing spatial constraint on cyclin-dependent kinase activity. Here we demonstrate that CIZ1 is predominantly expressed as a predicted full-length product throughout mouse development, consistent with a ubiquitous role in cell and tissue renewal. CIZ1 is expressed in proliferating stem cells of the testis, but is notably downregulated following commitment to differentiation. Significantly, CIZ1 is re-expressed at high levels in non-proliferative spermatocytes before meiotic division. Sequence analysis identifies at least seven alternatively spliced variants, including a dominant cancer-associated form and a set of novel isoforms. Furthermore, we show that in these post-replicative cells, CIZ1 interacts with germ-cell-specific cyclin A1, which has been implicated in the repair of DNA double-strand breaks. Consistent with this role, antibody depletion of CIZ1 reduces the capacity for testis extract to repair digested plasmid DNA in vitro. Together, the data imply post-replicative roles for CIZ1 in germ cell differentiation that might include meiotic recombination - a process intrinsic to genome stability and diversification.
Saitsu, Hirotomo; Osaka, Hitoshi; Sasaki, Masayuki; Takanashi, Jun-ichi; Hamada, Keisuke; Yamashita, Akio; Shibayama, Hidehiro; Shiina, Masaaki; Kondo, Yukiko; Nishiyama, Kiyomi; Tsurusaki, Yoshinori; Miyake, Noriko; Doi, Hiroshi; Ogata, Kazuhiro; Inoue, Ken; Matsumoto, Naomichi
2011-01-01
Congenital hypomyelinating disorders are a heterogeneous group of inherited leukoencephalopathies characterized by abnormal myelin formation. We have recently reported a hypomyelinating syndrome characterized by diffuse cerebral hypomyelination with cerebellar atrophy and hypoplasia of the corpus callosum (HCAHC). We performed whole-exome sequencing of three unrelated individuals with HCAHC and identified compound heterozygous mutations in POLR3B in two individuals. The mutations include a nonsense mutation, a splice-site mutation, and two missense mutations at evolutionally conserved amino acids. Using reverse transcription-PCR and sequencing, we demonstrated that the splice-site mutation caused deletion of exon 18 from POLR3B mRNA and that the transcript harboring the nonsense mutation underwent nonsense-mediated mRNA decay. We also identified compound heterozygous missense mutations in POLR3A in the remaining individual. POLR3A and POLR3B encode the largest and second largest subunits of RNA Polymerase III (Pol III), RPC1 and RPC2, respectively. RPC1 and RPC2 together form the active center of the polymerase and contribute to the catalytic activity of the polymerase. Pol III is involved in the transcription of small noncoding RNAs, such as 5S ribosomal RNA and all transfer RNAs (tRNA). We hypothesize that perturbation of Pol III target transcription, especially of tRNAs, could be a common pathological mechanism underlying POLR3A and POLR3B mutations. PMID:22036171
Elsayed, Liena E O; Mohammed, Inaam N; Hamed, Ahlam A A; Elseed, Maha A; Salih, Mustafa A M; Yahia, Ashraf; Siddig, Rayan A; Amin, Mutaz; Koko, Mahmoud; Elbashir, Mustafa I; Ibrahim, Muntaser E; Brice, Alexis; Ahmed, Ammar E; Stevanin, Giovanni
2018-05-08
Infantile neuroaxonal dystrophy (INAD) is a rare hereditary neurological disorder caused by mutations in PLA2G6. The disease commonly affects children below 3 years of age and presents with delay in motor skills, optic atrophy and progressive spastic tetraparesis. Studies of INAD in Africa are extremely rare, and genetic studies from Sub Saharan Africa are almost non-existent. Two Sudanese siblings presented, at ages 18 and 24 months, with regression in both motor milestones and speech development and hyper-reflexia. Brain MRI showed bilateral and symmetrical T2/FLAIR hyperintense signal changes in periventricular areas and basal ganglia and mild cerebellar atrophy. Whole exome sequencing with confirmatory Sanger sequencing were performed for the two patients and healthy family members. A novel variant (NM_003560.2 c.1427 + 2 T > C) acting on a splice donor site and predicted to lead to skipping of exon 10 was found in PLA2G6. It was found in a homozygous state in the two patients and homozygous reference or heterozygous in five healthy family members. This variant has one very strong (loss of function mutation) and three supporting evidences for its pathogenicity (segregation with the disease, multiple computational evidence and specific patients' phenotype). Therefore this variant can be currently annotated as "pathogenic". This is the first study to report mutations in PLA2G6 gene in patients from Sudan.
Shows, Kathryn H; Ward, Christy; Summers, Laura; Li, Lin; Ziegler, Gregory R; Hendrickx, Andrew G; Shiang, Rita
2006-02-01
Mutations in the human gene TCOF1 cause a mandibulofacial dysostosis known as Treacher Collins syndrome (TCS). An infant rhesus macaque (Macaca mulatta) that displayed the TCS phenotype was identified at the California National Primate Research Center. The TCOF1 coding region was cloned from a normal rhesus macaque and sequenced. The rhesus macaque homolog of TCOF1 is 91.6% identical in cDNA sequence and 93.8% identical in translated protein sequence compared to human TCOF1. Sequencing of TCOF1 in the TCS-affected rhesus macaque showed no mutations within the coding region or splice sites; however, real-time quantitative PCR showed an 87% reduction of spleen TCOF1 mRNA level in the TCS affected macaque when compared with normal macaque spleen.
Mansouri, Maria; Kayserili, Hülya; Elalaoui, Siham Chafai; Nishimura, Gen; Iida, Aritoshi; Lyahyai, Jaber; Miyake, Noriko; Matsumoto, Naomichi; Sefiani, Abdelaziz; Ikegawa, Shiro
2016-02-01
Spondylo-meta-epiphyseal dysplasia (SMED), short limb-abnormal calcification type (SMED, SL-AC), is a very rare autosomal recessive disorder with various skeletal changes characterized by premature calcification leading to severe disproportionate short stature. Twenty-two patients have been reported until now, but only five mutations (four missense and one splice-site) in the conserved sequence encoding the tyrosine kinase domain of the DDR2 gene has been identified. We report here a novel DDR2 missense mutation, c.370C > T (p.Arg124Trp) in a Moroccan girl with SMED, SL-AC, identified by whole exome sequencing. Our study has expanded the mutational spectrum of this rare disease and it has shown that exome sequencing is a powerful and cost-effective tool for the diagnosis of clinically heterogeneous disorders such as SMED. © 2015 Wiley Periodicals, Inc.
Lessons from non-canonical splicing
Ule, Jernej
2016-01-01
Recent improvements in experimental and computational techniques used to study the transcriptome have enabled an unprecedented view of RNA processing, revealing many previously unknown non-canonical splicing events. This includes cryptic events located far from the currently annotated exons, and unconventional splicing mechanisms that have important roles in regulating gene expression. These non-canonical splicing events are a major source of newly emerging transcripts during evolution, especially when they involve sequences derived from transposable elements. They are therefore under precise regulation and quality control, which minimises their potential to disrupt gene expression. While non-canonical splicing can lead to aberrant transcripts that cause many diseases, we also explain how it can be exploited for new therapeutic strategies. PMID:27240813
Calcium-activated potassium (BK) channels are encoded by duplicate slo1 genes in teleost fishes.
Rohmann, Kevin N; Deitcher, David L; Bass, Andrew H
2009-07-01
Calcium-activated, large conductance potassium (BK) channels in tetrapods are encoded by a single slo1 gene, which undergoes extensive alternative splicing. Alternative splicing generates a high level of functional diversity in BK channels that contributes to the wide range of frequencies electrically tuned by the inner ear hair cells of many tetrapods. To date, the role of BK channels in hearing among teleost fishes has not been investigated at the molecular level, although teleosts account for approximately half of all extant vertebrate species. We identified slo1 genes in teleost and nonteleost fishes using polymerase chain reaction and genetic sequence databases. In contrast to tetrapods, all teleosts examined were found to express duplicate slo1 genes in the central nervous system, whereas nonteleosts that diverged prior to the teleost whole-genome duplication event express a single slo1 gene. Phylogenetic analyses further revealed that whereas other slo1 duplicates were the result of a single duplication event, an independent duplication occurred in a basal teleost (Anguilla rostrata) following the slo1 duplication in teleosts. A third, independent slo1 duplication (autotetraploidization) occurred in salmonids. Comparison of teleost slo1 genomic sequences to their tetrapod orthologue revealed a reduced number of alternative splice sites in both slo1 co-orthologues. For the teleost Porichthys notatus, a focal study species that vocalizes with maximal spectral energy in the range electrically tuned by BK channels in the inner ear, peripheral tissues show the expression of either one (e.g., vocal muscle) or both (e.g., inner ear) slo1 paralogues with important implications for both auditory and vocal physiology. Additional loss of expression of one slo1 paralogue in nonneural tissues in P. notatus suggests that slo1 duplicates were retained via subfunctionalization. Together, the results predict that teleost fish achieve a diversity of BK channel subfunction via gene duplication, rather than increased alternative splicing as witnessed for the tetrapod and invertebrate orthologue.
Calcium-Activated Potassium (BK) Channels Are Encoded by Duplicate slo1 Genes in Teleost Fishes
Deitcher, David L.; Bass, Andrew H.
2009-01-01
Calcium-activated, large conductance potassium (BK) channels in tetrapods are encoded by a single slo1 gene, which undergoes extensive alternative splicing. Alternative splicing generates a high level of functional diversity in BK channels that contributes to the wide range of frequencies electrically tuned by the inner ear hair cells of many tetrapods. To date, the role of BK channels in hearing among teleost fishes has not been investigated at the molecular level, although teleosts account for approximately half of all extant vertebrate species. We identified slo1 genes in teleost and nonteleost fishes using polymerase chain reaction and genetic sequence databases. In contrast to tetrapods, all teleosts examined were found to express duplicate slo1 genes in the central nervous system, whereas nonteleosts that diverged prior to the teleost whole-genome duplication event express a single slo1 gene. Phylogenetic analyses further revealed that whereas other slo1 duplicates were the result of a single duplication event, an independent duplication occurred in a basal teleost (Anguilla rostrata) following the slo1 duplication in teleosts. A third, independent slo1 duplication (autotetraploidization) occurred in salmonids. Comparison of teleost slo1 genomic sequences to their tetrapod orthologue revealed a reduced number of alternative splice sites in both slo1 co-orthologues. For the teleost Porichthys notatus, a focal study species that vocalizes with maximal spectral energy in the range electrically tuned by BK channels in the inner ear, peripheral tissues show the expression of either one (e.g., vocal muscle) or both (e.g., inner ear) slo1 paralogues with important implications for both auditory and vocal physiology. Additional loss of expression of one slo1 paralogue in nonneural tissues in P. notatus suggests that slo1 duplicates were retained via subfunctionalization. Together, the results predict that teleost fish achieve a diversity of BK channel subfunction via gene duplication, rather than increased alternative splicing as witnessed for the tetrapod and invertebrate orthologue. PMID:19321796
Pla2g12b and Hpn Are Genes Identified by Mouse ENU Mutagenesis That Affect HDL Cholesterol
Aljakna, Aleksandra; Choi, Seungbum; Savage, Holly; Hageman Blair, Rachael; Gu, Tongjun; Svenson, Karen L.; Churchill, Gary A.; Hibbs, Matt; Korstanje, Ron
2012-01-01
Despite considerable progress understanding genes that affect the HDL particle, its function, and cholesterol content, genes identified to date explain only a small percentage of the genetic variation. We used N-ethyl-N-nitrosourea mutagenesis in mice to discover novel genes that affect HDL cholesterol levels. Two mutant lines (Hlb218 and Hlb320) with low HDL cholesterol levels were established. Causal mutations in these lines were mapped using linkage analysis: for line Hlb218 within a 12 Mbp region on Chr 10; and for line Hlb320 within a 21 Mbp region on Chr 7. High-throughput sequencing of Hlb218 liver RNA identified a mutation in Pla2g12b. The transition of G to A leads to a cysteine to tyrosine change and most likely causes a loss of a disulfide bridge. Microarray analysis of Hlb320 liver RNA showed a 7-fold downregulation of Hpn; sequencing identified a mutation in the 3′ splice site of exon 8. Northern blot confirmed lower mRNA expression level in Hlb320 and did not show a difference in splicing, suggesting that the mutation only affects the splicing rate. In addition to affecting HDL cholesterol, the mutated genes also lead to reduction in serum non-HDL cholesterol and triglyceride levels. Despite low HDL cholesterol levels, the mice from both mutant lines show similar atherosclerotic lesion sizes compared to control mice. These new mutant mouse models are valuable tools to further study the role of these genes, their affect on HDL cholesterol levels, and metabolism. PMID:22912808
Intron self-complementarity enforces exon inclusion in a yeast pre-mRNA
Howe, Kenneth James; Ares, Manuel
1997-01-01
Skipping of internal exons during removal of introns from pre-mRNA must be avoided for proper expression of most eukaryotic genes. Despite significant understanding of the mechanics of intron removal, mechanisms that ensure inclusion of internal exons in multi-intron pre-mRNAs remain mysterious. Using a natural two-intron yeast gene, we have identified distinct RNA–RNA complementarities within each intron that prevent exon skipping and ensure inclusion of internal exons. We show that these complementarities are positioned to act as intron identity elements, bringing together only the appropriate 5′ splice sites and branchpoints. Destroying either intron self-complementarity allows exon skipping to occur, and restoring the complementarity using compensatory mutations rescues exon inclusion, indicating that the elements act through formation of RNA secondary structure. Introducing new pairing potential between regions near the 5′ splice site of intron 1 and the branchpoint of intron 2 dramatically enhances exon skipping. Similar elements identified in single intron yeast genes contribute to splicing efficiency. Our results illustrate how intron secondary structure serves to coordinate splice site pairing and enforce exon inclusion. We suggest that similar elements in vertebrate genes could assist in the splicing of very large introns and in the evolution of alternative splicing. PMID:9356473
RNA splicing regulated by RBFOX1 is essential for cardiac function in zebrafish.
Frese, Karen S; Meder, Benjamin; Keller, Andreas; Just, Steffen; Haas, Jan; Vogel, Britta; Fischer, Simon; Backes, Christina; Matzas, Mark; Köhler, Doreen; Benes, Vladimir; Katus, Hugo A; Rottbauer, Wolfgang
2015-08-15
Alternative splicing is one of the major mechanisms through which the proteomic and functional diversity of eukaryotes is achieved. However, the complex nature of the splicing machinery, its associated splicing regulators and the functional implications of alternatively spliced transcripts are only poorly understood. Here, we investigated the functional role of the splicing regulator rbfox1 in vivo using the zebrafish as a model system. We found that loss of rbfox1 led to progressive cardiac contractile dysfunction and heart failure. By using deep-transcriptome sequencing and quantitative real-time PCR, we show that depletion of rbfox1 in zebrafish results in an altered isoform expression of several crucial target genes, such as actn3a and hug. This study underlines that tightly regulated splicing is necessary for unconstrained cardiac function and renders the splicing regulator rbfox1 an interesting target for investigation in human heart failure and cardiomyopathy. © 2015. Published by The Company of Biologists Ltd.
An alternative splicing program promotes adipose tissue thermogenesis
Vernia, Santiago; Edwards, Yvonne JK; Han, Myoung Sook; Cavanagh-Kyros, Julie; Barrett, Tamera; Kim, Jason K; Davis, Roger J
2016-01-01
Alternative pre-mRNA splicing expands the complexity of the transcriptome and controls isoform-specific gene expression. Whether alternative splicing contributes to metabolic regulation is largely unknown. Here we investigated the contribution of alternative splicing to the development of diet-induced obesity. We found that obesity-induced changes in adipocyte gene expression include alternative pre-mRNA splicing. Bioinformatics analysis associated part of this alternative splicing program with sequence specific NOVA splicing factors. This conclusion was confirmed by studies of mice with NOVA deficiency in adipocytes. Phenotypic analysis of the NOVA-deficient mice demonstrated increased adipose tissue thermogenesis and improved glycemia. We show that NOVA proteins mediate a splicing program that suppresses adipose tissue thermogenesis. Together, these data provide quantitative analysis of gene expression at exon-level resolution in obesity and identify a novel mechanism that contributes to the regulation of adipose tissue function and the maintenance of normal glycemia. DOI: http://dx.doi.org/10.7554/eLife.17672.001 PMID:27635635
Boric acid reversibly inhibits the second step of pre-mRNA splicing.
Shomron, Noam; Ast, Gil
2003-09-25
Several approaches have been used to identify the factors involved in mRNA splicing. None of them, however, comprises a straightforward reversible method for inhibiting the second step of splicing using an external reagent other than a chelator. This investigation demonstrates that the addition of boric acid to an in vitro pre-mRNA splicing reaction causes a dose-dependent reversible inhibition effect on the second step of splicing. The mechanism of action does not involve chelation of several metal ions; hindrance of 3' splice-site; or binding to hSlu7. This study presents a novel method for specific reversible inhibition of the second step of pre-mRNA splicing.
Mutations Affecting Expression of the rosy Locus in Drosophila melanogaster
Lee, Chong Sung; Curtis, Daniel; McCarron, Margaret; Love, Carol; Gray, Mark; Bender, Welcome; Chovnick, Arthur
1987-01-01
The rosy locus in Drosophila melanogaster codes for the enzyme xanthine dehydrogenase (XDH). Previous studies defined a "control element" near the 5' end of the gene, where variant sites affected the amount of rosy mRNA and protein produced. We have determined the DNA sequence of this region from both genomic and cDNA clones, and from the ry+10 underproducer strain. This variant strain had many sequence differences, so that the site of the regulatory change could not be fixed. A mutagenesis was also undertaken to isolate new regulatory mutations. We induced 376 new mutations with 1-ethyl-1-nitrosourea (ENU) and screened them to isolate those that reduced the amount of XDH protein produced, but did not change the properties of the enzyme. Genetic mapping was used to find mutations located near the 5' end of the gene. DNA from each of seven mutants was cloned and sequenced through the 5' region. Mutant base changes were identified in all seven; they appear to affect splicing and translation of the rosy mRNA. In a related study (T. P. Keith et al. 1987), the genomic and cDNA sequences are extended through the 3' end of the gene; the combined sequences define the processing pattern of the rosy transcript and predict the amino acid sequence of XDH. PMID:3036645
Zhang, Xiao-Ning; Shi, Yifei; Powers, Jordan J; Gowda, Nikhil B; Zhang, Chong; Ibrahim, Heba M M; Ball, Hannah B; Chen, Samuel L; Lu, Hua; Mount, Stephen M
2017-10-11
Regulation of pre-mRNA splicing diversifies protein products and affects many biological processes. Arabidopsis thaliana Serine/Arginine-rich 45 (SR45), regulates pre-mRNA splicing by interacting with other regulatory proteins and spliceosomal subunits. Although SR45 has orthologs in diverse eukaryotes, including human RNPS1, the sr45-1 null mutant is viable. Narrow flower petals and reduced seed formation suggest that SR45 regulates genes involved in diverse processes, including reproduction. To understand how SR45 is involved in the regulation of reproductive processes, we studied mRNA from the wild-type and sr45-1 inflorescences using RNA-seq, and identified SR45-bound RNAs by immunoprecipitation. Using a variety of bioinformatics tools, we identified a total of 358 SR45 differentially regulated (SDR) genes, 542 SR45-dependent alternative splicing (SAS) events, and 1812 SR45-associated RNAs (SARs). There is little overlap between SDR genes and SAS genes, and neither set of genes is enriched for flower or seed development. However, transcripts from reproductive process genes are significantly overrepresented in SARs. In exploring the fate of SARs, we found that a total of 81 SARs are subject to alternative splicing, while 14 of them are known Nonsense-Mediated Decay (NMD) targets. Motifs related to GGNGG are enriched both in SARs and near different types of SAS events, suggesting that SR45 recognizes this motif directly. Genes involved in plant defense are significantly over-represented among genes whose expression is suppressed by SR45, and sr45-1 plants do indeed show enhanced immunity. We find that SR45 is a suppressor of innate immunity. We find that a single motif (GGNGG) is highly enriched in both RNAs bound by SR45 and in sequences near SR45- dependent alternative splicing events in inflorescence tissue. We find that the alternative splicing events regulated by SR45 are enriched for this motif whether the effect of SR45 is activation or repression of the particular event. Thus, our data suggests that SR45 acts to control splice site choice in a way that defies simple categorization as an activator or repressor of splicing.
Is an observed non-co-linear RNA product spliced in trans, in cis or just in vitro?
Yu, Chun-Ying; Liu, Hsiao-Jung; Hung, Li-Yuan; Kuo, Hung-Chih; Chuang, Trees-Juen
2014-01-01
Global transcriptome investigations often result in the detection of an enormous number of transcripts composed of non-co-linear sequence fragments. Such ‘aberrant’ transcript products may arise from post-transcriptional events or genetic rearrangements, or may otherwise be false positives (sequencing/alignment errors or in vitro artifacts). Moreover, post-transcriptionally non-co-linear (‘PtNcl’) transcripts can arise from trans-splicing or back-splicing in cis (to generate so-called ‘circular RNA’). Here, we collected previously-predicted human non-co-linear RNA candidates, and designed a validation procedure integrating in silico filters with multiple experimental validation steps to examine their authenticity. We showed that >50% of the tested candidates were in vitro artifacts, even though some had been previously validated by RT-PCR. After excluding the possibility of genetic rearrangements, we distinguished between trans-spliced and circular RNAs, and confirmed that these two splicing forms can share the same non-co-linear junction. Importantly, the experimentally-confirmed PtNcl RNA events and their corresponding PtNcl splicing types (i.e. trans-splicing, circular RNA, or both sharing the same junction) were all expressed in rhesus macaque, and some were even expressed in mouse. Our study thus describes an essential procedure for confirming PtNcl transcripts, and provides further insight into the evolutionary role of PtNcl RNA events, opening up this important, but understudied, class of post-transcriptional events for comprehensive characterization. PMID:25053845
Ganaie, Safder S; Chen, Aaron Yun; Huang, Chun; Xu, Peng; Kleiboeker, Steve; Du, Aifang; Qiu, Jianming
2018-04-15
Human parvovirus B19 (B19V) expresses a single precursor mRNA (pre-mRNA), which undergoes alternative splicing and alternative polyadenylation to generate 12 viral mRNA transcripts that encode two structural proteins (VP1 and VP2) and three nonstructural proteins (NS1, 7.5-kDa protein, and 11-kDa protein). Splicing at the second 5' donor site (D2 site) of the B19V pre-mRNA is essential for the expression of VP2 and the 11-kDa protein. We previously identified that cis -acting intronic splicing enhancer 2 (ISE2) that lies immediately after the D2 site facilitates the recognition of the D2 donor for its efficient splicing. In this study, we report that ISE2 is critical for the expression of the 11-kDa viral nonstructural protein. We found that ISE2 harbors a consensus RNA binding motif protein 38 (RBM38) binding sequence, 5'-UGUGUG-3'. RBM38 is expressed during the middle stage of erythropoiesis. We first confirmed that RBM38 binds specifically with the ISE2 element in vitro The knockdown of RBM38 significantly decreases the level of spliced mRNA at D2 that encodes the 11-kDa protein but not that of the D2-spliced mRNA that encodes VP2. Importantly, we found that the 11-kDa protein enhances viral DNA replication and virion release. Accordingly, the knockdown of RBM38 decreases virus replication via downregulating 11-kDa protein expression. Taken together, these results suggest that the 11-kDa protein facilitates B19V DNA replication and that RBM38 is an essential host factor for B19V pre-mRNA splicing and for the expression of the 11-kDa protein. IMPORTANCE B19V is a human pathogen that can cause fifth disease, arthropathy, anemia in immunocompromised patients and sickle cell disease patients, myocarditis, and hydrops fetalis in pregnant women. Human erythroid progenitor cells (EPCs) are most susceptible to B19V infection and fully support viral DNA replication. The exclusive tropism of B19V for erythroid-lineage cells is dependent not only on the expression of viral receptors and coreceptors on the cell surface but also on the intracellular host factors that support B19V replication. Our present study shows that B19V uses a host factor, RNA binding motif protein 38 (RBM38), for the processing of its pre-mRNA during virus replication. Specifically, RBM38 interacts with the intronic splicing enhancer 2 (ISE2) element of B19V pre-mRNA and promotes 11-kDa protein expression, thereby regulating the 11-kDa protein-mediated augmentation of B19V replication. The identification of this novel host-pathogen interaction will provide mechanistic insights into B19V replication and aid in finding new targets for anti-B19V therapeutics. Copyright © 2018 American Society for Microbiology.
Leong, Ivone U.S.; Dryland, Philippa A.; Prosser, Debra O.; Lai, Stella W.-S.; Graham, Mandy; Stiles, Martin; Crawford, Jackie; Skinner, Jonathan R.; Love, Donald R.
2017-01-01
Background Approximately 75% of clinically definite long QT syndrome (LQTS) cases are caused by mutations in the KCNQ1, KCNH2 and SCN5A genes. Of these mutations, a small proportion (3.2-9.2%) are predicted to affect splicing. These mutations present a particular challenge in ascribing pathogenicity. Methods Here we report an analysis of the transcriptional consequences of two mutations, one in the KCNQ1 gene (c.781_782delinsTC) and one in the SCN5A gene (c.2437-5C>A), which are predicted to affect splicing. We isolated RNA from lymphocytes and used a directed PCR amplification strategy of cDNA to show mis-spliced transcripts in mutation-positive patients. Results The loss of an exon in each mis-spliced transcript had no deduced effect on the translational reading frame. The clinical phenotype corresponded closely with genotypic status in family members carrying the KCNQ1 splice variant, but not in family members with the SCN5A splice variant. These results are put in the context of a literature review, where only 20% of all splice variants reported in the KCNQ1, KCNH2 and SCN5A gene entries in the HGMDPro 2015.4 database have been evaluated using transcriptional assays. Conclusions Prediction programmes play a strong role in most diagnostic laboratories in classifying variants located at splice sites; however, transcriptional analysis should be considered critical to confirm mis-splicing. Critically, this study shows that genuine mis- splicing may not always imply clinical significance, and genotype/phenotype cosegregation remains important even when mis-splicing is confirmed. PMID:28725320
Min, Xiang Jia
2013-01-01
Expressed Sequence Tags (ESTs) are a rich resource for identifying Alternatively Splicing (AS) genes. The ASFinder webserver is designed to identify AS isoforms from EST-derived sequences. Two approaches are implemented in ASFinder. If no genomic sequences are provided, the server performs a local BLASTN to identify AS isoforms from ESTs having both ends aligned but an internal segment unaligned. Otherwise, ASFinder uses SIM4 to map ESTs to the genome, then the overlapping ESTs that are mapped to the same genomic locus and have internal variable exon/intron boundaries are identified as AS isoforms. The tool is available at http://proteomics.ysu.edu/tools/ASFinder.html.
Duyk, G M; Kim, S W; Myers, R M; Cox, D R
1990-11-01
Identification and recovery of transcribed sequences from cloned mammalian genomic DNA remains an important problem in isolating genes on the basis of their chromosomal location. We have developed a strategy that facilitates the recovery of exons from random pieces of cloned genomic DNA. The basis of this "exon trapping" strategy is that, during a retroviral life cycle, genomic sequences of nonviral origin are correctly spliced and may be recovered as a cDNA copy of the introduced segment. By using this genetic assay for cis-acting sequences required for RNA splicing, we have screened approximately 20 kilobase pairs of cloned genomic DNA and have recovered all four predicted exons.
Duyk, G M; Kim, S W; Myers, R M; Cox, D R
1990-01-01
Identification and recovery of transcribed sequences from cloned mammalian genomic DNA remains an important problem in isolating genes on the basis of their chromosomal location. We have developed a strategy that facilitates the recovery of exons from random pieces of cloned genomic DNA. The basis of this "exon trapping" strategy is that, during a retroviral life cycle, genomic sequences of nonviral origin are correctly spliced and may be recovered as a cDNA copy of the introduced segment. By using this genetic assay for cis-acting sequences required for RNA splicing, we have screened approximately 20 kilobase pairs of cloned genomic DNA and have recovered all four predicted exons. PMID:2247475
Jiang, Qiang; Yang, Chun Hong; Zhang, Yan; Sun, Yan; Li, Rong Ling; Wang, Chang Fa; Zhong, Ji Feng; Huang, Jin Ming
2016-01-01
Alternative splicing (AS) contributes to the complexity of the mammalian proteome and plays an important role in diseases, including infectious diseases. The differential AS patterns of these transcript sequences between the healthy (HS3A) and mastitic (HS8A) cows naturally infected by Staphylococcus aureus were compared to understand the molecular mechanisms underlying mastitis resistance and susceptibility. In this study, using the Illumina paired-end RNA sequencing method, 1352 differentially expressed genes (DEGs) with higher than twofold changes were found in the HS3A and HS8A mammary gland tissues. Gene ontology and KEGG pathway analyses revealed that the cytokine–cytokine receptor interaction pathway is the most significantly enriched pathway. Approximately 16k annotated unigenes were respectively identified in two libraries, based on the bovine Bos taurus UMD3.1 sequence assembly and search. A total of 52.62% and 51.24% annotated unigenes were alternatively spliced in term of exon skipping, intron retention, alternative 5′ splicing and alternative 3ʹ splicing. Additionally, 1,317 AS unigenes were HS3A-specific, whereas 1,093 AS unigenes were HS8A-specific. Some immune-related genes, such as ITGB6, MYD88, ADA, ACKR1, and TNFRSF1B, and their potential relationships with mastitis were highlighted. From Chromosome 2, 4, 6, 7, 10, 13, 14, 17, and 20, 3.66% (HS3A) and 5.4% (HS8A) novel transcripts, which harbor known quantitative trait locus associated with clinical mastitis, were identified. Many DEGs in the healthy and mastitic mammary glands are involved in immune, defense, and inflammation responses. These DEGs, which exhibit diverse and specific splicing patterns and events, can endow dairy cattle with the potential complex genetic resistance against mastitis. PMID:27459697
Genomic structure and expression of STM2, the chromosome 1 familial Alzheimer disease gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Levy-Lahad, E.; Wang, Kai; Fu, Ying Hui
1996-06-01
Mutations in the gene STM2 result in autosomal dominant familial Alzheimer disease. To screen for mutations and to identify regulatory elements for this gene, the genomic DNA sequence and intron-exon structure were determined. Twelve exons including 10 coding exons were identified in a genomic region spanning 23, 737 bp. The first 2 exons encode the 5{prime}-untranslated region. Expression analysis of STM2 indicates that two transcripts of 2.4 and 2.8 kb are found in skeletal muscle, pancreas, and heart. In addition, a splice variant of the 2.4-kb transcript was identified that is the result of the use of an alternative splicemore » acceptor site located in exon 10. The use of this site results in a transcript lacking a single glutamate. The promotor for this gene and the alternatively spliced exons leading to the 2.8-kb form of the gene remain to be identified. Expression of STM2 was high in skeletal muscle and pancreas, with comparatively low levels observed in brain. This expression pattern is intriguing since in Alzheimer disease, pathology and degeneration are observed only in the central nervous system. 19 refs., 2 figs., 3 tabs.« less
Structure, dynamics and RNA binding of the multi-domain splicing factor TIA-1
Wang, Iren; Hennig, Janosch; Jagtap, Pravin Kumar Ankush; Sonntag, Miriam; Valcárcel, Juan; Sattler, Michael
2014-01-01
Alternative pre-messenger ribonucleic acid (pre-mRNA) splicing is an essential process in eukaryotic gene regulation. The T-cell intracellular antigen-1 (TIA-1) is an apoptosis-promoting factor that modulates alternative splicing of transcripts, including the pre-mRNA encoding the membrane receptor Fas. TIA-1 is a multi-domain ribonucleic acid (RNA) binding protein that recognizes poly-uridine tract RNA sequences to facilitate 5′ splice site recognition by the U1 small nuclear ribonucleoprotein (snRNP). Here, we characterize the RNA interaction and conformational dynamics of TIA-1 by nuclear magnetic resonance (NMR), isothermal titration calorimetry (ITC) and small angle X-ray scattering (SAXS). Our NMR-derived solution structure of TIA-1 RRM2–RRM3 (RRM2,3) reveals that RRM2 adopts a canonical RNA recognition motif (RRM) fold, while RRM3 is preceded by an non-canonical helix α0. NMR and SAXS data show that all three RRMs are largely independent structural modules in the absence of RNA, while RNA binding induces a compact arrangement. RRM2,3 binds to pyrimidine-rich FAS pre-mRNA or poly-uridine (U9) RNA with nanomolar affinities. RRM1 has little intrinsic RNA binding affinity and does not strongly contribute to RNA binding in the context of RRM1,2,3. Our data unravel the role of binding avidity and the contributions of the TIA-1 RRMs for recognition of pyrimidine-rich RNAs. PMID:24682828
Miyata, Y; Sugita, C; Maruyama, K; Sugita, M
2008-03-01
RNA editing of cytidine (C) to uridine (U) transitions occurs in plastids and mitochondria of most land plants. In this study, we amplified and sequenced the group I intron-containing tRNA Leu gene, trnL-CAA, from Takakia lepidozioides, a moss. DNA sequence analysis revealed that the T. lepidozioides tRNA Leu gene consisted of a 35-bp 5' exon, a 469-bp group I intron and a 50-bp 3' exon. The intron was inserted between the first and second position of the tRNA Leu anticodon. In general, plastid tRNA Leu genes with a group I intron code for a TAA anticodon in most land plants. This strongly suggests that the first nucleotide of the CAA anticodon could be edited in T. lepidozioides plastids. To investigate this possibility, we analysed cDNAs derived from the trnL-CAA transcripts. We demonstrated that the first nucleotide C of the anticodon was edited to create a canonical UAA anticodon in T. lepidozioides plastids. cDNA sequencing analyses of the spliced or unspliced tRNA Leu transcripts revealed that, while the spliced tRNA was completely edited, editing in the unspliced tRNAs were only partial. This is the first experimental evidence that the anticodon editing of tRNA occurs before RNA splicing in plastids. This suggests that this editing is a prerequisite to splicing of pre-tRNA Leu.
regSNPs-splicing: a tool for prioritizing synonymous single-nucleotide substitution.
Zhang, Xinjun; Li, Meng; Lin, Hai; Rao, Xi; Feng, Weixing; Yang, Yuedong; Mort, Matthew; Cooper, David N; Wang, Yue; Wang, Yadong; Wells, Clark; Zhou, Yaoqi; Liu, Yunlong
2017-09-01
While synonymous single-nucleotide variants (sSNVs) have largely been unstudied, since they do not alter protein sequence, mounting evidence suggests that they may affect RNA conformation, splicing, and the stability of nascent-mRNAs to promote various diseases. Accurately prioritizing deleterious sSNVs from a pool of neutral ones can significantly improve our ability of selecting functional genetic variants identified from various genome-sequencing projects, and, therefore, advance our understanding of disease etiology. In this study, we develop a computational algorithm to prioritize sSNVs based on their impact on mRNA splicing and protein function. In addition to genomic features that potentially affect splicing regulation, our proposed algorithm also includes dozens structural features that characterize the functions of alternatively spliced exons on protein function. Our systematical evaluation on thousands of sSNVs suggests that several structural features, including intrinsic disorder protein scores, solvent accessible surface areas, protein secondary structures, and known and predicted protein family domains, show significant differences between disease-causing and neutral sSNVs. Our result suggests that the protein structure features offer an added dimension of information while distinguishing disease-causing and neutral synonymous variants. The inclusion of structural features increases the predictive accuracy for functional sSNV prioritization.
RNA splicing factors as oncoproteins and tumor suppressors
Dvinge, Heidi; Kim, Eunhee; Abdel-Wahab, Omar; Bradley, Robert K.
2016-01-01
Preface The recent genomic characterization of cancers has revealed recurrent somatic point mutations and copy number changes affecting genes encoding RNA splicing factors. Initial studies of these ‘spliceosomal mutations’ suggest that the proteins bearing these mutations exhibit altered splice site and/or exon recognition preferences relative to their wild-type counterparts, resulting in cancer-specific mis-splicing. Such changes in the splicing machinery may create novel vulnerabilities in cancer cells that can be therapeutically exploited using compounds that can influence the splicing process. Further studies to dissect the biochemical, genomic, and biological effects of spliceosomal mutations are critical for the development of cancer therapies targeted to these mutations. PMID:27282250
Sadek, Jouliana
2016-01-01
ABSTRACT During lytic herpes simplex virus (HSV) infections, the virion host shutoff (Vhs) (UL41) endoribonuclease degrades many cellular and viral mRNAs. In uninfected cells, spliced mRNAs emerge into the cytoplasm bound by exon junction complexes (EJCs) and are translated several times more efficiently than unspliced mRNAs that have the same sequence but lack EJCs. Notably, most cellular mRNAs are spliced, whereas most HSV mRNAs are not. To examine the effect of splicing on gene expression during HSV infection, cells were transfected with plasmids harboring an unspliced renilla luciferase (RLuc) reporter mRNA or RLuc constructs with introns near the 5′ or 3′ end of the gene. After splicing of intron-containing transcripts, all three RLuc mRNAs had the same primary sequence. Upon infection in the presence of actinomycin D, spliced mRNAs were much less sensitive to degradation by copies of Vhs from infecting virions than were unspliced mRNAs. During productive infections (in the absence of drugs), RLuc was expressed at substantially higher levels from spliced than from unspliced mRNAs. Interestingly, the stimulatory effect of splicing on RLuc expression was significantly greater in infected than in uninfected cells. The translational stimulatory effect of an intron during HSV-1 infections could be replicated by artificially tethering various EJC components to an unspliced RLuc transcript. Thus, the splicing history of an mRNA, and the consequent presence or absence of EJCs, affects its level of translation and sensitivity to Vhs cleavage during lytic HSV infections. IMPORTANCE Most mammalian mRNAs are spliced. In contrast, of the more than 80 mRNAs harbored by herpes simplex virus 1 (HSV-1), only 5 are spliced. In addition, synthesis of the immediate early protein ICP27 causes partial inhibition of pre-mRNA splicing, with the resultant accumulation of both spliced and unspliced versions of some mRNAs in the cytoplasm. A common perception is that HSV-1 infection necessarily inhibits the expression of spliced mRNAs. In contrast, this study demonstrates two instances in which pre-mRNA splicing actually enhances the synthesis of proteins from mRNAs during HSV-1 infections. Specifically, splicing stabilized an mRNA against degradation by copies of the Vhs endoribonuclease from infecting virions and greatly enhanced the amount of protein synthesized from spliced mRNAs at late times after infection. The data suggest that splicing, and the resultant presence of exon junction complexes on an mRNA, may play an important role in gene expression during HSV-1 infections. PMID:27681125
Xia, Zunjing; Lin, Jie; Lu, Lingping; Kim, Chol; Yu, Ping; Qi, Ming
2018-06-01
: Hemophilia A is a bleeding disorder caused by coagulation factor VIII protein deficiency or dysfunction, which is classified into severe, moderate, and mild according to factor clotting activity. An overwhelming majority of missense and nonsense mutations occur in exons of F8 gene, whereas mutations in introns can also be pathogenic. This study aimed to investigate the effect of an intronic mutation, c.6430-3C>G (IVS22-3C>G), on pre-mRNA splicing of the F8 gene. We applied DNA and cDNA sequencing in a Chinese boy with hemophilia A to search if any pathogenic mutation in the F8 gene. Functional analysis was performed to investigate the effect of an intronic mutation at the transcriptional level. Human Splicing Finder and PyMol were also used to predict its effect. We found the mutation c.6430-3C>G (IVS22-3C>G) in the F8 gene in the affected boy, with his mother being a carrier. cDNA from the mother and pSPL3 splicing assay showed that the mutation IVS22-3C>G results in a two-nucleotide AG inclusion at the 3' end of intron 22 and leads to a truncated coagulation factor VIII protein, with partial loss of the C1 domain and complete loss of the C2 domain. The in-silico tool predicted that the mutation induces altered pre-mRNA splicing by using a cryptic acceptor site in intron 22. The IVS22-3C>G mutation was confirmed to affect pre-mRNA splicing and produce a truncated protein, which reduces the stability of binding between the F8 protein and von Willebrand factor carrier protein due to the loss of an interaction domain.
Chen, Letian; Wang, Fengpin; Wang, Xiaoyu; Liu, Yao-Guang
2013-01-01
Functional genomics requires vector construction for protein expression and functional characterization of target genes; therefore, a simple, flexible and low-cost molecular manipulation strategy will be highly advantageous for genomics approaches. Here, we describe a Ω-PCR strategy that enables multiple types of sequence modification, including precise insertion, deletion and substitution, in any position of a circular plasmid. Ω-PCR is based on an overlap extension site-directed mutagenesis technique, and is named for its characteristic Ω-shaped secondary structure during PCR. Ω-PCR can be performed either in two steps, or in one tube in combination with exonuclease I treatment. These strategies have wide applications for protein engineering, gene function analysis and in vitro gene splicing. PMID:23335613
Pasion, S G; Hines, J C; Ou, X; Mahmood, R; Ray, D S
1996-01-01
Gene expression in trypanosomatids appears to be regulated largely at the posttranscriptional level and involves maturation of mRNA precursors by trans splicing of a 39-nucleotide miniexon sequence to the 5' end of the mRNA and cleavage and polyadenylation at the 3' end of the mRNA. To initiate the identification of sequences involved in the periodic expression of DNA replication genes in trypanosomatids, we have mapped splice acceptor sites in the 5' flanking region of the TOP2 gene, which encodes the kinetoplast DNA topoisomerase, and have carried out deletion analysis of this region on a plasmid-encoded TOP2 gene. Block deletions within the 5' untranslated region (UTR) identified two regions (-608 to -388 and -387 to -186) responsible for periodic accumulation of the mRNA. Deletion of one or the other of these sequences had no effect on periodic expression of the mRNA, while deletion of both regions resulted in constitutive expression of the mRNA throughout the cell cycle. Subcloning of these sequences into the 5' UTR of a construct lacking both regions of the TOP2 5' UTR has shown that an octamer consensus sequence present in the 5' UTR of the TOP2, RPA1, and DHFR-TS mRNAs is required for normal cycling of the TOP2 mRNA. Mutation of the consensus octamer sequence in the TOP2 5' UTR in a plasmid construct containing only a single consensus octamer and that shows normal cycling of the plasmid-encoded TOP2 mRNA resulted in substantial reduction of the cycling of the mRNA level. These results imply a negative regulation of TOP2 mRNA during the cell cycle by a mechanism involving redundant elements containing one or more copies of a conserved octamer sequence within the 5' UTR of TOP2 mRNA. PMID:8943327
Pasion, S G; Hines, J C; Ou, X; Mahmood, R; Ray, D S
1996-12-01
Gene expression in trypanosomatids appears to be regulated largely at the posttranscriptional level and involves maturation of mRNA precursors by trans splicing of a 39-nucleotide miniexon sequence to the 5' end of the mRNA and cleavage and polyadenylation at the 3' end of the mRNA. To initiate the identification of sequences involved in the periodic expression of DNA replication genes in trypanosomatids, we have mapped splice acceptor sites in the 5' flanking region of the TOP2 gene, which encodes the kinetoplast DNA topoisomerase, and have carried out deletion analysis of this region on a plasmid-encoded TOP2 gene. Block deletions within the 5' untranslated region (UTR) identified two regions (-608 to -388 and -387 to -186) responsible for periodic accumulation of the mRNA. Deletion of one or the other of these sequences had no effect on periodic expression of the mRNA, while deletion of both regions resulted in constitutive expression of the mRNA throughout the cell cycle. Subcloning of these sequences into the 5' UTR of a construct lacking both regions of the TOP2 5' UTR has shown that an octamer consensus sequence present in the 5' UTR of the TOP2, RPA1, and DHFR-TS mRNAs is required for normal cycling of the TOP2 mRNA. Mutation of the consensus octamer sequence in the TOP2 5' UTR in a plasmid construct containing only a single consensus octamer and that shows normal cycling of the plasmid-encoded TOP2 mRNA resulted in substantial reduction of the cycling of the mRNA level. These results imply a negative regulation of TOP2 mRNA during the cell cycle by a mechanism involving redundant elements containing one or more copies of a conserved octamer sequence within the 5' UTR of TOP2 mRNA.
Huntingtin gene evolution in Chordata and its peculiar features in the ascidian Ciona genus
Gissi, Carmela; Pesole, Graziano; Cattaneo, Elena; Tartari, Marzia
2006-01-01
Background To gain insight into the evolutionary features of the huntingtin (htt) gene in Chordata, we have sequenced and characterized the full-length htt mRNA in the ascidian Ciona intestinalis, a basal chordate emerging as new invertebrate model organism. Moreover, taking advantage of the availability of genomic and EST sequences, the htt gene structure of a number of chordate species, including the cogeneric ascidian Ciona savignyi, and the vertebrates Xenopus and Gallus was reconstructed. Results The C. intestinalis htt transcript exhibits some peculiar features, such as spliced leader trans-splicing in the 98 nt-long 5' untranslated region (UTR), an alternative splicing in the coding region, eight alternative polyadenylation sites, and no similarities of both 5' and 3'UTRs compared to homologs of the cogeneric C. savignyi. The predicted protein is 2946 amino acids long, shorter than its vertebrate homologs, and lacks the polyQ and the polyP stretches found in the the N-terminal regions of mammalian homologs. The exon-intron organization of the htt gene is almost identical among vertebrates, and significantly conserved between Ciona and vertebrates, allowing us to hypothesize an ancestral chordate gene consisting of at least 40 coding exons. Conclusion During chordate diversification, events of gain/loss, sliding, phase changes, and expansion of introns occurred in both vertebrate and ascidian lineages predominantly in the 5'-half of the htt gene, where there is also evidence of lineage-specific evolutionary dynamics in vertebrates. On the contrary, the 3'-half of the gene is highly conserved in all chordates at the level of both gene structure and protein sequence. Between the two Ciona species, a fast evolutionary rate and/or an early divergence time is suggested by the absence of significant similarity between UTRs, protein divergence comparable to that observed between mammals and fishes, and different distribution of repetitive elements. PMID:17092333
USDA-ARS?s Scientific Manuscript database
Alternate pathways of RNA processing play an important role in the expression of the secreted (S) and membrane (Mb) forms of immunoglobulin (Ig) heavy (H) chain isotypes in all vertebrates. Interestingly, while the differential splicing mechanism and the splice sites that generate the two forms of I...
Nuclear sequestration of COL1A1 mRNA transcript associated with type I osteogenesis imperfecta (OI)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Primorac, D.; Stover, M.L.; McKinstry, M.B.
Previously we identified an OI type I patient with a splice donor mutation that resulted in intron 26 retention instead of exon skipping and sequestration of normal levels of the mutant transcript in the nuclear compartment. Intron retention was consistent with the exon definition hypothesis for splice site selection since the size of the exon-intron-exon unit was less than 300 bp. Furthermore, the retained intron contained in-frame stop codons which is thought to cause the mutant RNA to remain within the nucleus rather than appearing in the cytoplasm. To test these hypotheses, genomic fragments containing the normal sequence or themore » donor mutation were cloned into a collagen minigene and expressed in stably tansfected NIH 3T3 cells. None of the modifications to the normal intron altered the level of RNA that accumulated in the cytoplasm, as expected. However none of the modifications to the mutant intron allowed accumulation of normal levels of mRNA in the cytoplasm. Moreover, in contrast to our findings in the patient`s cells only low levels of mutant transcript were found in the nucleus; a fraction of the transcript did appear in the cytoplasm which had spliced the mutant donor site correctly. Nuclear run-on experiments demonstrated equal levels of transcription from each transgene. Expression of another donor mutation known to cause in-frame exon skipping in OI type IV was accurately reproduced in the minigene in transfected 3T3 cells. Our experience suggests that either mechanism can lead to formation of a null allele possibly related to the type of splicing events surrounding the potential stop codons. Understanding the rules governing inactivation of a collagen RNA transcript may be important in designing a strategy to inactivate a dominate negative mutation associated with the more severe forms of OI.« less
Ren, Xiaojun; Deng, Ruijie; Wang, Lida; Zhang, Kaixiang
2017-01-01
RNA splicing, which mainly involves two transesterification steps, is a fundamental process of gene expression and its abnormal regulation contributes to serious genetic diseases. Antisense oligonucleotides (ASOs) are genetic control tools that can be used to specifically control genes through alteration of the RNA splicing pathway. Despite intensive research, how ASOs or various other factors influence the multiple processes of RNA splicing still remains obscure. This is largely due to an inability to analyze the splicing efficiency of each step in the RNA splicing process with high sensitivity. We addressed this limitation by introducing a padlock probe-based isothermal amplification assay to achieve quantification of the specific products in different splicing steps. With this amplified assay, the roles that ASOs play in RNA splicing inhibition in the first and second steps could be distinguished. We identified that 5′-ASO could block RNA splicing by inhibiting the first step, while 3′-ASO could block RNA splicing by inhibiting the second step. This method provides a versatile tool for assisting efficient ASO design and discovering new splicing modulators and therapeutic drugs. PMID:28989608
Genomic overview of mRNA 5′-leader trans-splicing in the ascidian Ciona intestinalis
Satou, Yutaka; Hamaguchi, Makoto; Takeuchi, Keisuke; Hastings, Kenneth E. M.; Satoh, Nori
2006-01-01
Although spliced leader (SL) trans-splicing in the chordates was discovered in the tunicate Ciona intestinalis there has been no genomic overview analysis of the extent of trans-splicing or the make-up of the trans-spliced and non-trans-spliced gene populations of this model organism. Here we report such an analysis for Ciona based on the oligo-capping full-length cDNA approach. We randomly sampled 2078 5′-full-length ESTs representing 668 genes, or 4.2% of the entire genome. Our results indicate that Ciona contains a single major SL, which is efficiently trans-spliced to mRNAs transcribed from a specific set of genes representing ∼50% of the total number of expressed genes, and that individual trans-spliced mRNA species are, on average, 2–3-fold less abundant than non-trans-spliced mRNA species. Our results also identify a relationship between trans-splicing status and gene functional classification; ribosomal protein genes fall predominantly into the non-trans-spliced category. In addition, our data provide the first evidence for the occurrence of polycistronic transcription in Ciona. An interesting feature of the Ciona polycistronic transcription units is that the great majority entirely lack intercistronic sequences. PMID:16822859
Hartmann, Linda; Neveling, Kornelia; Borkens, Stephanie; Schneider, Hildegard; Freund, Marcel; Grassman, Elke; Theiss, Stephan; Wawer, Angela; Burdach, Stefan; Auerbach, Arleen D.; Schindler, Detlev; Hanenberg, Helmut; Schaal, Heiner
2010-01-01
The U1 small nuclear RNA (U1 snRNA) as a component of the major U2-dependent spliceosome recognizes 5′ splice sites (5′ss) containing GT as the canonical dinucleotide in the intronic positions +1 and +2. The c.165+1G>T germline mutation in the 5′ss of exon 2 of the Fanconi anemia C (FANCC) gene commonly predicted to prevent correct splicing was identified in nine FA patients from three pedigrees. RT-PCR analysis of the endogenous FANCC mRNA splicing pattern of patient-derived fibroblasts revealed aberrant mRNA processing, but surprisingly also correct splicing at the TT dinucleotide, albeit with lower efficiency. This consequently resulted in low levels of correctly spliced transcript and minute levels of normal posttranslationally processed FANCD2 protein, indicating that this naturally occurring TT splicing might contribute to the milder clinical manifestations of the disease in these patients. Functional analysis of this FANCC 5′ss within splicing reporters revealed that both the noncanonical TT dinucleotide and the genomic context of FANCC were required for the residual correct splicing at this mutant 5′ss. Finally, use of lentiviral vectors as a delivery system to introduce expression cassettes for TT-adapted U1 snRNAs into primary FANCC patient fibroblasts allowed the correction of the DNA-damage-induced G2 cell-cycle arrest in these cells, thus representing an alternative transcript-targeting approach for genetic therapy of inherited splice-site mutations. PMID:20869034
Sjursen, Wenche; Bjørnevoll, Inga; Engebretsen, Lars F; Fjelland, Kristine; Halvorsen, Tore; Myrvold, Helge E
2009-01-01
Turcot syndrome is a rare, inherited disease predisposing of tumours in the central nerve system and in the colorectal system. This report describes a Turcot patient with an extraordinary clinical history. The patient is still alive at the age of 43. She was operated at the age of 10 by brain tumour and at the age of 16 by colorectal cancer. She has since then been treated for multiple cancers (gastrointestinal, endometrial, basal cell carcinomas), and removal of adenomatous polyps at several occasions. The aim of this work was to investigate if there was any specific genotype that explains her remarkable clinical history. Microsatellite instability and immunohistochemistry analysis for four DNA mismatch repair proteins were performed. DNA mutation analysis was done for genes involved in polyposis and mismatch repair by denaturing high performance liquid chromatography and sequencing. cDNA analysis was carried out for the mismatch repair gene PMS2. The patients genotype was found to be a homozygous splice site mutation in the PMS2 gene, c.989-1G
Echigoya, Yusuke; Mouly, Vincent; Garcia, Luis; Yokota, Toshifumi; Duddy, William
2015-01-01
The use of antisense ‘splice-switching’ oligonucleotides to induce exon skipping represents a potential therapeutic approach to various human genetic diseases. It has achieved greatest maturity in exon skipping of the dystrophin transcript in Duchenne muscular dystrophy (DMD), for which several clinical trials are completed or ongoing, and a large body of data exists describing tested oligonucleotides and their efficacy. The rational design of an exon skipping oligonucleotide involves the choice of an antisense sequence, usually between 15 and 32 nucleotides, targeting the exon that is to be skipped. Although parameters describing the target site can be computationally estimated and several have been identified to correlate with efficacy, methods to predict efficacy are limited. Here, an in silico pre-screening approach is proposed, based on predictive statistical modelling. Previous DMD data were compiled together and, for each oligonucleotide, some 60 descriptors were considered. Statistical modelling approaches were applied to derive algorithms that predict exon skipping for a given target site. We confirmed (1) the binding energetics of the oligonucleotide to the RNA, and (2) the distance in bases of the target site from the splice acceptor site, as the two most predictive parameters, and we included these and several other parameters (while discounting many) into an in silico screening process, based on their capacity to predict high or low efficacy in either phosphorodiamidate morpholino oligomers (89% correctly predicted) and/or 2’O Methyl RNA oligonucleotides (76% correctly predicted). Predictions correlated strongly with in vitro testing for sixteen de novo PMO sequences targeting various positions on DMD exons 44 (R2 0.89) and 53 (R2 0.89), one of which represents a potential novel candidate for clinical trials. We provide these algorithms together with a computational tool that facilitates screening to predict exon skipping efficacy at each position of a target exon. PMID:25816009
Recurrent chimeric RNAs enriched in human prostate cancer identified by deep sequencing
Kannan, Kalpana; Wang, Liguo; Wang, Jianghua; Ittmann, Michael M.; Li, Wei; Yen, Laising
2011-01-01
Transcription-induced chimeric RNAs, possessing sequences from different genes, are expected to increase the proteomic diversity through chimeric proteins or altered regulation. Despite their importance, few studies have focused on chimeric RNAs especially regarding their presence/roles in human cancers. By deep sequencing the transcriptome of 20 human prostate cancer and 10 matched benign prostate tissues, we obtained 1.3 billion sequence reads, which led to the identification of 2,369 chimeric RNA candidates. Chimeric RNAs occurred in significantly higher frequency in cancer than in matched benign samples. Experimental investigation of a selected 46 set led to the confirmation of 32 chimeric RNAs, of which 27 were highly recurrent and previously undescribed in prostate cancer. Importantly, a subset of these chimeras was present in prostate cancer cell lines, but not detectable in primary human prostate epithelium cells, implying their associations with cancer. These chimeras contain discernable 5′ and 3′ splice sites at the RNA junction, indicating that their formation is mediated by splicing. Their presence is also largely independent of the expression of parental genes, suggesting that other factors are involved in their production and regulation. One chimera, TMEM79-SMG5, is highly differentially expressed in human cancer samples and therefore a potential biomarker. The prevalence of chimeric RNAs may allow the limited number of human genes to encode a substantially larger number of RNAs and proteins, forming an additional layer of cellular complexity. Together, our results suggest that chimeric RNAs are widespread, and increased chimeric RNA events could represent a unique class of molecular alteration in cancer. PMID:21571633
Camats, Núria; Pandey, Amit V; Fernández-Cancio, Mónica; Fernández, Juan M; Ortega, Ana M; Udhane, Sameer; Andaluz, Pilar; Audí, Laura; Flück, Christa E
2014-02-01
The steroidogenic acute regulatory protein (StAR) transports cholesterol to the mitochondria for steroidogenesis. Loss of StAR function causes lipoid congenital adrenal hyperplasia (LCAH) which is characterized by impaired synthesis of adrenal and gonadal steroids causing adrenal insufficiency, 46,XY disorder of sex development (DSD) and failure of pubertal development. Partial loss of StAR activity may cause adrenal insufficiency only. A newborn girl was admitted for mild dehydration, hyponatremia, hyperkalemia and hypoglycaemia and had normal external female genitalia without hyperpigmentation. Plasma cortisol, 17OH-progesterone, DHEA-S, androstendione and aldosterone were low, while ACTH and plasma renin activity were elevated, consistent with the diagnosis of primary adrenal insufficiency. Imaging showed normal adrenals, and cytogenetics revealed a 46,XX karyotype. She was treated with fluids, hydrocortisone and fludrocortisone. Genetic studies revealed a novel homozygous STAR mutation in the 3' acceptor splice site of intron 4, c.466-1G>A (IVS4-1G>A). To test whether this mutation would affect splicing, we performed a minigene experiment with a plasmid construct containing wild-type or mutant StAR gDNA of exons-introns 4-6 in COS-1 cells. The splicing was assessed on total RNA using RT-PCR for STAR cDNAs. The mutant STAR minigene skipped exon 5 completely and changed the reading frame. Thus, it is predicted to produce an aberrant and shorter protein (p.V156GfsX19). Computational analysis revealed that this mutant protein lacks wild-type exons 5-7 which are essential for StAR-cholesterol interaction. STAR c.466-1A skips exon 5 and causes a dramatic change in the C-terminal sequence of the protein, which is essential for StAR-cholesterol interaction. This splicing mutation is a loss-of-function mutation explaining the severe phenotype of our patient. Thus far, all reported splicing mutations of STAR cause a severe impairment of protein function and phenotype. © 2013 John Wiley & Sons Ltd.
X-linked hypophosphatemia attributable to pseudoexons of the PHEX gene.
Christie, P T; Harding, B; Nesbit, M A; Whyte, M P; Thakker, R V
2001-08-01
X-linked hypophosphatemia is commonly caused by mutations of the coding region of PHEX (phosphate-regulating gene with homologies to endopeptidases on the X chromosome). However, such PHEX mutations are not detected in approximately one third of X-linked hypophosphatemia patients who may harbor defects in the noncoding or intronic regions. We have therefore investigated 11 unrelated X-linked hypophosphatemia patients in whom coding region mutations had been excluded, for intronic mutations that may lead to mRNA splicing abnormalities, by the use of lymphoblastoid RNA and RT-PCRs. One X-linked hypophosphatemia patient was found to have 3 abnormally large transcripts, resulting from 51-bp, 100-bp, and 170-bp insertions, all of which would lead to missense peptides and premature termination codons. The origin of these transcripts was a mutation (g to t) at position +1268 of intron 7, which resulted in the occurrence of a high quality novel donor splice site (ggaagg to gtaagg). Splicing between this novel donor splice site and 3 preexisting, but normally silent, acceptor splice sites within intron 7 resulted in the occurrences of the 3 pseudoexons. This represents the first report of PHEX pseudoexons and reveals further the diversity of genetic abnormalities causing X-linked hypophosphatemia.
Characterization of the canine mda-7 gene, transcripts and expression patterns
Sandey, Maninder; Bird, R. Curtis; Das, Swadesh K.; Sarkar, Devanand; Curiel, David T.; Fisher, Paul B.; Smith, Bruce F.
2014-01-01
Human melanoma differentiation associated gene-7/interleukin-24 (mda-7/IL-24) displays potent growth suppressing and cell killing activity against a wide variety of human and rodent cancer cells. In this study, we identified a canine ortholog of the human mda-7/IL-24 gene located within a cluster of IL-10 family members on chromosome 7. The full-length mRNA sequence of canine mda-7 was determined, which encodes a 186-amino acid protein that has 66% similarity to human MDA-7/IL-24. Canine MDA-7 is constitutively expressed in cultured normal canine epidermal keratinocytes (NCEKs), and its expression levels are increased after lipopolysaccharide stimulation. In cultured NCEKs, the canine mda-7 pre-mRNA is differentially spliced, via exon skipping and alternate 5′-splice donor sites, to yield five splice variants (canine mda-7sv1, canine mda-7sv2, canine mda-7sv3, canine mda-7sv4 and canine mda-7sv5) that encode four protein isoforms of the canine MDA-7 protein. These protein isoforms have a conserved N-terminus (signal peptide sequence) and are dissimilar in amino acid sequences at their C-terminus. Canine MDA-7 is not expressed in primary canine tumor samples, and most tumor derived cancer cell lines tested, like its human counterpart. Unlike human MDA-7/IL-24, canine mda-7 mRNA is not expressed in unstimulated or lipopolysaccharide (LPS), concanavalin A (ConA) or phytohemagglutinin (PHA) stimulated canine peripheral blood mononuclear cells (PBMCs). Furthermore, in-silico analysis revealed that canonical canine MDA-7 has a potential 28 amino acid signal peptide sequence that can target it for active secretion. This data suggests that canine mda-7 is indeed an ortholog of human mda-7/IL-24, its protein product has high amino acid similarity to human MDA-7/IL-24 protein and it may possess similar biological properties to human MDA-7/IL-24, but its expression pattern is more restricted than its human ortholog. PMID:24865935
DOE Office of Scientific and Technical Information (OSTI.GOV)
Funkenstein, B.; Leary, S.L.; Stein, J.C.
1988-03-01
The Gus-s/sup ..cap alpha../ allele of the mouse ..beta..-glucuronidase gene exhibits a high degree of inducibility by androgens due to its linkage with the Gus-r/sup ..cap alpha../ regulatory locus. The authors isolated Gus-s/sup ..cap alpha../ on a 28-kilobase pair fragment of mouse chromosome 5 and found that it contains 12 exons and 11 intervening sequences spanning 14 kilobase pairs of this genomic segment. The mRNA cap site was identified by ribonuclease protection and primer extension analyses which revealed an unusually short 5' noncoding sequence of 12 nucleotides. Proximal regulatory sequences in the 5'-flanking DNA and the complete sequence of themore » Gus-s/sup ..cap alpha../ mRNA transcript were also determined. Comparison of the amino acid sequence determined from the Gus-s/sup ..cap alpha../ nucleotide sequence with that of human ..beta..-glucuronidase indicated that the two human mRNA species differ due to alternate splicing of an exon homologous to exon 6 of the mouse gene.« less
[Alternative splicing regulation: implications in cancer diagnosis and treatment].
Martínez-Montiel, Nancy; Rosas-Murrieta, Nora; Martínez-Contreras, Rebeca
2015-04-08
The accurate expression of the genetic information is regulated by processes like mRNA splicing, proposed after the discoveries of Phil Sharp and Richard Roberts, who demonstrated the existence of intronic sequences, present in almost every structural eukaryotic gene, which should be precisely removed. This intron removal is called "splicing", which generates different proteins from a single mRNA, with different or even antagonistic functions. We currently know that alternative splicing is the most important source of protein diversity, given that 70% of the human genes undergo splicing and that mutations causing defects in this process could originate up to 50% of genetic diseases, including cancer. When these defects occur in genes involved in cell adhesion, proliferation and cell cycle regulation, there is an impact on cancer progression, rising the opportunity to diagnose and treat some types of cancer according to a particular splicing profile. Copyright © 2013 Elsevier España, S.L.U. All rights reserved.
Evolution of a tissue-specific splicing network
Taliaferro, J. Matthew; Alvarez, Nehemiah; Green, Richard E.; Blanchette, Marco; Rio, Donald C.
2011-01-01
Alternative splicing of precursor mRNA (pre-mRNA) is a strategy employed by most eukaryotes to increase transcript and proteomic diversity. Many metazoan splicing factors are members of multigene families, with each member having different functions. How these highly related proteins evolve unique properties has been unclear. Here we characterize the evolution and function of a new Drosophila splicing factor, termed LS2 (Large Subunit 2), that arose from a gene duplication event of dU2AF50, the large subunit of the highly conserved heterodimeric general splicing factor U2AF (U2-associated factor). The quickly evolving LS2 gene has diverged from the splicing-promoting, ubiquitously expressed dU2AF50 such that it binds a markedly different RNA sequence, acts as a splicing repressor, and is preferentially expressed in testes. Target transcripts of LS2 are also enriched for performing testes-related functions. We therefore propose a path for the evolution of a new splicing factor in Drosophila that regulates specific pre-mRNAs and contributes to transcript diversity in a tissue-specific manner. PMID:21406555
Livingstone, Mark; Folkman, Lukas; Yang, Yuedong; Zhang, Ping; Mort, Matthew; Cooper, David N; Liu, Yunlong; Stantic, Bela; Zhou, Yaoqi
2017-10-01
Synonymous single-nucleotide variants (SNVs), although they do not alter the encoded protein sequences, have been implicated in many genetic diseases. Experimental studies indicate that synonymous SNVs can lead to changes in the secondary and tertiary structures of DNA and RNA, thereby affecting translational efficiency, cotranslational protein folding as well as the binding of DNA-/RNA-binding proteins. However, the importance of these various features in disease phenotypes is not clearly understood. Here, we have built a support vector machine (SVM) model (termed DDIG-SN) as a means to discriminate disease-causing synonymous variants. The model was trained and evaluated on nearly 900 disease-causing variants. The method achieves robust performance with the area under the receiver operating characteristic curve of 0.84 and 0.85 for protein-stratified 10-fold cross-validation and independent testing, respectively. We were able to show that the disease-causing effects in the immediate proximity to exon-intron junctions (1-3 bp) are driven by the loss of splicing motif strength, whereas the gain of splicing motif strength is the primary cause in regions further away from the splice site (4-69 bp). The method is available as a part of the DDIG server at http://sparks-lab.org/ddig. © 2017 Wiley Periodicals, Inc.
Informational structure of genetic sequences and nature of gene splicing
NASA Astrophysics Data System (ADS)
Trifonov, E. N.
1991-10-01
Only about 1/20 of DNA of higher organisms codes for proteins, by means of classical triplet code. The rest of DNA sequences is largely silent, with unclear functions, if any. The triplet code is not the only code (message) carried by the sequences. There are three levels of molecular communication, where the same sequence ``talks'' to various bimolecules, while having, respectively, three different appearances: DNA, RNA and protein. Since the molecular structures and, hence, sequence specific preferences of these are substantially different, the original DNA sequence has to carry simultaneously three types of sequence patterns (codes, messages), thus, being a composite structure in which one had the same letter (nucleotide) is frequently involved in several overlapping codes of different nature. This multiplicity and overlapping of the codes is a unique feature of the Gnomic, language of genetic sequences. The coexisting codes have to be degenerate in various degrees to allow an optimal and concerted performance of all the encoded functions. There is an obvious conflict between the best possible performance of a given function and necessity to compromise the quality of a given sequence pattern in favor of other patterns. It appears that the major role of various changes in the sequences on their ``ontogenetic'' way from DNA to RNA to protein, like RNA editing and splicing, or protein post-translational modifications is to resolve such conflicts. New data are presented strongly indicating that the gene splicing is such a device to resolve the conflict between the code of DNA folding in chromatin and the triplet code for protein synthesis.
Walline, Heather M; Komarck, Christine M; McHugh, Jonathan B; Tang, Alice L; Owen, John H; Teh, Bin T; McKean, Erin; Glover, Thomas; Graham, Martin P; Prince, Mark E; Chepeha, Douglas B; Chinn, Steven B; Ferris, Robert L; Gollin, Susanne M; Hoffmann, Thomas K; Bier, Henning; Brakenhoff, Ruud; Bradford, Carol R; Carey, Thomas E
2017-01-01
Background HPV-positive oropharyngeal cancer is generally associated with excellent response to therapy, but some HPV-positive tumors progress despite aggressive therapy. This study evaluates viral oncogene expression and viral integration sites in HPV16 and HPV18-positive squamous carcinoma cell lines. Methods E6-E7 alternate transcripts were assessed by RT-PCR. Detection of integrated papillomavirus sequences (DIPS-PCR) and sequencing identified viral insertion sites and affected host genes. Cellular gene expression was assessed across viral integration sites. Results All HPV-positive cell lines expressed alternate HPVE6/E7 splicing indicative of active viral oncogenesis. HPV integration occurred within cancer-related genes TP63, DCC, JAK1, TERT, ATR, ETV6, PGR, PTPRN2, and TMEM237 in 8 HNSCC lines but UM-SCC-105 and UM-GCC-1 had only intergenic integration. Conclusions HPV integration into cancer-related genes occurred in 7/9 HPV-positive cell lines and of these six were from tumors that progressed. HPV integration into cancer-related genes may be a secondary carcinogenic driver in HPV-driven tumors. PMID:28236344
Haas, Brian J; Salzberg, Steven L; Zhu, Wei; Pertea, Mihaela; Allen, Jonathan E; Orvis, Joshua; White, Owen; Buell, C Robin; Wortman, Jennifer R
2008-01-01
EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation. PMID:18190707
ERIC Educational Resources Information Center
Rice, Michael; Gladstone, William; Weir, Michael
2004-01-01
We discuss how relational databases constitute an ideal framework for representing and analyzing large-scale genomic data sets in biology. As a case study, we describe a Drosophila splice-site database that we recently developed at Wesleyan University for use in research and teaching. The database stores data about splice sites computed by a…
HSJ1-related hereditary neuropathies: novel mutations and extended clinical spectrum.
Gess, Burkhard; Auer-Grumbach, Michaela; Schirmacher, Anja; Strom, Tim; Zitzelsberger, Manuela; Rudnik-Schöneborn, Sabine; Röhr, Dominik; Halfter, Hartmut; Young, Peter; Senderek, Jan
2014-11-04
To determine the nature and frequency of HSJ1 mutations in patients with hereditary motor and hereditary motor and sensory neuropathies. Patients were screened for mutations by genome-wide or targeted linkage and homozygosity studies, whole-exome sequencing, and Sanger sequencing. RNA and protein studies of skin fibroblasts were used for functional characterization. We describe 2 additional mutations in the HSJ1 gene in a cohort of 90 patients with autosomal recessive distal hereditary motor neuropathy (dHMN) and Charcot-Marie-Tooth disease type 2 (CMT2). One family with a dHMN phenotype showed the homozygous splice-site mutation c.229+1G>A, which leads to retention of intron 4 in the HSJ1 messenger RNA with a premature stop codon and loss of protein expression. Another family, presenting with a CMT2 phenotype, carried the homozygous missense mutation c.14A>G (p.Tyr5Cys). This mutation was classified as likely disease-related by several automatic algorithms for prediction of possible impact of an amino acid substitution on the structure and function of proteins. Both mutations cosegregated with autosomal recessive inheritance of the disease and were absent from the general population. Taken together, in our cohort of 90 probands, we confirm that HSJ1 mutations are a rare but detectable cause of autosomal recessive dHMN and CMT2. We provide clinical and functional information on an HSJ1 splice-site mutation and report the detailed phenotype of 2 patients with CMT2, broadening the phenotypic spectrum of HSJ1-related neuropathies. © 2014 American Academy of Neurology.
76 FR 59590 - Airworthiness Directives; The Boeing Company Airplanes
Federal Register 2010, 2011, 2012, 2013, 2014
2011-09-27
... web lap and tear strap splices of the aft pressure bulkhead at STA 1582 due to fatigue. We are... radial web lap and tear strap splices of the aft pressure bulkhead at station (STA) 1582 due to fatigue... prompted by reports of multiple site damage cracks in the radial web lap and tear strap splices of the aft...