Xu, Dong-Qing; Mattox, William
2006-01-01
Exonic splicing enhancers (ESEs) are sequences that facilitate recognition of splice sites and prevent exon-skipping. Because ESEs are often embedded within proteincoding sequences, alterations in them can also often be interpreted as nonsense, missense or silent mutations. To correctly interpret exonic mutations and their roles in disease, it is important to develop strategies that identify ESE mutations. Potential ESEs can be found computationally in many exons but it has proven difficult to predict if a given mutation will have effects on splicing based on sequence alone. Here we describe a flexible in vitro method that can be used to functionally compare the effects of multiple sequence variants on ESE activity in a single in vitro splicing reaction. We have applied this method in parallel with conventional splicing assays to test for a splicing enhancer in exon 17 of the human MLH1 gene. Point mutations associated with hereditary nonpolyposis colorectal cancer (HNPCC) have previously been found to correlate with exon-skipping in both lymphocytes and tumors from patients. We show that sequences from this exon can replace an ESE from the mouse IgM gene to support RNA splicing in HeLa nuclear extracts. ESE activity was reduced by HNPCC point mutations in codon 659 indicating that their primary effect is on splicing. Surprisingly the strongest enhancer function mapped to a different region of the exon upstream of this codon. Together our results indicate that HNPCC point mutations in codon 659 affect an auxillary element that augments the enhancer function to ensure exon inclusion. PMID:16357104
A 5′ Splice Site-Proximal Enhancer Binds SF1 and Activates Exon Bridging of a Microexon
Carlo, Troy; Sierra, Rebecca; Berget, Susan M.
2000-01-01
Internal exon size in vertebrates occurs over a narrow size range. Experimentally, exons shorter than 50 nucleotides are poorly included in mRNA unless accompanied by strengthened splice sites or accessory sequences that act as splicing enhancers, suggesting steric interference between snRNPs and other splicing factors binding simultaneously to the 3′ and 5′ splice sites of microexons. Despite these problems, very small naturally occurring exons exist. Here we studied the factors and mechanism involved in recognizing a constitutively included six-nucleotide exon from the cardiac troponin T gene. Inclusion of this exon is dependent on an enhancer located downstream of the 5′ splice site. This enhancer contains six copies of the simple sequence GGGGCUG. The enhancer activates heterologous microexons and will work when located either upstream or downstream of the target exon, suggesting an ability to bind factors that bridge splicing units. A single copy of this sequence is sufficient for in vivo exon inclusion and is the binding site for the known bridging mammalian splicing factor 1 (SF1). The enhancer and its bound SF1 act to increase recognition of the upstream exon during exon definition, such that competition of in vitro reactions with RNAs containing the GGGGCUG repeated sequence depress splicing of the upstream intron, assembly of the spliceosome on the 3′ splice site of the exon, and cross-linking of SF1. These results suggest a model in which SF1 bridges the small exon during initial assembly, thereby effectively extending the domain of the exon. PMID:10805741
Multiple splicing defects in an intronic false exon.
Sun, H; Chasin, L A
2000-09-01
Splice site consensus sequences alone are insufficient to dictate the recognition of real constitutive splice sites within the typically large transcripts of higher eukaryotes, and large numbers of pseudoexons flanked by pseudosplice sites with good matches to the consensus sequences can be easily designated. In an attempt to identify elements that prevent pseudoexon splicing, we have systematically altered known splicing signals, as well as immediately adjacent flanking sequences, of an arbitrarily chosen pseudoexon from intron 1 of the human hprt gene. The substitution of a 5' splice site that perfectly matches the 5' consensus combined with mutation to match the CAG/G sequence of the 3' consensus failed to get this model pseudoexon included as the central exon in a dhfr minigene context. Provision of a real 3' splice site and a consensus 5' splice site and removal of an upstream inhibitory sequence were necessary and sufficient to confer splicing on the pseudoexon. This activated context also supported the splicing of a second pseudoexon sequence containing no apparent enhancer. Thus, both the 5' splice site sequence and the polypyrimidine tract of the pseudoexon are defective despite their good agreement with the consensus. On the other hand, the pseudoexon body did not exert a negative influence on splicing. The introduction into the pseudoexon of a sequence selected for binding to ASF/SF2 or its replacement with beta-globin exon 2 only partially reversed the effect of the upstream negative element and the defective polypyrimidine tract. These results support the idea that exon-bridging enhancers are not a prerequisite for constitutive exon definition and suggest that intrinsically defective splice sites and negative elements play important roles in distinguishing the real splicing signal from the vast number of false splicing signals.
Conservation of CD44 exon v3 functional elements in mammals
Vela, Elena; Hilari, Josep M; Delclaux, María; Fernández-Bellon, Hugo; Isamat, Marcos
2008-01-01
Background The human CD44 gene contains 10 variable exons (v1 to v10) that can be alternatively spliced to generate hundreds of different CD44 protein isoforms. Human CD44 variable exon v3 inclusion in the final mRNA depends on a multisite bipartite splicing enhancer located within the exon itself, which we have recently described, and provides the protein domain responsible for growth factor binding to CD44. Findings We have analyzed the sequence of CD44v3 in 95 mammalian species to report high conservation levels for both its splicing regulatory elements (the 3' splice site and the exonic splicing enhancer), and the functional glycosaminglycan binding site coded by v3. We also report the functional expression of CD44v3 isoforms in peripheral blood cells of different mammalian taxa with both consensus and variant v3 sequences. Conclusion CD44v3 mammalian sequences maintain all functional splicing regulatory elements as well as the GAG binding site with the same relative positions and sequence identity previously described during alternative splicing of human CD44. The sequence within the GAG attachment site, which in turn contains the Y motif of the exonic splicing enhancer, is more conserved relative to the rest of exon. Amplification of CD44v3 sequence from mammalian species but not from birds, fish or reptiles, may lead to classify CD44v3 as an exclusive mammalian gene trait. PMID:18710510
Smith, Lindsay D.; Dickinson, Rachel L.; Lucas, Christian M.; Cousins, Alex; Malygin, Alexey A.; Weldon, Carika; Perrett, Andrew J.; Bottrill, Andrew R.; Searle, Mark S.; Burley, Glenn A.; Eperon, Ian C.
2014-01-01
Summary The use of oligonucleotides to activate the splicing of selected exons is limited by a poor understanding of the mechanisms affected. A targeted bifunctional oligonucleotide enhancer of splicing (TOES) anneals to SMN2 exon 7 and carries an exonic splicing enhancer (ESE) sequence. We show that it stimulates splicing specifically of intron 6 in the presence of repressing sequences in intron 7. Complementarity to the 5′ end of exon 7 increases U2AF65 binding, but the ESE sequence is required for efficient recruitment of U2 snRNP. The ESE forms at least three coexisting discrete states: a quadruplex, a complex containing only hnRNP F/H, and a complex enriched in the activator SRSF1. Neither hnRNP H nor quadruplex formation contributes to ESE activity. The results suggest that splicing limited by weak signals can be rescued by rapid exchange of TOES oligonucleotides in various complexes and raise the possibility that SR proteins associate transiently with ESEs. PMID:25263560
Can the HIV-1 splicing machinery be targeted for drug discovery?
Dlamini, Zodwa; Hull, Rodney
2017-01-01
HIV-1 is able to express multiple protein types and isoforms from a single 9 kb mRNA transcript. These proteins are also expressed at particular stages of viral development, and this is achieved through the control of alternative splicing and the export of these transcripts from the nucleus. The nuclear export is controlled by the HIV protein Rev being required to transport incompletely spliced and partially spliced mRNA from the nucleus where they are normally retained. This implies a close relationship between the control of alternate splicing and the nuclear export of mRNA in the control of HIV-1 viral proliferation. This review discusses both the processes. The specificity and regulation of splicing in HIV-1 is controlled by the use of specific splice sites as well as exonic splicing enhancer and exonic splicing silencer sequences. The use of these silencer and enhancer sequences is dependent on the serine arginine family of proteins as well as the heterogeneous nuclear ribonucleoprotein family of proteins that bind to these sequences and increase or decrease splicing. Since alternative splicing is such a critical factor in viral development, it presents itself as a promising drug target. This review aims to discuss the inhibition of splicing, which would stall viral development, as an anti-HIV therapeutic strategy. In this review, the most recent knowledge of splicing in human immunodeficiency viral development and the latest therapeutic strategies targeting human immunodeficiency viral splicing are discussed. PMID:28331370
The power of fission: yeast as a tool for understanding complex splicing.
Fair, Benjamin Jung; Pleiss, Jeffrey A
2017-06-01
Pre-mRNA splicing is an essential component of eukaryotic gene expression. Many metazoans, including humans, regulate alternative splicing patterns to generate expansions of their proteome from a limited number of genes. Importantly, a considerable fraction of human disease causing mutations manifest themselves through altering the sequences that shape the splicing patterns of genes. Thus, understanding the mechanistic bases of this complex pathway will be an essential component of combating these diseases. Dating almost to the initial discovery of splicing, researchers have taken advantage of the genetic tractability of budding yeast to identify the components and decipher the mechanisms of splicing. However, budding yeast lacks the complex splicing machinery and alternative splicing patterns most relevant to humans. More recently, many researchers have turned their efforts to study the fission yeast, Schizosaccharomyces pombe, which has retained many features of complex splicing, including degenerate splice site sequences, the usage of exonic splicing enhancers, and SR proteins. Here, we review recent work using fission yeast genetics to examine pre-mRNA splicing, highlighting its promise for modeling the complex splicing seen in higher eukaryotes.
A survey of the sorghum transcriptome using single-molecule long reads
Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; ...
2016-06-24
Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novelmore » splice isoforms. Additionally, we uncover APA ofB11,000 expressed genes and more than 2,100 novel genes. Lastly, these results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism.« less
A survey of the sorghum transcriptome using single-molecule long reads
Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; Ngam, Peter; Devitt, Nicholas; Schilkey, Faye; Ben-Hur, Asa; Reddy, Anireddy S. N.
2016-01-01
Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novel splice isoforms. Additionally, we uncover APA of ∼11,000 expressed genes and more than 2,100 novel genes. These results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism. PMID:27339290
Mechanisms and Regulation of Alternative Pre-mRNA Splicing
Lee, Yeon
2015-01-01
Precursor messenger RNA (pre-mRNA) splicing is a critical step in the posttranscriptional regulation of gene expression, providing significant expansion of the functional proteome of eukaryotic organisms with limited gene numbers. Split eukaryotic genes contain intervening sequences or introns disrupting protein-coding exons, and intron removal occurs by repeated assembly of a large and highly dynamic ribonucleoprotein complex termed the spliceosome, which is composed of five small nuclear ribonucleoprotein particles, U1, U2, U4/U6, and U5. Biochemical studies over the past 10 years have allowed the isolation as well as compositional, functional, and structural analysis of splicing complexes at distinct stages along the spliceosome cycle. The average human gene contains eight exons and seven introns, producing an average of three or more alternatively spliced mRNA isoforms. Recent high-throughput sequencing studies indicate that 100% of human genes produce at least two alternative mRNA isoforms. Mechanisms of alternative splicing include RNA–protein interactions of splicing factors with regulatory sites termed silencers or enhancers, RNA–RNA base-pairing interactions, or chromatin-based effects that can change or determine splicing patterns. Disease-causing mutations can often occur in splice sites near intron borders or in exonic or intronic RNA regulatory silencer or enhancer elements, as well as in genes that encode splicing factors. Together, these studies provide mechanistic insights into how spliceosome assembly, dynamics, and catalysis occur; how alternative splicing is regulated and evolves; and how splicing can be disrupted by cis- and trans-acting mutations leading to disease states. These findings make the spliceosome an attractive new target for small-molecule, antisense, and genome-editing therapeutic interventions. PMID:25784052
Badr, Eman; ElHefnawi, Mahmoud; Heath, Lenwood S
2016-01-01
Alternative splicing is a vital process for regulating gene expression and promoting proteomic diversity. It plays a key role in tissue-specific expressed genes. This specificity is mainly regulated by splicing factors that bind to specific sequences called splicing regulatory elements (SREs). Here, we report a genome-wide analysis to study alternative splicing on multiple tissues, including brain, heart, liver, and muscle. We propose a pipeline to identify differential exons across tissues and hence tissue-specific SREs. In our pipeline, we utilize the DEXSeq package along with our previously reported algorithms. Utilizing the publicly available RNA-Seq data set from the Human BodyMap project, we identified 28,100 differentially used exons across the four tissues. We identified tissue-specific exonic splicing enhancers that overlap with various previously published experimental and computational databases. A complicated exonic enhancer regulatory network was revealed, where multiple exonic enhancers were found across multiple tissues while some were found only in specific tissues. Putative combinatorial exonic enhancers and silencers were discovered as well, which may be responsible for exon inclusion or exclusion across tissues. Some of the exonic enhancers are found to be co-occurring with multiple exonic silencers and vice versa, which demonstrates a complicated relationship between tissue-specific exonic enhancers and silencers.
Munroe, Stephen H.; Morales, Christopher H.; Duyck, Tessa H.; Waters, Paul D.
2015-01-01
The α-thyroid hormone receptor gene (TRα) codes for two functionally distinct proteins: TRα1, the α-thyroid hormone receptor; and TRα2, a non-hormone-binding variant. The final exon of TRα2 mRNA overlaps the 3’ end of Rev-erbα mRNA, which encodes another nuclear receptor on the opposite strand of DNA. To understand the evolution of this antisense overlap, we sequenced these genes and mRNAs in the platypus Orthorhynchus anatinus. Despite its strong homology with other mammals, the platypus TRα/Rev-erbα locus lacks elements essential for expression of TRα2. Comparative analysis suggests that alternative splicing of TRα2 mRNA expression evolved in a stepwise fashion before the divergence of eutherian and marsupial mammals. A short G-rich element (G30) located downstream of the alternative 3’splice site of TRα2 mRNA and antisense to the 3’UTR of Rev-erbα plays an important role in regulating TRα2 splicing. G30 is tightly conserved in eutherian mammals, but is absent in marsupials and monotremes. Systematic deletions and substitutions within G30 have dramatically different effects on TRα2 splicing, leading to either its inhibition or its enhancement. Mutations that disrupt one or more clusters of G residues enhance splicing two- to three-fold. These results suggest the G30 sequence can adopt a highly structured conformation, possibly a G-quadruplex, and that it is part of a complex splicing regulatory element which exerts both positive and negative effects on TRα2 expression. Since mutations that strongly enhance splicing in vivo have no effect on splicing in vitro, it is likely that the regulatory role of G30 is mediated through linkage of transcription and splicing. PMID:26368571
Genome-wide mapping of alternative splicing in Arabidopsis thaliana
Filichkin, Sergei A.; Priest, Henry D.; Givan, Scott A.; Shen, Rongkun; Bryant, Douglas W.; Fox, Samuel E.; Wong, Weng-Keen; Mockler, Todd C.
2010-01-01
Alternative splicing can enhance transcriptome plasticity and proteome diversity. In plants, alternative splicing can be manifested at different developmental stages, and is frequently associated with specific tissue types or environmental conditions such as abiotic stress. We mapped the Arabidopsis transcriptome at single-base resolution using the Illumina platform for ultrahigh-throughput RNA sequencing (RNA-seq). Deep transcriptome sequencing confirmed a majority of annotated introns and identified thousands of novel alternatively spliced mRNA isoforms. Our analysis suggests that at least ∼42% of intron-containing genes in Arabidopsis are alternatively spliced; this is significantly higher than previous estimates based on cDNA/expressed sequence tag sequencing. Random validation confirmed that novel splice isoforms empirically predicted by RNA-seq can be detected in vivo. Novel introns detected by RNA-seq were substantially enriched in nonconsensus terminal dinucleotide splice signals. Alternative isoforms with premature termination codons (PTCs) comprised the majority of alternatively spliced transcripts. Using an example of an essential circadian clock gene, we show that intron retention can generate relatively abundant PTC+ isoforms and that this specific event is highly conserved among diverse plant species. Alternatively spliced PTC+ isoforms can be potentially targeted for degradation by the nonsense mediated mRNA decay (NMD) surveillance machinery or regulate the level of functional transcripts by the mechanism of regulated unproductive splicing and translation (RUST). We demonstrate that the relative ratios of the PTC+ and reference isoforms for several key regulatory genes can be considerably shifted under abiotic stress treatments. Taken together, our results suggest that like in animals, NMD and RUST may be widespread in plants and may play important roles in regulating gene expression. PMID:19858364
An RNAi-Enhanced Logic Circuit for Cancer Specific Detection and Destruction
2013-02-01
monomeric protein secreted by Corynebacterium diphtheriae, and pro-apoptotic members of Bcl-2 family: mBax (Mus musculus), hBax ( Homo sapiens ), and its...Gata3 mStaple. Intron- feature sequences – donor site, branch point, poly- pyrimidine tract, and acceptor site – were selected based on previously...sequences found in literature our intron features were chosen according SplicePort [4], an online analyzer that detects the likelihood of splicing to
A conserved intronic U1 snRNP-binding sequence promotes trans-splicing in Drosophila
Gao, Jun-Li; Fan, Yu-Jie; Wang, Xiu-Ye; Zhang, Yu; Pu, Jia; Li, Liang; Shao, Wei; Zhan, Shuai; Hao, Jianjiang
2015-01-01
Unlike typical cis-splicing, trans-splicing joins exons from two separate transcripts to produce chimeric mRNA and has been detected in most eukaryotes. Trans-splicing in trypanosomes and nematodes has been characterized as a spliced leader RNA-facilitated reaction; in contrast, its mechanism in higher eukaryotes remains unclear. Here we investigate mod(mdg4), a classic trans-spliced gene in Drosophila, and report that two critical RNA sequences in the middle of the last 5′ intron, TSA and TSB, promote trans-splicing of mod(mdg4). In TSA, a 13-nucleotide (nt) core motif is conserved across Drosophila species and is essential and sufficient for trans-splicing, which binds U1 small nuclear RNP (snRNP) through strong base-pairing with U1 snRNA. In TSB, a conserved secondary structure acts as an enhancer. Deletions of TSA and TSB using the CRISPR/Cas9 system result in developmental defects in flies. Although it is not clear how the 5′ intron finds the 3′ introns, compensatory changes in U1 snRNA rescue trans-splicing of TSA mutants, demonstrating that U1 recruitment is critical to promote trans-splicing in vivo. Furthermore, TSA core-like motifs are found in many other trans-spliced Drosophila genes, including lola. These findings represent a novel mechanism of trans-splicing, in which RNA motifs in the 5′ intron are sufficient to bring separate transcripts into close proximity to promote trans-splicing. PMID:25838544
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture.
Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen; Burge, Christopher B
2017-12-27
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning ('intron definition') or exon-spanning ('exon definition') pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila , using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60-70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.
Regulation of alternative splicing in Drosophila by 56 RNA binding proteins
Brooks, Angela N.; Duff, Michael O.; May, Gemma; ...
2015-08-20
Alternative splicing is regulated by RNA binding proteins (RBPs) that recognize pre-mRNA sequence elements and activate or repress adjacent exons. Here, we used RNA interference and RNA-seq to identify splicing events regulated by 56 Drosophila proteins, some previously unknown to regulate splicing. Nearly all proteins affected alternative first exons, suggesting that RBPs play important roles in first exon choice. Half of the splicing events were regulated by multiple proteins, demonstrating extensive combinatorial regulation. We observed that SR and hnRNP proteins tend to act coordinately with each other, not antagonistically. We also identified a cross-regulatory network where splicing regulators affected themore » splicing of pre-mRNAs encoding other splicing regulators. In conclusion, this large-scale study substantially enhances our understanding of recent models of splicing regulation and provides a resource of thousands of exons that are regulated by 56 diverse RBPs.« less
Identification of an Intronic Splicing Enhancer Essential for the Inclusion of FGFR2 Exon IIIc*S⃞
Seth, Puneet; Miller, Heather B.; Lasda, Erika L.; Pearson, James L.; Garcia-Blanco, Mariano A.
2008-01-01
The ligand specificity of fibroblast growth factor receptor 2 (FGFR2) is determined by the alternative splicing of exons 8 (IIIb) or 9 (IIIc). Exon IIIb is included in epithelial cells, whereas exon IIIc is included in mesenchymal cells. Although a number of cis elements and trans factors have been identified that play a role in exon IIIb inclusion in epithelium, little is known about the activation of exon IIIc in mesenchyme. We report here the identification of a splicing enhancer required for IIIc inclusion. This 24-nucleotide (nt) downstream intronic splicing enhancer (DISE) is located within intron 9 immediately downstream of exon IIIc. DISE was able to activate the inclusion of heterologous exons rat FGFR2 IIIb and human β-globin exon 2 in cell lines from different tissues and species and also in HeLa cell nuclear extracts in vitro. DISE was capable of replacing the intronic activator sequence 1 (IAS1), a known IIIb splicing enhancer and vice versa. This fact, together with the requirement for DISE to be close to the 5′-splice site and the ability of DISE to promote binding of U1 snRNP, suggested that IAS1 and DISE belong to the same class of cis-acting elements. PMID:18256031
Hayashi, Tetsutaro; Ozaki, Haruka; Sasagawa, Yohei; Umeda, Mana; Danno, Hiroki; Nikaido, Itoshi
2018-02-12
Total RNA sequencing has been used to reveal poly(A) and non-poly(A) RNA expression, RNA processing and enhancer activity. To date, no method for full-length total RNA sequencing of single cells has been developed despite the potential of this technology for single-cell biology. Here we describe random displacement amplification sequencing (RamDA-seq), the first full-length total RNA-sequencing method for single cells. Compared with other methods, RamDA-seq shows high sensitivity to non-poly(A) RNA and near-complete full-length transcript coverage. Using RamDA-seq with differentiation time course samples of mouse embryonic stem cells, we reveal hundreds of dynamically regulated non-poly(A) transcripts, including histone transcripts and long noncoding RNA Neat1. Moreover, RamDA-seq profiles recursive splicing in >300-kb introns. RamDA-seq also detects enhancer RNAs and their cell type-specific activity in single cells. Taken together, we demonstrate that RamDA-seq could help investigate the dynamics of gene expression, RNA-processing events and transcriptional regulation in single cells.
RBFOX and PTBP1 proteins regulate the alternative splicing of micro-exons in human brain transcripts
Sanchez-Pulido, Luis; Haerty, Wilfried
2015-01-01
Ninety-four percent of mammalian protein-coding exons exceed 51 nucleotides (nt) in length. The paucity of micro-exons (≤ 51 nt) suggests that their recognition and correct processing by the splicing machinery present greater challenges than for longer exons. Yet, because thousands of human genes harbor processed micro-exons, specialized mechanisms may be in place to promote their splicing. Here, we survey deep genomic data sets to define 13,085 micro-exons and to study their splicing mechanisms and molecular functions. More than 60% of annotated human micro-exons exhibit a high level of sequence conservation, an indicator of functionality. While most human micro-exons require splicing-enhancing genomic features to be processed, the splicing of hundreds of micro-exons is enhanced by the adjacent binding of splice factors in the introns of pre-messenger RNAs. Notably, splicing of a significant number of micro-exons was found to be facilitated by the binding of RBFOX proteins, which promote their inclusion in the brain, muscle, and heart. Our analyses suggest that accurate regulation of micro-exon inclusion by RBFOX proteins and PTBP1 plays an important role in the maintenance of tissue-specific protein–protein interactions. PMID:25524026
Fu, X Y; Colgan, J D; Manley, J L
1988-01-01
We have determined the effects of a number of mutations in the small-t antigen mRNA intron on the alternative splicing pattern of the simian virus 40 early transcript. Expansion of the distance separating the small-t pre-mRNA lariat branch point and the shared large T-small t 3' splice site from 18 to 29 nucleotides (nt) resulted in a relative enhancement of small-t splicing in vivo. This finding, coupled with the observation that large-T pre-RNA splicing in vitro was not affected by this expansion, suggests that small-t splicing is specifically constrained by a short branch point-3' splice site distance. Similarly, the distance separating the 5' splice site and branch point (48 nt) was found to be at or near a minimum for small-t splicing, because deletions in this region as small as 2 nt dramatically reduced the ratio of small-t to large-T mRNA that accumulated in transfected cells. Finally, a specific sequence within the small-t intron, encompassing the upstream branch sites used in large-T splicing, was found to be an important element in the cell-specific pattern of early alternative splicing. Substitutions within this region reduced the ratio of small-t to large-T mRNA produced in HeLa cells but had only minor effects in human 293 cells. Images PMID:2851720
Collins, Richard A; Stajich, Jason E; Field, Deborah J; Olive, Joan E; DeAbreu, Diane M
2015-05-01
When we expressed a small (0.9 kb) nonprotein-coding transcript derived from the mitochondrial VS plasmid in the nucleus of Neurospora we found that it was efficiently spliced at one or more of eight 5' splice sites and ten 3' splice sites, which are present apparently by chance in the sequence. Further experimental and bioinformatic analyses of other mitochondrial plasmids, random sequences, and natural nuclear genes in Neurospora and other fungi indicate that fungal spliceosomes recognize a wide range of 5' splice site and branchpoint sequences and predict introns to be present at high frequency in random sequence. In contrast, analysis of intronless fungal nuclear genes indicates that branchpoint, 5' splice site and 3' splice site consensus sequences are underrepresented compared with random sequences. This underrepresentation of splicing signals is sufficient to deplete the nuclear genome of splice sites at locations that do not comprise biologically relevant introns. Thus, the splicing machinery can recognize a wide range of splicing signal sequences, but splicing still occurs with great accuracy, not because the splicing machinery distinguishes correct from incorrect introns, but because incorrect introns are substantially depleted from the genome. © 2015 Collins et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen
2017-01-01
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing. PMID:29280736
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
Pai, Athma A.; Henriques, Telmo; McCue, Kayla; ...
2017-12-27
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pai, Athma A.; Henriques, Telmo; McCue, Kayla
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less
Homologous SV40 RNA trans-splicing
Eul, Joachim; Patzel, Volker
2013-01-01
Simian Virus 40 (SV40) is a polyomavirus found in both monkeys and humans, which causes cancer in some animal models. In humans, SV40 has been reported to be associated with cancers but causality has not been proven yet. The transforming activity of SV40 is mainly due to its 94-kD large T antigen, which binds to the retinoblastoma (pRb) and p53 tumor suppressor proteins, and thereby perturbs their functions. Here we describe a 100 kD super T antigen harboring a duplication of the pRB binding domain that was associated with unusual high cell transformation activity and that was generated by a novel mechanism involving homologous RNA trans-splicing of SV40 early transcripts in transformed rodent cells. Enhanced trans-splice activity was observed in clones carrying a single point mutation in the large T antigen 5′ donor splice site (ss). This mutation impaired cis-splicing in favor of an alternative trans-splice reaction via a cryptic 5′ss within a second cis-spliced SV40 pre-mRNA molecule and enabled detectable gene expression. Next to the cryptic 5′ss we identified additional trans-splice helper functions, including putative dimerization domains and a splice enhancer sequence. Our findings suggest RNA trans-splicing as a SV40-intrinsic mechanism that supports the diversification of viral RNA and phenotypes. PMID:24178438
Purifying Selection on Exonic Splice Enhancers in Intronless Genes
Savisaar, Rosina; Hurst, Laurence D.
2016-01-01
Exonic splice enhancers (ESEs) are short nucleotide motifs, enriched near exon ends, that enhance the recognition of the splice site and thus promote splicing. Are intronless genes under selection to avoid these motifs so as not to attract the splicing machinery to an mRNA that should not be spliced, thereby preventing the production of an aberrant transcript? Consistent with this possibility, we find that ESEs in putative recent retrocopies are at a higher density and evolving faster than those in other intronless genes, suggesting that they are being lost. Moreover, intronless genes are less dense in putative ESEs than intron-containing ones. However, this latter difference is likely due to the skewed base composition of intronless sequences, a skew that is in line with the general GC richness of few exon genes. Indeed, after controlling for such biases, we find that both intronless and intron-containing genes are denser in ESEs than expected by chance. Importantly, nucleotide-controlled analysis of evolutionary rates at synonymous sites in ESEs indicates that the ESEs in intronless genes are under purifying selection in both human and mouse. We conclude that on the loss of introns, some but not all, ESE motifs are lost, the remainder having functions beyond a role in splice promotion. These results have implications for the design of intronless transgenes and for understanding the causes of selection on synonymous sites. PMID:26802218
Spliced RNA of woodchuck hepatitis virus.
Ogston, C W; Razman, D G
1992-07-01
Polymerase chain reaction was used to investigate RNA splicing in liver of woodchucks infected with woodchuck hepatitis virus (WHV). Two spliced species were detected, and the splice junctions were sequenced. The larger spliced RNA has an intron of 1300 nucleotides, and the smaller spliced sequence shows an additional downstream intron of 1104 nucleotides. We did not detect singly spliced sequences from which the smaller intron alone was removed. Control experiments showed that spliced sequences are present in both RNA and DNA in infected liver, showing that the viral reverse transcriptase can use spliced RNA as template. Spliced sequences were detected also in virion DNA prepared from serum. The upstream intron produces a reading frame that fuses the core to the polymerase polypeptide, while the downstream intron causes an inframe deletion in the polymerase open reading frame. Whereas the splicing patterns in WHV are superficially similar to those reported recently in hepatitis B virus, we detected no obvious homology in the coding capacity of spliced RNAs from these two viruses.
Li, Yang I; Sanchez-Pulido, Luis; Haerty, Wilfried; Ponting, Chris P
2015-01-01
Ninety-four percent of mammalian protein-coding exons exceed 51 nucleotides (nt) in length. The paucity of micro-exons (≤ 51 nt) suggests that their recognition and correct processing by the splicing machinery present greater challenges than for longer exons. Yet, because thousands of human genes harbor processed micro-exons, specialized mechanisms may be in place to promote their splicing. Here, we survey deep genomic data sets to define 13,085 micro-exons and to study their splicing mechanisms and molecular functions. More than 60% of annotated human micro-exons exhibit a high level of sequence conservation, an indicator of functionality. While most human micro-exons require splicing-enhancing genomic features to be processed, the splicing of hundreds of micro-exons is enhanced by the adjacent binding of splice factors in the introns of pre-messenger RNAs. Notably, splicing of a significant number of micro-exons was found to be facilitated by the binding of RBFOX proteins, which promote their inclusion in the brain, muscle, and heart. Our analyses suggest that accurate regulation of micro-exon inclusion by RBFOX proteins and PTBP1 plays an important role in the maintenance of tissue-specific protein-protein interactions. © 2015 Li et al.; Published by Cold Spring Harbor Laboratory Press.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Oestberg, Sara, E-mail: sara.ostberg@imbim.uu.se; Toermaenen Persson, Heidi, E-mail: heidi.tormanen.persson@imbim.uu.se; Akusjaervi, Goeran, E-mail: goran.akusjarvi@imbim.uu.se
2012-11-25
The adenovirus L4-33K protein is a key regulator involved in the temporal shift from early to late pattern of mRNA expression from the adenovirus major late transcription unit. L4-33K is a virus-encoded alternative splicing factor, which enhances processing of 3 Prime splice sites with a weak sequence context. Here we show that L4-33K expressed from a plasmid is localized at the nuclear margin of uninfected cells. During an infection L4-33K is relocalized to the periphery of E2A-72K containing viral replication centers. We also show that serine 192 in the tiny RS repeat of the conserved carboxy-terminus of L4-33K, which ismore » critical for the splicing enhancer function of L4-33K, is necessary for the nuclear localization and redistribution of the protein to viral replication sites. Collectively, our results show a good correlation between the activity of L4-33K as a splicing enhancer protein and its localization to the periphery of viral replication centers.« less
A mechanism for exon skipping caused by nonsense or missense mutations in BRCA1 and other genes.
Liu, H X; Cartegni, L; Zhang, M Q; Krainer, A R
2001-01-01
Point mutations can generate defective and sometimes harmful proteins. The nonsense-mediated mRNA decay (NMD) pathway minimizes the potential damage caused by nonsense mutations. In-frame nonsense codons located at a minimum distance upstream of the last exon-exon junction are recognized as premature termination codons (PTCs), targeting the mRNA for degradation. Some nonsense mutations cause skipping of one or more exons, presumably during pre-mRNA splicing in the nucleus; this phenomenon is termed nonsense-mediated altered splicing (NAS), and its underlying mechanism is unclear. By analyzing NAS in BRCA1, we show here that inappropriate exon skipping can be reproduced in vitro, and results from disruption of a splicing enhancer in the coding sequence. Enhancers can be disrupted by single nonsense, missense and translationally silent point mutations, without recognition of an open reading frame as such. These results argue against a nuclear reading-frame scanning mechanism for NAS. Coding-region single-nucleotide polymorphisms (cSNPs) within exonic splicing enhancers or silencers may affect the patterns or efficiency of mRNA splicing, which may in turn cause phenotypic variability and variable penetrance of mutations elsewhere in a gene.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ponthier, Julie L.; Schluepen, Christina; Chen, Weiguo
Activation of protein 4.1R exon 16 (E16) inclusion during erythropoiesis represents a physiologically important splicing switch that increases 4.1R affinity for spectrin and actin. Previous studies showed that negative regulation of E16 splicing is mediated by the binding of hnRNP A/B proteins to silencer elements in the exon and that downregulation of hnRNP A/B proteins in erythroblasts leads to activation of E16 inclusion. This paper demonstrates that positive regulation of E16 splicing can be mediated by Fox-2 or Fox-1, two closely related splicing factors that possess identical RNA recognition motifs. SELEX experiments with human Fox-1 revealed highly selective binding tomore » the hexamer UGCAUG. Both Fox-1 and Fox-2 were able to bind the conserved UGCAUG elements in the proximal intron downstream of E16, and both could activate E16 splicing in HeLa cell co-transfection assays in a UGCAUG-dependent manner. Conversely, knockdown of Fox-2 expression, achieved with two different siRNA sequences resulted in decreased E16 splicing. Moreover, immunoblot experiments demonstrate mouse erythroblasts express Fox-2, but not Fox-1. These findings suggest that Fox-2 is a physiological activator of E16 splicing in differentiating erythroid cells in vivo. Recent experiments show that UGCAUG is present in the proximal intron sequence of many tissue-specific alternative exons, and we propose that the Fox family of splicing enhancers plays an important role in alternative splicing switches during differentiation in metazoan organisms.« less
Sadek, Jouliana
2016-01-01
ABSTRACT During lytic herpes simplex virus (HSV) infections, the virion host shutoff (Vhs) (UL41) endoribonuclease degrades many cellular and viral mRNAs. In uninfected cells, spliced mRNAs emerge into the cytoplasm bound by exon junction complexes (EJCs) and are translated several times more efficiently than unspliced mRNAs that have the same sequence but lack EJCs. Notably, most cellular mRNAs are spliced, whereas most HSV mRNAs are not. To examine the effect of splicing on gene expression during HSV infection, cells were transfected with plasmids harboring an unspliced renilla luciferase (RLuc) reporter mRNA or RLuc constructs with introns near the 5′ or 3′ end of the gene. After splicing of intron-containing transcripts, all three RLuc mRNAs had the same primary sequence. Upon infection in the presence of actinomycin D, spliced mRNAs were much less sensitive to degradation by copies of Vhs from infecting virions than were unspliced mRNAs. During productive infections (in the absence of drugs), RLuc was expressed at substantially higher levels from spliced than from unspliced mRNAs. Interestingly, the stimulatory effect of splicing on RLuc expression was significantly greater in infected than in uninfected cells. The translational stimulatory effect of an intron during HSV-1 infections could be replicated by artificially tethering various EJC components to an unspliced RLuc transcript. Thus, the splicing history of an mRNA, and the consequent presence or absence of EJCs, affects its level of translation and sensitivity to Vhs cleavage during lytic HSV infections. IMPORTANCE Most mammalian mRNAs are spliced. In contrast, of the more than 80 mRNAs harbored by herpes simplex virus 1 (HSV-1), only 5 are spliced. In addition, synthesis of the immediate early protein ICP27 causes partial inhibition of pre-mRNA splicing, with the resultant accumulation of both spliced and unspliced versions of some mRNAs in the cytoplasm. A common perception is that HSV-1 infection necessarily inhibits the expression of spliced mRNAs. In contrast, this study demonstrates two instances in which pre-mRNA splicing actually enhances the synthesis of proteins from mRNAs during HSV-1 infections. Specifically, splicing stabilized an mRNA against degradation by copies of the Vhs endoribonuclease from infecting virions and greatly enhanced the amount of protein synthesized from spliced mRNAs at late times after infection. The data suggest that splicing, and the resultant presence of exon junction complexes on an mRNA, may play an important role in gene expression during HSV-1 infections. PMID:27681125
Zhu, Fu-Yuan; Chen, Mo-Xian; Ye, Neng-Hui; Shi, Lu; Ma, Kai-Long; Yang, Jing-Fang; Cao, Yun-Ying; Zhang, Youjun; Yoshida, Takuya; Fernie, Alisdair R; Fan, Guang-Yi; Wen, Bo; Zhou, Ruo; Liu, Tie-Yuan; Fan, Tao; Gao, Bei; Zhang, Di; Hao, Ge-Fei; Xiao, Shi; Liu, Ying-Gao; Zhang, Jianhua
2017-08-01
In eukaryotes, mechanisms such as alternative splicing (AS) and alternative translation initiation (ATI) contribute to organismal protein diversity. Specifically, splicing factors play crucial roles in responses to environment and development cues; however, the underlying mechanisms are not well investigated in plants. Here, we report the parallel employment of short-read RNA sequencing, single molecule long-read sequencing and proteomic identification to unravel AS isoforms and previously unannotated proteins in response to abscisic acid (ABA) treatment. Combining the data from the two sequencing methods, approximately 83.4% of intron-containing genes were alternatively spliced. Two AS types, which are referred to as alternative first exon (AFE) and alternative last exon (ALE), were more abundant than intron retention (IR); however, by contrast to AS events detected under normal conditions, differentially expressed AS isoforms were more likely to be translated. ABA extensively affects the AS pattern, indicated by the increasing number of non-conventional splicing sites. This work also identified thousands of unannotated peptides and proteins by ATI based on mass spectrometry and a virtual peptide library deduced from both strands of coding regions within the Arabidopsis genome. The results enhance our understanding of AS and alternative translation mechanisms under normal conditions, and in response to ABA treatment. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
Ge, H; Noble, J; Colgan, J; Manley, J L
1990-01-01
We have studied splicing of the polyoma virus early region pre-mRNA in vitro. This RNA is alternatively spliced in vivo to produce mRNA encoding the large, middle-sized (MTAg), and small (StAg) tumor antigens. Our primary interest was to learn how the 48-nucleotide StAg intron is excised, because the length of this intron is significantly less than the apparent minimum established for mammalian introns. Although the products of all three splices are detected in vitro, characterization of the pathway and sequence requirements of StAg splicing suggests that splicing factors interact with the precursor RNA in an unexpected way to catalyze removal of this intron. Specifically, StAg splicing uses either of two lariat branch points, one of which is located only 4 nucleotides from the 3' splice site. Furthermore, the StAg splice absolutely requires that the alternative MTAg 3' splice site, located 14 nucleotides downstream of the StAg 3' splice site, be intact. Insertion mutations that increase or decrease the quality of the MTAg pyrimidine stretch enhance or repress StAg as well as MTAg splicing, and a single-base change in the MTAg AG splice acceptor totally blocks both splices. These results demonstrate the ability of two 3' splice sites to cooperate with each other to bring about removal of a single intron. Images PMID:2159146
hnRNP L regulates differences in expression of mouse integrin alpha2beta1.
Cheli, Yann; Kunicki, Thomas J
2006-06-01
There is a 2-fold variation in platelet integrin alpha2beta1 levels among inbred mouse strains. Decreased alpha2beta1 in 4 strains carrying Itga2 haplotype 2 results from decreased affinity of heterogeneous ribonucleoprotein L (hnRNP L) for a 6 CA repeat sequence (CA6) within intron 1. Seven strains bearing haplotype 1 and a 21 CA repeat sequence at this position (CA21) express twice the level of platelet alpha2beta1 and exhibit an equivalent gain of platelet function in vitro. By UV crosslinking and immunoprecipitation, hnRNP L binds more avidly to CA21, relative to CA6. By cell-free, in vitro mRNA splicing, decreased binding of hnRNP L results in decreased splicing efficiency and an increased proportion of alternatively spliced product. The splicing enhancer activity of CA21 in vivo is abolished by prior treatment with hnRNP L-specific siRNA. Thus, decreased surface alpha2beta1 results from decreased Itga2 pre-mRNA splicing regulated by hnRNP L and depends on CA repeat length at a specific site in intron 1.
hnRNP L regulates differences in expression of mouse integrin α2β1
Cheli, Yann; Kunicki, Thomas J.
2006-01-01
There is a 2-fold variation in platelet integrin α2β1 levels among inbred mouse strains. Decreased α2β1 in 4 strains carrying Itga2 haplotype 2 results from decreased affinity of heterogeneous ribonucleoprotein L (hnRNP L) for a 6 CA repeat sequence (CA6) within intron 1. Seven strains bearing haplotype 1 and a 21 CA repeat sequence at this position (CA21) express twice the level of platelet α2β1 and exhibit an equivalent gain of platelet function in vitro. By UV crosslinking and immunoprecipitation, hnRNP L binds more avidly to CA21, relative to CA6. By cell-free, in vitro mRNA splicing, decreased binding of hnRNP L results in decreased splicing efficiency and an increased proportion of alternatively spliced product. The splicing enhancer activity of CA21 in vivo is abolished by prior treatment with hnRNP L–specific siRNA. Thus, decreased surface α2β1 results from decreased Itga2 pre-mRNA splicing regulated by hnRNP L and depends on CA repeat length at a specific site in intron 1. PMID:16455949
Pettigrew, Christopher; Wayte, Nicola; Lovelock, Paul K; Tavtigian, Sean V; Chenevix-Trench, Georgia; Spurdle, Amanda B; Brown, Melissa A
2005-01-01
Introduction Aberrant pre-mRNA splicing can be more detrimental to the function of a gene than changes in the length or nature of the encoded amino acid sequence. Although predicting the effects of changes in consensus 5' and 3' splice sites near intron:exon boundaries is relatively straightforward, predicting the possible effects of changes in exonic splicing enhancers (ESEs) remains a challenge. Methods As an initial step toward determining which ESEs predicted by the web-based tool ESEfinder in the breast cancer susceptibility gene BRCA1 are likely to be functional, we have determined their evolutionary conservation and compared their location with known BRCA1 sequence variants. Results Using the default settings of ESEfinder, we initially detected 669 potential ESEs in the coding region of the BRCA1 gene. Increasing the threshold score reduced the total number to 464, while taking into consideration the proximity to splice donor and acceptor sites reduced the number to 211. Approximately 11% of these ESEs (23/211) either are identical at the nucleotide level in human, primates, mouse, cow, dog and opossum Brca1 (conserved) or are detectable by ESEfinder in the same position in the Brca1 sequence (shared). The frequency of conserved and shared predicted ESEs between human and mouse is higher in BRCA1 exons (2.8 per 100 nucleotides) than in introns (0.6 per 100 nucleotides). Of conserved or shared putative ESEs, 61% (14/23) were predicted to be affected by sequence variants reported in the Breast Cancer Information Core database. Applying the filters described above increased the colocalization of predicted ESEs with missense changes, in-frame deletions and unclassified variants predicted to be deleterious to protein function, whereas they decreased the colocalization with known polymorphisms or unclassified variants predicted to be neutral. Conclusion In this report we show that evolutionary conservation analysis may be used to improve the specificity of an ESE prediction tool. This is the first report on the prediction of the frequency and distribution of ESEs in the BRCA1 gene, and it is the first reported attempt to predict which ESEs are most likely to be functional and therefore which sequence variants in ESEs are most likely to be pathogenic. PMID:16280041
Pseudoexon activation increases phenotype severity in a Becker muscular dystrophy patient.
Greer, Kane; Mizzi, Kayla; Rice, Emily; Kuster, Lukas; Barrero, Roberto A; Bellgard, Matthew I; Lynch, Bryan J; Foley, Aileen Reghan; O Rathallaigh, Eoin; Wilton, Steve D; Fletcher, Sue
2015-07-01
We report a dystrophinopathy patient with an in-frame deletion of DMD exons 45-47, and therefore a genetic diagnosis of Becker muscular dystrophy, who presented with a more severe than expected phenotype. Analysis of the patient DMD mRNA revealed an 82 bp pseudoexon, derived from intron 44, that disrupts the reading frame and is expected to yield a nonfunctional dystrophin. Since the sequence of the pseudoexon and canonical splice sites does not differ from the reference sequence, we concluded that the genomic rearrangement promoted recognition of the pseudoexon, causing a severe dystrophic phenotype. We characterized the deletion breakpoints and identified motifs that might influence selection of the pseudoexon. We concluded that the donor splice site was strengthened by juxtaposition of intron 47, and loss of intron 44 silencer elements, normally located downstream of the pseudoexon donor splice site, further enhanced pseudoexon selection and inclusion in the DMD transcript in this patient.
[Deregulation of pre-messenger RNA splicing and rare diseases].
de la Grange, Pierre
2016-12-01
Most of protein-coding human genes are subjected to alternative pre-mRNA splicing. This mechanism is highly regulated to precisely modulate detection of specific splice sites. This regulation is under control of the spliceosome and several splicing factors are also required to modulate the alternative usage of splice sites. Splicing factors and spliceosome components recognize splicing signals and regulatory sequences of the pre-mRNAs. These splicing sequences make splicing susceptible to polymorphisms and mutations. Examples of associations between human rare diseases and defects in pre-messenger RNA splicing are accumulating. Although many alterations are caused by mutations in splicing sequence (i.e., cis acting mutations), recent studies described the disruptive impact of mutations within spliceosome components or splicing factors (i.e., trans acting mutations). Following growing of knowledge regarding splicing regulation, several approaches have been developed to compensate for the effect of deleterious mutations and to restore sufficient amounts of functional protein. © 2016 médecine/sciences – Inserm.
Dong, Qiongye; Wei, Lei; Zhang, Michael Q; Wang, Xiaowo
2018-06-24
Dysregulation of mRNA splicing has been observed in certain cellular senescence process. However, the common splicing alterations on the whole transcriptome shared by various types of senescence are poorly understood. In order to systematically identify senescence-associated transcriptomic changes in genome-wide scale, we collected RNA sequencing datasets of different human cell types with a variety of senescence-inducing methods from public databases and performed meta-analysis. First, we discovered that a group of RNA binding proteins were consistently down-regulated in diverse senescent samples and identified 406 senescence-associated common differential splicing events. Then, eight differentially expressed RNA binding proteins were predicted to regulate these senescence-associated splicing alterations through an enrichment analysis of their RNA binding information, including motif scanning and enhanced cross-linking immunoprecipitation data. In addition, we constructed the splicing regulatory modules that might contribute to senescence-associated biological processes. Finally, it was confirmed that knockdown of the predicted senescence-associated potential splicing regulators through shRNAs in HepG2 cell line could result in senescence-like splicing changes. Taken together, our work demonstrated a broad range of common changes in mRNA splicing switches and detected their central regulatory RNA binding proteins during senescence. These findings would help to better understand the coordinating splicing alterations in cellular senescence.
Bae, Yun Mi; Kim, Myung Hee; Yu, Gwang Sig; Um, Bong Ho; Park, Hee Kyung; Lee, Hyun-il; Lee, Kang Taek; Suh, Yung Doug; Choi, Joon Sig
2014-02-10
Peptide nucleic acids (PNAs) are synthetic structural analogues of DNA and RNA. They recognize specific cellular nucleic acid sequences and form stable complexes with complementary DNA or RNA. Here, we designed an oligo-aspartic acid-PNA conjugate and showed its enhanced delivery into cells with high gene correction efficiency using conventional cationic carriers, such as polyethylenimine (PEI) and Lipofectamine 2000. The negatively charged oligo-aspartic acid-PNA (Asp(n)-PNA) formed complexes with PEI and Lipofectamine, and the resulting Asp(n)-PNA/PEI and Asp(n)-PNA/Lipofectamine complexes were introduced into cells. We observed significantly enhanced cellular uptake of Asp(n)-PNA by cationic carriers and detected an active splicing correction effect even at nanomolar concentrations. We found that the splicing correction efficiency of the complex depended on the kind of the cationic carriers and on the number of repeating aspartic acid units. By enhancing the cellular uptake efficiency of PNAs, these results may provide a novel platform technology of PNAs as bioactive substances for their biological and therapeutic applications. Copyright © 2013 Elsevier B.V. All rights reserved.
Language study on Spliced Semigraph using Folding techniques
NASA Astrophysics Data System (ADS)
Thiagarajan, K.; Padmashree, J.
2018-04-01
In this paper, we proposed algorithm to identify cut vertices and cut edges for n-Cut Spliced Semigraph and splicing the n-Cut Spliced Semigraph using cut vertices else cut edges or combination of cut vertex and cut edge and applying sequence of folding to the spliced semigraph to obtain the semigraph quadruple η(S)=(2, 1, 1, 1). We observed that the splicing and folding using both cut vertices and cut edges is applicable only for n-Cut Spliced Semigraph where n > 2. Also, we transformed the spliced semigraph into tree structure and studied the language for the semigraph with n+2 vertices and n+1 semivertices using Depth First Edge Sequence algorithm and obtain the language structure with sequence of alphabet ‘a’ and ‘b’.
Rogan, P K; Schneider, T D
1995-01-01
Predicting the effects of nucleotide substitutions in human splice sites has been based on analysis of consensus sequences. We used a graphic representation of sequence conservation and base frequency, the sequence logo, to demonstrate that a change in a splice acceptor of hMSH2 (a gene associated with familial nonpolyposis colon cancer) probably does not reduce splicing efficiency. This confirms a population genetic study that suggested that this substitution is a genetic polymorphism. The information theory-based sequence logo is quantitative and more sensitive than the corresponding splice acceptor consensus sequence for detection of true mutations. Information analysis may potentially be used to distinguish polymorphisms from mutations in other types of transcriptional, translational, or protein-coding motifs.
Splicing of designer exons informs a biophysical model for exon definition
Arias, Mauricio A.; Chasin, Lawrence A.
2015-01-01
Pre-mRNA molecules in humans contain mostly short internal exons flanked by longer introns. To explain the removal of such introns, exon recognition instead of intron recognition has been proposed. We studied this exon definition using designer exons (DEs) made up of three prototype modules of our own design: an exonic splicing enhancer (ESE), an exonic splicing silencer (ESS), and a Reference Sequence (R) predicted to be neither. Each DE was examined as the central exon in a three-exon minigene. DEs made of R modules showed a sharp size dependence, with exons shorter than 14 nt and longer than 174 nt splicing poorly. Changing the strengths of the splice sites improved longer exon splicing but worsened shorter exon splicing, effectively displacing the curve to the right. For the ESE we found, unexpectedly, that its enhancement efficiency was independent of its position within the exon. For the ESS we found a step-wise positional increase in its effects; it was most effective at the 3′ end of the exon. To apply these results quantitatively, we developed a biophysical model for exon definition of internal exons undergoing cotranscriptional splicing. This model features commitment to inclusion before the downstream exon is synthesized and competition between skipping and inclusion fates afterward. Collision of both exon ends to form an exon definition complex was incorporated to account for the effect of size; ESE/ESS effects were modeled on the basis of stabilization/destabilization. This model accurately predicted the outcome of independent experiments on more complex DEs that combined ESEs and ESSs. PMID:25492963
Khoo, Bernard; Roca, Xavier; Chew, Shern L; Krainer, Adrian R
2007-01-17
Apolipoprotein B (APOB) is an integral part of the LDL, VLDL, IDL, Lp(a) and chylomicron lipoprotein particles. The APOB pre-mRNA consists of 29 constitutively-spliced exons. APOB exists as two natural isoforms: the full-length APOB100 isoform, assembled into LDL, VLDL, IDL and Lp(a) and secreted by the liver in humans; and the C-terminally truncated APOB48, assembled into chylomicrons and secreted by the intestine in humans. Down-regulation of APOB100 is a potential therapy to lower circulating LDL and cholesterol levels. We investigated the ability of 2'O-methyl RNA antisense oligonucleotides (ASOs) to induce the skipping of exon 27 in endogenous APOB mRNA in HepG2 cells. These ASOs are directed towards the 5' and 3' splice-sites of exon 27, the branch-point sequence (BPS) of intron 26-27 and several predicted exonic splicing enhancers within exon 27. ASOs targeting either the 5' or 3' splice-site, in combination with the BPS, are the most effective. The splicing of other alternatively spliced genes are not influenced by these ASOs, suggesting that the effects seen are not due to non-specific changes in alternative splicing. The skip 27 mRNA is translated into a truncated isoform, APOB87SKIP27. The induction of APOB87SKIP27 expression in vivo should lead to decreased LDL and cholesterol levels, by analogy to patients with hypobetalipoproteinemia. As intestinal APOB mRNA editing and APOB48 expression rely on sequences within exon 26, exon 27 skipping should not affect APOB48 expression unlike other methods of down-regulating APOB100 expression which also down-regulate APOB48.
A role for exon sequences in alternative splicing of the human fibronectin gene.
Mardon, H J; Sebastio, G; Baralle, F E
1987-01-01
Exon EDIIIA of the fibronectin (Fn) gene is alternatively spliced via pathways which either skip or include the whole exon in the messenger RNA (mRNA). We have investigated the role of EDIIIA exon sequences in the human Fn gene in determining alternative splicing of this exon during transient expression of alpha globin/Fn minigene hybrids in HeLa cells. We demonstrate that a DNA sequence of 81bp within the central region of exon EDIIIA is required for alternative splicing during processing of the primary transcript to generate both EDIIIA+ and EDIIIA- mRNA's. Furthermore, alternative splicing of EDIIIA only occurs when this sequence is present in the correct orientation since when it is in antisense orientation splicing always occurs via exon-skipping generating EDIIIA- mRNA. Images PMID:3671064
Survey of gene splicing algorithms based on reads.
Si, Xiuhua; Wang, Qian; Zhang, Lei; Wu, Ruo; Ma, Jiquan
2017-11-02
Gene splicing is the process of assembling a large number of unordered short sequence fragments to the original genome sequence as accurately as possible. Several popular splicing algorithms based on reads are reviewed in this article, including reference genome algorithms and de novo splicing algorithms (Greedy-extension, Overlap-Layout-Consensus graph, De Bruijn graph). We also discuss a new splicing method based on the MapReduce strategy and Hadoop. By comparing these algorithms, some conclusions are drawn and some suggestions on gene splicing research are made.
Unusual Intron Conservation near Tissue-Regulated Exons Found by Splicing Microarrays
Sugnet, Charles W; Srinivasan, Karpagam; Clark, Tyson A; O'Brien, Georgeann; Cline, Melissa S; Wang, Hui; Williams, Alan; Kulp, David; Blume, John E; Haussler, David; Ares, Manuel
2006-01-01
Alternative splicing contributes to both gene regulation and protein diversity. To discover broad relationships between regulation of alternative splicing and sequence conservation, we applied a systems approach, using oligonucleotide microarrays designed to capture splicing information across the mouse genome. In a set of 22 adult tissues, we observe differential expression of RNA containing at least two alternative splice junctions for about 40% of the 6,216 alternative events we could detect. Statistical comparisons identify 171 cassette exons whose inclusion or skipping is different in brain relative to other tissues and another 28 exons whose splicing is different in muscle. A subset of these exons is associated with unusual blocks of intron sequence whose conservation in vertebrates rivals that of protein-coding exons. By focusing on sets of exons with similar regulatory patterns, we have identified new sequence motifs implicated in brain and muscle splicing regulation. Of note is a motif that is strikingly similar to the branchpoint consensus but is located downstream of the 5′ splice site of exons included in muscle. Analysis of three paralogous membrane-associated guanylate kinase genes reveals that each contains a paralogous tissue-regulated exon with a similar tissue inclusion pattern. While the intron sequences flanking these exons remain highly conserved among mammalian orthologs, the paralogous flanking intron sequences have diverged considerably, suggesting unusually complex evolution of the regulation of alternative splicing in multigene families. PMID:16424921
Alternative Splicing as a Target for Cancer Treatment.
Martinez-Montiel, Nancy; Rosas-Murrieta, Nora Hilda; Anaya Ruiz, Maricruz; Monjaraz-Guzman, Eduardo; Martinez-Contreras, Rebeca
2018-02-11
Alternative splicing is a key mechanism determinant for gene expression in metazoan. During alternative splicing, non-coding sequences are removed to generate different mature messenger RNAs due to a combination of sequence elements and cellular factors that contribute to splicing regulation. A different combination of splicing sites, exonic or intronic sequences, mutually exclusive exons or retained introns could be selected during alternative splicing to generate different mature mRNAs that could in turn produce distinct protein products. Alternative splicing is the main source of protein diversity responsible for 90% of human gene expression, and it has recently become a hallmark for cancer with a full potential as a prognostic and therapeutic tool. Currently, more than 15,000 alternative splicing events have been associated to different aspects of cancer biology, including cell proliferation and invasion, apoptosis resistance and susceptibility to different chemotherapeutic drugs. Here, we present well established and newly discovered splicing events that occur in different cancer-related genes, their modification by several approaches and the current status of key tools developed to target alternative splicing with diagnostic and therapeutic purposes.
Nuzzo, F; Bulato, C; Nielsen, B I; Lee, K; Wielders, S J; Simioni, P; Key, N S; Castoldi, E
2015-03-01
Coagulation factor V (FV) deficiency is a rare autosomal recessive bleeding disorder. We investigated a patient with severe FV deficiency (FV:C < 3%) and moderate bleeding symptoms. Thrombin generation experiments showed residual FV expression in the patient's plasma, which was quantified as 0.7 ± 0.3% by a sensitive prothrombinase-based assay. F5 gene sequencing identified a novel missense mutation in exon 4 (c.578G>C, p.Cys193Ser), predicting the abolition of a conserved disulphide bridge, and an apparently synonymous variant in exon 8 (c.1281C>G). The observation that half of the patient's F5 mRNA lacked the last 18 nucleotides of exon 8 prompted us to re-evaluate the c.1281C>G variant for its possible effects on splicing. Bioinformatics sequence analysis predicted that this transversion would activate a cryptic donor splice site and abolish an exonic splicing enhancer. Characterization in a F5 minigene model confirmed that the c.1281C>G variant was responsible for the patient's splicing defect, which could be partially corrected by a mutation-specific morpholino antisense oligonucleotide. The aberrantly spliced F5 mRNA, whose stability was similar to that of the normal mRNA, encoded a putative FV mutant lacking amino acids 427-432. Expression in COS-1 cells indicated that the mutant protein is poorly secreted and not functional. In conclusion, the c.1281C>G mutation, which was predicted to be translationally silent and hence neutral, causes FV deficiency by impairing pre-mRNA splicing. This finding underscores the importance of cDNA analysis for the correct assessment of exonic mutations. © 2014 John Wiley & Sons Ltd.
Takeda, Jun-ichi; Suzuki, Yutaka; Nakao, Mitsuteru; Barrero, Roberto A.; Koyanagi, Kanako O.; Jin, Lihua; Motono, Chie; Hata, Hiroko; Isogai, Takao; Nagai, Keiichi; Otsuki, Tetsuji; Kuryshev, Vladimir; Shionyu, Masafumi; Yura, Kei; Go, Mitiko; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Wiemann, Stefan; Nomura, Nobuo; Sugano, Sumio; Gojobori, Takashi; Imanishi, Tadashi
2006-01-01
We report the first genome-wide identification and characterization of alternative splicing in human gene transcripts based on analysis of the full-length cDNAs. Applying both manual and computational analyses for 56 419 completely sequenced and precisely annotated full-length cDNAs selected for the H-Invitational human transcriptome annotation meetings, we identified 6877 alternative splicing genes with 18 297 different alternative splicing variants. A total of 37 670 exons were involved in these alternative splicing events. The encoded protein sequences were affected in 6005 of the 6877 genes. Notably, alternative splicing affected protein motifs in 3015 genes, subcellular localizations in 2982 genes and transmembrane domains in 1348 genes. We also identified interesting patterns of alternative splicing, in which two distinct genes seemed to be bridged, nested or having overlapping protein coding sequences (CDSs) of different reading frames (multiple CDS). In these cases, completely unrelated proteins are encoded by a single locus. Genome-wide annotations of alternative splicing, relying on full-length cDNAs, should lay firm groundwork for exploring in detail the diversification of protein function, which is mediated by the fast expanding universe of alternative splicing variants. PMID:16914452
Designing oligo libraries taking alternative splicing into account
NASA Astrophysics Data System (ADS)
Shoshan, Avi; Grebinskiy, Vladimir; Magen, Avner; Scolnicov, Ariel; Fink, Eyal; Lehavi, David; Wasserman, Alon
2001-06-01
We have designed sequences for DNA microarrays and oligo libraries, taking alternative splicing into account. Alternative splicing is a common phenomenon, occurring in more than 25% of the human genes. In many cases, different splice variants have different functions, are expressed in different tissues or may indicate different stages of disease. When designing sequences for DNA microarrays or oligo libraries, it is very important to take into account the sequence information of all the mRNA transcripts. Therefore, when a gene has more than one transcript (as a result of alternative splicing, alternative promoter sites or alternative poly-adenylation sites), it is very important to take all of them into account in the design. We have used the LEADS transcriptome prediction system to cluster and assemble the human sequences in GenBank and design optimal oligonucleotides for all the human genes with a known mRNA sequence based on the LEADS predictions.
Qu, Wen; Cingolani, Pablo; Zeeberg, Barry R; Ruden, Douglas M
2017-01-01
Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5' or 3' splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an "alternative-splicing code" in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs.
Widespread alternative and aberrant splicing revealed by lariat sequencing
Stepankiw, Nicholas; Raghavan, Madhura; Fogarty, Elizabeth A.; Grimson, Andrew; Pleiss, Jeffrey A.
2015-01-01
Alternative splicing is an important and ancient feature of eukaryotic gene structure, the existence of which has likely facilitated eukaryotic proteome expansions. Here, we have used intron lariat sequencing to generate a comprehensive profile of splicing events in Schizosaccharomyces pombe, amongst the simplest organisms that possess mammalian-like splice site degeneracy. We reveal an unprecedented level of alternative splicing, including alternative splice site selection for over half of all annotated introns, hundreds of novel exon-skipping events, and thousands of novel introns. Moreover, the frequency of these events is far higher than previous estimates, with alternative splice sites on average activated at ∼3% the rate of canonical sites. Although a subset of alternative sites are conserved in related species, implying functional potential, the majority are not detectably conserved. Interestingly, the rate of aberrant splicing is inversely related to expression level, with lowly expressed genes more prone to erroneous splicing. Although we validate many events with RNAseq, the proportion of alternative splicing discovered with lariat sequencing is far greater, a difference we attribute to preferential decay of aberrantly spliced transcripts. Together, these data suggest the spliceosome possesses far lower fidelity than previously appreciated, highlighting the potential contributions of alternative splicing in generating novel gene structures. PMID:26261211
Spinelli, Roberta; Pirola, Alessandra; Redaelli, Sara; Sharma, Nitesh; Raman, Hima; Valletta, Simona; Magistroni, Vera; Piazza, Rocco; Gambacorti-Passerini, Carlo
2013-11-01
Point mutations in intronic regions near mRNA splice junctions can affect the splicing process. To identify novel splicing variants from exome sequencing data, we developed a bioinformatics splice-site prediction procedure to analyze next-generation sequencing (NGS) data (SpliceFinder). SpliceFinder integrates two functional annotation tools for NGS, ANNOVAR and MutationTaster and two canonical splice site prediction programs for single mutation analysis, SSPNN and NetGene2. By SpliceFinder, we identified somatic mutations affecting RNA splicing in a colon cancer sample, in eight atypical chronic myeloid leukemia (aCML), and eight CML patients. A novel homozygous splicing mutation was found in APC (NM_000038.4:c.1312+5G>A) and six heterozygous in GNAQ (NM_002072.2:c.735+1C>T), ABCC 3 (NM_003786.3:c.1783-1G>A), KLHDC 1 (NM_172193.1:c.568-2A>G), HOOK 1 (NM_015888.4:c.1662-1G>A), SMAD 9 (NM_001127217.2:c.1004-1C>T), and DNAH 9 (NM_001372.3:c.10242+5G>A). Integrating whole-exome and RNA sequencing in aCML and CML, we assessed the phenotypic effect of mutations on mRNA splicing for GNAQ, ABCC 3, HOOK 1. In ABCC 3 and HOOK 1, RNA-Seq showed the presence of aberrant transcripts with activation of a cryptic splice site or intron retention, validated by the reverse transcription-polymerase chain reaction (RT-PCR) in the case of HOOK 1. In GNAQ, RNA-Seq showed 22% of wild-type transcript and 78% of mRNA skipping exon 5, resulting in a 4-6 frameshift fusion confirmed by RT-PCR. The pipeline can be useful to identify intronic variants affecting RNA sequence by complementing conventional exome analysis.
Cell cycle dependent intracellular distribution of two spliced isoforms of TCP/ILF3 proteins.
Xu, You Hai; Leonova, Tatyana; Grabowski, Gregory A
2003-12-01
TCP80 is an approximately 80kDa mammalian cytoplasmic protein that binds to a set of mRNAs and inhibits their translation in vitro and ex vivo. This protein has high sequence similarity to interleukin-2 enhancer-binding factors (NF90/ILF3) and the M-phase phosphoprotein (MPP4)/DRBP76. A 110kDa immunologic isoform of TCP80/NF90/MPP4/DRBP76, termed TCP110, also is present in cytoplasm and nuclei of many types of cells. A cDNA sequence coding for TCP110 was derived by 5(')RACE. The TCP110 sequence is identical to ILF3. The gene coding for TCP110/ILF3 mapped to human chromosome 19 and the gene organization was analyzed using TCP80 and TCP110/ILF3 cDNA sequences. The TCP/ILF3 gene spans >34.8kb and contains 21 exons. At least one alternatively spliced product involving exons 19-21 exists and predicts the formation of either TCP80 or TCP110/ILF3. However, the functional relationships of TCP80 and TCP110/ILF3 required elucidation. The metabolic turnover rates and subcellular distribution of TCP80 and TCP110/ILF3 during the cell cycle showed TCP80 to be relatively stable (t(1/2)=5 days) in the cytoplasmic compartment. In comparison, TCP110/ILF3 migrated between the cytoplasmic and nuclear compartments during the cell cycle. The TCP110 C-terminal segment contains an additional nuclear localizing signal that plays a role in its nuclear translocation. This study indicates that the multiple cellular functions, i.e., translation control, interleukin-2 enhancer binding, or cell division, of TCP/ILF3 are fulfilled by alternatively spliced isoforms.
Human Splicing Finder: an online bioinformatics tool to predict splicing signals.
Desmet, François-Olivier; Hamroun, Dalil; Lalande, Marine; Collod-Béroud, Gwenaëlle; Claustres, Mireille; Béroud, Christophe
2009-05-01
Thousands of mutations are identified yearly. Although many directly affect protein expression, an increasing proportion of mutations is now believed to influence mRNA splicing. They mostly affect existing splice sites, but synonymous, non-synonymous or nonsense mutations can also create or disrupt splice sites or auxiliary cis-splicing sequences. To facilitate the analysis of the different mutations, we designed Human Splicing Finder (HSF), a tool to predict the effects of mutations on splicing signals or to identify splicing motifs in any human sequence. It contains all available matrices for auxiliary sequence prediction as well as new ones for binding sites of the 9G8 and Tra2-beta Serine-Arginine proteins and the hnRNP A1 ribonucleoprotein. We also developed new Position Weight Matrices to assess the strength of 5' and 3' splice sites and branch points. We evaluated HSF efficiency using a set of 83 intronic and 35 exonic mutations known to result in splicing defects. We showed that the mutation effect was correctly predicted in almost all cases. HSF could thus represent a valuable resource for research, diagnostic and therapeutic (e.g. therapeutic exon skipping) purposes as well as for global studies, such as the GEN2PHEN European Project or the Human Variome Project.
Human Splicing Finder: an online bioinformatics tool to predict splicing signals
Desmet, François-Olivier; Hamroun, Dalil; Lalande, Marine; Collod-Béroud, Gwenaëlle; Claustres, Mireille; Béroud, Christophe
2009-01-01
Thousands of mutations are identified yearly. Although many directly affect protein expression, an increasing proportion of mutations is now believed to influence mRNA splicing. They mostly affect existing splice sites, but synonymous, non-synonymous or nonsense mutations can also create or disrupt splice sites or auxiliary cis-splicing sequences. To facilitate the analysis of the different mutations, we designed Human Splicing Finder (HSF), a tool to predict the effects of mutations on splicing signals or to identify splicing motifs in any human sequence. It contains all available matrices for auxiliary sequence prediction as well as new ones for binding sites of the 9G8 and Tra2-β Serine-Arginine proteins and the hnRNP A1 ribonucleoprotein. We also developed new Position Weight Matrices to assess the strength of 5′ and 3′ splice sites and branch points. We evaluated HSF efficiency using a set of 83 intronic and 35 exonic mutations known to result in splicing defects. We showed that the mutation effect was correctly predicted in almost all cases. HSF could thus represent a valuable resource for research, diagnostic and therapeutic (e.g. therapeutic exon skipping) purposes as well as for global studies, such as the GEN2PHEN European Project or the Human Variome Project. PMID:19339519
SplicePlot: a utility for visualizing splicing quantitative trait loci.
Wu, Eric; Nance, Tracy; Montgomery, Stephen B
2014-04-01
RNA sequencing has provided unprecedented resolution of alternative splicing and splicing quantitative trait loci (sQTL). However, there are few tools available for visualizing the genotype-dependent effects of splicing at a population level. SplicePlot is a simple command line utility that produces intuitive visualization of sQTLs and their effects. SplicePlot takes mapped RNA sequencing reads in BAM format and genotype data in VCF format as input and outputs publication-quality Sashimi plots, hive plots and structure plots, enabling better investigation and understanding of the role of genetics on alternative splicing and transcript structure. Source code and detailed documentation are available at http://montgomerylab.stanford.edu/spliceplot/index.html under Resources and at Github. SplicePlot is implemented in Python and is supported on Linux and Mac OS. A VirtualBox virtual machine running Ubuntu with SplicePlot already installed is also available.
New discoveries of old SON: a link between RNA splicing and cancer.
Hickey, Christopher J; Kim, Jung-Hyun; Ahn, Eun-Young Erin
2014-02-01
The SON protein is a ubiquitously expressed DNA- and RNA-binding protein primarily localized to nuclear speckles. Although several early studies implicated SON in DNA-binding, tumorigenesis and apoptosis, functional significance of this protein had not been recognized until recent studies discovered SON as a novel RNA splicing co-factor. During constitutive RNA splicing, SON ensures efficient intron removal from the transcripts containing suboptimal splice sites. Importantly, SON-mediated splicing is required for proper processing of selective transcripts related to cell cycle, microtubules, centrosome maintenance, and genome stability. Moreover, SON regulates alternative splicing of RNAs from the genes involved in apoptosis and epigenetic modification. In addition to the role in RNA splicing, SON has an ability to suppress transcriptional activation at certain promoter/enhancer DNA sequences. Considering the multiple SON target genes which are directly involved in cell proliferation, genome stability and chromatin modifications, SON is an emerging player in gene regulation during cancer development and progression. Here, we summarize available information from several early studies on SON, and highlight recent discoveries describing molecular mechanisms of SON-mediated gene regulation. We propose that our future effort on better understanding of diverse SON functions would reveal novel targets for cancer therapy. © 2013 Wiley Periodicals, Inc.
Awan, Ali R; Manfredo, Amanda; Pleiss, Jeffrey A
2013-07-30
Alternative splicing is a potent regulator of gene expression that vastly increases proteomic diversity in multicellular eukaryotes and is associated with organismal complexity. Although alternative splicing is widespread in vertebrates, little is known about the evolutionary origins of this process, in part because of the absence of phylogenetically conserved events that cross major eukaryotic clades. Here we describe a lariat-sequencing approach, which offers high sensitivity for detecting splicing events, and its application to the unicellular fungus, Schizosaccharomyces pombe, an organism that shares many of the hallmarks of alternative splicing in mammalian systems but for which no previous examples of exon-skipping had been demonstrated. Over 200 previously unannotated splicing events were identified, including examples of regulated alternative splicing. Remarkably, an evolutionary analysis of four of the exons identified here as subject to skipping in S. pombe reveals high sequence conservation and perfect length conservation with their homologs in scores of plants, animals, and fungi. Moreover, alternative splicing of two of these exons have been documented in multiple vertebrate organisms, making these the first demonstrations of identical alternative-splicing patterns in species that are separated by over 1 billion y of evolution.
Suh, E R; Waring, R B
1990-01-01
It has been proposed that recognition of the 3' splice site in many group I introns involves base pairing between the start of the 3' exon and a region of the intron known as the internal guide sequence (R. W. Davies, R. B. Waring, J. Ray, T. A. Brown, and C. Scazzocchio, Nature [London] 300:719-724, 1982). We have examined this hypothesis, using the self-splicing rRNA intron from Tetrahymena thermophila. Mutations in the 3' exon that weaken this proposed pairing increased use of a downstream cryptic 3' splice site. Compensatory mutations in the guide sequence that restore this pairing resulted in even stronger selection of the normal 3' splice site. These changes in 3' splice site usage were more pronounced in the background of a mutation (414A) which resulted in an adenine instead of a guanine being the last base of the intron. These results show that the proposed pairing (P10) plays an important role in ensuring that cryptic 3' splice sites are selected against. Surprisingly, the 414A mutation alone did not result in activation of the cryptic 3' splice site. Images PMID:2342465
Splicing predictions reliably classify different types of alternative splicing
Busch, Anke; Hertel, Klemens J.
2015-01-01
Alternative splicing is a key player in the creation of complex mammalian transcriptomes and its misregulation is associated with many human diseases. Multiple mRNA isoforms are generated from most human genes, a process mediated by the interplay of various RNA signature elements and trans-acting factors that guide spliceosomal assembly and intron removal. Here, we introduce a splicing predictor that evaluates hundreds of RNA features simultaneously to successfully differentiate between exons that are constitutively spliced, exons that undergo alternative 5′ or 3′ splice-site selection, and alternative cassette-type exons. Surprisingly, the splicing predictor did not feature strong discriminatory contributions from binding sites for known splicing regulators. Rather, the ability of an exon to be involved in one or multiple types of alternative splicing is dictated by its immediate sequence context, mainly driven by the identity of the exon's splice sites, the conservation around them, and its exon/intron architecture. Thus, the splicing behavior of human exons can be reliably predicted based on basic RNA sequence elements. PMID:25805853
Spinelli, Roberta; Pirola, Alessandra; Redaelli, Sara; Sharma, Nitesh; Raman, Hima; Valletta, Simona; Magistroni, Vera; Piazza, Rocco; Gambacorti-Passerini, Carlo
2013-01-01
Point mutations in intronic regions near mRNA splice junctions can affect the splicing process. To identify novel splicing variants from exome sequencing data, we developed a bioinformatics splice-site prediction procedure to analyze next-generation sequencing (NGS) data (SpliceFinder). SpliceFinder integrates two functional annotation tools for NGS, ANNOVAR and MutationTaster and two canonical splice site prediction programs for single mutation analysis, SSPNN and NetGene2. By SpliceFinder, we identified somatic mutations affecting RNA splicing in a colon cancer sample, in eight atypical chronic myeloid leukemia (aCML), and eight CML patients. A novel homozygous splicing mutation was found in APC (NM_000038.4:c.1312+5G>A) and six heterozygous in GNAQ (NM_002072.2:c.735+1C>T), ABCC3 (NM_003786.3:c.1783-1G>A), KLHDC1 (NM_172193.1:c.568-2A>G), HOOK1 (NM_015888.4:c.1662-1G>A), SMAD9 (NM_001127217.2:c.1004-1C>T), and DNAH9 (NM_001372.3:c.10242+5G>A). Integrating whole-exome and RNA sequencing in aCML and CML, we assessed the phenotypic effect of mutations on mRNA splicing for GNAQ, ABCC3, HOOK1. In ABCC3 and HOOK1, RNA-Seq showed the presence of aberrant transcripts with activation of a cryptic splice site or intron retention, validated by the reverse transcription-polymerase chain reaction (RT-PCR) in the case of HOOK1. In GNAQ, RNA-Seq showed 22% of wild-type transcript and 78% of mRNA skipping exon 5, resulting in a 4–6 frameshift fusion confirmed by RT-PCR. The pipeline can be useful to identify intronic variants affecting RNA sequence by complementing conventional exome analysis. PMID:24498620
Ben Rebeh, Imen; Morinière, Madeleine; Ayadi, Leila; Benzina, Zeineb; Charfedine, Ilhem; Feki, Jamel; Ayadi, Hammadi; Ghorbel, Abdelmonem; Baklouti, Faouzi; Masmoudi, Saber
2010-09-30
Recessive mutations of the myosin VIIA (MYO7A) gene are reported to be responsible for both a deaf-blindness syndrome (Usher type 1B [USH1B] and atypical Usher syndrome) and nonsyndromic hearing loss (HL; Deafness, Neurosensory, Autosomal Recessive 2 [DFNB2]). The existence of DFNB2 is controversial, and often there is no relationship between the type and location of the MYO7A mutations corresponding to the USH1B and DFNB2 phenotype. We investigated the molecular determinant of a mild form of retinopathy in association with a subtle splicing modulation of MYO7A mRNA. Affected members underwent detailed audiologic and ocular characterization. DNA samples from family members were genotyped with polymorphic microsatellite markers. Sequencing of MYO7A was performed. Endogenous lymphoid RNA analysis and a splicing minigene assay were used to study the effect of the c.1935G>A mutation. Funduscopy showed mild retinitis pigmentosa in adults with HL. Microsatellite analysis showed linkage to markers in the region on chromosome 11q13.5. Sequencing of MYO7A revealed a mutation in the last nucleotide of exon 16 (c.1935G>A), which corresponds to a substitution of a methionine to an isoleucine residue at amino acid 645 of the myosin VIIA. However, structural prediction of the molecular model of myosin VIIA shows that this amino acid replacement induces only minor structural changes in the immediate environment of the mutation and thus does not alter the overall native structure. We found that, although predominantly included in mature mRNA, exon 16 is in fact alternatively spliced in control cells and that the mutation at the very last position is associated with a switch toward a predominant exclusion of that exon. This observation was further supported using a splicing minigene transfection assay; the c.1935G>A mutation was found to trigger a partial impairment of the adjacent donor splice site, suggesting that the unique change at the last position of the exon is responsible for the enhanced exon exclusion in this family. This study shows how an exonic mutation that weakens the 5' splice site enhances a minor alternative splicing without abolishing a complete exclusion of the exon and therefore causes a less severe retinitis pigmentosa than the USH1B-associated alleles. It would be interesting to examine a possible correlation between intrafamilial phenotypic variability and the subtle variation in exon 16 inclusion, probably related to genetic background specificities.
Human Splice-Site Prediction with Deep Neural Networks.
Naito, Tatsuhiko
2018-04-18
Accurate splice-site prediction is essential to delineate gene structures from sequence data. Several computational techniques have been applied to create a system to predict canonical splice sites. For classification tasks, deep neural networks (DNNs) have achieved record-breaking results and often outperformed other supervised learning techniques. In this study, a new method of splice-site prediction using DNNs was proposed. The proposed system receives an input sequence data and returns an answer as to whether it is splice site. The length of input is 140 nucleotides, with the consensus sequence (i.e., "GT" and "AG" for the donor and acceptor sites, respectively) in the middle. Each input sequence model is applied to the pretrained DNN model that determines the probability that an input is a splice site. The model consists of convolutional layers and bidirectional long short-term memory network layers. The pretraining and validation were conducted using the data set tested in previously reported methods. The performance evaluation results showed that the proposed method can outperform the previous methods. In addition, the pattern learned by the DNNs was visualized as position frequency matrices (PFMs). Some of PFMs were very similar to the consensus sequence. The trained DNN model and the brief source code for the prediction system are uploaded. Further improvement will be achieved following the further development of DNNs.
Zhao, Yunjun; Sun, Jiayan; Xu, Peng; Zhang, Rui; Li, Laigeng
2014-02-01
Alternative splicing is an important mechanism involved in regulating the development of multicellular organisms. Although many genes in plants undergo alternative splicing, little is understood of its significance in regulating plant growth and development. In this study, alternative splicing of black cottonwood (Populus trichocarpa) wood-associated NAC domain transcription factor (PtrWNDs), PtrWND1B, is shown to occur exclusively in secondary xylem fiber cells. PtrWND1B is expressed with a normal short-transcript PtrWND1B-s as well as its alternative long-transcript PtrWND1B-l. The intron 2 structure of the PtrWND1B gene was identified as a critical sequence that causes PtrWND1B alternative splicing. Suppression of PtrWND1B expression specifically inhibited fiber cell wall thickening. The two PtrWND1B isoforms play antagonistic roles in regulating cell wall thickening during fiber cell differentiation in Populus spp. PtrWND1B-s overexpression enhanced fiber cell wall thickening, while overexpression of PtrWND1B-l repressed fiber cell wall thickening. Alternative splicing may enable more specific regulation of processes such as fiber cell wall thickening during wood formation.
Zhao, Yunjun; Sun, Jiayan; Xu, Peng; Zhang, Rui; Li, Laigeng
2014-01-01
Alternative splicing is an important mechanism involved in regulating the development of multicellular organisms. Although many genes in plants undergo alternative splicing, little is understood of its significance in regulating plant growth and development. In this study, alternative splicing of black cottonwood (Populus trichocarpa) wood-associated NAC domain transcription factor (PtrWNDs), PtrWND1B, is shown to occur exclusively in secondary xylem fiber cells. PtrWND1B is expressed with a normal short-transcript PtrWND1B-s as well as its alternative long-transcript PtrWND1B-l. The intron 2 structure of the PtrWND1B gene was identified as a critical sequence that causes PtrWND1B alternative splicing. Suppression of PtrWND1B expression specifically inhibited fiber cell wall thickening. The two PtrWND1B isoforms play antagonistic roles in regulating cell wall thickening during fiber cell differentiation in Populus spp. PtrWND1B-s overexpression enhanced fiber cell wall thickening, while overexpression of PtrWND1B-l repressed fiber cell wall thickening. Alternative splicing may enable more specific regulation of processes such as fiber cell wall thickening during wood formation. PMID:24394777
HSA: a heuristic splice alignment tool.
Bu, Jingde; Chi, Xuebin; Jin, Zhong
2013-01-01
RNA-Seq methodology is a revolutionary transcriptomics sequencing technology, which is the representative of Next generation Sequencing (NGS). With the high throughput sequencing of RNA-Seq, we can acquire much more information like differential expression and novel splice variants from deep sequence analysis and data mining. But the short read length brings a great challenge to alignment, especially when the reads span two or more exons. A two steps heuristic splice alignment tool is generated in this investigation. First, map raw reads to reference with unspliced aligner--BWA; second, split initial unmapped reads into three equal short reads (seeds), align each seed to the reference, filter hits, search possible split position of read and extend hits to a complete match. Compare with other splice alignment tools like SOAPsplice and Tophat2, HSA has a better performance in call rate and efficiency, but its results do not as accurate as the other software to some extent. HSA is an effective spliced aligner of RNA-Seq reads mapping, which is available at https://github.com/vlcc/HSA.
Lücke, S; Xu, G L; Palfi, Z; Cross, M; Bellofatto, V; Bindereif, A
1996-01-01
In trypanosomes mRNAs are generated through trans splicing. The spliced leader (SL) RNA, which donates the 5'-terminal mini-exon to each of the protein coding exons, plays a central role in the trans splicing process. We have established in vivo assays to study in detail trans splicing, cap4 modification, and RNP assembly of the SL RNA in the trypanosomatid species Leptomonas seymouri. First, we found that extensive sequences within the mini-exon are required for SL RNA function in vivo, although a conserved length of 39 nt is not essential. In contrast, the intron sequence appears to be surprisingly tolerant to mutation; only the stem-loop II structure is indispensable. The asymmetry of the sequence requirements in the stem I region suggests that this domain may exist in different functional conformations. Second, distinct mini-exon sequences outside the modification site are important for efficient cap4 formation. Third, all SL RNA mutations tested allowed core RNP assembly, suggesting flexible requirements for core protein binding. In sum, the results of our mutational analysis provide evidence for a discrete domain structure of the SL RNA and help to explain the strong phylogenetic conservation of the mini-exon sequence and of the overall SL RNA secondary structure; they also suggest that there may be certain differences between trans splicing in nematodes and trypanosomes. This approach provides a basis for studying RNA-RNA interactions in the trans spliceosome. Images PMID:8861965
PRP5: a helicase-like protein required for mRNA splicing in yeast.
Dalbadie-McFarland, G; Abelson, J
1990-01-01
A 96-kDa protein predicted by the DNA sequence of the Saccharomyces cerevisiae PRP5 gene contains a domain that bears a striking resemblance to a family of RNA helicases characterized by the conserved amino acid sequence Asp-Glu-Ala-Asp (D-E-A-D). Previous work indicated that the product of the PRP5 gene is required for splicing and that spliceosome assembly does not occur in its absence. However, its precise role in splicing and the nature of its biochemical activity remained unknown. To examine the role of PRP5 in splicing, we cloned the gene by complementation of a temperature-sensitive mutation and determined its DNA sequence. We discuss here the possible roles for an RNA helicase in splicing and for the activity of the PRP5 protein. Images PMID:2349233
SpliceRover: Interpretable Convolutional Neural: Networks for Improved Splice Site Prediction.
Zuallaert, Jasper; Godin, Fréderic; Kim, Mijung; Soete, Arne; Saeys, Yvan; De Neve, Wesley
2018-06-21
During the last decade, improvements in high-throughput sequencing have generated a wealth of genomic data. Functionally interpreting these sequences and finding the biological signals that are hallmarks of gene function and regulation is currently mostly done using automated genome annotation platforms, which mainly rely on integrated machine learning frameworks to identify different functional sites of interest, including splice sites. Splicing is an essential step in the gene regulation process, and the correct identification of splice sites is a major cornerstone in a genome annotation system. In this paper, we present SpliceRover, a predictive deep learning approach that outperforms the state-of-the-art in splice site prediction. SpliceRover uses convolutional neural networks (CNNs), which have been shown to obtain cutting edge performance on a wide variety of prediction tasks. We adapted this approach to deal with genomic sequence inputs, and show it consistently outperforms already existing approaches, with relative improvements in prediction effectiveness of up to 80.9% when measured in terms of false discovery rate. However, a major criticism of CNNs concerns their "black box" nature, as mechanisms to obtain insight into their reasoning processes are limited. To facilitate interpretability of the SpliceRover models, we introduce an approach to visualize the biologically relevant information learnt. We show that our visualization approach is able to recover features known to be important for splice site prediction (binding motifs around the splice site, presence of polypyrimidine tracts and branch points), as well as reveal new features (e.g., several types of exclusion patterns near splice sites). SpliceRover is available as a web service. The prediction tool and instructions can be found at http://bioit2.irc.ugent.be/splicerover/. Supplementary materials are available at Bioinformatics online.
Context-dependent control of alternative splicing by RNA-binding proteins
Fu, Xiang-Dong; Ares, Manuel
2015-01-01
Sequence-specific RNA-binding proteins (RBPs) bind to pre-mRNA to control alternative splicing, but it is not yet possible to read the ‘splicing code’ that dictates splicing regulation on the basis of genome sequence. Each alternative splicing event is controlled by multiple RBPs, the combined action of which creates a distribution of alternatively spliced products in a given cell type. As each cell type expresses a distinct array of RBPs, the interpretation of regulatory information on a given RNA target is exceedingly dependent on the cell type. RBPs also control each other’s functions at many levels, including by mutual modulation of their binding activities on specific regulatory RNA elements. In this Review, we describe some of the emerging rules that govern the highly context-dependent and combinatorial nature of alternative splicing regulation. PMID:25112293
Identification of true EST alignments for recognising transcribed regions.
Ma, Chuang; Wang, Jia; Li, Lun; Duan, Mo-Jie; Zhou, Yan-Hong
2011-01-01
Transcribed regions can be determined by aligning Expressed Sequence Tags (ESTs) with genome sequences. The kernel of this strategy is to effectively distinguish true EST alignments from spurious ones. In this study, three measures including Direction Check, Identity Check and Terminal Check were introduced to more effectively eliminate spurious EST alignments. On the basis of these introduced measures and other widely used measures, a computational tool, named ESTCleanser, has been developed to identify true EST alignments for obtaining reliable transcribed regions. The performance of ESTCleanser has been evaluated on the well-annotated human ENCyclopedia of DNA Elements (ENCODE) regions using human ESTs in the dbEST database. The evaluation results show that the accuracy of ESTCleanser at exon and intron levels is more remarkably enhanced than that of UCSC-spliced EST alignments. This work would be helpful to EST-based researches on finding new genes, complementing genome annotation, recognising alternative splicing events and Single Nucleotide Polymorphisms (SNPs), etc.
Hong, Yoonki; Kim, Woo Jin; Bang, Chi Young; Lee, Jae Cheol; Oh, Yeon-Mok
2016-04-01
Lung cancer is the most common cause of cancer related death. Alterations in gene sequence, structure, and expression have an important role in the pathogenesis of lung cancer. Fusion genes and alternative splicing of cancer-related genes have the potential to be oncogenic. In the current study, we performed RNA-sequencing (RNA-seq) to investigate potential fusion genes and alternative splicing in non-small cell lung cancer. RNA was isolated from lung tissues obtained from 86 subjects with lung cancer. The RNA samples from lung cancer and normal tissues were processed with RNA-seq using the HiSeq 2000 system. Fusion genes were evaluated using Defuse and ChimeraScan. Candidate fusion transcripts were validated by Sanger sequencing. Alternative splicing was analyzed using multivariate analysis of transcript sequencing and validated using quantitative real time polymerase chain reaction. RNA-seq data identified oncogenic fusion genes EML4-ALK and SLC34A2-ROS1 in three of 86 normal-cancer paired samples. Nine distinct fusion transcripts were selected using DeFuse and ChimeraScan; of which, four fusion transcripts were validated by Sanger sequencing. In 33 squamous cell carcinoma, 29 tumor specific skipped exon events and six mutually exclusive exon events were identified. ITGB4 and PYCR1 were top genes that showed significant tumor specific splice variants. In conclusion, RNA-seq data identified novel potential fusion transcripts and splice variants. Further evaluation of their functional significance in the pathogenesis of lung cancer is required.
Lan, Susan; Kamel, Wael; Punga, Tanel; Akusjärvi, Göran
2017-02-28
The adenovirus L4-22K protein both activates and suppresses transcription from the adenovirus major late promoter (MLP) by binding to DNA elements located downstream of the MLP transcriptional start site: the so-called DE element (positive) and the R1 region (negative). Here we show that L4-22K preferentially binds to the RNA form of the R1 region, both to the double-stranded RNA and the single-stranded RNA of the same polarity as the nascent MLP transcript. Further, L4-22K binds to a 5΄-CAAA-3΄ motif in the single-stranded RNA, which is identical to the sequence motif characterized for L4-22K DNA binding. L4-22K binding to single-stranded RNA results in an enhancement of U1 snRNA recruitment to the major late first leader 5΄ splice site. This increase in U1 snRNA binding results in a suppression of MLP transcription and a concurrent stimulation of major late first intron splicing. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Berkers, Celia R.; de Jong, Annemieke; Schuurman, Karianne G.; Linnemann, Carsten; Meiring, Hugo D.; Janssen, Lennert; Neefjes, Jacques J.; Schumacher, Ton N. M.; Rodenko, Boris
2015-01-01
Peptide splicing, in which two distant parts of a protein are excised and then ligated to form a novel peptide, can generate unique MHC class I–restricted responses. Because these peptides are not genetically encoded and the rules behind proteasomal splicing are unknown, it is difficult to predict these spliced Ags. In the current study, small libraries of short peptides were used to identify amino acid sequences that affect the efficiency of this transpeptidation process. We observed that splicing does not occur at random, neither in terms of the amino acid sequences nor through random splicing of peptides from different sources. In contrast, splicing followed distinct rules that we deduced and validated both in vitro and in cells. Peptide ligation was quantified using a model peptide and demonstrated to occur with up to 30% ligation efficiency in vitro, provided that optimal structural requirements for ligation were met by both ligating partners. In addition, many splicing products could be formed from a single protein. Our splicing rules will facilitate prediction and detection of new spliced Ags to expand the peptidome presented by MHC class I Ags. PMID:26401003
Bonizzoni, Paola; Rizzi, Raffaella; Pesole, Graziano
2005-10-05
Currently available methods to predict splice sites are mainly based on the independent and progressive alignment of transcript data (mostly ESTs) to the genomic sequence. Apart from often being computationally expensive, this approach is vulnerable to several problems--hence the need to develop novel strategies. We propose a method, based on a novel multiple genome-EST alignment algorithm, for the detection of splice sites. To avoid limitations of splice sites prediction (mainly, over-predictions) due to independent single EST alignments to the genomic sequence our approach performs a multiple alignment of transcript data to the genomic sequence based on the combined analysis of all available data. We recast the problem of predicting constitutive and alternative splicing as an optimization problem, where the optimal multiple transcript alignment minimizes the number of exons and hence of splice site observations. We have implemented a splice site predictor based on this algorithm in the software tool ASPIC (Alternative Splicing PredICtion). It is distinguished from other methods based on BLAST-like tools by the incorporation of entirely new ad hoc procedures for accurate and computationally efficient transcript alignment and adopts dynamic programming for the refinement of intron boundaries. ASPIC also provides the minimal set of non-mergeable transcript isoforms compatible with the detected splicing events. The ASPIC web resource is dynamically interconnected with the Ensembl and Unigene databases and also implements an upload facility. Extensive bench marking shows that ASPIC outperforms other existing methods in the detection of novel splicing isoforms and in the minimization of over-predictions. ASPIC also requires a lower computation time for processing a single gene and an EST cluster. The ASPIC web resource is available at http://aspic.algo.disco.unimib.it/aspic-devel/.
Spliced DNA Sequences in the Paramecium Germline: Their Properties and Evolutionary Potential
Catania, Francesco; McGrath, Casey L.; Doak, Thomas G.; Lynch, Michael
2013-01-01
Despite playing a crucial role in germline-soma differentiation, the evolutionary significance of developmentally regulated genome rearrangements (DRGRs) has received scant attention. An example of DRGR is DNA splicing, a process that removes segments of DNA interrupting genic and/or intergenic sequences. Perhaps, best known for shaping immune-system genes in vertebrates, DNA splicing plays a central role in the life of ciliated protozoa, where thousands of germline DNA segments are eliminated after sexual reproduction to regenerate a functional somatic genome. Here, we identify and chronicle the properties of 5,286 sequences that putatively undergo DNA splicing (i.e., internal eliminated sequences [IESs]) across the genomes of three closely related species of the ciliate Paramecium (P. tetraurelia, P. biaurelia, and P. sexaurelia). The study reveals that these putative IESs share several physical characteristics. Although our results are consistent with excision events being largely conserved between species, episodes of differential IES retention/excision occur, may have a recent origin, and frequently involve coding regions. Our findings indicate interconversion between somatic—often coding—DNA sequences and noncoding IESs, and provide insights into the role of DNA splicing in creating potentially functional genetic innovation. PMID:23737328
The intron 1 of HPV 16 has a suboptimal branch point at a guanosine.
De la Rosa-Rios, Marco Antonio; Martínez-Salazar, Martha; Martínez-Garcia, Martha; González-Bonilla, César; Villegas-Sepúlveda, Nicolás
2006-06-01
The branch point sequence (BPS) of intron 1 of the HPV-16 was determined via RT-PCR in a cell free system, using lariat intermediates obtained by in vitro splicing reactions. We used synthetic E6/E7 transcripts and HeLa nuclear protein extracts to obtain the splicing intermediates. Then, a divergent oligonucleotide primer set, pairing on the lariat RNA that encompassed the 2'-5' phosphodiester bond formed between the 5' end of the intron and the BPS, was used for cDNA synthesis and PCR amplification. Subsequent RT-PCR assays revealed four splicing intermediates, made up of a major intermediary corresponding to the BPS and four cryptic branched sequences. Only intermediates bound at the 5' end of the intron are probably the authentic branch point sequence, and all of them branch at guanosine 328 instead of the typical adenosine. Unusually, the BPS of intron 1 of HPV-16 is a suboptimal sequence (AGUGAGU) that differs from the eukaryotic consensus BPS, which correlates with the splicing profile observed for early transcripts of HPV-16 in tumors and tumor derived cell lines. The implications of this unusual branch point sequence for splicing of the HPV-16 pre-mRNA are discussed.
An RNAi-enhanced Logic Circuit for Cancer Specific Detection and Destruction
2010-07-01
Bcl-2 family: mBax (Mus musculus), hBax ( Homo sapiens ), and its mutant hBax-S184A [4]. A plasmid containing the tested gene was transfected into HEK...the far-red fluorescent protein mKate to express the Gata3 mStaple. Intron- feature sequences – donor site, branch point, poly- pyrimidine tract, and...intron-exon junction. Among the donor and acceptor sequences found in literature our intron features were chosen according SplicePort [5], an
Huang, J M; Wang, Z Y; Ju, Z H; Wang, C F; Li, Q L; Sun, T; Hou, Q L; Hang, S Q; Hou, M H; Zhong, J F
2011-12-21
Bovine lactoferrin (bLF) is a member of the transferrin family; it plays an important role in the innate immune response. We identified novel splice variants of the bLF gene in mastitis-infected and healthy cows. Reverse transcription-polymerase chain reaction (RT-PCR) and clone sequencing analysis were used to screen the splice variants of the bLF gene in the mammary gland, spleen and liver tissues. One main transcript corresponding to the bLF reference sequence was found in three tissues in both healthy and mastitis-infected cows. Quantitative real-time PCR analysis showed that the expression levels of the LF gene's main transcript were not significantly different in tissues from healthy versus mastitis-infected cows. However, the new splice variant, LF-AS2, which has the exon-skipping alternative splicing pattern, was only identified in mammary glands infected with Staphylococcus aureus. Sequencing analysis showed that the new splice variant was 251 bp in length, including exon 1, part of exon 2, part of exon 16, and exon 17. We conclude that bLF may play a role in resistance to mastitis through alternative splicing mechanisms.
Tran, Trung T; Bollineni, Ravi C; Strozynski, Margarita; Koehler, Christian J; Thiede, Bernd
2017-07-07
Alternative splicing is a mechanism in eukaryotes by which different forms of mRNAs are generated from the same gene. Identification of alternative splice variants requires the identification of peptides specific for alternative splice forms. For this purpose, we generated a human database that contains only unique tryptic peptides specific for alternative splice forms from Swiss-Prot entries. Using this database allows an easy access to splice variant-specific peptide sequences that match to MS data. Furthermore, we combined this database without alternative splice variant-1-specific peptides with human Swiss-Prot. This combined database can be used as a general database for searching of LC-MS data. LC-MS data derived from in-solution digests of two different cell lines (LNCaP, HeLa) and phosphoproteomics studies were analyzed using these two databases. Several nonalternative splice variant-1-specific peptides were found in both cell lines, and some of them seemed to be cell-line-specific. Control and apoptotic phosphoproteomes from Jurkat T cells revealed several nonalternative splice variant-1-specific peptides, and some of them showed clear quantitative differences between the two states.
Pre-mRNA splicing repression triggers abiotic stress signaling in plants.
Ling, Yu; Alshareef, Sahar; Butt, Haroon; Lozano-Juste, Jorge; Li, Lixin; Galal, Aya A; Moustafa, Ahmed; Momin, Afaque A; Tashkandi, Manal; Richardson, Dale N; Fujii, Hiroaki; Arold, Stefan; Rodriguez, Pedro L; Duque, Paula; Mahfouz, Magdy M
2017-01-01
Alternative splicing (AS) of precursor RNAs enhances transcriptome plasticity and proteome diversity in response to diverse growth and stress cues. Recent work has shown that AS is pervasive across plant species, with more than 60% of intron-containing genes producing different isoforms. Mammalian cell-based assays have discovered various inhibitors of AS. Here, we show that the macrolide pladienolide B (PB) inhibits constitutive splicing and AS in plants. Also, our RNA sequencing (RNA-seq) data revealed that PB mimics abiotic stress signals including salt, drought and abscisic acid (ABA). PB activates the abiotic stress- and ABA-responsive reporters RD29A::LUC and MAPKKK18::uidA in Arabidopsis thaliana and mimics the effects of ABA on stomatal aperture. Genome-wide analysis of AS by RNA-seq revealed that PB perturbs the splicing machinery and leads to a striking increase in intron retention and a reduction in other forms of AS. Interestingly, PB treatment activates the ABA signaling pathway by inhibiting the splicing of clade A PP2C phosphatases while still maintaining to some extent the splicing of ABA-activated SnRK2 kinases. Taken together, our data establish PB as an inhibitor and modulator of splicing and a mimic of abiotic stress signals in plants. Thus, PB reveals the molecular underpinnings of the interplay between stress responses, ABA signaling and post-transcriptional regulation in plants. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.
SplicingTypesAnno: annotating and quantifying alternative splicing events for RNA-Seq data.
Sun, Xiaoyong; Zuo, Fenghua; Ru, Yuanbin; Guo, Jiqiang; Yan, Xiaoyan; Sablok, Gaurav
2015-04-01
Alternative splicing plays a key role in the regulation of the central dogma. Four major types of alternative splicing have been classified as intron retention, exon skipping, alternative 5 splice sites or alternative donor sites, and alternative 3 splice sites or alternative acceptor sites. A few algorithms have been developed to detect splice junctions from RNA-Seq reads. However, there are few tools targeting at the major alternative splicing types at the exon/intron level. This type of analysis may reveal subtle, yet important events of alternative splicing, and thus help gain deeper understanding of the mechanism of alternative splicing. This paper describes a user-friendly R package, extracting, annotating and analyzing alternative splicing types for sequence alignment files from RNA-Seq. SplicingTypesAnno can: (1) provide annotation for major alternative splicing at exon/intron level. By comparing the annotation from GTF/GFF file, it identifies the novel alternative splicing sites; (2) offer a convenient two-level analysis: genome-scale annotation for users with high performance computing environment, and gene-scale annotation for users with personal computers; (3) generate a user-friendly web report and additional BED files for IGV visualization. SplicingTypesAnno is a user-friendly R package for extracting, annotating and analyzing alternative splicing types at exon/intron level for sequence alignment files from RNA-Seq. It is publically available at https://sourceforge.net/projects/splicingtypes/files/ or http://genome.sdau.edu.cn/research/software/SplicingTypesAnno.html. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
SpliceDisease database: linking RNA splicing and disease.
Wang, Juan; Zhang, Jie; Li, Kaibo; Zhao, Wei; Cui, Qinghua
2012-01-01
RNA splicing is an important aspect of gene regulation in many organisms. Splicing of RNA is regulated by complicated mechanisms involving numerous RNA-binding proteins and the intricate network of interactions among them. Mutations in cis-acting splicing elements or its regulatory proteins have been shown to be involved in human diseases. Defects in pre-mRNA splicing process have emerged as a common disease-causing mechanism. Therefore, a database integrating RNA splicing and disease associations would be helpful for understanding not only the RNA splicing but also its contribution to disease. In SpliceDisease database, we manually curated 2337 splicing mutation disease entries involving 303 genes and 370 diseases, which have been supported experimentally in 898 publications. The SpliceDisease database provides information including the change of the nucleotide in the sequence, the location of the mutation on the gene, the reference Pubmed ID and detailed description for the relationship among gene mutations, splicing defects and diseases. We standardized the names of the diseases and genes and provided links for these genes to NCBI and UCSC genome browser for further annotation and genomic sequences. For the location of the mutation, we give direct links of the entry to the respective position/region in the genome browser. The users can freely browse, search and download the data in SpliceDisease at http://cmbi.bjmu.edu.cn/sdisease.
Katz, R A; Kotler, M; Skalka, A M
1988-01-01
The full-length retroviral RNA transcript serves as (i) mRNA for the gag and pol gene products, (ii) genomic RNA that is assembled into progeny virions, and (iii) a pre-mRNA for spliced subgenomic mRNAs. Therefore, a balance of spliced and unspliced RNA is required to generate the appropriate levels of protein and RNA products for virion production. We have introduced an insertion mutation near the avian sarcoma virus env splice acceptor site that results in a significant increase in splicing to form functional env mRNA. The mutant virus is replication defective, but phenotypic revertant viruses that have acquired second-site mutations near the splice acceptor site can be isolated readily. Detailed analysis of one of these viruses revealed that a single nucleotide change at -20 from the splice acceptor site, within the original mutagenic insert, was sufficient to restore viral growth and significantly decrease splicing efficiency compared with the original mutant and wild-type viruses. Thus, minor sequence alterations near the env splice acceptor site can produce major changes in the balance of spliced and unspliced RNAs. Our results suggest a mechanism of control in which splicing is modulated by cis-acting sequences at the env splice acceptor site. Furthermore, this retroviral system provides a powerful genetic method for selection and analysis of mutations that affect splicing control. Images PMID:2839694
NASA Astrophysics Data System (ADS)
Zhang, Chunxi; Zhang, Zuchen; Song, Jingming; Wu, Chunxiao; Song, Ningfang
2015-03-01
A splicing parameter optimization method to increase the tensile strength of splicing joint between photonic crystal fiber (PCF) and conventional fiber is demonstrated. Based on the splicing recipes provided by splicer or fiber manufacturers, the optimal values of some major splicing parameters are obtained in sequence, and a conspicuous improvement in the mechanical strength of splicing joints between PCFs and conventional fibers is validated through experiments.
Stanescu, Ana; Caragea, Doina
2015-01-01
Recent biochemical advances have led to inexpensive, time-efficient production of massive volumes of raw genomic data. Traditional machine learning approaches to genome annotation typically rely on large amounts of labeled data. The process of labeling data can be expensive, as it requires domain knowledge and expert involvement. Semi-supervised learning approaches that can make use of unlabeled data, in addition to small amounts of labeled data, can help reduce the costs associated with labeling. In this context, we focus on the problem of predicting splice sites in a genome using semi-supervised learning approaches. This is a challenging problem, due to the highly imbalanced distribution of the data, i.e., small number of splice sites as compared to the number of non-splice sites. To address this challenge, we propose to use ensembles of semi-supervised classifiers, specifically self-training and co-training classifiers. Our experiments on five highly imbalanced splice site datasets, with positive to negative ratios of 1-to-99, showed that the ensemble-based semi-supervised approaches represent a good choice, even when the amount of labeled data consists of less than 1% of all training data. In particular, we found that ensembles of co-training and self-training classifiers that dynamically balance the set of labeled instances during the semi-supervised iterations show improvements over the corresponding supervised ensemble baselines. In the presence of limited amounts of labeled data, ensemble-based semi-supervised approaches can successfully leverage the unlabeled data to enhance supervised ensembles learned from highly imbalanced data distributions. Given that such distributions are common for many biological sequence classification problems, our work can be seen as a stepping stone towards more sophisticated ensemble-based approaches to biological sequence annotation in a semi-supervised framework.
2015-01-01
Background Recent biochemical advances have led to inexpensive, time-efficient production of massive volumes of raw genomic data. Traditional machine learning approaches to genome annotation typically rely on large amounts of labeled data. The process of labeling data can be expensive, as it requires domain knowledge and expert involvement. Semi-supervised learning approaches that can make use of unlabeled data, in addition to small amounts of labeled data, can help reduce the costs associated with labeling. In this context, we focus on the problem of predicting splice sites in a genome using semi-supervised learning approaches. This is a challenging problem, due to the highly imbalanced distribution of the data, i.e., small number of splice sites as compared to the number of non-splice sites. To address this challenge, we propose to use ensembles of semi-supervised classifiers, specifically self-training and co-training classifiers. Results Our experiments on five highly imbalanced splice site datasets, with positive to negative ratios of 1-to-99, showed that the ensemble-based semi-supervised approaches represent a good choice, even when the amount of labeled data consists of less than 1% of all training data. In particular, we found that ensembles of co-training and self-training classifiers that dynamically balance the set of labeled instances during the semi-supervised iterations show improvements over the corresponding supervised ensemble baselines. Conclusions In the presence of limited amounts of labeled data, ensemble-based semi-supervised approaches can successfully leverage the unlabeled data to enhance supervised ensembles learned from highly imbalanced data distributions. Given that such distributions are common for many biological sequence classification problems, our work can be seen as a stepping stone towards more sophisticated ensemble-based approaches to biological sequence annotation in a semi-supervised framework. PMID:26356316
Kalyna, Maria; Lopato, Sergiy; Voronin, Viktor; Barta, Andrea
2006-01-01
Alternative splicing is an important mechanism for fine tuning of gene expression at the post-transcriptional level. SR proteins govern splice site selection and spliceosome assembly. The Arabidopsis genome encodes 19 SR proteins, several of which have no orthologues in metazoan. Three of the plant specific subfamilies are characterized by the presence of a relatively long alternatively spliced intron located in their first RNA recognition motif, which potentially results in an extremely truncated protein. In atRSZ33, a member of the RS2Z subfamily, this alternative splicing event was shown to be autoregulated. Here we show that atRSp31, a member of the RS subfamily, does not autoregulate alternative splicing of its similarily positioned intron. Interestingly, this alternative splicing event is regulated by atRSZ33. We demonstrate that the positions of these long introns and their capability for alternative splicing are conserved from green algae to flowering plants. Moreover, in particular alternative splicing events the splicing signals are embedded into highly conserved sequences. In different taxa, these conserved sequences occur in at least one gene within a subfamily. The evolutionary preservation of alternative splice forms together with highly conserved intron features argues for additional functions hidden in the genes of these plant-specific SR proteins. PMID:16936312
Cheriyan, Manoj; Chan, Siu-Hong; Perler, Francine
2014-12-12
Inteins self-catalytically cleave out of precursor proteins while ligating the surrounding extein fragments with a native peptide bond. Much attention has been lavished on these molecular marvels with the hope of understanding and harnessing their chemistry for novel biochemical transformations including coupling peptides from synthetic or biological origins and controlling protein function. Despite an abundance of powerful applications, the use of inteins is still hampered by limitations in our understanding of their specificity (defined as flanking sequences that permit splicing) and the challenge of inserting inteins into target proteins. We examined the frequently used Nostoc punctiforme Npu DnaE intein after the C-extein cysteine nucleophile (Cys+1) was mutated to serine or threonine. Previous studies demonstrated reduced rates and/or splicing yields with the Npu DnaE intein after mutation of Cys+1 to Ser+1. In this study, genetic selection identified extein sequences with Ser+1 that enabled the Npu DnaE intein to splice with only a 5-fold reduction in rate compared to the wild-type Cys+1 intein and without mutation of the intein itself to activate Ser+1 as a nucleophile. Three different proteins spliced efficiently after insertion of the intein flanked by the selected sequences. We then used this selected specificity to achieve traceless splicing in a targeted enzyme at a location predicted by primary sequence similarity to only the selected C-extein sequence. This study highlights the latent catalytic potential of the Npu DnaE intein to splice with an alternative nucleophile and enables broader intein utility by increasing insertion site choices. Copyright © 2014. Published by Elsevier Ltd.
Bacrot, Séverine; Doyard, Mathilde; Huber, Céline; Alibeu, Olivier; Feldhahn, Niklas; Lehalle, Daphné; Lacombe, Didier; Marlin, Sandrine; Nitschke, Patrick; Petit, Florence; Vazquez, Marie-Paule; Munnich, Arnold; Cormier-Daire, Valérie
2015-02-01
Cerebro-costo-mandibular syndrome (CCMS) is a developmental disorder characterized by the association of Pierre Robin sequence and posterior rib defects. Exome sequencing and Sanger sequencing in five unrelated CCMS patients revealed five heterozygous variants in the small nuclear ribonucleoprotein polypeptides B and B1 (SNRPB) gene. This gene includes three transcripts, namely transcripts 1 and 2, encoding components of the core spliceosomal machinery (SmB' and SmB) and transcript 3 undergoing nonsense-mediated mRNA decay. All variants were located in the premature termination codon (PTC)-introducing alternative exon of transcript 3. Quantitative RT-PCR analysis revealed a significant increase in transcript 3 levels in leukocytes of CCMS individuals compared to controls. We conclude that CCMS is due to heterozygous mutations in SNRPB, enhancing inclusion of a SNRPB PTC-introducing alternative exon, and show that this developmental disease is caused by defects in the splicing machinery. Our finding confirms the report of SNRPB mutations in CCMS patients by Lynch et al. (2014) and further extends the clinical and molecular observations. © 2014 WILEY PERIODICALS, INC.
Analysis and recognition of 5′ UTR intron splice sites in human pre-mRNA
Eden, E.; Brunak, S.
2004-01-01
Prediction of splice sites in non-coding regions of genes is one of the most challenging aspects of gene structure recognition. We perform a rigorous analysis of such splice sites embedded in human 5′ untranslated regions (UTRs), and investigate correlations between this class of splice sites and other features found in the adjacent exons and introns. By restricting the training of neural network algorithms to ‘pure’ UTRs (not extending partially into protein coding regions), we for the first time investigate the predictive power of the splicing signal proper, in contrast to conventional splice site prediction, which typically relies on the change in sequence at the transition from protein coding to non-coding. By doing so, the algorithms were able to pick up subtler splicing signals that were otherwise masked by ‘coding’ noise, thus enhancing significantly the prediction of 5′ UTR splice sites. For example, the non-coding splice site predicting networks pick up compositional and positional bias in the 3′ ends of non-coding exons and 5′ non-coding intron ends, where cytosine and guanine are over-represented. This compositional bias at the true UTR donor sites is also visible in the synaptic weights of the neural networks trained to identify UTR donor sites. Conventional splice site prediction methods perform poorly in UTRs because the reading frame pattern is absent. The NetUTR method presented here performs 2–3-fold better compared with NetGene2 and GenScan in 5′ UTRs. We also tested the 5′ UTR trained method on protein coding regions, and discovered, surprisingly, that it works quite well (although it cannot compete with NetGene2). This indicates that the local splicing pattern in UTRs and coding regions is largely the same. The NetUTR method is made publicly available at www.cbs.dtu.dk/services/NetUTR. PMID:14960723
iCLIP Predicts the Dual Splicing Effects of TIA-RNA Interactions
Briese, Michael; Zarnack, Kathi; Luscombe, Nicholas M.; Rot, Gregor; Zupan, Blaž; Curk, Tomaž; Ule, Jernej
2010-01-01
The regulation of alternative splicing involves interactions between RNA-binding proteins and pre-mRNA positions close to the splice sites. T-cell intracellular antigen 1 (TIA1) and TIA1-like 1 (TIAL1) locally enhance exon inclusion by recruiting U1 snRNP to 5′ splice sites. However, effects of TIA proteins on splicing of distal exons have not yet been explored. We used UV-crosslinking and immunoprecipitation (iCLIP) to find that TIA1 and TIAL1 bind at the same positions on human RNAs. Binding downstream of 5′ splice sites was used to predict the effects of TIA proteins in enhancing inclusion of proximal exons and silencing inclusion of distal exons. The predictions were validated in an unbiased manner using splice-junction microarrays, RT-PCR, and minigene constructs, which showed that TIA proteins maintain splicing fidelity and regulate alternative splicing by binding exclusively downstream of 5′ splice sites. Surprisingly, TIA binding at 5′ splice sites silenced distal cassette and variable-length exons without binding in proximity to the regulated alternative 3′ splice sites. Using transcriptome-wide high-resolution mapping of TIA-RNA interactions we evaluated the distal splicing effects of TIA proteins. These data are consistent with a model where TIA proteins shorten the time available for definition of an alternative exon by enhancing recognition of the preceding 5′ splice site. Thus, our findings indicate that changes in splicing kinetics could mediate the distal regulation of alternative splicing. PMID:21048981
DOE Office of Scientific and Technical Information (OSTI.GOV)
Willing, M.; Deschenes, S.
We have identified a G to A substitution in the 5{prime} donor splice site of intron 18 of one COL1A1 allele in two unrelated families with osteogenesis imperfecta (OI) type I. A third OI type I family has a G to A substitution at the identical position in intron 48 of one COL1A1 allele. Both mutations abolish normal splicing and lead to reduced steady-state levels of mRNA from the mutant COL1A1 allele. The intron 18 mutation leads to both exon 18 skipping in the mRNA and to utilization of a single alternative splice site near the 3{prime} end of exonmore » 18. The latter results in deletion of the last 8 nucleotides of exon 18 from the mRNA, a shift in the translational reading-frame, and the creation of a premature termination codon in exon 19. Of the potential alternative 5{prime} splice sites in exon 18 and intron 18, the one utilized has a surrounding nucleotide sequence which most closely resembles that of the natural splice site. Although a G to A mutation was detected at the identical position in intron 48 of one COL1A1 allele in another OI type I family, nine complex alternative splicing patterns were identified by sequence analysis of cDNA clones derived from fibroblast mRNA from this cell strain. All result in partial or complete skipping of exon 48, with in-frame deletions of portions of exons 47 and/or 49. The different patterns of RNA splicing were not explained by their sequence homology with naturally occuring 5{prime} splice sites, but rather by recombination between highly homologous exon sequences, suggesting that we may not have identified the major splicing alternative(s) in this cell strain. Both G to A mutations result in decreased production of type I collagen, the common biochemical correlate of OI type I.« less
Defective control of pre–messenger RNA splicing in human disease
Shkreta, Lulzim
2016-01-01
Examples of associations between human disease and defects in pre–messenger RNA splicing/alternative splicing are accumulating. Although many alterations are caused by mutations in splicing signals or regulatory sequence elements, recent studies have noted the disruptive impact of mutated generic spliceosome components and splicing regulatory proteins. This review highlights recent progress in our understanding of how the altered splicing function of RNA-binding proteins contributes to myelodysplastic syndromes, cancer, and neuropathologies. PMID:26728853
Manananggal - a novel viewer for alternative splicing events.
Barann, Matthias; Zimmer, Ralf; Birzele, Fabian
2017-02-21
Alternative splicing is an important cellular mechanism that can be analyzed by RNA sequencing. However, identification of splicing events in an automated fashion is error-prone. Thus, further validation is required to select reliable instances of alternative splicing events (ASEs). There are only few tools specifically designed for interactive inspection of ASEs and available visualization approaches can be significantly improved. Here, we present Manananggal, an application specifically designed for the identification of splicing events in next generation sequencing data. Manananggal includes a web application for visual inspection and a command line tool that allows for ASE detection. We compare the sashimi plots available in the IGV Viewer, the DEXSeq splicing plots and SpliceSeq to the Manananggal interface and discuss the advantages and drawbacks of these tools. We show that sashimi plots (such as those used by the IGV Viewer and SpliceSeq) offer a practical solution for simple ASEs, but also indicate short-comings for highly complex genes. Manananggal is an interactive web application that offers functions specifically tailored to the identification of alternative splicing events that other tools are lacking. The ability to select a subset of isoforms allows an easier interpretation of complex alternative splicing events. In contrast to SpliceSeq and the DEXSeq splicing plot, Manananggal does not obscure the gene structure by showing full transcript models that makes it easier to determine which isoforms are expressed and which are not.
Dynamic ASXL1 Exon Skipping and Alternative Circular Splicing in Single Human Cells
Natarajan, Sivaraman; Carter, Robert; Brown, Patrick O.
2016-01-01
Circular RNAs comprise a poorly understood new class of noncoding RNA. In this study, we used a combination of targeted deletion, high-resolution splicing detection, and single-cell sequencing to deeply probe ASXL1 circular splicing. We found that efficient circular splicing required the canonical transcriptional start site and inverted AluSx elements. Sequencing-based interrogation of isoforms after ASXL1 overexpression identified promiscuous linear splicing between all exons, with the two most abundant non-canonical linear products skipping the exons that produced the circular isoforms. Single-cell sequencing revealed a strong preference for either the linear or circular ASXL1 isoforms in each cell, and found the predominant exon skipping product is frequently co-expressed with its reciprocal circular isoform. Finally, absolute quantification of ASXL1 isoforms confirmed our findings and suggests that standard methods overestimate circRNA abundance. Taken together, these data reveal a dynamic new view of circRNA genesis, providing additional framework for studying their roles in cellular biology. PMID:27736885
Larson, Amy; Fair, Benjamin Jung; Pleiss, Jeffrey A
2016-06-01
Pre-mRNA splicing is an essential component of eukaryotic gene expression and is highly conserved from unicellular yeasts to humans. Here, we present the development and implementation of a sequencing-based reverse genetic screen designed to identify nonessential genes that impact pre-mRNA splicing in the fission yeast Schizosaccharomyces pombe, an organism that shares many of the complex features of splicing in higher eukaryotes. Using a custom-designed barcoding scheme, we simultaneously queried ∼3000 mutant strains for their impact on the splicing efficiency of two endogenous pre-mRNAs. A total of 61 nonessential genes were identified whose deletions resulted in defects in pre-mRNA splicing; enriched among these were factors encoding known or predicted components of the spliceosome. Included among the candidates identified here are genes with well-characterized roles in other RNA-processing pathways, including heterochromatic silencing and 3' end processing. Splicing-sensitive microarrays confirm broad splicing defects for many of these factors, revealing novel functional connections between these pathways. Copyright © 2016 Larson et al.
Larson, Amy; Fair, Benjamin Jung; Pleiss, Jeffrey A.
2016-01-01
Pre-mRNA splicing is an essential component of eukaryotic gene expression and is highly conserved from unicellular yeasts to humans. Here, we present the development and implementation of a sequencing-based reverse genetic screen designed to identify nonessential genes that impact pre-mRNA splicing in the fission yeast Schizosaccharomyces pombe, an organism that shares many of the complex features of splicing in higher eukaryotes. Using a custom-designed barcoding scheme, we simultaneously queried ∼3000 mutant strains for their impact on the splicing efficiency of two endogenous pre-mRNAs. A total of 61 nonessential genes were identified whose deletions resulted in defects in pre-mRNA splicing; enriched among these were factors encoding known or predicted components of the spliceosome. Included among the candidates identified here are genes with well-characterized roles in other RNA-processing pathways, including heterochromatic silencing and 3ʹ end processing. Splicing-sensitive microarrays confirm broad splicing defects for many of these factors, revealing novel functional connections between these pathways. PMID:27172183
Kawarai, Toshitaka; Miyamoto, Ryosuke; Mori, Atsuko; Oki, Ryosuke; Tsukamoto-Miyashiro, Ai; Matsui, Naoko; Miyazaki, Yoshimichi; Orlacchio, Antonio; Izumi, Yuishin; Nishida, Yoshihiko; Kaji, Ryuji
2015-12-15
We identified a novel homozygous mutation in the splice site donor (SSD) of intron 30 (c.5866+1G>A) in consanguineous Japanese SPG11 siblings showing late-onset spastic paraplegia using the whole-exome sequencing. Phenotypic variability was observed, including age-at-onset, dysarthria and pes cavus. Coding DNA sequencing revealed that the mutation affected the recognition of the constitutive SSD of intron 30, splicing upstream onto a nearby cryptic SSD in exon 30. The use of constitutive splice sites of intron 29 was confirmed by sequencing. The mutant transcripts are mostly subject to degradation by the nonsense-mediated mRNA decay system. SPG11 transcripts, escaping from the nonsense-mediated mRNA decay pathway, would generate a truncated protein (p.Tyr1900Phefs5X) containing the first 1899 amino acids and followed by 4 aberrant amino acids. This study showed a successful clinical application of whole-exome sequencing in spastic paraplegia and demonstrated a further evidence of allelic heterogeneity in SPG11. The confirmation of aberrant transcript by splice site mutation is a prerequisite for a more precise molecular diagnosis. Copyright © 2015 Elsevier B.V. All rights reserved.
Reading the tea leaves: Dead transposon copies reveal novel host and transposon biology.
McLaughlin, Richard N
2018-03-01
Transposable elements comprise a huge portion of most animal genomes. Unlike many pathogens, these elements leave a mark of their impact via their insertion into host genomes. With proper teasing, these sequences can relay information about the evolutionary history of transposons and their hosts. In a new publication, Larson and colleagues describe a previously unappreciated density of long interspersed element-1 (LINE-1) sequences that have been spliced (LINE-1 and other reverse transcribing elements are necessarily intronless). They provide data to suggest that the retention of these potentially deleterious splice sites in LINE-1 results from the sites' overlap with an important transcription factor binding site. These spliced LINE-1s (i.e., spliced integrated retrotransposed elements [SpiREs]) lose their ability to replicate, suggesting they are evolutionary dead ends. However, the lethality of this splicing could be an efficient means of blocking continued replication of LINE-1. In this way, the record of inactive LINE-1 sequences in the human genome revealed a new, though infrequent, event in the LINE-1 replication cycle and motivates future studies to test whether splicing might be another weapon in the anti-LINE-1 arsenal of host genomes.
Crotti, Lia; Lewandowska, Marzena A; Schwartz, Peter J; Insolia, Roberto; Pedrazzini, Matteo; Bussani, Erica; Dagradi, Federica; George, Alfred L; Pagani, Franco
2009-02-01
Genetic screening of long QT syndrome (LQTS) fails to identify disease-causing mutations in about 30% of patients. So far, molecular screening has focused mainly on coding sequence mutations or on substitutions at canonical splice sites. The purpose of this study was to explore the possibility that intronic variants not at canonical splice sites might affect splicing regulatory elements, lead to aberrant transcripts, and cause LQTS. Molecular screening was performed through DHPLC and sequence analysis. The role of the intronic mutation identified was assessed with a hybrid minigene splicing assay. A three-generation LQTS family was investigated. Molecular screening failed to identify an obvious disease-causing mutation in the coding sequences of the major LQTS genes but revealed an intronic A-to-G substitution in KCNH2 (IVS9-28A/G) cosegregating with the clinical phenotype in family members. In vitro analysis proved that the mutation disrupts the acceptor splice site definition by affecting the branch point (BP) sequence and promoting intron retention. We further demonstrated a tight functional relationship between the BP and the polypyrimidine tract, whose weakness is responsible for the pathological effect of the IVS9-28A/G mutation. We identified a novel BP mutation in KCNH2 that disrupts the intron 9 acceptor splice site definition and causes LQT2. The present finding demonstrates that intronic mutations affecting pre-mRNA processing may contribute to the failure of traditional molecular screening in identifying disease-causing mutations in LQTS subjects and offers a rationale strategy for the reduction of genotype-negative cases.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Miller, S.I.; Wirth, D.F.
1988-06-01
The 5' ends of Leishmania mRNAs contain an identical 35-nucleotide sequence termed the spliced leader (SL) or 5' mini-exon. The SL sequence is at the 5' end of an 85-nucleotide primary transcript that contains a consensus eucaryotic 5' intron-exon splice junction immediately 3' to the SL. The SL is added to protein-coding genes immediately 3' to a consensus eucaryotic 3' intron-exon splice junction. The authors' previous work demonstrated possible intermediates in discontinuous mRNA processing that contain the 50 nucleotides of the SL primary transcript 3' to the SL, the SL intron sequence (SLIS). These RNAs have a 5' terminus atmore » the splice junction of the SL and the SLIS. The authors examined a Leishmania nuclear extract for these RNAs in ribonucleoprotein (RNP) particles. Density centrifugation analysis showed that the SL RNA is predominately in RNP complexes at 60S, while the SLIS-containing RNAs are in complexes at 40S. They also demonstrated that the SLIS can be released from polyadenylated RNA by incubation with a HeLa cell extract containing debranching enzymatic activity. These data suggested that Leishmania enriettii mRNAs are assembled by bimolecular or trans splicing as has been recently demonstrated for Trypanosoma brucei. Furthermore, they determined the partial sequence of the Leishmania U2 equivalent RNA and demonstrated that it cosediments with the SL RNA at 60S in a nuclear extract. These RNP particles may be analogous to so-called spliceosomes that have been demonstrated in other systems.« less
Fine-Scale Variation and Genetic Determinants of Alternative Splicing across Individuals
Coulombe-Huntington, Jasmin; Lam, Kevin C. L.; Dias, Christel; Majewski, Jacek
2009-01-01
Recently, thanks to the increasing throughput of new technologies, we have begun to explore the full extent of alternative pre–mRNA splicing (AS) in the human transcriptome. This is unveiling a vast layer of complexity in isoform-level expression differences between individuals. We used previously published splicing sensitive microarray data from lymphoblastoid cell lines to conduct an in-depth analysis on splicing efficiency of known and predicted exons. By combining publicly available AS annotation with a novel algorithm designed to search for AS, we show that many real AS events can be detected within the usually unexploited, speculative majority of the array and at significance levels much below standard multiple-testing thresholds, demonstrating that the extent of cis-regulated differential splicing between individuals is potentially far greater than previously reported. Specifically, many genes show subtle but significant genetically controlled differences in splice-site usage. PCR validation shows that 42 out of 58 (72%) candidate gene regions undergo detectable AS, amounting to the largest scale validation of isoform eQTLs to date. Targeted sequencing revealed a likely causative SNP in most validated cases. In all 17 incidences where a SNP affected a splice-site region, in silico splice-site strength modeling correctly predicted the direction of the micro-array and PCR results. In 13 other cases, we identified likely causative SNPs disrupting predicted splicing enhancers. Using Fst and REHH analysis, we uncovered significant evidence that 2 putative causative SNPs have undergone recent positive selection. We verified the effect of five SNPs using in vivo minigene assays. This study shows that splicing differences between individuals, including quantitative differences in isoform ratios, are frequent in human populations and that causative SNPs can be identified using in silico predictions. Several cases affected disease-relevant genes and it is likely some of these differences are involved in phenotypic diversity and susceptibility to complex diseases. PMID:20011102
Seim, Inge; Jeffery, Penny L; Thomas, Patrick B; Walpole, Carina M; Maugham, Michelle; Fung, Jenny N T; Yap, Pei-Yi; O'Keeffe, Angela J; Lai, John; Whiteside, Eliza J; Herington, Adrian C; Chopin, Lisa K
2016-06-01
The peptide hormone ghrelin is a potent orexigen produced predominantly in the stomach. It has a number of other biological actions, including roles in appetite stimulation, energy balance, the stimulation of growth hormone release and the regulation of cell proliferation. Recently, several ghrelin gene splice variants have been described. Here, we attempted to identify conserved alternative splicing of the ghrelin gene by cross-species sequence comparisons. We identified a novel human exon 2-deleted variant and provide preliminary evidence that this splice variant and in1-ghrelin encode a C-terminally truncated form of the ghrelin peptide, termed minighrelin. These variants are expressed in humans and mice, demonstrating conservation of alternative splicing spanning 90 million years. Minighrelin appears to have similar actions to full-length ghrelin, as treatment with exogenous minighrelin peptide stimulates appetite and feeding in mice. Forced expression of the exon 2-deleted preproghrelin variant mirrors the effect of the canonical preproghrelin, stimulating cell proliferation and migration in the PC3 prostate cancer cell line. This is the first study to characterise an exon 2-deleted preproghrelin variant and to demonstrate sequence conservation of ghrelin gene-derived splice variants that encode a truncated ghrelin peptide. This adds further impetus for studies into the alternative splicing of the ghrelin gene and the function of novel ghrelin peptides in vertebrates.
Lisboa, Bianca Cristina Garcia; Machado, Tamara da Rocha; Pimenta, Daniel Carvalho; Han, Sang Won
2007-02-01
Human cytidine deaminase (HCD) catalyzes the deamination of cytidine or deoxycytidine to uridine or deoxyuridine, respectively. The genomic sequence of HCD is formed by 31 kb with 4 exons and several alternative splicing signals, but an alternative form of HCD has yet to be reported. Here we describe the cloning and characterization of a small form of HCD, HSCD, and it is likely to be a product of alternative splicing of HCD. The alignment of DNA sequences shows that the HSCD matches HCD in 2 parts, except for a deletion of 170 bp. Based on the HCD genome organization, exons 1 and 4 should be joined and all sequences of introns and exons 2 and 3 should be deleted by splicing. This alternative splicing shifted the translation of the reading frame from the point of splicing. The estimated molecular mass is 9.8 kDa, and this value was confirmed by Western blot and mass spectroscopy after expressing the gene fused with glutathionine-S-transferase in the pGEX vector. The deletion and shift of the reading frame caused a loss of HCD activity, which was confirmed by enzyme assay and also with NIH3T3 cells modified to express HSCD and challenged against cytosine arabinoside. In this work we describe the identification and characterization of HSCD, which is the product of alternative splicing of the HCD gene.
SEQassembly: A Practical Tools Program for Coding Sequences Splicing
NASA Astrophysics Data System (ADS)
Lee, Hongbin; Yang, Hang; Fu, Lei; Qin, Long; Li, Huili; He, Feng; Wang, Bo; Wu, Xiaoming
CDS (Coding Sequences) is a portion of mRNA sequences, which are composed by a number of exon sequence segments. The construction of CDS sequence is important for profound genetic analysis such as genotyping. A program in MATLAB environment is presented, which can process batch of samples sequences into code segments under the guide of reference exon models, and splice these code segments of same sample source into CDS according to the exon order in queue file. This program is useful in transcriptional polymorphism detection and gene function study.
Kon, Ayana; Yamazaki, Satoshi; Nannya, Yasuhito; Kataoka, Keisuke; Ota, Yasunori; Nakagawa, Masahiro Marshall; Yoshida, Kenichi; Shiozawa, Yusuke; Morita, Maiko; Yoshizato, Tetsuichi; Sanada, Masashi; Nakayama, Manabu; Koseki, Haruhiko; Nakauchi, Hiromitsu; Ogawa, Seishi
2018-02-08
Splicing factor mutations are characteristic of myelodysplastic syndromes (MDS) and related myeloid neoplasms and implicated in their pathogenesis, but their roles in the development of MDS have not been fully elucidated. In the present study, we investigated the consequence of mutant Srsf2 expression using newly generated Vav1-Cre -mediated conditional knockin mice. Mice carrying a heterozygous Srsf2 P95H mutation showed significantly reduced numbers of hematopoietic stem and progenitor cells (HSPCs) and differentiation defects both in the steady-state condition and transplantation settings. Srsf2 -mutated hematopoietic stem cells (HSCs) showed impaired long-term reconstitution compared with control mice in competitive repopulation assays. Although the Srsf2 mutant mice did not develop MDS under the steady-state condition, when their stem cells were transplanted into lethally irradiated mice, the recipients developed anemia, leukopenia, and erythroid dysplasia, which suggests the role of replicative stress in the development of an MDS-like phenotype in Srsf2 -mutated mice. RNA sequencing of the Srsf2 -mutated HSPCs revealed a number of abnormal splicing events and differentially expressed genes, including several potential targets implicated in the pathogenesis of hematopoietic malignancies, such as Csf3r , Fyn , Gnas , Nsd1 , Hnrnpa2b1 , and Trp53bp1 Among the mutant Srsf2 -associated splicing events, most commonly observed were the enhanced inclusion and/or exclusion of cassette exons, which were caused by the altered consensus motifs for the recognition of exonic splicing enhancers. Our findings suggest that the mutant Srsf2 leads to a compromised HSC function by causing abnormal RNA splicing and expression, contributing to the deregulated hematopoiesis that recapitulates the MDS phenotypes, possibly as a result of additional genetic and/or environmental insults. © 2018 by The American Society of Hematology.
The GRK4 subfamily of G protein-coupled receptor kinases. Alternative splicing, gene organization, and sequence conservation.
Premont RT, Macrae AD, Aparicio SA, Kendall HE, Welch JE, Lefkowitz RJ.
Department of Medicine, Howard Hughes Medical Institute, Duke Univer...
iSS-PseDNC: identifying splicing sites using pseudo dinucleotide composition.
Chen, Wei; Feng, Peng-Mian; Lin, Hao; Chou, Kuo-Chen
2014-01-01
In eukaryotic genes, exons are generally interrupted by introns. Accurately removing introns and joining exons together are essential processes in eukaryotic gene expression. With the avalanche of genome sequences generated in the postgenomic age, it is highly desired to develop automated methods for rapid and effective detection of splice sites that play important roles in gene structure annotation and even in RNA splicing. Although a series of computational methods were proposed for splice site identification, most of them neglected the intrinsic local structural properties. In the present study, a predictor called "iSS-PseDNC" was developed for identifying splice sites. In the new predictor, the sequences were formulated by a novel feature-vector called "pseudo dinucleotide composition" (PseDNC) into which six DNA local structural properties were incorporated. It was observed by the rigorous cross-validation tests on two benchmark datasets that the overall success rates achieved by iSS-PseDNC in identifying splice donor site and splice acceptor site were 85.45% and 87.73%, respectively. It is anticipated that iSS-PseDNC may become a useful tool for identifying splice sites and that the six DNA local structural properties described in this paper may provide novel insights for in-depth investigations into the mechanism of RNA splicing.
Yeakley, J M; Hedjran, F; Morfin, J P; Merillat, N; Rosenfeld, M G; Emeson, R B
1993-01-01
The calcitonin/calcitonin gene-related peptide (CGRP) primary transcript is alternatively spliced in thyroid C cells and neurons, resulting in the tissue-specific production of calcitonin and CGRP mRNAs. Analyses of mutated calcitonin/CGRP transcription units in permanently transfected cell lines have indicated that alternative splicing is regulated by a differential capacity to utilize the calcitonin-specific splice acceptor. The analysis of an extensive series of mutations suggests that tissue-specific regulation of calcitonin mRNA production does not depend on the presence of a single, unique cis-active element but instead appears to be a consequence of suboptimal constitutive splicing signals. While only those mutations that altered constitutive splicing signals affected splice choices, the action of multiple regulatory sequences cannot be formally excluded. Further, we have identified a 13-nucleotide purine-rich element from a constitutive exon that, when placed in exon 4, entirely switches splice site usage in CGRP-producing cells. These data suggest that specific exon recruitment sequences, in combination with other constitutive elements, serve an important function in exon recognition. These results are consistent with the hypothesis that tissue-specific alternative splicing of the calcitonin/CGRP primary transcript is mediated by cell-specific differences in components of the constitutive splicing machinery. Images PMID:8413203
Quantitation of normal CFTR mRNA in CF patients with splice-site mutations
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, Z.; Olsen, J.C.; Silverman, L.M.
Previously we identified two mutations in introns of the CFTR gene associated with partially active splice sites and unusual clinical phenotypes. One mutation in intron 19 (3849+10 kb C to T) is common in CF patients with normal sweat chloride values; an 84 bp sequence from intron 19, which contains a stop codon, is inserted between exon 19 and exon 20 in most nasal CFTR transcripts. The other mutation in intron 14B (2789+5 G to A) is associated with elevated sweat chloride levels, but mild pulmonary disease; exon 14B (38 bp) is spliced out of most nasal CFTR transcipts. Themore » remaining CFTR cDNA sequences, other than the 84 bp insertion of exon 14B deletion, are identical to the published sequence. To correlate genotype and phenotype, we used quantitative RT-PCR to determine the levels of normally-spliced CFTR mRNA in nasal epithelia from these patients. CFTR cDNA was amplified (25 cycles) by using primers specific for normally-spliced species, {gamma}-actin cDNA was amplified as a standard.« less
Indel detection from DNA and RNA sequencing data with transIndel.
Yang, Rendong; Van Etten, Jamie L; Dehm, Scott M
2018-04-19
Insertions and deletions (indels) are a major class of genomic variation associated with human disease. Indels are primarily detected from DNA sequencing (DNA-seq) data but their transcriptional consequences remain unexplored due to challenges in discriminating medium-sized and large indels from splicing events in RNA-seq data. Here, we developed transIndel, a splice-aware algorithm that parses the chimeric alignments predicted by a short read aligner and reconstructs the mid-sized insertions and large deletions based on the linear alignments of split reads from DNA-seq or RNA-seq data. TransIndel exhibits competitive or superior performance over eight state-of-the-art indel detection tools on benchmarks using both synthetic and real DNA-seq data. Additionally, we applied transIndel to DNA-seq and RNA-seq datasets from 333 primary prostate cancer patients from The Cancer Genome Atlas (TCGA) and 59 metastatic prostate cancer patients from AACR-PCF Stand-Up- To-Cancer (SU2C) studies. TransIndel enhanced the taxonomy of DNA- and RNA-level alterations in prostate cancer by identifying recurrent FOXA1 indels as well as exitron splicing in genes implicated in disease progression. Our study demonstrates that transIndel is a robust tool for elucidation of medium- and large-sized indels from DNA-seq and RNA-seq data. Including RNA-seq in indel discovery efforts leads to significant improvements in sensitivity for identification of med-sized and large indels missed by DNA-seq, and reveals non-canonical RNA-splicing events in genes associated with disease pathology.
Long-read sequencing of nascent RNA reveals coupling among RNA processing events.
Herzel, Lydia; Straube, Korinna; Neugebauer, Karla M
2018-06-14
Pre-mRNA splicing is accomplished by the spliceosome, a megadalton complex that assembles de novo on each intron. Because spliceosome assembly and catalysis occur cotranscriptionally, we hypothesized that introns are removed in the order of their transcription in genomes dominated by constitutive splicing. Remarkably little is known about splicing order and the regulatory potential of nascent transcript remodeling by splicing, due to the limitations of existing methods that focus on analysis of mature splicing products (mRNAs) rather than substrates and intermediates. Here, we overcome this obstacle through long-read RNA sequencing of nascent, multi-intron transcripts in the fission yeast Schizosaccharomyces pombe Most multi-intron transcripts were fully spliced, consistent with rapid cotranscriptional splicing. However, an unexpectedly high proportion of transcripts were either fully spliced or fully unspliced, suggesting that splicing of any given intron is dependent on the splicing status of other introns in the transcript. Supporting this, mild inhibition of splicing by a temperature-sensitive mutation in prp2 , the homolog of vertebrate U2AF65, increased the frequency of fully unspliced transcripts. Importantly, fully unspliced transcripts displayed transcriptional read-through at the polyA site and were degraded cotranscriptionally by the nuclear exosome. Finally, we show that cellular mRNA levels were reduced in genes with a high number of unspliced nascent transcripts during caffeine treatment, showing regulatory significance of cotranscriptional splicing. Therefore, overall splicing of individual nascent transcripts, 3' end formation, and mRNA half-life depend on the splicing status of neighboring introns, suggesting crosstalk among spliceosomes and the polyA cleavage machinery during transcription elongation. © 2018 Herzel et al.; Published by Cold Spring Harbor Laboratory Press.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Conrad, R.; Thomas, J.; Spieth, J.
In nematodes, the RNA products of some genes are trans-spliced to a 22-nucleotide spliced leader (SL), while the RNA products of other genes are not. In Caenorhabditis elegans, there are two SLs, Sl1 and SL2, donated by two distinct small nuclear ribonucleoprotein particles in a process functionally quite similar to nuclear intron removal. The authors demonstrate here that it is possible to convert a non-trans-spliced gene into a trans-spliced gene by placement of an intron missing only the 5[prime] splice site into the 5[prime] untranslated region. Stable transgenic strains were isolated expressing a gene in which 69 nucleotides of amore » vit-5 intron, including the 3[prime] splice site, were inserted into the 5[prime] untranslated region of a vit-2/vit-6 fusion gene. The RNA product of this gene was examined by primer extension and PCR amplification. Although the vit-2/vit-6 transgene product is not normally trans-spliced, the majority of transcripts from this altered gene were trans-spliced to SL1. They termed the region of a trans-spliced mRNA precursor between the 5[prime] end and the first 3[prime] splice site an 'outrun'. The results suggest that if a transcript begins with intronlike sequence followed by a 3[prime] splice site, this alone may constitute an outrun and be sufficient to demarcate a transcript as a trans-splice acceptor. These findings leave open the possibility that specific sequences are required to increase the efficiency of trans-splicing.« less
Liu, Kaiyu; Li, Yi; Jousset, Françoise-Xavière; Zadori, Zoltan; Szelei, Jozsef; Yu, Qian; Pham, Hanh Thi; Lépine, François; Bergoin, Max; Tijssen, Peter
2011-01-01
The Acheta domesticus densovirus (AdDNV), isolated from crickets, has been endemic in Europe for at least 35 years. Severe epizootics have also been observed in American commercial rearings since 2009 and 2010. The AdDNV genome was cloned and sequenced for this study. The transcription map showed that splicing occurred in both the nonstructural (NS) and capsid protein (VP) multicistronic RNAs. The splicing pattern of NS mRNA predicted 3 nonstructural proteins (NS1 [576 codons], NS2 [286 codons], and NS3 [213 codons]). The VP gene cassette contained two VP open reading frames (ORFs), of 597 (ORF-A) and 268 (ORF-B) codons. The VP2 sequence was shown by N-terminal Edman degradation and mass spectrometry to correspond with ORF-A. Mass spectrometry, sequencing, and Western blotting of baculovirus-expressed VPs versus native structural proteins demonstrated that the VP1 structural protein was generated by joining ORF-A and -B via splicing (splice II), eliminating the N terminus of VP2. This splice resulted in a nested set of VP1 (816 codons), VP3 (467 codons), and VP4 (429 codons) structural proteins. In contrast, the two splices within ORF-B (Ia and Ib) removed the donor site of intron II and resulted in VP2, VP3, and VP4 expression. ORF-B may also code for several nonstructural proteins, of 268, 233, and 158 codons. The small ORF-B contains the coding sequence for a phospholipase A2 motif found in VP1, which was shown previously to be critical for cellular uptake of the virus. These splicing features are unique among parvoviruses and define a new genus of ambisense densoviruses. PMID:21775445
On the path to genetic novelties: insights from programmed DNA elimination and RNA splicing.
Catania, Francesco; Schmitz, Jürgen
2015-01-01
Understanding how genetic novelties arise is a central goal of evolutionary biology. To this end, programmed DNA elimination and RNA splicing deserve special consideration. While programmed DNA elimination reshapes genomes by eliminating chromatin during organismal development, RNA splicing rearranges genetic messages by removing intronic regions during transcription. Small RNAs help to mediate this class of sequence reorganization, which is not error-free. It is this imperfection that makes programmed DNA elimination and RNA splicing excellent candidates for generating evolutionary novelties. Leveraging a number of these two processes' mechanistic and evolutionary properties, which have been uncovered over the past years, we present recently proposed models and empirical evidence for how splicing can shape the structure of protein-coding genes in eukaryotes. We also chronicle a number of intriguing similarities between the processes of programmed DNA elimination and RNA splicing, and highlight the role that the variation in the population-genetic environment may play in shaping their target sequences. © 2015 Wiley Periodicals, Inc.
QUANTIFYING ALTERNATIVE SPLICING FROM PAIRED-END RNA-SEQUENCING DATA.
Rossell, David; Stephan-Otto Attolini, Camille; Kroiss, Manuel; Stöcker, Almond
2014-03-01
RNA-sequencing has revolutionized biomedical research and, in particular, our ability to study gene alternative splicing. The problem has important implications for human health, as alternative splicing may be involved in malfunctions at the cellular level and multiple diseases. However, the high-dimensional nature of the data and the existence of experimental biases pose serious data analysis challenges. We find that the standard data summaries used to study alternative splicing are severely limited, as they ignore a substantial amount of valuable information. Current data analysis methods are based on such summaries and are hence sub-optimal. Further, they have limited flexibility in accounting for technical biases. We propose novel data summaries and a Bayesian modeling framework that overcome these limitations and determine biases in a non-parametric, highly flexible manner. These summaries adapt naturally to the rapid improvements in sequencing technology. We provide efficient point estimates and uncertainty assessments. The approach allows to study alternative splicing patterns for individual samples and can also be the basis for downstream analyses. We found a several fold improvement in estimation mean square error compared popular approaches in simulations, and substantially higher consistency between replicates in experimental data. Our findings indicate the need for adjusting the routine summarization and analysis of alternative splicing RNA-seq studies. We provide a software implementation in the R package casper.
2013-01-01
Background The production of multiple transcript isoforms from one gene is a major source of transcriptome complexity. RNA-Seq experiments, in which transcripts are converted to cDNA and sequenced, allow the resolution and quantification of alternative transcript isoforms. However, methods to analyze splicing are underdeveloped and errors resulting in incorrect splicing calls occur in every experiment. Results We used RNA-Seq data to develop sequencing and aligner error models. By applying these error models to known input from simulations, we found that errors result from false alignment to minor splice motifs and antisense stands, shifted junction positions, paralog joining, and repeat induced gaps. By using a series of quantitative and qualitative filters, we eliminated diagnosed errors in the simulation, and applied this to RNA-Seq data from Drosophila melanogaster heads. We used high-confidence junction detections to specifically interrogate local splicing differences between transcripts. This method out-performed commonly used RNA-seq methods to identify known alternative splicing events in the Drosophila sex determination pathway. We describe a flexible software package to perform these tasks called Splicing Analysis Kit (Spanki), available at http://www.cbcb.umd.edu/software/spanki. Conclusions Splice-junction centric analysis of RNA-Seq data provides advantages in specificity for detection of alternative splicing. Our software provides tools to better understand error profiles in RNA-Seq data and improve inference from this new technology. The splice-junction centric approach that this software enables will provide more accurate estimates of differentially regulated splicing than current tools. PMID:24209455
Sturgill, David; Malone, John H; Sun, Xia; Smith, Harold E; Rabinow, Leonard; Samson, Marie-Laure; Oliver, Brian
2013-11-09
The production of multiple transcript isoforms from one gene is a major source of transcriptome complexity. RNA-Seq experiments, in which transcripts are converted to cDNA and sequenced, allow the resolution and quantification of alternative transcript isoforms. However, methods to analyze splicing are underdeveloped and errors resulting in incorrect splicing calls occur in every experiment. We used RNA-Seq data to develop sequencing and aligner error models. By applying these error models to known input from simulations, we found that errors result from false alignment to minor splice motifs and antisense stands, shifted junction positions, paralog joining, and repeat induced gaps. By using a series of quantitative and qualitative filters, we eliminated diagnosed errors in the simulation, and applied this to RNA-Seq data from Drosophila melanogaster heads. We used high-confidence junction detections to specifically interrogate local splicing differences between transcripts. This method out-performed commonly used RNA-seq methods to identify known alternative splicing events in the Drosophila sex determination pathway. We describe a flexible software package to perform these tasks called Splicing Analysis Kit (Spanki), available at http://www.cbcb.umd.edu/software/spanki. Splice-junction centric analysis of RNA-Seq data provides advantages in specificity for detection of alternative splicing. Our software provides tools to better understand error profiles in RNA-Seq data and improve inference from this new technology. The splice-junction centric approach that this software enables will provide more accurate estimates of differentially regulated splicing than current tools.
Detection of Splice Sites Using Support Vector Machine
NASA Astrophysics Data System (ADS)
Varadwaj, Pritish; Purohit, Neetesh; Arora, Bhumika
Automatic identification and annotation of exon and intron region of gene, from DNA sequences has been an important research area in field of computational biology. Several approaches viz. Hidden Markov Model (HMM), Artificial Intelligence (AI) based machine learning and Digital Signal Processing (DSP) techniques have extensively and independently been used by various researchers to cater this challenging task. In this work, we propose a Support Vector Machine based kernel learning approach for detection of splice sites (the exon-intron boundary) in a gene. Electron-Ion Interaction Potential (EIIP) values of nucleotides have been used for mapping character sequences to corresponding numeric sequences. Radial Basis Function (RBF) SVM kernel is trained using EIIP numeric sequences. Furthermore this was tested on test gene dataset for detection of splice site by window (of 12 residues) shifting. Optimum values of window size, various important parameters of SVM kernel have been optimized for a better accuracy. Receiver Operating Characteristic (ROC) curves have been utilized for displaying the sensitivity rate of the classifier and results showed 94.82% accuracy for splice site detection on test dataset.
Modelling reveals kinetic advantages of co-transcriptional splicing.
Aitken, Stuart; Alexander, Ross D; Beggs, Jean D
2011-10-01
Messenger RNA splicing is an essential and complex process for the removal of intron sequences. Whereas the composition of the splicing machinery is mostly known, the kinetics of splicing, the catalytic activity of splicing factors and the interdependency of transcription, splicing and mRNA 3' end formation are less well understood. We propose a stochastic model of splicing kinetics that explains data obtained from high-resolution kinetic analyses of transcription, splicing and 3' end formation during induction of an intron-containing reporter gene in budding yeast. Modelling reveals co-transcriptional splicing to be the most probable and most efficient splicing pathway for the reporter transcripts, due in part to a positive feedback mechanism for co-transcriptional second step splicing. Model comparison is used to assess the alternative representations of reactions. Modelling also indicates the functional coupling of transcription and splicing, because both the rate of initiation of transcription and the probability that step one of splicing occurs co-transcriptionally are reduced, when the second step of splicing is abolished in a mutant reporter.
Harrison, Neale; Kalbfleisch, Andreas; Connolly, Bernadette; Pettitt, Jonathan; Müller, Berndt
2010-08-01
Spliced-leader (SL) trans-splicing has been found in all molecularly characterized nematode species to date, and it is likely to be a nematode synapomorphy. Most information regarding SL trans-splicing has come from the study of nematodes from a single monophyletic group, the Rhabditida, all of which employ SL RNAs that are identical to, or variants of, the SL1 RNA first characterized in Caenorhabditis elegans. In contrast, the more distantly related Trichinella spiralis, belonging to the subclass Dorylaimia, utilizes a distinct set of SL RNAs that display considerable sequence diversity. To investigate whether this is true of other members of the Dorylaimia, we have characterized SL RNAs from Prionchulus punctatus. Surprisingly, this revealed the presence of a set of SLs that show clear sequence similarity to the SL2 family of spliced leaders, which have previously only been found within the rhabditine group (which includes C. elegans). Expression of one of the P. punctatus SL RNAs in C. elegans reveals that it can compete specifically with the endogenous C. elegans SL2 spliced leaders, being spliced to the pre-mRNAs derived from downstream genes in operons, but does not compete with the SL1 spliced leaders. This discovery raises the possibility that SL2-like spliced leaders were present in the last common ancestor of the nematode phylum.
Sharma, Neeraj; Sosnay, Patrick R.; Ramalho, Anabela S.; Douville, Christopher; Franca, Arianna; Gottschalk, Laura B.; Park, Jeenah; Lee, Melissa; Vecchio-Pagan, Briana; Raraigh, Karen S.; Amaral, Margarida D.; Karchin, Rachel; Cutting, Garry R.
2015-01-01
Assessment of the functional consequences of variants near splice sites is a major challenge in the diagnostic laboratory. To address this issue, we created expression minigenes (EMGs) to determine the RNA and protein products generated by splice site variants (n = 10) implicated in cystic fibrosis (CF). Experimental results were compared with the splicing predictions of eight in silico tools. EMGs containing the full-length Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) coding sequence and flanking intron sequences generated wild-type transcript and fully processed protein in Human Embryonic Kidney (HEK293) and CF bronchial epithelial (CFBE41o-) cells. Quantification of variant induced aberrant mRNA isoforms was concordant using fragment analysis and pyrosequencing. The splicing patterns of c.1585−1G>A and c.2657+5G>A were comparable to those reported in primary cells from individuals bearing these variants. Bioinformatics predictions were consistent with experimental results for 9/10 variants (MES), 8/10 variants (NNSplice), and 7/10 variants (SSAT and Sroogle). Programs that estimate the consequences of mis-splicing predicted 11/16 (HSF and ASSEDA) and 10/16 (Fsplice and SplicePort) experimentally observed mRNA isoforms. EMGs provide a robust experimental approach for clinical interpretation of splice site variants and refinement of in silico tools. PMID:25066652
Quaking and PTB control overlapping splicing regulatory networks during muscle cell differentiation
Hall, Megan P.; Nagel, Roland J.; Fagg, W. Samuel; Shiue, Lily; Cline, Melissa S.; Perriman, Rhonda J.; Donohue, John Paul; Ares, Manuel
2013-01-01
Alternative splicing contributes to muscle development, but a complete set of muscle-splicing factors and their combinatorial interactions are unknown. Previous work identified ACUAA (“STAR” motif) as an enriched intron sequence near muscle-specific alternative exons such as Capzb exon 9. Mass spectrometry of myoblast proteins selected by the Capzb exon 9 intron via RNA affinity chromatography identifies Quaking (QK), a protein known to regulate mRNA function through ACUAA motifs in 3′ UTRs. We find that QK promotes inclusion of Capzb exon 9 in opposition to repression by polypyrimidine tract-binding protein (PTB). QK depletion alters inclusion of 406 cassette exons whose adjacent intron sequences are also enriched in ACUAA motifs. During differentiation of myoblasts to myotubes, QK levels increase two- to threefold, suggesting a mechanism for QK-responsive exon regulation. Combined analysis of the PTB- and QK-splicing regulatory networks during myogenesis suggests that 39% of regulated exons are under the control of one or both of these splicing factors. This work provides the first evidence that QK is a global regulator of splicing during muscle development in vertebrates and shows how overlapping splicing regulatory networks contribute to gene expression programs during differentiation. PMID:23525800
Li, Xiaoze; Johansson, Cecilia; Cardoso Palacios, Carlos; Mossberg, Anki; Dhanjal, Soniya; Bergvall, Monika; Schwartz, Stefan
2013-01-01
The most commonly used 3′-splice site on the human papillomavirus type 16 (HPV-16) genome named SA3358 is used to produce HPV-16 early mRNAs encoding E4, E5, E6 and E7, and late mRNAs encoding L1 and L2. We have previously shown that SA3358 is suboptimal and is totally dependent on a downstream splicing enhancer containingmultiple potential ASF/SF2 binding sites. Here weshow that only one of the predicted ASF/SF2 sites accounts for the majority of the enhancer activity. We demonstrate that single nucleotide substitutions in this predicted ASF/SF2 site impair enhancer function and that this correlates with less efficient binding to ASF/SF2 in vitro. We provide evidence that HPV-16 mRNAs that arespliced to SA3358 interact with ASF/SF2 in living cells. In addition,mutational inactivation of the ASF/SF2 site weakened the enhancer at SA3358 in episomal forms of the HPV-16 genome, indicating that the enhancer is active in the context of the full HPV-16 genome.This resulted in induction of HPV-16 late gene expression as a result of competition from late splice site SA5639. Furthermore, inactivation of the ASF/SF2 site of the SA3358 splicing enhancer reduced the ability of E6- and E7-encoding HPV-16 plasmids to increase the life span of primary keratinocytes in vitro, demonstrating arequirement for an intact splicing enhancer of SA3358 forefficient production of the E6 and E7 mRNAs. These results link the strength of the HPV-16 SA3358 splicing enhancer to expression of E6 and E7 and to the pathogenic properties of HPV-16. PMID:24039800
Sun, Xiaoyong; Wang, Lin; Ding, Jiechao; Wang, Yanru; Wang, Jiansheng; Zhang, Xiaoyang; Che, Yulei; Liu, Ziwei; Zhang, Xinran; Ye, Jiazhen; Wang, Jie; Sablok, Gaurav; Deng, Zhiping; Zhao, Hongwei
2016-10-01
A new regulatory class of small endogenous RNAs called circular RNAs (circRNAs) has been described as miRNA sponges in animals. Using 16 Arabidopsis thaliana RNA-Seq data sets, we identified 803 circRNAs in RNase R-/non-RNase R-treated samples. The results revealed the following features: Canonical and noncanonical splicing can generate circRNAs; chloroplasts are a hotspot for circRNA generation; furthermore, limited complementary sequences exist not only in introns, but also in the sequences flanking splice sites. The latter finding suggests that multiple combinations between complementary sequences may facilitate the formation of the circular structure. Our results contribute to a better understanding of this novel class of plant circRNAs. © 2016 Federation of European Biochemical Societies.
Splicing-related genes are alternatively spliced upon changes in ambient temperatures in plants
Bucher, Johan; Lammers, Michiel; Busscher-Lange, Jacqueline; Bonnema, Guusje; Rodenburg, Nicole; Proveniers, Marcel C. G.; Angenent, Gerco C.
2017-01-01
Plants adjust their development and architecture to small variations in ambient temperature. In a time in which temperatures are rising world-wide, the mechanism by which plants are able to sense temperature fluctuations and adapt to it, is becoming of special interest. By performing RNA-sequencing on two Arabidopsis accession and one Brassica species exposed to temperature alterations, we showed that alternative splicing is an important mechanism in ambient temperature sensing and adaptation. We found that amongst the differentially alternatively spliced genes, splicing related genes are enriched, suggesting that the splicing machinery itself is targeted for alternative splicing when temperature changes. Moreover, we showed that many different components of the splicing machinery are targeted for ambient temperature regulated alternative splicing. Mutant analysis of a splicing related gene that was differentially spliced in two of the genotypes showed an altered flowering time response to different temperatures. We propose a two-step mechanism where temperature directly influences alternative splicing of the splicing machinery genes, followed by a second step where the altered splicing machinery affects splicing of downstream genes involved in the adaptation to altered temperatures. PMID:28257507
Genetic therapies for RNA mis-splicing diseases.
Hammond, Suzan M; Wood, Matthew J A
2011-05-01
RNA mis-splicing diseases account for up to 15% of all inherited diseases, ranging from neurological to myogenic and metabolic disorders. With greatly increased genomic sequencing being performed for individual patients, the number of known mutations affecting splicing has risen to 50-60% of all disease-causing mutations. During the past 10years, genetic therapy directed toward correction of RNA mis-splicing in disease has progressed from theoretical work in cultured cells to promising clinical trials. In this review, we discuss the use of antisense oligonucleotides to modify splicing as well as the principles and latest work in bifunctional RNA, trans-splicing and modification of U1 and U7 snRNA to target splice sites. The success of clinical trials for modifying splicing to treat Duchenne muscular dystrophy opens the door for the use of splicing modification for most of the mis-splicing diseases. Copyright © 2011 Elsevier Ltd. All rights reserved.
Li, Wencheng; You, Bei; Hoque, Mainul; Zheng, Dinghai; Luo, Wenting; Ji, Zhe; Park, Ji Yeon; Gunderson, Samuel I.; Kalsotra, Auinash; Manley, James L.; Tian, Bin
2015-01-01
Alternative cleavage and polyadenylation (APA) results in mRNA isoforms containing different 3’ untranslated regions (3’UTRs) and/or coding sequences. How core cleavage/polyadenylation (C/P) factors regulate APA is not well understood. Using siRNA knockdown coupled with deep sequencing, we found that several C/P factors can play significant roles in 3’UTR-APA. Whereas Pcf11 and Fip1 enhance usage of proximal poly(A) sites (pAs), CFI-25/68, PABPN1 and PABPC1 promote usage of distal pAs. Strong cis element biases were found for pAs regulated by CFI-25/68 or Fip1, and the distance between pAs plays an important role in APA regulation. In addition, intronic pAs are substantially regulated by splicing factors, with U1 mostly inhibiting C/P events in introns near the 5’ end of gene and U2 suppressing those in introns with features for efficient splicing. Furthermore, PABPN1 inhibits expression of transcripts with pAs near the transcription start site (TSS), a property possibly related to its role in RNA degradation. Finally, we found that groups of APA events regulated by C/P factors are also modulated in cell differentiation and development with distinct trends. Together, our results support an APA code where an APA event in a given cellular context is regulated by a number of parameters, including relative location to the TSS, splicing context, distance between competing pAs, surrounding cis elements and concentrations of core C/P factors. PMID:25906188
Förch, Patrik; Merendino, Livia; Martínez, Concepción; Valcárcel, Juan
2003-01-01
The splicing factor U2AF(65), U2 small nuclear ribonucleoprotein particle (snRNP) auxillary factor of 65 kDa, binds to pyrimidine-rich sequences at 3' splice sites to recruit U2 snRNP to pre-mRNAs. We report that U2AF(65) can also promote the recruitment of U1 snRNP to weak 5' splice sites that are followed by uridine-rich sequences. The arginine- and serine-rich domain of U2AF(65) is critical for U1 recruitment, and we discuss the role of its RNA-RNA annealing activity in this novel function of U2AF(65). PMID:12558503
López-Urrutia, Eduardo; Valdés, Jesús; Bonilla-Moreno, Raúl; Martínez-Salazar, Martha; Martínez-Garcia, Martha; Berumen, Jaime; Villegas-Sepúlveda, Nicolás
2012-06-01
The HPV-16 E6/E7 genes, which contain intron 1, are processed by alternative splicing and its transcripts are detected with a heterogeneous profile in tumours cells. Frequently, the HPV-16 positive carcinoma cells bear viral variants that contain single nucleotide polymorphisms into its DNA sequence. We were interested in analysing the contribution of this polymorphism to the heterogeneity in the pattern of the E6/E7 spliced transcripts. Using the E6/E7 sequences from three closely related HPV-16 variants, we have shown that a few nucleotide changes are sufficient to produce heterogeneity in the splicing profile. Furthermore, using mutants that contained a single SNP, we also showed that one nucleotide change was sufficient to reproduce the heterogeneous splicing profile. Additionally, a difference of two or three SNPs among these viral sequences was sufficient to recruit differentially several splicing factors to the polymorphic E6/E7 transcripts. Moreover, only one SNP was sufficient to alter the binding site of at least one splicing factor, changing the ability of splicing factors to bind the transcript. Finally, the factors that were differentially bound to the short form of intron 1 of one of these E6/E7 variants were identified as TIA1 and/or TIAR and U1-70k, while U2AF65, U5-52k and PTB were preferentially bound to the transcript of the other variants. Copyright © 2012 Elsevier B.V. All rights reserved.
Detection of alternative splice variants at the proteome level in Aspergillus flavus.
Chang, Kung-Yen; Georgianna, D Ryan; Heber, Steffen; Payne, Gary A; Muddiman, David C
2010-03-05
Identification of proteins from proteolytic peptides or intact proteins plays an essential role in proteomics. Researchers use search engines to match the acquired peptide sequences to the target proteins. However, search engines depend on protein databases to provide candidates for consideration. Alternative splicing (AS), the mechanism where the exon of pre-mRNAs can be spliced and rearranged to generate distinct mRNA and therefore protein variants, enable higher eukaryotic organisms, with only a limited number of genes, to have the requisite complexity and diversity at the proteome level. Multiple alternative isoforms from one gene often share common segments of sequences. However, many protein databases only include a limited number of isoforms to keep minimal redundancy. As a result, the database search might not identify a target protein even with high quality tandem MS data and accurate intact precursor ion mass. We computationally predicted an exhaustive list of putative isoforms of Aspergillus flavus proteins from 20 371 expressed sequence tags to investigate whether an alternative splicing protein database can assign a greater proportion of mass spectrometry data. The newly constructed AS database provided 9807 new alternatively spliced variants in addition to 12 832 previously annotated proteins. The searches of the existing tandem MS spectra data set using the AS database identified 29 new proteins encoded by 26 genes. Nine fungal genes appeared to have multiple protein isoforms. In addition to the discovery of splice variants, AS database also showed potential to improve genome annotation. In summary, the introduction of an alternative splicing database helps identify more proteins and unveils more information about a proteome.
MutPred Splice: machine learning-based prediction of exonic variants that disrupt splicing
2014-01-01
We have developed a novel machine-learning approach, MutPred Splice, for the identification of coding region substitutions that disrupt pre-mRNA splicing. Applying MutPred Splice to human disease-causing exonic mutations suggests that 16% of mutations causing inherited disease and 10 to 14% of somatic mutations in cancer may disrupt pre-mRNA splicing. For inherited disease, the main mechanism responsible for the splicing defect is splice site loss, whereas for cancer the predominant mechanism of splicing disruption is predicted to be exon skipping via loss of exonic splicing enhancers or gain of exonic splicing silencer elements. MutPred Splice is available at http://mutdb.org/mutpredsplice. PMID:24451234
X-linked Alport syndrome caused by splicing mutations in COL4A5.
Nozu, Kandai; Vorechovsky, Igor; Kaito, Hiroshi; Fu, Xue Jun; Nakanishi, Koichi; Hashimura, Yuya; Hashimoto, Fusako; Kamei, Koichi; Ito, Shuichi; Kaku, Yoshitsugu; Imasawa, Toshiyuki; Ushijima, Katsumi; Shimizu, Junya; Makita, Yoshio; Konomoto, Takao; Yoshikawa, Norishige; Iijima, Kazumoto
2014-11-07
X-linked Alport syndrome is caused by mutations in the COL4A5 gene. Although many COL4A5 mutations have been detected, the mutation detection rate has been unsatisfactory. Some men with X-linked Alport syndrome show a relatively mild phenotype, but molecular basis investigations have rarely been conducted to clarify the underlying mechanism. In total, 152 patients with X-linked Alport syndrome who were suspected of having Alport syndrome through clinical and pathologic investigations and referred to the hospital for mutational analysis between January of 2006 and January of 2013 were genetically diagnosed. Among those patients, 22 patients had suspected splice site mutations. Transcripts are routinely examined when suspected splice site mutations for abnormal transcripts are detected; 11 of them showed expected exon skipping, but others showed aberrant splicing patterns. The mutation detection strategy had two steps: (1) genomic DNA analysis using PCR and direct sequencing and (2) mRNA analysis using RT-PCR to detect RNA processing abnormalities. Six splicing consensus site mutations resulting in aberrant splicing patterns, one exonic mutation leading to exon skipping, and four deep intronic mutations producing cryptic splice site activation were identified. Interestingly, one case produced a cryptic splice site with a single nucleotide substitution in the deep intron that led to intronic exonization containing a stop codon; however, the patient showed a clearly milder phenotype for X-linked Alport syndrome in men with a truncating mutation. mRNA extracted from the kidney showed both normal and abnormal transcripts, with the normal transcript resulting in the milder phenotype. This novel mechanism leads to mild clinical characteristics. This report highlights the importance of analyzing transcripts to enhance the mutation detection rate and provides insight into genotype-phenotype correlations. This approach can clarify the cause of atypically mild phenotypes in X-linked Alport syndrome. Copyright © 2014 by the American Society of Nephrology.
Tanaka, Arisa; Aoki, Fugaku; Suzuki, Masataka G
2018-05-26
The transformer (tra) gene, which is a female-determining master gene in the housefly Musca domestica, acts as a memory device for sex determination via its auto-regulatory function, i.e., through the contribution of the TRA protein to female-specific splicing of its own pre-mRNA. The TRA protein contains 4 small domains that are specifically conserved among TRA proteins (domains 1-4). Domain 2, also named TRA-CAM domain, is the most conserved, but its function remains unknown. To examine whether these domains are involved in the auto-regulatory function, we performed in vitro splicing assays using a tra minigene containing a partial genomic sequence of the M. domestica tra (Mdtra) gene. Co-transfection of the Mdtra minigene and an MdTRA protein expression vector into cultured insect cells strongly induced female-specific splicing of the minigene. A series of deletion mutation analyses demonstrated that these domains act complementarily to induce female-specific splicing. Domain 1 and the TRA-CAM domain were necessary for the female-specific splicing when the MdTRA protein lacked both domains 3 and 4. In this situation, mutation of the well-conserved 3 amino acids (GEG) in the TRA-CAM domain significantly reduced the female-specific splicing activity of MdTRA. GST-pull down analyses demonstrated that the MdTRA protein specifically enriched on the male-specific exonic region (exon 2b), which contains the putative TRA/TRA-2 binding sites, and that the GEG mutation disrupts this enrichment. Since the MdTRA protein interacts with its own pre-mRNA through TRA-2, our findings suggest that the conserved amino acid residues in the TRA-CAM domain may be crucial for the interaction between MdTRA and TRA-2, enhancing MdTRA recruitment on its pre-mRNA to induce female-specific splicing of tra in the housefly. © 2018 S. Karger AG, Basel.
A novel protein factor is required for use of distal alternative 5' splice sites in vitro.
Harper, J E; Manley, J L
1991-01-01
Adenovirus E1A pre-mRNA was used as a model to examine alternative 5' splice site selection during in vitro splicing reactions. Strong preference for the downstream 13S 5' splice site over the upstream 12S or 9S 5' splice sites was observed. However, the 12S 5' splice site was used efficiently when a mutant pre-mRNA lacking the 13S 5' splice site was processed, and 12S splicing from this substrate was not reduced by 13S splicing from a separate pre-mRNA, demonstrating that 13S splicing reduced 12S 5' splice site selection through a bona fide cis-competition. DEAE-cellulose chromatography of nuclear extract yielded two fractions with different splicing activities. The bound fraction contained all components required for efficient splicing of simple substrates but was unable to utilize alternative 5' splice sites. In contrast, the flow-through fraction, which by itself was inactive, contained an activity required for alternative splicing and was shown to stimulate 12S and 9S splicing, while reducing 13S splicing, when added to reactions carried out by the bound fraction. Furthermore, the activity, which we have called distal splicing factor (DSF), enhanced utilization of an upstream 5' splice site on a simian virus 40 early pre-mRNA, suggesting that the factor acts in a position-dependent, substrate-independent fashion. Several lines of evidence are presented suggesting that DSF is a non-small nuclear ribonucleoprotein protein. Finally, we describe a functional interaction between DSF and ASF, a protein that enhances use of downstream 5' splice sites. Images PMID:1658620
Spliced synthetic genes as internal controls in RNA sequencing experiments.
Hardwick, Simon A; Chen, Wendy Y; Wong, Ted; Deveson, Ira W; Blackburn, James; Andersen, Stacey B; Nielsen, Lars K; Mattick, John S; Mercer, Tim R
2016-09-01
RNA sequencing (RNA-seq) can be used to assemble spliced isoforms, quantify expressed genes and provide a global profile of the transcriptome. However, the size and diversity of the transcriptome, the wide dynamic range in gene expression and inherent technical biases confound RNA-seq analysis. We have developed a set of spike-in RNA standards, termed 'sequins' (sequencing spike-ins), that represent full-length spliced mRNA isoforms. Sequins have an entirely artificial sequence with no homology to natural reference genomes, but they align to gene loci encoded on an artificial in silico chromosome. The combination of multiple sequins across a range of concentrations emulates alternative splicing and differential gene expression, and it provides scaling factors for normalization between samples. We demonstrate the use of sequins in RNA-seq experiments to measure sample-specific biases and determine the limits of reliable transcript assembly and quantification in accompanying human RNA samples. In addition, we have designed a complementary set of sequins that represent fusion genes arising from rearrangements of the in silico chromosome to aid in cancer diagnosis. RNA sequins provide a qualitative and quantitative reference with which to navigate the complexity of the human transcriptome.
Pan, Ling; Pasternak, David A; Xu, Jin; Xu, Mingming; Lu, Zhigang; Pasternak, Gavril W; Pan, Ying-Xian
2017-01-01
The sigma1 receptor acts as a chaperone at the endoplasmic reticulum, associates with multiple proteins in various cellular systems, and involves in a number of diseases, such as addiction, pain, cancer and psychiatric disorders. The sigma1 receptor is encoded by the single copy SIGMAR1 gene. The current study identifies five alternatively spliced variants of the mouse sigma1 receptor gene using a polymerase chain reaction cloning approach. All the splice variants are generated by exon skipping or alternative 3' or 5' splicing, producing the truncated sigma1 receptor. Similar alternative splicing has been observed in the human SIGMAR1 gene based on the molecular cloning or genome sequence prediction, suggesting conservation of alternative splicing of SIGMAR1 gene. Using quantitative polymerase chain reactions, we demonstrate differential expression of several splice variants in mouse tissues and brain regions. When expressed in HEK293 cells, all the splice variants fail to bind sigma ligands, implicating that each truncated region in these splice variants is important for ligand binding. However, co-immunoprecipitation (Co-IP) study in HEK293 cells co-transfected with tagged constructs reveals that all the splice variants maintain their ability to physically associate with a mu opioid receptor (mMOR-1), providing useful information to correlate the motifs/sequences necessary for their physical association. Furthermore, a competition Co-IP study showed that all the variants can disrupt in a dose-dependent manner the dimerization of the original sigma1 receptor with mMOR-1, suggesting a potential dominant negative function and providing significant insights into their function.
PASTA: splice junction identification from RNA-Sequencing data
2013-01-01
Background Next generation transcriptome sequencing (RNA-Seq) is emerging as a powerful experimental tool for the study of alternative splicing and its regulation, but requires ad-hoc analysis methods and tools. PASTA (Patterned Alignments for Splicing and Transcriptome Analysis) is a splice junction detection algorithm specifically designed for RNA-Seq data, relying on a highly accurate alignment strategy and on a combination of heuristic and statistical methods to identify exon-intron junctions with high accuracy. Results Comparisons against TopHat and other splice junction prediction software on real and simulated datasets show that PASTA exhibits high specificity and sensitivity, especially at lower coverage levels. Moreover, PASTA is highly configurable and flexible, and can therefore be applied in a wide range of analysis scenarios: it is able to handle both single-end and paired-end reads, it does not rely on the presence of canonical splicing signals, and it uses organism-specific regression models to accurately identify junctions. Conclusions PASTA is a highly efficient and sensitive tool to identify splicing junctions from RNA-Seq data. Compared to similar programs, it has the ability to identify a higher number of real splicing junctions, and provides highly annotated output files containing detailed information about their location and characteristics. Accurate junction data in turn facilitates the reconstruction of the splicing isoforms and the analysis of their expression levels, which will be performed by the remaining modules of the PASTA pipeline, still under development. Use of PASTA can therefore enable the large-scale investigation of transcription and alternative splicing. PMID:23557086
DOE Office of Scientific and Technical Information (OSTI.GOV)
Solovyev, V.V.; Salamov, A.A.; Lawrence, C.B.
1994-12-31
Discriminant analysis is applied to the problem of recognition 5`-, internal and 3`-exons in human DNA sequences. Specific recognition functions were developed for revealing exons of particular types. The method based on a splice site prediction algorithm that uses the linear Fisher discriminant to combine the information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotide in protein coding and nation regions. The accuracy of our splice site recognition function is about 97%. A discriminant function for 5`-exon prediction includes hexanucleotide composition of upstream region, triplet composition around the ATG codon, ORF codingmore » potential, donor splice site potential and composition of downstream introit region. For internal exon prediction, we combine in a discriminant function the characteristics describing the 5`- intron region, donor splice site, coding region, acceptor splice site and Y-intron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences is 77% with a specificity of 79% and a level of pseudoexon ORF prediction of 99.96%. The recognition quality computed at the level of individual nucleotides is 89%, for exon sequences and 98% for intron sequences. A discriminant function for 3`-exon prediction includes octanucleolide composition of upstream nation region, triplet composition around the stop codon, ORF coding potential, acceptor splice site potential and hexanucleotide composition of downstream region. We unite these three discriminant functions in exon predicting program FEX (find exons). FEX exactly predicts 70% of 1016 exons from the test of 181 complete genes with specificity 73%, and 89% exons are exactly or partially predicted. On the average, 85% of nucleotides were predicted accurately with specificity 91%.« less
Regulation of alternative mRNA splicing: old players and new perspectives.
Dvinge, Heidi
2018-06-01
Nearly all human multi-exon genes are subject to alternative splicing in one or more cell types. The splicing machinery, therefore, has to select between multiple splice sites in a context-dependent manner, relying on sequence features in cis and trans-acting splicing regulators that either promote or repress splice site recognition and spliceosome assembly. However, the functional coupling between multiple gene regulatory layers signifies that splicing can also be modulated by transcriptional or epigenetic characteristics. Other, less obvious, aspects of alternative splicing have come to light in recent years, often involving core components of the spliceosome previously thought to perform a basal rather than a regulatory role in splicing. Together this paints a highly dynamic picture of splicing regulation, where the final splice site choice is governed by the entire transcriptional environment of a gene and its cellular context. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
NMR studies of two spliced leader RNAs using isotope labeling
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lapham, J.; Crothers, D.M.
1994-12-01
Spliced leader RNAs are a class of RNA molecules (<200 nts) involved in the trans splicing of messenger RNA found in trypanosomes, nematodes, and other lower eukaryotes. The spliced leader RNA from the trypanosome Leptomonas Collosoma exists in two alternate structural forms with similar thermal stabilities. The 54 nucleotides on the 5{prime} end of the SL molecule is structurally independent from the 3{prime} half of the RNA, and displays the two structural forms. Furthermore, the favored of the two structures was shown to contain anomalous nuclease sensitivity and thermal stability features, which suggests that there may be tertiary interactions betweenmore » the splice site and other nucleotides in the 5{prime} end. Multidimensional NMR studies are underway to elucidate the structural elements present in the SL RNAs that give rise to their physical properties. Two spliced leader sequences have been studied. The first, the 54 nucleotides on the 5{prime} end of the L. Collosoma sequence, was selected because of earlier studies in our laboratory. The second sequence is the 5{prime} end of the trypanosome Crithidia Fasciculata, which was chosen because of its greater sequence homology to other SL sequences. Given the complexity of the NMR spectra for RNA molecules of this size, we have incorporated {sup 15}N/{sup 13}C-labeled nucleotides into the RNA. One of the techniques we have developed to simplify the spectra of these RNA molecules is isotope labeling of specific regions of the RNA. This has been especially helpful in assigning the secondary structure of molecules that may be able to adopt multiple conformations. Using this technique one can examine a part of the molecule without spectral interference from the unlabeled portion. We hope this approach will promote an avenue for studying the structure of larger RNAs in their native surroundings.« less
The Human Splicing Factor ASF/SF2 can Specifically Recognize Pre-mRNA 5' Splice Sites
NASA Astrophysics Data System (ADS)
Zuo, Ping; Manley, James L.
1994-04-01
ASF/SF2 is a human protein previously shown to function in in vitro pre-mRNA splicing as an essential factor necessary for all splices and also as an alternative splicing factor, capable of switching selection of 5' splice sites. To begin to study the protein's mechanism of action, we have investigated the RNA binding properties of purified recombinant ASF/SF2. Using UV crosslinking and gel shift assays, we demonstrate that the RNA binding region of ASF/SF2 can interact with RNA in a sequence-specific manner, recognizing the 5' splice site in each of two different pre-mRNAs. Point mutations in the 5' splice site consensus can reduce binding by as much as a factor of 100, with the largest effects observed in competition assays. These findings support a model in which ASF/SF2 aids in the recognition of pre-mRNA 5' splice sites.
In silico prediction of splice-altering single nucleotide variants in the human genome.
Jian, Xueqiu; Boerwinkle, Eric; Liu, Xiaoming
2014-12-16
In silico tools have been developed to predict variants that may have an impact on pre-mRNA splicing. The major limitation of the application of these tools to basic research and clinical practice is the difficulty in interpreting the output. Most tools only predict potential splice sites given a DNA sequence without measuring splicing signal changes caused by a variant. Another limitation is the lack of large-scale evaluation studies of these tools. We compared eight in silico tools on 2959 single nucleotide variants within splicing consensus regions (scSNVs) using receiver operating characteristic analysis. The Position Weight Matrix model and MaxEntScan outperformed other methods. Two ensemble learning methods, adaptive boosting and random forests, were used to construct models that take advantage of individual methods. Both models further improved prediction, with outputs of directly interpretable prediction scores. We applied our ensemble scores to scSNVs from the Catalogue of Somatic Mutations in Cancer database. Analysis showed that predicted splice-altering scSNVs are enriched in recurrent scSNVs and known cancer genes. We pre-computed our ensemble scores for all potential scSNVs across the human genome, providing a whole genome level resource for identifying splice-altering scSNVs discovered from large-scale sequencing studies.
Another heritage from the RNA world: self-excision of intron sequence from nuclear pre-tRNAs.
Weber, U; Beier, H; Gross, H J
1996-06-15
The intervening sequences of nuclear tRNA precursors are known to be excised by tRNA splicing endonuclease. We show here that a T7 transcript corresponding to a pre-tRNA(Tyr) from Arabidopsis thaliana has a highly specific activity for autolytic intron excision. Self-cleavage occurs precisely at the authentic 3'-splice site and at the phosphodiester bond one nucleotide downstream of the authentic 5'-splice site. The reaction results in fragments with 2',3'-cyclic phosphate and 5'-OH termini. It is resistant to proteinase K and/or SDS treatment and is not inhibited by added tRNA. The self-cleavage depends on Mg2+ and is stimulated by spermine and Triton X-100. A set of sequence variants at the cleavage sites has been analysed for autolytic intron excision and, in parallel, for enzymatic in vitro splicing in wheat germ S23 extract. Single-stranded loops are a prerequisite for both reactions. Self-cleavage not only occurs at pyrimidine-A but also at U-U bonds. Since intron self-excision is only about five times slower than the enzymatic intron excision in a wheat germ S23 extract, we propose that the splicing endonuclease may function by improving the preciseness and efficiency of an inherent pre-tRNA self-cleavage activity.
Phylogenetic Analysis of Nuclear-Encoded RNA Maturases
Malik, Sunita; Upadhyaya, KC; Khurana, SM Paul
2017-01-01
Posttranscriptional processes, such as splicing, play a crucial role in gene expression and are prevalent not only in nuclear genes but also in plant mitochondria where splicing of group II introns is catalyzed by a class of proteins termed maturases. In plant mitochondria, there are 22 mitochondrial group II introns. matR, nMAT1, nMAT2, nMAT3, and nMAT4 proteins have been shown to be required for efficient splicing of several group II introns in Arabidopsis thaliana. Nuclear maturases (nMATs) are necessary for splicing of mitochondrial genes, leading to normal oxidative phosphorylation. Sequence analysis through phylogenetic tree (including bootstrapping) revealed high homology with maturase sequences of A thaliana and other plants. This study shows the phylogenetic relationship of nMAT proteins between A thaliana and other nonredundant plant species taken from BLASTP analysis. PMID:28607538
Fernandez-Valverde, Selene L; Calcino, Andrew D; Degnan, Bernard M
2015-05-15
The demosponge Amphimedon queenslandica is amongst the few early-branching metazoans with an assembled and annotated draft genome, making it an important species in the study of the origin and early evolution of animals. Current gene models in this species are largely based on in silico predictions and low coverage expressed sequence tag (EST) evidence. Amphimedon queenslandica protein-coding gene models are improved using deep RNA-Seq data from four developmental stages and CEL-Seq data from 82 developmental samples. Over 86% of previously predicted genes are retained in the new gene models, although 24% have additional exons; there is also a marked increase in the total number of annotated 3' and 5' untranslated regions (UTRs). Importantly, these new developmental transcriptome data reveal numerous previously unannotated protein-coding genes in the Amphimedon genome, increasing the total gene number by 25%, from 30,060 to 40,122. In general, Amphimedon genes have introns that are markedly smaller than those in other animals and most of the alternatively spliced genes in Amphimedon undergo intron-retention; exon-skipping is the least common mode of alternative splicing. Finally, in addition to canonical polyadenylation signal sequences, Amphimedon genes are enriched in a number of unique AT-rich motifs in their 3' UTRs. The inclusion of developmental transcriptome data has substantially improved the structure and composition of protein-coding gene models in Amphimedon queenslandica, providing a more accurate and comprehensive set of genes for functional and comparative studies. These improvements reveal the Amphimedon genome is comprised of a remarkably high number of tightly packed genes. These genes have small introns and there is pervasive intron retention amongst alternatively spliced transcripts. These aspects of the sponge genome are more similar unicellular opisthokont genomes than to other animal genomes.
A study of alternative splicing in the pig
2010-01-01
Background Since at least half of the genes in mammalian genomes are subjected to alternative splicing, alternative pre-mRNA splicing plays an important contribution to the complexity of the mammalian proteome. Expressed sequence tags (ESTs) provide evidence of a great number of possible alternative isoforms. With the EST resource for the domestic pig now containing more than one million porcine ESTs, it is possible to identify alternative splice forms of the individual transcripts in this species from the EST data with some confidence. Results The pig EST data generated by the Sino-Danish Pig Genome project has been assembled with publicly available ESTs and made available in the PigEST database. Using the Distiller package 2,515 EST clusters with candidate alternative isoforms were identified in the EST data with high confidence. In agreement with general observations in human and mouse, we find putative splice variants in about 30% of the contigs with more than 50 ESTs. Based on the criteria that a minimum of two EST sequences confirmed each splice event, a list of 100 genes with the most distinct tissue-specific alternative splice events was generated from the list of candidates. To confirm the tissue specificity of the splice events, 10 genes with functional annotation were randomly selected from which 16 individual splice events were chosen for experimental verification by quantitative PCR (qPCR). Six genes were shown to have tissue specific alternatively spliced transcripts with expression patterns matching those of the EST data. The remaining four genes had tissue-restricted expression of alternative spliced transcripts. Five out of the 16 splice events that were experimentally verified were found to be putative pig specific. Conclusions In accordance with human and rodent studies we estimate that approximately 30% of the porcine genes undergo alternative splicing. We found a good correlation between EST predicted tissue-specificity and experimentally validated splice events in different porcine tissue. This study indicates that a cluster size of around 50 ESTs is optimal for in silico detection of alternative splicing. Although based on a limited number of splice events, the study supports the notion that alternative splicing could have an important impact on species differentiation since 31% of the splice events studied appears to be species specific. PMID:20444244
Dai, Gucan; Sherpa, Tshering; Varnum, Michael D.
2014-01-01
Precursor mRNA encoding CNGA3 subunits of cone photoreceptor cyclic nucleotide-gated (CNG) channels undergoes alternative splicing, generating isoforms differing in the N-terminal cytoplasmic region of the protein. In humans, four variants arise from alternative splicing, but the functional significance of these changes has been a persistent mystery. Heterologous expression of the four possible CNGA3 isoforms alone or with CNGB3 subunits did not reveal significant differences in basic channel properties. However, inclusion of optional exon 3, with or without optional exon 5, produced heteromeric CNGA3 + CNGB3 channels exhibiting an ∼2-fold greater shift in K1/2,cGMP after phosphatidylinositol 4,5-biphosphate or phosphatidylinositol 3,4,5-trisphosphate application compared with channels lacking the sequence encoded by exon 3. We have previously identified two structural features within CNGA3 that support phosphoinositides (PIPn) regulation of cone CNG channels: N- and C-terminal regulatory modules. Specific mutations within these regions eliminated PIPn sensitivity of CNGA3 + CNGB3 channels. The exon 3 variant enhanced the component of PIPn regulation that depends on the C-terminal region rather than the nearby N-terminal region, consistent with an allosteric effect on PIPn sensitivity because of altered N-C coupling. Alternative splicing of CNGA3 occurs in multiple species, although the exact variants are not conserved across CNGA3 orthologs. Optional exon 3 appears to be unique to humans, even compared with other primates. In parallel, we found that a specific splice variant of canine CNGA3 removes a region of the protein that is necessary for high sensitivity to PIPn. CNGA3 alternative splicing may have evolved, in part, to tune the interactions between cone CNG channels and membrane-bound phosphoinositides. PMID:24675082
Dai, Gucan; Sherpa, Tshering; Varnum, Michael D
2014-05-09
Precursor mRNA encoding CNGA3 subunits of cone photoreceptor cyclic nucleotide-gated (CNG) channels undergoes alternative splicing, generating isoforms differing in the N-terminal cytoplasmic region of the protein. In humans, four variants arise from alternative splicing, but the functional significance of these changes has been a persistent mystery. Heterologous expression of the four possible CNGA3 isoforms alone or with CNGB3 subunits did not reveal significant differences in basic channel properties. However, inclusion of optional exon 3, with or without optional exon 5, produced heteromeric CNGA3 + CNGB3 channels exhibiting an ∼2-fold greater shift in K1/2,cGMP after phosphatidylinositol 4,5-biphosphate or phosphatidylinositol 3,4,5-trisphosphate application compared with channels lacking the sequence encoded by exon 3. We have previously identified two structural features within CNGA3 that support phosphoinositides (PIPn) regulation of cone CNG channels: N- and C-terminal regulatory modules. Specific mutations within these regions eliminated PIPn sensitivity of CNGA3 + CNGB3 channels. The exon 3 variant enhanced the component of PIPn regulation that depends on the C-terminal region rather than the nearby N-terminal region, consistent with an allosteric effect on PIPn sensitivity because of altered N-C coupling. Alternative splicing of CNGA3 occurs in multiple species, although the exact variants are not conserved across CNGA3 orthologs. Optional exon 3 appears to be unique to humans, even compared with other primates. In parallel, we found that a specific splice variant of canine CNGA3 removes a region of the protein that is necessary for high sensitivity to PIPn. CNGA3 alternative splicing may have evolved, in part, to tune the interactions between cone CNG channels and membrane-bound phosphoinositides.
Spliced integrated retrotransposed element (SpIRE) formation in the human genome.
Larson, Peter A; Moldovan, John B; Jasti, Naveen; Kidd, Jeffrey M; Beck, Christine R; Moran, John V
2018-03-01
Human Long interspersed element-1 (L1) retrotransposons contain an internal RNA polymerase II promoter within their 5' untranslated region (UTR) and encode two proteins, (ORF1p and ORF2p) required for their mobilization (i.e., retrotransposition). The evolutionary success of L1 relies on the continuous retrotransposition of full-length L1 mRNAs. Previous studies identified functional splice donor (SD), splice acceptor (SA), and polyadenylation sequences in L1 mRNA and provided evidence that a small number of spliced L1 mRNAs retrotransposed in the human genome. Here, we demonstrate that the retrotransposition of intra-5'UTR or 5'UTR/ORF1 spliced L1 mRNAs leads to the generation of spliced integrated retrotransposed elements (SpIREs). We identified a new intra-5'UTR SpIRE that is ten times more abundant than previously identified SpIREs. Functional analyses demonstrated that both intra-5'UTR and 5'UTR/ORF1 SpIREs lack Cis-acting transcription factor binding sites and exhibit reduced promoter activity. The 5'UTR/ORF1 SpIREs also produce nonfunctional ORF1p variants. Finally, we demonstrate that sequence changes within the L1 5'UTR over evolutionary time, which permitted L1 to evade the repressive effects of a host protein, can lead to the generation of new L1 splicing events, which, upon retrotransposition, generates a new SpIRE subfamily. We conclude that splicing inhibits L1 retrotransposition, SpIREs generally represent evolutionary "dead-ends" in the L1 retrotransposition process, mutations within the L1 5'UTR alter L1 splicing dynamics, and that retrotransposition of the resultant spliced transcripts can generate interindividual genomic variation.
Spliced integrated retrotransposed element (SpIRE) formation in the human genome
Larson, Peter A.; Moldovan, John B.; Jasti, Naveen; Kidd, Jeffrey M.; Beck, Christine R.; Moran, John V.
2018-01-01
Human Long interspersed element-1 (L1) retrotransposons contain an internal RNA polymerase II promoter within their 5′ untranslated region (UTR) and encode two proteins, (ORF1p and ORF2p) required for their mobilization (i.e., retrotransposition). The evolutionary success of L1 relies on the continuous retrotransposition of full-length L1 mRNAs. Previous studies identified functional splice donor (SD), splice acceptor (SA), and polyadenylation sequences in L1 mRNA and provided evidence that a small number of spliced L1 mRNAs retrotransposed in the human genome. Here, we demonstrate that the retrotransposition of intra-5′UTR or 5′UTR/ORF1 spliced L1 mRNAs leads to the generation of spliced integrated retrotransposed elements (SpIREs). We identified a new intra-5′UTR SpIRE that is ten times more abundant than previously identified SpIREs. Functional analyses demonstrated that both intra-5′UTR and 5′UTR/ORF1 SpIREs lack Cis-acting transcription factor binding sites and exhibit reduced promoter activity. The 5′UTR/ORF1 SpIREs also produce nonfunctional ORF1p variants. Finally, we demonstrate that sequence changes within the L1 5′UTR over evolutionary time, which permitted L1 to evade the repressive effects of a host protein, can lead to the generation of new L1 splicing events, which, upon retrotransposition, generates a new SpIRE subfamily. We conclude that splicing inhibits L1 retrotransposition, SpIREs generally represent evolutionary “dead-ends” in the L1 retrotransposition process, mutations within the L1 5′UTR alter L1 splicing dynamics, and that retrotransposition of the resultant spliced transcripts can generate interindividual genomic variation. PMID:29505568
Pan, Ling; Pasternak, David A.; Xu, Jin; Xu, Mingming; Lu, Zhigang; Pasternak, Gavril W.
2017-01-01
The sigma1 receptor acts as a chaperone at the endoplasmic reticulum, associates with multiple proteins in various cellular systems, and involves in a number of diseases, such as addiction, pain, cancer and psychiatric disorders. The sigma1 receptor is encoded by the single copy SIGMAR1 gene. The current study identifies five alternatively spliced variants of the mouse sigma1 receptor gene using a polymerase chain reaction cloning approach. All the splice variants are generated by exon skipping or alternative 3’ or 5’ splicing, producing the truncated sigma1 receptor. Similar alternative splicing has been observed in the human SIGMAR1 gene based on the molecular cloning or genome sequence prediction, suggesting conservation of alternative splicing of SIGMAR1 gene. Using quantitative polymerase chain reactions, we demonstrate differential expression of several splice variants in mouse tissues and brain regions. When expressed in HEK293 cells, all the splice variants fail to bind sigma ligands, implicating that each truncated region in these splice variants is important for ligand binding. However, co-immunoprecipitation (Co-IP) study in HEK293 cells co-transfected with tagged constructs reveals that all the splice variants maintain their ability to physically associate with a mu opioid receptor (mMOR-1), providing useful information to correlate the motifs/sequences necessary for their physical association. Furthermore, a competition Co-IP study showed that all the variants can disrupt in a dose-dependent manner the dimerization of the original sigma1 receptor with mMOR-1, suggesting a potential dominant negative function and providing significant insights into their function. PMID:28350844
Sarmiento, José M; Añazco, Carolina C; Campos, Danae M; Prado, Gregory N; Navarro, Javier; González, Carlos B
2004-11-05
In rat kidney, two alternatively spliced transcripts are generated from the V2 vasopressin receptor gene. The large transcript (1.2 kb) encodes the canonical V2 receptor, whereas the small transcript encodes a splice variant displaying a distinct sequence corresponding to the putative seventh transmembrane domain and the intracellular C terminus of the V2 receptor. This work showed that the small spliced transcript is translated in the rat kidney collecting tubules. However, the protein encoded by the small transcript (here called the V2b splice variant) is retained inside the cell, in contrast to the preferential surface distribution of the V2 receptor (here called the V2a receptor). Cells expressing the V2b splice variant do not exhibit binding to 3H-labeled vasopressin. Interestingly, we found that expression of the splice variant V2b down-regulates the surface expression of the V2a receptor, most likely via the formation of V2a.V2b heterodimers as demonstrated by co-immunoprecipitation and fluorescence resonance energy transfer experiments between the V2a receptor and the V2b splice variant. The V2b splice variant would then be acting as a dominant negative. The effect of the V2b splice variant is specific, as it does not affect the surface expression of the G protein-coupled interleukin-8 receptor (CXCR1). Furthermore, the sequence encompassing residues 242-339, corresponding to the C-terminal domain of the V2b splice variant, also down-regulates the surface expression of the V2a receptor. We suggest that some forms of nephrogenic diabetes insipidus are due to overexpression of the splice variant V2b, which could retain the wild-type V2a receptor inside the cell via the formation of V2a.V2b heterodimers.
Novel splice mutation in microthalmia-associated transcription factor in Waardenburg Syndrome.
Brenner, Laura; Burke, Kelly; Leduc, Charles A; Guha, Saurav; Guo, Jiancheng; Chung, Wendy K
2011-01-01
Waardenburg Syndrome (WS) is a syndromic form of hearing loss associated with mutations in six different genes. We identified a large family with WS that had previously undergone clinical testing, with no reported pathogenic mutation. Using linkage analysis, a region on 3p14.1 with an LOD score of 6.6 was identified. Microthalmia-Associated Transcription Factor, a gene known to cause WS, is located within this region of linkage. Sequencing of Microthalmia-Associated Transcription Factor demonstrated a c.1212 G>A synonymous variant that segregated with the WS in the family and was predicted to cause a novel splicing site that was confirmed with expression analysis of the mRNA. This case illustrates the need to computationally analyze novel synonymous sequence variants for possible effects on splicing to maximize the clinical sensitivity of sequence-based genetic testing.
Mutation analysis of pre-mRNA splicing genes in Chinese families with retinitis pigmentosa
Pan, Xinyuan; Chen, Xue; Liu, Xiaoxing; Gao, Xiang; Kang, Xiaoli; Xu, Qihua; Chen, Xuejuan; Zhao, Kanxing; Zhang, Xiumei; Chu, Qiaomei; Wang, Xiuying
2014-01-01
Purpose Seven genes involved in precursor mRNA (pre-mRNA) splicing have been implicated in autosomal dominant retinitis pigmentosa (adRP). We sought to detect mutations in all seven genes in Chinese families with RP, to characterize the relevant phenotypes, and to evaluate the prevalence of mutations in splicing genes in patients with adRP. Methods Six unrelated families from our adRP cohort (42 families) and two additional families with RP with uncertain inheritance mode were clinically characterized in the present study. Targeted sequence capture with next-generation massively parallel sequencing (NGS) was performed to screen mutations in 189 genes including all seven pre-mRNA splicing genes associated with adRP. Variants detected with NGS were filtered with bioinformatics analyses, validated with Sanger sequencing, and prioritized with pathogenicity analysis. Results Mutations in pre-mRNA splicing genes were identified in three individual families including one novel frameshift mutation in PRPF31 (p.Leu366fs*1) and two known mutations in SNRNP200 (p.Arg681His and p.Ser1087Leu). The patients carrying SNRNP200 p.R681H showed rapid disease progression, and the family carrying p.S1087L presented earlier onset ages and more severe phenotypes compared to another previously reported family with p.S1087L. In five other families, we identified mutations in other RP-related genes, including RP1 p. Ser781* (novel), RP2 p.Gln65* (novel) and p.Ile137del (novel), IMPDH1 p.Asp311Asn (recurrent), and RHO p.Pro347Leu (recurrent). Conclusions Mutations in splicing genes identified in the present and our previous study account for 9.5% in our adRP cohort, indicating the important role of pre-mRNA splicing deficiency in the etiology of adRP. Mutations in the same splicing gene, or even the same mutation, could correlate with different phenotypic severities, complicating the genotype–phenotype correlation and clinical prognosis. PMID:24940031
Schernthaner-Reiter, Marie Helene; Adams, David; Trivellin, Giampaolo; Ramnitz, Mary Scott; Raygada, Margarita; Golas, Gretchen; Faucz, Fabio R; Nilsson, Ola; Nella, Aikaterini A; Dileepan, Kavitha; Lodish, Maya; Lee, Paul; Tifft, Cynthia; Markello, Thomas; Gahl, William; Stratakis, Constantine A
2016-05-01
X-linked nephrogenic diabetes insipidus (NDI, OMIM#304800) is caused by mutations in the arginine vasopressin (AVP, OMIM*192340) receptor type 2 (AVPR2, OMIM*300538) gene. A 20-month-old boy and his 8-year-old brother presented with polyuria, polydipsia, and failure to thrive. Both boys demonstrated partial DDAVP (1-desamino-8-D AVP or desmopressin) responses; thus, NDI diagnosis was delayed. While routine sequencing of AVPR2 showed a potential splice site variant, it was not until exome sequencing confirmed the AVPR2 splice site variant and did not reveal any more likely candidates that the patients' diagnosis was made and proper treatment was instituted. Both patients were hemizygous for two AVPR2 variants predicted in silico to affect AVPR2 messenger RNA (mRNA) splicing. A minigene assay revealed that the novel AVPR2 c.276A>G mutation creates a novel splice acceptor site leading to 5' truncation of AVPR2 exon 2 in HEK293 human kidney cells. Both patients have been treated with high-dose DDAVP with a remarkable improvement of their symptoms and accelerated linear growth and weight gain. We present here a unique case of partial X-linked NDI due to an AVPR2 splice site mutation; patients with diabetes insipidus of unknown etiology may harbor splice site mutations that are initially underestimated in their pathogenicity on sequence analysis. • X-linked nephrogenic diabetes insipidus is caused by AVPR2 mutations, and disease severity can vary depending on the functional effect of the mutation. What is New: • We demonstrate here that a splice site mutation in AVPR2 leads to partial X-linked NDI in two brothers. • Treatment with high-dose DDAVP led to improvement of polyuria and polydipsia, weight gain, and growth.
Postnatal Expression of V2 Vasopressin Receptor Splice Variants in the Rat Cerebellum
Vargas, Karina J.; Sarmiento, José M.; Ehrenfeld, Pamela; Añazco, Carolina C.; Villanueva, Carolina I.; Carmona, Pamela L.; Brenet, Marianne; Navarro, Javier; Müller-Esterl, Werner; Figueroa, Carlos D.; González, Carlos B.
2010-01-01
The V2 vasopressin receptor gene contains an alternative splice site in exon-3, which leads to the generation of two splice variants (V2a and V2b) first identified in the kidney. The open reading frame of the alternatively spliced V2b transcripten codes a truncated receptor, showing the same amino acid sequence as the canonical V2a receptor up to the 6th transmembrane segment, but displaying a distinct sequence to the corresponding 7th transmembrane segment and C-terminal domain relative to the V2a receptor. Here, we demonstrate the postnatal expression of V2a and V2b variants in the rat cerebellum. Most importantly, we showed by in situ hybridization and immunocytochemistry that both V2 splice variants were preferentially expressed in Purkinje cells, from early to late postnatal development. In addition, both variants were transiently expressed in the neuroblastic external granule cells and Bergmann fibers. These results indicate that the cellular distributions of both splice variants are developmentally regulated, and suggest that the transient expression of the V2 receptor is involved in the mechanisms of cerebellar cytodifferentiation by AVP. Finally, transfected CHO-K1 .expressing similar amounts of both V2 splice variants, as that found in the cerebellum, showed a significant reduction in the surface expression of V2a receptors, suggesting that the differential expression of the V2 splice variants regulate the vasopressin signaling in the cerebellum. PMID:19281786
Kapahnke, Marcel; Banning, Antje; Tikkanen, Ritva
2016-12-14
The clustered regularly interspaced short palindromic repeats (CRISPR)-associated sequence 9 (CRISPR/Cas9) system is widely used for genome editing purposes as it facilitates an efficient knockout of a specific gene in, e.g. cultured cells. Targeted double-strand breaks are introduced to the target sequence of the guide RNAs, which activates the cellular DNA repair mechanism for non-homologous-end-joining, resulting in unprecise repair and introduction of small deletions or insertions. Due to this, sequence alterations in the coding region of the target gene frequently cause frame-shift mutations, facilitating degradation of the mRNA. We here show that such CRISPR/Cas9-mediated alterations in the target exon may also result in altered splicing of the respective pre-mRNA, most likely due to mutations of splice-regulatory sequences. Using the human FLOT-1 gene as an example, we demonstrate that such altered splicing products also give rise to aberrant protein products. These may potentially function as dominant-negative proteins and thus interfere with the interpretation of the data generated with these cell lines. Since most researchers only control the consequences of CRISPR knockout at genomic and protein level, our data should encourage to also check the alterations at the mRNA level.
Khan, Shahid Y.; Ali, Shahbaz; Naeem, Muhammad Asif; Khan, Shaheen N.; Husnain, Tayyab; Butt, Nadeem H.; Qazi, Zaheeruddin A.; Akram, Javed; Riazuddin, Sheikh; Ayyagari, Radha; Hejtmancik, J. Fielding
2015-01-01
Purpose This study was conducted to localize and identify causal mutations associated with autosomal recessive retinitis pigmentosa (RP) in consanguineous familial cases of Pakistani origin. Methods Ophthalmic examinations that included funduscopy and electroretinography (ERG) were performed to confirm the affectation status. Blood samples were collected from all participating individuals, and genomic DNA was extracted. A genome-wide scan was performed, and two-point logarithm of odds (LOD) scores were calculated. Sanger sequencing was performed to identify the causative variants. Subsequently, we performed whole exome sequencing to rule out the possibility of a second causal variant within the linkage interval. Sequence conservation was performed with alignment analyses of PDE6A orthologs, and in silico splicing analysis was completed with Human Splicing Finder version 2.4.1. Results A large multigenerational consanguineous family diagnosed with early-onset RP was ascertained. An ophthalmic clinical examination consisting of fundus photography and electroretinography confirmed the diagnosis of RP. A genome-wide scan was performed, and suggestive two-point LOD scores were observed with markers on chromosome 5q. Haplotype analyses identified the region; however, the region did not segregate with the disease phenotype in the family. Subsequently, we performed a second genome-wide scan that excluded the entire genome except the chromosome 5q region harboring PDE6A. Next-generation whole exome sequencing identified a splice acceptor site mutation in intron 16: c.2028–1G>A, which was completely conserved in PDE6A orthologs and was absent in ethnically matched 350 control chromosomes, the 1000 Genomes database, and the NHLBI Exome Sequencing Project. Subsequently, we investigated our entire cohort of RP familial cases and identified a second family who harbored a splice acceptor site mutation in intron 10: c.1408–2A>G. In silico analysis suggested that these mutations will result in the elimination of wild-type splice acceptor sites that would result in either skipping of the respective exon or the creation of a new cryptic splice acceptor site; both possibilities would result in retinal photoreceptor cells that lack PDE6A wild-type protein. Conclusions we report two splice acceptor site variations in PDE6A in consanguineous Pakistani families who manifested cardinal symptoms of RP. Taken together with our previously published work, our data suggest that mutations in PDE6A account for about 2% of the total genetic load of RP in our cohort and possibly in the Pakistani population as well. PMID:26321862
The in vivo use of alternate 3'-splice sites in group I introns.
Sellem, C H; Belcour, L
1994-04-11
Alternative splicing of group I introns has been postulated as a possible mechanism that would ensure the translation of proteins encoded into intronic open reading frames, discontinuous with the upstream exon and lacking an initiation signal. Alternate splice sites were previously depicted according to secondary structures of several group I introns. We present here strong evidence that, in the case of Podospora anserina nad 1-i4 and cox1-i7 mitochondrial introns, alternative splicing events do occur in vivo. Indeed, by PCR experiments we have detected molecules whose sequence is precisely that expected if the predicted alternate 3'-splice sites were used.
Colwill, Karen; Wells, Clark D; Elder, Kelly; Goudreault, Marilyn; Hersi, Kadija; Kulkarni, Sarang; Hardy, W Rod; Pawson, Tony; Morin, Gregg B
2006-03-06
Recombinational systems have been developed to rapidly shuttle Open Reading Frames (ORFs) into multiple expression vectors in order to analyze the large number of cDNAs available in the post-genomic era. In the Creator system, an ORF introduced into a donor vector can be transferred with Cre recombinase to a library of acceptor vectors optimized for different applications. Usability of the Creator system is impacted by the ability to easily manipulate DNA, the number of acceptor vectors for downstream applications, and the level of protein expression from Creator vectors. To date, we have developed over 20 novel acceptor vectors that employ a variety of promoters and epitope tags commonly employed for proteomics applications and gene function analysis. We also made several enhancements to the donor vectors including addition of different multiple cloning sites to allow shuttling from pre-existing vectors and introduction of the lacZ alpha reporter gene to allow for selection. Importantly, in order to ameliorate any effects on protein expression of the loxP site between a 5' tag and ORF, we introduced a splicing event into our expression vectors. The message produced from the resulting 'Creator Splice' vector undergoes splicing in mammalian systems to remove the loxP site. Upon analysis of our Creator Splice constructs, we discovered that protein expression levels were also significantly increased. The development of new donor and acceptor vectors has increased versatility during the cloning process and made this system compatible with a wider variety of downstream applications. The modifications introduced in our Creator Splice system were designed to remove extraneous sequences due to recombination but also aided in downstream analysis by increasing protein expression levels. As a result, we can now employ epitope tags that are detected less efficiently and reduce our assay scale to allow for higher throughput. The Creator Splice system appears to be an extremely useful tool for proteomics.
Colwill, Karen; Wells, Clark D; Elder, Kelly; Goudreault, Marilyn; Hersi, Kadija; Kulkarni, Sarang; Hardy, W Rod; Pawson, Tony; Morin, Gregg B
2006-01-01
Background Recombinational systems have been developed to rapidly shuttle Open Reading Frames (ORFs) into multiple expression vectors in order to analyze the large number of cDNAs available in the post-genomic era. In the Creator system, an ORF introduced into a donor vector can be transferred with Cre recombinase to a library of acceptor vectors optimized for different applications. Usability of the Creator system is impacted by the ability to easily manipulate DNA, the number of acceptor vectors for downstream applications, and the level of protein expression from Creator vectors. Results To date, we have developed over 20 novel acceptor vectors that employ a variety of promoters and epitope tags commonly employed for proteomics applications and gene function analysis. We also made several enhancements to the donor vectors including addition of different multiple cloning sites to allow shuttling from pre-existing vectors and introduction of the lacZ alpha reporter gene to allow for selection. Importantly, in order to ameliorate any effects on protein expression of the loxP site between a 5' tag and ORF, we introduced a splicing event into our expression vectors. The message produced from the resulting 'Creator Splice' vector undergoes splicing in mammalian systems to remove the loxP site. Upon analysis of our Creator Splice constructs, we discovered that protein expression levels were also significantly increased. Conclusion The development of new donor and acceptor vectors has increased versatility during the cloning process and made this system compatible with a wider variety of downstream applications. The modifications introduced in our Creator Splice system were designed to remove extraneous sequences due to recombination but also aided in downstream analysis by increasing protein expression levels. As a result, we can now employ epitope tags that are detected less efficiently and reduce our assay scale to allow for higher throughput. The Creator Splice system appears to be an extremely useful tool for proteomics. PMID:16519801
Bitar, Mainá; Boroni, Mariana; Macedo, Andréa M.; Machado, Carlos R.; Franco, Glória R.
2013-01-01
The spliced leader (SL) is a gene that generates a functional ncRNA that is composed of two regions: an intronic region of unknown function (SLi) and an exonic region (SLe), which is transferred to the 5′ end of independent transcripts yielding mature mRNAs, in a process known as spliced leader trans-splicing (SLTS). The best described function for SLTS is to solve polycistronic transcripts into monocistronic units, specifically in Trypanosomatids. In other metazoans, it is speculated that the SLe addition could lead to increased mRNA stability, differential recruitment of the translational machinery, modification of the 5′ region or a combination of these effects. Although important aspects of this mechanism have been revealed, several features remain to be elucidated. We have analyzed 157 SLe sequences from 148 species from seven phyla and found a high degree of conservation among the sequences of species from the same phylum, although no considerable similarity seems to exist between sequences of species from different phyla. When analyzing case studies, we found evidence that a given SLe will always be related to a given set of transcripts in different species from the same phylum, and therefore, different SLe sequences from the same species would regulate different sets of transcripts. In addition, we have observed distinct transcript categories to be preferential targets for the SLe addition in different phyla. This work sheds light into crucial and controversial aspects of the SLTS mechanism. It represents a comprehensive study concerning various species and different characteristics of this important post-transcriptional regulatory mechanism. PMID:24130571
Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W
1996-02-15
Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U).
Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W
1996-01-01
Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U). PMID:8604302
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vidaud, M.; Vidaud, D.; Amselem, S.
The authors have characterized a Mediterranean {beta}-thalassemia allele containing a sequence change at codon 30 that alters both {beta}-globin pre-mRNA splicing and the structure of the homoglobin product. Presumably, this G {yields} C transversion at position {minus}1 of intron 1 reduces severely the utilization of the normal 5{prime} splice site since the level of the Arg {yields} Thr mutant hemoglobin (designated hemoglobin Kairouan) found in the erythrocytes of the patient is very low (2% of total hemoglobin). Since no natural mutations of the guanine located at position {minus}1 of the CAG/GTAAGT consensus sequence had been isolated previously. They investigated themore » role of this nucleotide in the constitution of an active 5{prime} splice site by studying the splicing of the pre-mRNA in cell-free extracts. They demonstrate that correct splicing of the mutant pre-mRNA is 98% inhibited. Their results provide further insights into the mechanisms of pre-mRNA maturation by revealing that the last residue of the exon plays a role at least equivalent to that of the intron residue at position +5.« less
Cryptic splice site in the complementary DNA of glucocerebrosidase causes inefficient expression.
Bukovac, Scott W; Bagshaw, Richard D; Rigat, Brigitte A; Callahan, John W; Clarke, Joe T R; Mahuran, Don J
2008-10-15
The low levels of human lysosomal glucocerebrosidase activity expressed in transiently transfected Chinese hamster ovary (CHO) cells were investigated. Reverse transcription PCR (RT-PCR) demonstrated that a significant portion of the transcribed RNA was misspliced owing to the presence of a cryptic splice site in the complementary DNA (cDNA). Missplicing results in the deletion of 179 bp of coding sequence and a premature stop codon. A repaired cDNA was constructed abolishing the splice site without changing the amino acid sequence. The level of glucocerebrosidase expression was increased sixfold. These data demonstrate that for maximum expression of any cDNA construct, the transcription products should be examined.
Wang, Peter Lincoln; Lacayo, Norman; Brown, Patrick O.
2012-01-01
Most human pre-mRNAs are spliced into linear molecules that retain the exon order defined by the genomic sequence. By deep sequencing of RNA from a variety of normal and malignant human cells, we found RNA transcripts from many human genes in which the exons were arranged in a non-canonical order. Statistical estimates and biochemical assays provided strong evidence that a substantial fraction of the spliced transcripts from hundreds of genes are circular RNAs. Our results suggest that a non-canonical mode of RNA splicing, resulting in a circular RNA isoform, is a general feature of the gene expression program in human cells. PMID:22319583
Preedagasamzin, Sarinthip; Nualkaew, Tiwaporn; Pongrujikorn, Tanjitti; Jinawath, Natini; Kole, Ryszard; Fucharoen, Suthat; Jearawiriyapaisarn, Natee; Svasti, Saovaros
2018-04-30
Repair of a splicing defect of β-globin pre-mRNA harboring hemoglobin E (HbE) mutation was successfully accomplished in erythroid cells from patients with β-thalassemia/HbE disorder by a synthetic splice-switching oligonucleotide (SSO). However, its application is limited by short-term effectiveness and requirement of lifelong periodic administration of SSO, especially for chronic diseases like thalassemias. Here, we engineered lentiviral vectors that stably express U7 small nuclear RNA (U7 snRNA) carrying the splice-switching sequence of the SSO that restores correct splicing of β E -globin pre-mRNA and achieves a long-term therapeutic effect. Using a two-step tiling approach, we systematically screened U7 snRNAs carrying splice-switching SSO sequences targeted to the cryptic 5' splice site created by HbE mutation. We tested this approach and identified the most responsive element for mediating splicing correction in engineered U7 snRNAs in HeLa-β E cell model cell line. Remarkably, the U7 snRNA lentiviral vector (U7 βE4+1) targeted to this region effectively restored the correctly-spliced β E -globin mRNA for at least 5 months. Moreover, the effects of the U7 βE4+1 snRNA lentiviral vector were also evident as upregulation of the correctly-spliced β E -globin mRNA in erythroid progenitor cells from β-thalassemia/HbE patients treated with the vector, which led to improvements of pathologies in erythroid progenitor cells from thalassemia patients. These results suggest that the splicing correction of β E -globin pre-mRNA by the engineered U7 snRNA lentiviral vector provides a promising, long-term treatment for β-thalassemia/HbE. Copyright © 2018 Elsevier Inc. All rights reserved.
Non-exomic and synonymous variants in ABCA4 are an important cause of Stargardt disease
Braun, Terry A.; Mullins, Robert F.; Wagner, Alex H.; Andorf, Jeaneen L.; Johnston, Rebecca M.; Bakall, Benjamin B.; Deluca, Adam P.; Fishman, Gerald A.; Lam, Byron L.; Weleber, Richard G.; Cideciyan, Artur V.; Jacobson, Samuel G.; Sheffield, Val C.; Tucker, Budd A.; Stone, Edwin M.
2013-01-01
Mutations in ABCA4 cause Stargardt disease and other blinding autosomal recessive retinal disorders. However, sequencing of the complete coding sequence in patients with clinical features of Stargardt disease sometimes fails to detect one or both mutations. For example, among 208 individuals with clear clinical evidence of ABCA4 disease ascertained at a single institution, 28 had only one disease-causing allele identified in the exons and splice junctions of the primary retinal transcript of the gene. Haplotype analysis of these 28 probands revealed 3 haplotypes shared among ten families, suggesting that 18 of the 28 missing alleles were rare enough to be present only once in the cohort. We hypothesized that mutations near rare alternate splice junctions in ABCA4 might cause disease by increasing the probability of mis-splicing at these sites. Next-generation sequencing of RNA extracted from human donor eyes revealed more than a dozen alternate exons that are occasionally incorporated into the ABCA4 transcript in normal human retina. We sequenced the genomic DNA containing 15 of these minor exons in the 28 one-allele subjects and observed five instances of two different variations in the splice signals of exon 36.1 that were not present in normal individuals (P < 10−6). Analysis of RNA obtained from the keratinocytes of patients with these mutations revealed the predicted alternate transcript. This study illustrates the utility of RNA sequence analysis of human donor tissue and patient-derived cell lines to identify mutations that would be undetectable by exome sequencing. PMID:23918662
Alternative Splicing in Neurogenesis and Brain Development.
Su, Chun-Hao; D, Dhananjaya; Tarn, Woan-Yuh
2018-01-01
Alternative splicing of precursor mRNA is an important mechanism that increases transcriptomic and proteomic diversity and also post-transcriptionally regulates mRNA levels. Alternative splicing occurs at high frequency in brain tissues and contributes to every step of nervous system development, including cell-fate decisions, neuronal migration, axon guidance, and synaptogenesis. Genetic manipulation and RNA sequencing have provided insights into the molecular mechanisms underlying the effects of alternative splicing in stem cell self-renewal and neuronal fate specification. Timely expression and perhaps post-translational modification of neuron-specific splicing regulators play important roles in neuronal development. Alternative splicing of many key transcription regulators or epigenetic factors reprograms the transcriptome and hence contributes to stem cell fate determination. During neuronal differentiation, alternative splicing also modulates signaling activity, centriolar dynamics, and metabolic pathways. Moreover, alternative splicing impacts cortical lamination and neuronal development and function. In this review, we focus on recent progress toward understanding the contributions of alternative splicing to neurogenesis and brain development, which has shed light on how splicing defects may cause brain disorders and diseases.
Malone, Andrew F; Funk, Steven D; Alhamad, Tarek; Miner, Jeffrey H
2017-06-01
Many COL4A5 splice region variants have been described in patients with X-linked Alport syndrome, but few have been confirmed by functional analysis to actually cause defective splicing. We sought to demonstrate that a novel COL4A5 splice region variant in a family with Alport syndrome is pathogenic using functional studies. We also describe an alternative method of diagnosis. Targeted next-generation sequencing results of an individual with Alport syndrome were analyzed and the results confirmed by Sanger sequencing in family members. A splicing reporter minigene assay was used to examine the variant's effect on splicing in transfected cells. Plucked hair follicles from patients and controls were examined for collagen IV proteins using immunofluorescence microscopy. A novel splice region mutation in COL4A5, c.1780-6T>G, was identified and segregated with disease in this family. This variant caused frequent skipping of exon 25, resulting in a frameshift and truncation of collagen α5(IV) protein. We also developed and validated a new approach to characterize the expression of collagen α5(IV) protein in the basement membranes of plucked hair follicles. Using this approach we demonstrated reduced collagen α5(IV) protein in affected male and female individuals in this family, supporting frequent failure of normal splicing. Differing normal to abnormal transcript ratios in affected individuals carrying splice region variants may contribute to variable disease severity observed in Alport families. Examination of plucked hair follicles in suspected X-linked Alport syndrome patients may offer a less invasive alternative method of diagnosis and serve as a pathogenicity test for COL4A5 variants of uncertain significance.
Malone, Andrew F.; Funk, Steven D.; Alhamad, Tarek; Miner, Jeffrey H.
2016-01-01
Introduction Many COL4A5 splice region variants have been described in patients with X-linked Alport syndrome, but few have been confirmed by functional analysis to actually cause defective splicing. We sought to demonstrate that a novel COL4A5 splice region variant in a family with Alport syndrome is pathogenic using functional studies. We also describe an alternative method of diagnosis. Methods We analyzed targeted next-generation sequencing results of an individual with Alport syndrome and confirmed results by Sanger sequencing in family members. A splicing reporter minigene assay was used to examine the variant’s effect on splicing in transfected cells. Plucked hair follicles from patients and controls were examined for collagen IV proteins using immunofluorescence microscopy. Results A novel splice region mutation in COL4A5, c.1780-6T>G, was identified and segregated with disease in this family. This variant caused frequent skipping of exon 25, resulting in a frameshift and truncation of collagen α5(IV) protein. We also developed and validated a new approach to characterize the expression of collagen α5(IV) protein in the basement membranes of plucked hair follicles. We demonstrated reduced collagen α5(IV) protein in affected male and female individuals in this family, supporting frequent failure of normal splicing. Conclusions Differing normal to abnormal transcript ratios in affected individuals carrying splice region variants may contribute to variable disease severity observed in Alport families. Examination of plucked hair follicles in suspected X-linked Alport syndrome patients may offer a less invasive alternative method of diagnosis and serve as a pathogenicity test for COL4A5 variants of uncertain significance. PMID:28013382
RNA editing in nascent RNA affects pre-mRNA splicing
Hsiao, Yun-Hua Esther; Bahn, Jae Hoon; Yang, Yun; Lin, Xianzhi; Tran, Stephen; Yang, Ei-Wen; Quinones-Valdez, Giovanni
2018-01-01
In eukaryotes, nascent RNA transcripts undergo an intricate series of RNA processing steps to achieve mRNA maturation. RNA editing and alternative splicing are two major RNA processing steps that can introduce significant modifications to the final gene products. By tackling these processes in isolation, recent studies have enabled substantial progress in understanding their global RNA targets and regulatory pathways. However, the interplay between individual steps of RNA processing, an essential aspect of gene regulation, remains poorly understood. By sequencing the RNA of different subcellular fractions, we examined the timing of adenosine-to-inosine (A-to-I) RNA editing and its impact on alternative splicing. We observed that >95% A-to-I RNA editing events occurred in the chromatin-associated RNA prior to polyadenylation. We report about 500 editing sites in the 3′ acceptor sequences that can alter splicing of the associated exons. These exons are highly conserved during evolution and reside in genes with important cellular function. Furthermore, we identified a second class of exons whose splicing is likely modulated by RNA secondary structures that are recognized by the RNA editing machinery. The genome-wide analyses, supported by experimental validations, revealed remarkable interplay between RNA editing and splicing and expanded the repertoire of functional RNA editing sites. PMID:29724793
RNA editing in nascent RNA affects pre-mRNA splicing.
Hsiao, Yun-Hua Esther; Bahn, Jae Hoon; Yang, Yun; Lin, Xianzhi; Tran, Stephen; Yang, Ei-Wen; Quinones-Valdez, Giovanni; Xiao, Xinshu
2018-06-01
In eukaryotes, nascent RNA transcripts undergo an intricate series of RNA processing steps to achieve mRNA maturation. RNA editing and alternative splicing are two major RNA processing steps that can introduce significant modifications to the final gene products. By tackling these processes in isolation, recent studies have enabled substantial progress in understanding their global RNA targets and regulatory pathways. However, the interplay between individual steps of RNA processing, an essential aspect of gene regulation, remains poorly understood. By sequencing the RNA of different subcellular fractions, we examined the timing of adenosine-to-inosine (A-to-I) RNA editing and its impact on alternative splicing. We observed that >95% A-to-I RNA editing events occurred in the chromatin-associated RNA prior to polyadenylation. We report about 500 editing sites in the 3' acceptor sequences that can alter splicing of the associated exons. These exons are highly conserved during evolution and reside in genes with important cellular function. Furthermore, we identified a second class of exons whose splicing is likely modulated by RNA secondary structures that are recognized by the RNA editing machinery. The genome-wide analyses, supported by experimental validations, revealed remarkable interplay between RNA editing and splicing and expanded the repertoire of functional RNA editing sites. © 2018 Hsiao et al.; Published by Cold Spring Harbor Laboratory Press.
Sequence variation between 462 human individuals fine-tunes functional sites of RNA processing
NASA Astrophysics Data System (ADS)
Ferreira, Pedro G.; Oti, Martin; Barann, Matthias; Wieland, Thomas; Ezquina, Suzana; Friedländer, Marc R.; Rivas, Manuel A.; Esteve-Codina, Anna; Estivill, Xavier; Guigó, Roderic; Dermitzakis, Emmanouil; Antonarakis, Stylianos; Meitinger, Thomas; Strom, Tim M.; Palotie, Aarno; François Deleuze, Jean; Sudbrak, Ralf; Lerach, Hans; Gut, Ivo; Syvänen, Ann-Christine; Gyllensten, Ulf; Schreiber, Stefan; Rosenstiel, Philip; Brunner, Han; Veltman, Joris; Hoen, Peter A. C. T.; Jan van Ommen, Gert; Carracedo, Angel; Brazma, Alvis; Flicek, Paul; Cambon-Thomsen, Anne; Mangion, Jonathan; Bentley, David; Hamosh, Ada; Rosenstiel, Philip; Strom, Tim M.; Lappalainen, Tuuli; Guigó, Roderic; Sammeth, Michael
2016-09-01
Recent advances in the cost-efficiency of sequencing technologies enabled the combined DNA- and RNA-sequencing of human individuals at the population-scale, making genome-wide investigations of the inter-individual genetic impact on gene expression viable. Employing mRNA-sequencing data from the Geuvadis Project and genome sequencing data from the 1000 Genomes Project we show that the computational analysis of DNA sequences around splice sites and poly-A signals is able to explain several observations in the phenotype data. In contrast to widespread assessments of statistically significant associations between DNA polymorphisms and quantitative traits, we developed a computational tool to pinpoint the molecular mechanisms by which genetic markers drive variation in RNA-processing, cataloguing and classifying alleles that change the affinity of core RNA elements to their recognizing factors. The in silico models we employ further suggest RNA editing can moonlight as a splicing-modulator, albeit less frequently than genomic sequence diversity. Beyond existing annotations, we demonstrate that the ultra-high resolution of RNA-Seq combined from 462 individuals also provides evidence for thousands of bona fide novel elements of RNA processing—alternative splice sites, introns, and cleavage sites—which are often rare and lowly expressed but in other characteristics similar to their annotated counterparts.
Short intronic repeat sequences facilitate circular RNA production
Liang, Dongming
2014-01-01
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery “backsplices” and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3′ end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. PMID:25281217
Lessons from non-canonical splicing
Ule, Jernej
2016-01-01
Recent improvements in experimental and computational techniques used to study the transcriptome have enabled an unprecedented view of RNA processing, revealing many previously unknown non-canonical splicing events. This includes cryptic events located far from the currently annotated exons, and unconventional splicing mechanisms that have important roles in regulating gene expression. These non-canonical splicing events are a major source of newly emerging transcripts during evolution, especially when they involve sequences derived from transposable elements. They are therefore under precise regulation and quality control, which minimises their potential to disrupt gene expression. While non-canonical splicing can lead to aberrant transcripts that cause many diseases, we also explain how it can be exploited for new therapeutic strategies. PMID:27240813
Drosha Promotes Splicing of a Pre-microRNA-like Alternative Exon
Havens, Mallory A.; Reich, Ashley A.; Hastings, Michelle L.
2014-01-01
The ribonuclease III enzyme Drosha has a central role in the biogenesis of microRNA (miRNA) by binding and cleaving hairpin structures in primary RNA transcripts into precursor miRNAs (pre-miRNAs). Many miRNA genes are located within protein-coding host genes and cleaved by Drosha in a manner that is coincident with splicing of introns by the spliceosome. The close proximity of splicing and pre-miRNA biogenesis suggests a potential for co-regulation of miRNA and host gene expression, though this relationship is not completely understood. Here, we describe a cleavage-independent role for Drosha in the splicing of an exon that has a predicted hairpin structure resembling a Drosha substrate. We find that Drosha can cleave the alternatively spliced exon 5 of the eIF4H gene into a pre-miRNA both in vitro and in cells. However, the primary role of Drosha in eIF4H gene expression is to promote the splicing of exon 5. Drosha binds to the exon and enhances splicing in a manner that depends on RNA structure but not on cleavage by Drosha. We conclude that Drosha can function like a splicing enhancer and promote exon inclusion. Our results reveal a new mechanism of alternative splicing regulation involving a cleavage-independent role for Drosha in splicing. PMID:24786770
Diverse alternative back-splicing and alternative splicing landscape of circular RNAs
Zhang, Xiao-Ou; Dong, Rui; Zhang, Yang; Zhang, Jia-Lin; Luo, Zheng; Zhang, Jun; Chen, Ling-Ling; Yang, Li
2016-01-01
Circular RNAs (circRNAs) derived from back-spliced exons have been widely identified as being co-expressed with their linear counterparts. A single gene locus can produce multiple circRNAs through alternative back-splice site selection and/or alternative splice site selection; however, a detailed map of alternative back-splicing/splicing in circRNAs is lacking. Here, with the upgraded CIRCexplorer2 pipeline, we systematically annotated different types of alternative back-splicing and alternative splicing events in circRNAs from various cell lines. Compared with their linear cognate RNAs, circRNAs exhibited distinct patterns of alternative back-splicing and alternative splicing. Alternative back-splice site selection was correlated with the competition of putative RNA pairs across introns that bracket alternative back-splice sites. In addition, all four basic types of alternative splicing that have been identified in the (linear) mRNA process were found within circRNAs, and many exons were predominantly spliced in circRNAs. Unexpectedly, thousands of previously unannotated exons were detected in circRNAs from the examined cell lines. Although these novel exons had similar splice site strength, they were much less conserved than known exons in sequences. Finally, both alternative back-splicing and circRNA-predominant alternative splicing were highly diverse among the examined cell lines. All of the identified alternative back-splicing and alternative splicing in circRNAs are available in the CIRCpedia database (http://www.picb.ac.cn/rnomics/circpedia). Collectively, the annotation of alternative back-splicing and alternative splicing in circRNAs provides a valuable resource for depicting the complexity of circRNA biogenesis and for studying the potential functions of circRNAs in different cells. PMID:27365365
RNA splicing regulated by RBFOX1 is essential for cardiac function in zebrafish.
Frese, Karen S; Meder, Benjamin; Keller, Andreas; Just, Steffen; Haas, Jan; Vogel, Britta; Fischer, Simon; Backes, Christina; Matzas, Mark; Köhler, Doreen; Benes, Vladimir; Katus, Hugo A; Rottbauer, Wolfgang
2015-08-15
Alternative splicing is one of the major mechanisms through which the proteomic and functional diversity of eukaryotes is achieved. However, the complex nature of the splicing machinery, its associated splicing regulators and the functional implications of alternatively spliced transcripts are only poorly understood. Here, we investigated the functional role of the splicing regulator rbfox1 in vivo using the zebrafish as a model system. We found that loss of rbfox1 led to progressive cardiac contractile dysfunction and heart failure. By using deep-transcriptome sequencing and quantitative real-time PCR, we show that depletion of rbfox1 in zebrafish results in an altered isoform expression of several crucial target genes, such as actn3a and hug. This study underlines that tightly regulated splicing is necessary for unconstrained cardiac function and renders the splicing regulator rbfox1 an interesting target for investigation in human heart failure and cardiomyopathy. © 2015. Published by The Company of Biologists Ltd.
An alternative splicing program promotes adipose tissue thermogenesis
Vernia, Santiago; Edwards, Yvonne JK; Han, Myoung Sook; Cavanagh-Kyros, Julie; Barrett, Tamera; Kim, Jason K; Davis, Roger J
2016-01-01
Alternative pre-mRNA splicing expands the complexity of the transcriptome and controls isoform-specific gene expression. Whether alternative splicing contributes to metabolic regulation is largely unknown. Here we investigated the contribution of alternative splicing to the development of diet-induced obesity. We found that obesity-induced changes in adipocyte gene expression include alternative pre-mRNA splicing. Bioinformatics analysis associated part of this alternative splicing program with sequence specific NOVA splicing factors. This conclusion was confirmed by studies of mice with NOVA deficiency in adipocytes. Phenotypic analysis of the NOVA-deficient mice demonstrated increased adipose tissue thermogenesis and improved glycemia. We show that NOVA proteins mediate a splicing program that suppresses adipose tissue thermogenesis. Together, these data provide quantitative analysis of gene expression at exon-level resolution in obesity and identify a novel mechanism that contributes to the regulation of adipose tissue function and the maintenance of normal glycemia. DOI: http://dx.doi.org/10.7554/eLife.17672.001 PMID:27635635
Evaluating approaches to find exon chains based on long reads.
Kuosmanen, Anna; Norri, Tuukka; Mäkinen, Veli
2018-05-01
Transcript prediction can be modeled as a graph problem where exons are modeled as nodes and reads spanning two or more exons are modeled as exon chains. Pacific Biosciences third-generation sequencing technology produces significantly longer reads than earlier second-generation sequencing technologies, which gives valuable information about longer exon chains in a graph. However, with the high error rates of third-generation sequencing, aligning long reads correctly around the splice sites is a challenging task. Incorrect alignments lead to spurious nodes and arcs in the graph, which in turn lead to incorrect transcript predictions. We survey several approaches to find the exon chains corresponding to long reads in a splicing graph, and experimentally study the performance of these methods using simulated data to allow for sensitivity/precision analysis. Our experiments show that short reads from second-generation sequencing can be used to significantly improve exon chain correctness either by error-correcting the long reads before splicing graph creation, or by using them to create a splicing graph on which the long-read alignments are then projected. We also study the memory and time consumption of various modules, and show that accurate exon chains lead to significantly increased transcript prediction accuracy. The simulated data and in-house scripts used for this article are available at http://www.cs.helsinki.fi/group/gsa/exon-chains/exon-chains-bib.tar.bz2.
LEDGF/p75 interacts with mRNA splicing factors and targets HIV-1 integration to highly spliced genes
Singh, Parmit Kumar; Plumb, Matthew R.; Ferris, Andrea L.; Iben, James R.; Wu, Xiaolin; Fadel, Hind J.; Luke, Brian T.; Esnault, Caroline; Poeschla, Eric M.; Hughes, Stephen H.; Kvaratskhelia, Mamuka; Levin, Henry L.
2015-01-01
The host chromatin-binding factor LEDGF/p75 interacts with HIV-1 integrase and directs integration to active transcription units. To understand how LEDGF/p75 recognizes transcription units, we sequenced 1 million HIV-1 integration sites isolated from cultured HEK293T cells. Analysis of integration sites showed that cancer genes were preferentially targeted, raising concerns about using lentivirus vectors for gene therapy. Additional analysis led to the discovery that introns and alternative splicing contributed significantly to integration site selection. These correlations were independent of transcription levels, size of transcription units, and length of the introns. Multivariate analysis with five parameters previously found to predict integration sites showed that intron density is the strongest predictor of integration density in transcription units. Analysis of previously published HIV-1 integration site data showed that integration density in transcription units in mouse embryonic fibroblasts also correlated strongly with intron number, and this correlation was absent in cells lacking LEDGF. Affinity purification showed that LEDGF/p75 is associated with a number of splicing factors, and RNA sequencing (RNA-seq) analysis of HEK293T cells lacking LEDGF/p75 or the LEDGF/p75 integrase-binding domain (IBD) showed that LEDGF/p75 contributes to splicing patterns in half of the transcription units that have alternative isoforms. Thus, LEDGF/p75 interacts with splicing factors, contributes to exon choice, and directs HIV-1 integration to transcription units that are highly spliced. PMID:26545813
Is an observed non-co-linear RNA product spliced in trans, in cis or just in vitro?
Yu, Chun-Ying; Liu, Hsiao-Jung; Hung, Li-Yuan; Kuo, Hung-Chih; Chuang, Trees-Juen
2014-01-01
Global transcriptome investigations often result in the detection of an enormous number of transcripts composed of non-co-linear sequence fragments. Such ‘aberrant’ transcript products may arise from post-transcriptional events or genetic rearrangements, or may otherwise be false positives (sequencing/alignment errors or in vitro artifacts). Moreover, post-transcriptionally non-co-linear (‘PtNcl’) transcripts can arise from trans-splicing or back-splicing in cis (to generate so-called ‘circular RNA’). Here, we collected previously-predicted human non-co-linear RNA candidates, and designed a validation procedure integrating in silico filters with multiple experimental validation steps to examine their authenticity. We showed that >50% of the tested candidates were in vitro artifacts, even though some had been previously validated by RT-PCR. After excluding the possibility of genetic rearrangements, we distinguished between trans-spliced and circular RNAs, and confirmed that these two splicing forms can share the same non-co-linear junction. Importantly, the experimentally-confirmed PtNcl RNA events and their corresponding PtNcl splicing types (i.e. trans-splicing, circular RNA, or both sharing the same junction) were all expressed in rhesus macaque, and some were even expressed in mouse. Our study thus describes an essential procedure for confirming PtNcl transcripts, and provides further insight into the evolutionary role of PtNcl RNA events, opening up this important, but understudied, class of post-transcriptional events for comprehensive characterization. PMID:25053845
Min, Xiang Jia
2013-01-01
Expressed Sequence Tags (ESTs) are a rich resource for identifying Alternatively Splicing (AS) genes. The ASFinder webserver is designed to identify AS isoforms from EST-derived sequences. Two approaches are implemented in ASFinder. If no genomic sequences are provided, the server performs a local BLASTN to identify AS isoforms from ESTs having both ends aligned but an internal segment unaligned. Otherwise, ASFinder uses SIM4 to map ESTs to the genome, then the overlapping ESTs that are mapped to the same genomic locus and have internal variable exon/intron boundaries are identified as AS isoforms. The tool is available at http://proteomics.ysu.edu/tools/ASFinder.html.
Duyk, G M; Kim, S W; Myers, R M; Cox, D R
1990-11-01
Identification and recovery of transcribed sequences from cloned mammalian genomic DNA remains an important problem in isolating genes on the basis of their chromosomal location. We have developed a strategy that facilitates the recovery of exons from random pieces of cloned genomic DNA. The basis of this "exon trapping" strategy is that, during a retroviral life cycle, genomic sequences of nonviral origin are correctly spliced and may be recovered as a cDNA copy of the introduced segment. By using this genetic assay for cis-acting sequences required for RNA splicing, we have screened approximately 20 kilobase pairs of cloned genomic DNA and have recovered all four predicted exons.
Duyk, G M; Kim, S W; Myers, R M; Cox, D R
1990-01-01
Identification and recovery of transcribed sequences from cloned mammalian genomic DNA remains an important problem in isolating genes on the basis of their chromosomal location. We have developed a strategy that facilitates the recovery of exons from random pieces of cloned genomic DNA. The basis of this "exon trapping" strategy is that, during a retroviral life cycle, genomic sequences of nonviral origin are correctly spliced and may be recovered as a cDNA copy of the introduced segment. By using this genetic assay for cis-acting sequences required for RNA splicing, we have screened approximately 20 kilobase pairs of cloned genomic DNA and have recovered all four predicted exons. PMID:2247475
Srivastava, Vaibhav; Srivastava, Manoj Kumar; Chibani, Kamel; Nilsson, Robert; Rouhier, Nicolas; Melzer, Michael; Wingsle, Gunnar
2009-01-01
Recent evidence has shown that alternative splicing (AS) is widely involved in the regulation of gene expression, substantially extending the diversity of numerous proteins. In this study, a subset of expressed sequence tags representing members of the reactive oxygen species gene network was selected from the PopulusDB database to investigate AS mechanisms in Populus. Examples of all known types of AS were detected, but intron retention was the most common. Interestingly, the closest Arabidopsis (Arabidopsis thaliana) homologs of half of the AS genes identified in Populus are not reportedly alternatively spliced. Two genes encoding the protein of most interest in our study (high-isoelectric-point superoxide dismutase [hipI-SOD]) have been found in black cottonwood (Populus trichocarpa), designated PthipI-SODC1 and PthipI-SODC2. Analysis of the expressed sequence tag libraries has indicated the presence of two transcripts of PthipI-SODC1 (hipI-SODC1b and hipI-SODC1s). Alignment of these sequences with the PthipI-SODC1 gene showed that hipI-SODC1b was 69 bp longer than hipI-SODC1s due to an AS event involving the use of an alternative donor splice site in the sixth intron. Transcript analysis showed that the splice variant hipI-SODC1b was differentially expressed, being clearly expressed in cambial and xylem, but not phloem, regions. In addition, immunolocalization and mass spectrometric data confirmed the presence of hipI-SOD proteins in vascular tissue. The functionalities of the spliced gene products were assessed by expressing recombinant hipI-SOD proteins and in vitro SOD activity assays. PMID:19176719
Srivastava, Vaibhav; Srivastava, Manoj Kumar; Chibani, Kamel; Nilsson, Robert; Rouhier, Nicolas; Melzer, Michael; Wingsle, Gunnar
2009-04-01
Recent evidence has shown that alternative splicing (AS) is widely involved in the regulation of gene expression, substantially extending the diversity of numerous proteins. In this study, a subset of expressed sequence tags representing members of the reactive oxygen species gene network was selected from the PopulusDB database to investigate AS mechanisms in Populus. Examples of all known types of AS were detected, but intron retention was the most common. Interestingly, the closest Arabidopsis (Arabidopsis thaliana) homologs of half of the AS genes identified in Populus are not reportedly alternatively spliced. Two genes encoding the protein of most interest in our study (high-isoelectric-point superoxide dismutase [hipI-SOD]) have been found in black cottonwood (Populus trichocarpa), designated PthipI-SODC1 and PthipI-SODC2. Analysis of the expressed sequence tag libraries has indicated the presence of two transcripts of PthipI-SODC1 (hipI-SODC1b and hipI-SODC1s). Alignment of these sequences with the PthipI-SODC1 gene showed that hipI-SODC1b was 69 bp longer than hipI-SODC1s due to an AS event involving the use of an alternative donor splice site in the sixth intron. Transcript analysis showed that the splice variant hipI-SODC1b was differentially expressed, being clearly expressed in cambial and xylem, but not phloem, regions. In addition, immunolocalization and mass spectrometric data confirmed the presence of hipI-SOD proteins in vascular tissue. The functionalities of the spliced gene products were assessed by expressing recombinant hipI-SOD proteins and in vitro SOD activity assays.
Jiang, Qiang; Yang, Chun Hong; Zhang, Yan; Sun, Yan; Li, Rong Ling; Wang, Chang Fa; Zhong, Ji Feng; Huang, Jin Ming
2016-01-01
Alternative splicing (AS) contributes to the complexity of the mammalian proteome and plays an important role in diseases, including infectious diseases. The differential AS patterns of these transcript sequences between the healthy (HS3A) and mastitic (HS8A) cows naturally infected by Staphylococcus aureus were compared to understand the molecular mechanisms underlying mastitis resistance and susceptibility. In this study, using the Illumina paired-end RNA sequencing method, 1352 differentially expressed genes (DEGs) with higher than twofold changes were found in the HS3A and HS8A mammary gland tissues. Gene ontology and KEGG pathway analyses revealed that the cytokine–cytokine receptor interaction pathway is the most significantly enriched pathway. Approximately 16k annotated unigenes were respectively identified in two libraries, based on the bovine Bos taurus UMD3.1 sequence assembly and search. A total of 52.62% and 51.24% annotated unigenes were alternatively spliced in term of exon skipping, intron retention, alternative 5′ splicing and alternative 3ʹ splicing. Additionally, 1,317 AS unigenes were HS3A-specific, whereas 1,093 AS unigenes were HS8A-specific. Some immune-related genes, such as ITGB6, MYD88, ADA, ACKR1, and TNFRSF1B, and their potential relationships with mastitis were highlighted. From Chromosome 2, 4, 6, 7, 10, 13, 14, 17, and 20, 3.66% (HS3A) and 5.4% (HS8A) novel transcripts, which harbor known quantitative trait locus associated with clinical mastitis, were identified. Many DEGs in the healthy and mastitic mammary glands are involved in immune, defense, and inflammation responses. These DEGs, which exhibit diverse and specific splicing patterns and events, can endow dairy cattle with the potential complex genetic resistance against mastitis. PMID:27459697
A Novel Subgenomic Murine Leukemia Virus RNA Transcript Results from Alternative Splicing
Déjardin, Jérôme; Bompard-Maréchal, Guillaume; Audit, Muriel; Hope, Thomas J.; Sitbon, Marc; Mougel, Marylène
2000-01-01
Here we show the existence of a novel subgenomic 4.4-kb RNA in cells infected with the prototypic replication-competent Friend or Moloney murine leukemia viruses (MuLV). This RNA derives by splicing from an alternative donor site (SD′) within the capsid-coding region to the canonical envelope splice acceptor site. The position and the sequence of SD′ was highly conserved among mammalian type C and D oncoviruses. Point mutations used to inactivate SD′ without changing the capsid-coding ability affected viral RNA splicing and reduced viral replication in infected cells. PMID:10729146
Haut, Donald D.; Pintel, D. J.
1998-01-01
Alternative splicing of pre-mRNAs plays a critical role in maximizing the coding capacity of the small parvovirus genome. The small-intron region of minute virus of mice (MVM) pre-mRNAs undergoes an unusual pattern of overlapping alternative splicing—using two donors (D1 and D2) and two acceptors (A1 and A2) within a region of 120 nucleotides—that determines the steady-state ratios of the various viral mRNAs. In this report, we show that the determinants that govern excision of the small intron are complex and are also required for efficient definition of the upstream exon. For the MVM small intron in its natural context, the two donors appear to compete for the splicing machinery: the position of D1 favors its usage, while the primary sequence of D2 must be more like the consensus sequence than is D1 to be used efficiently. We have genetically defined the branch points that are used for generation of the major and minor spliced forms and show that recognition of components of the small-intron acceptors is likely to be the dominant determinant in alternative small-intron excision. We have also identified a G-rich intronic enhancer sequence within the small intron that is essential for splicing of the minor form (D2 to A2) but not the major form (D1 to A1) of MVM mRNAs and is required for efficient definition of the upstream NS2-specific exon. In its natural context, the small intron appears to be excised by a mechanism consistent with intron definition. When the MVM small intron is expanded, various parameters of its excision are altered, indicating that critical cis-acting signals are context dependent. Relative use of the donors and acceptors is altered, and the upstream NS2-specific exon is no longer efficiently defined. The fact that definition of the upstream NS2-specific exon can be achieved by the MVM small intron in its natural context, but not when it is expanded, suggests that the multiple determinants that govern definition and excision of the small intron are required, in concert, for upstream exon definition. Our data are consistent with a model in which alternative splicing of the MVM P4-generated pre-mRNAs is governed by a hybrid of intron- and exon-defining mechanisms. PMID:9499034
Saravanaperumal, Siva Arumugam; Pediconi, Dario; Renieri, Carlo; La Terza, Antonietta
2012-01-01
Stem cell factor (SCF) is a growth factor, essential for haemopoiesis, mast cell development and melanogenesis. In the hematopoietic microenvironment (HM), SCF is produced either as a membrane-bound (−) or soluble (+) forms. Skin expression of SCF stimulates melanocyte migration, proliferation, differentiation, and survival. We report for the first time, a novel mRNA splice variant of SCF from the skin of white merino sheep via cloning and sequencing. Reverse transcriptase (RT)-PCR and molecular prediction revealed two different cDNA products of SCF. Full-length cDNA libraries were enriched by the method of rapid amplification of cDNA ends (RACE-PCR). Nucleotide sequencing and molecular prediction revealed that the primary 1519 base pair (bp) cDNA encodes a precursor protein of 274 amino acids (aa), commonly known as ‘soluble’ isoform. In contrast, the shorter (835 and/or 725 bp) cDNA was found to be a ‘novel’ mRNA splice variant. It contains an open reading frame (ORF) corresponding to a truncated protein of 181 aa (vs 245 aa) with an unique C-terminus lacking the primary proteolytic segment (28 aa) right after the D175G site which is necessary to produce ‘soluble’ form of SCF. This alternative splice (AS) variant was explained by the complete nucleotide sequencing of splice junction covering exon 5-intron (5)-exon 6 (948 bp) with a premature termination codon (PTC) whereby exons 6 to 9/10 are skipped (Cassette Exon, CE 6–9/10). We also demonstrated that the Northern blot analysis at transcript level is mediated via an intron-5 splicing event. Our data refine the structure of SCF gene; clarify the presence (+) and/or absence (−) of primary proteolytic-cleavage site specific SCF splice variants. This work provides a basis for understanding the functional role and regulation of SCF in hair follicle melanogenesis in sheep beyond what was known in mice, humans and other mammals. PMID:22719917
Rodríguez-Martín, Carlos; Cidre, Florencia; Fernández-Teijeiro, Ana; Gómez-Mariano, Gema; de la Vega, Leticia; Ramos, Patricia; Zaballos, Ángel; Monzón, Sara; Alonso, Javier
2016-05-01
Retinoblastoma (RB, MIM 180200) is the paradigm of hereditary cancer. Individuals harboring a constitutional mutation in one allele of the RB1 gene have a high predisposition to develop RB. Here, we present the first case of familial RB caused by a de novo insertion of a full-length long interspersed element-1 (LINE-1) into intron 14 of the RB1 gene that caused a highly heterogeneous splicing pattern of RB1 mRNA. LINE-1 insertion was inferred by mRNA studies and full-length sequenced by massive parallel sequencing. Some of the aberrant mRNAs were produced by noncanonical acceptor splice sites, a new finding that up to date has not been described to occur upon LINE-1 retrotransposition. Our results clearly show that RNA-based strategies have the potential to detect disease-causing transposon insertions. It also confirms that the incorporation of new genetic approaches, such as massive parallel sequencing, contributes to characterize at the sequence level these unique and exceptional genetic alterations.
Miyata, Y; Sugita, C; Maruyama, K; Sugita, M
2008-03-01
RNA editing of cytidine (C) to uridine (U) transitions occurs in plastids and mitochondria of most land plants. In this study, we amplified and sequenced the group I intron-containing tRNA Leu gene, trnL-CAA, from Takakia lepidozioides, a moss. DNA sequence analysis revealed that the T. lepidozioides tRNA Leu gene consisted of a 35-bp 5' exon, a 469-bp group I intron and a 50-bp 3' exon. The intron was inserted between the first and second position of the tRNA Leu anticodon. In general, plastid tRNA Leu genes with a group I intron code for a TAA anticodon in most land plants. This strongly suggests that the first nucleotide of the CAA anticodon could be edited in T. lepidozioides plastids. To investigate this possibility, we analysed cDNAs derived from the trnL-CAA transcripts. We demonstrated that the first nucleotide C of the anticodon was edited to create a canonical UAA anticodon in T. lepidozioides plastids. cDNA sequencing analyses of the spliced or unspliced tRNA Leu transcripts revealed that, while the spliced tRNA was completely edited, editing in the unspliced tRNAs were only partial. This is the first experimental evidence that the anticodon editing of tRNA occurs before RNA splicing in plastids. This suggests that this editing is a prerequisite to splicing of pre-tRNA Leu.
Zhang, Xiao-Ning; Shi, Yifei; Powers, Jordan J; Gowda, Nikhil B; Zhang, Chong; Ibrahim, Heba M M; Ball, Hannah B; Chen, Samuel L; Lu, Hua; Mount, Stephen M
2017-10-11
Regulation of pre-mRNA splicing diversifies protein products and affects many biological processes. Arabidopsis thaliana Serine/Arginine-rich 45 (SR45), regulates pre-mRNA splicing by interacting with other regulatory proteins and spliceosomal subunits. Although SR45 has orthologs in diverse eukaryotes, including human RNPS1, the sr45-1 null mutant is viable. Narrow flower petals and reduced seed formation suggest that SR45 regulates genes involved in diverse processes, including reproduction. To understand how SR45 is involved in the regulation of reproductive processes, we studied mRNA from the wild-type and sr45-1 inflorescences using RNA-seq, and identified SR45-bound RNAs by immunoprecipitation. Using a variety of bioinformatics tools, we identified a total of 358 SR45 differentially regulated (SDR) genes, 542 SR45-dependent alternative splicing (SAS) events, and 1812 SR45-associated RNAs (SARs). There is little overlap between SDR genes and SAS genes, and neither set of genes is enriched for flower or seed development. However, transcripts from reproductive process genes are significantly overrepresented in SARs. In exploring the fate of SARs, we found that a total of 81 SARs are subject to alternative splicing, while 14 of them are known Nonsense-Mediated Decay (NMD) targets. Motifs related to GGNGG are enriched both in SARs and near different types of SAS events, suggesting that SR45 recognizes this motif directly. Genes involved in plant defense are significantly over-represented among genes whose expression is suppressed by SR45, and sr45-1 plants do indeed show enhanced immunity. We find that SR45 is a suppressor of innate immunity. We find that a single motif (GGNGG) is highly enriched in both RNAs bound by SR45 and in sequences near SR45- dependent alternative splicing events in inflorescence tissue. We find that the alternative splicing events regulated by SR45 are enriched for this motif whether the effect of SR45 is activation or repression of the particular event. Thus, our data suggests that SR45 acts to control splice site choice in a way that defies simple categorization as an activator or repressor of splicing.
regSNPs-splicing: a tool for prioritizing synonymous single-nucleotide substitution.
Zhang, Xinjun; Li, Meng; Lin, Hai; Rao, Xi; Feng, Weixing; Yang, Yuedong; Mort, Matthew; Cooper, David N; Wang, Yue; Wang, Yadong; Wells, Clark; Zhou, Yaoqi; Liu, Yunlong
2017-09-01
While synonymous single-nucleotide variants (sSNVs) have largely been unstudied, since they do not alter protein sequence, mounting evidence suggests that they may affect RNA conformation, splicing, and the stability of nascent-mRNAs to promote various diseases. Accurately prioritizing deleterious sSNVs from a pool of neutral ones can significantly improve our ability of selecting functional genetic variants identified from various genome-sequencing projects, and, therefore, advance our understanding of disease etiology. In this study, we develop a computational algorithm to prioritize sSNVs based on their impact on mRNA splicing and protein function. In addition to genomic features that potentially affect splicing regulation, our proposed algorithm also includes dozens structural features that characterize the functions of alternatively spliced exons on protein function. Our systematical evaluation on thousands of sSNVs suggests that several structural features, including intrinsic disorder protein scores, solvent accessible surface areas, protein secondary structures, and known and predicted protein family domains, show significant differences between disease-causing and neutral sSNVs. Our result suggests that the protein structure features offer an added dimension of information while distinguishing disease-causing and neutral synonymous variants. The inclusion of structural features increases the predictive accuracy for functional sSNV prioritization.
Hereditary cancer genes are highly susceptible to splicing mutations
Soemedi, Rachel; Maguire, Samantha; Murray, Michael F.; Monaghan, Sean F.
2018-01-01
Substitutions that disrupt pre-mRNA splicing are a common cause of genetic disease. On average, 13.4% of all hereditary disease alleles are classified as splicing mutations mapping to the canonical 5′ and 3′ splice sites. However, splicing mutations present in exons and deeper intronic positions are vastly underreported. A recent re-analysis of coding mutations in exon 10 of the Lynch Syndrome gene, MLH1, revealed an extremely high rate (77%) of mutations that lead to defective splicing. This finding is confirmed by extending the sampling to five other exons in the MLH1 gene. Further analysis suggests a more general phenomenon of defective splicing driving Lynch Syndrome. Of the 36 mutations tested, 11 disrupted splicing. Furthermore, analyzing past reports suggest that MLH1 mutations in canonical splice sites also occupy a much higher fraction (36%) of total mutations than expected. When performing a comprehensive analysis of splicing mutations in human disease genes, we found that three main causal genes of Lynch Syndrome, MLH1, MSH2, and PMS2, belonged to a class of 86 disease genes which are enriched for splicing mutations. Other cancer genes were also enriched in the 86 susceptible genes. The enrichment of splicing mutations in hereditary cancers strongly argues for additional priority in interpreting clinical sequencing data in relation to cancer and splicing. PMID:29505604
Splicing fidelity: DEAD/H-box ATPases as molecular clocks.
Koodathingal, Prakash; Staley, Jonathan P
2013-07-01
The spliceosome discriminates against suboptimal substrates, both during assembly and catalysis, thereby enhancing specificity during pre-mRNA splicing. Central to such fidelity mechanisms are a conserved subset of the DEAD- and DEAH-box ATPases, which belong to a superfamily of proteins that mediate RNP rearrangements in almost all RNA-dependent processes in the cell. Through an investigation of the mechanisms contributing to the specificity of 5' splice site cleavage, two related reports, one from our lab and the other from the Cheng lab, have provided insights into fidelity mechanisms utilized by the spliceosome. In our work, we found evidence for a kinetic proofreading mechanism in splicing in which the DEAH-box ATPase Prp16 discriminates against substrates undergoing slow 5' splice site cleavage. Additionally, our study revealed that discriminated substrates are discarded through a general spliceosome disassembly pathway, mediated by another DEAH-box ATPase Prp43. In their work, Tseng et al. described the underlying molecular events through which Prp16 discriminates against a splicing substrate during 5' splice site cleavage. Here, we present a synthesis of these two studies and, additionally, provide the first biochemical evidence for discrimination of a suboptimal splicing substrate just prior to 5' splice site cleavage. Together, these findings support a general mechanism for a ubiquitous superfamily of ATPases in enhancing specificity during RNA-dependent processes in the cell.
Liu, Jun; Bhadra, Malini; Sinnakannu, Joanna Rajeswary; Yue, Wan Lin; Tan, Cheryl Weiqi; Rigo, Frank; Ong, S.Tiong; Roca, Xavier
2017-01-01
Many tyrosine kinase-driven cancers, including chronic myeloid leukemia (CML), are characterized by high response rates to specific tyrosine kinase inhibitors (TKIs) like imatinib. In East Asians, primary imatinib resistance is caused by a deletion polymorphism in Intron 2 of the BIM gene, whose product is required for TKI-induced apoptosis. The deletion biases BIM splicing from exon 4 to exon 3, generating splice isoforms lacking the exon 4-encoded pro-apoptotic BH3 domain, which impairs the ability of TKIs to induce apoptosis. We sought to identify splice-switching antisense oligonucleotides (ASOs) that block exon 3 but enhance exon 4 splicing, and thereby resensitize BIM deletion-containing cancers to imatinib. First, we mapped multiple cis-acting splicing elements around BIM exon 3 by minigene mutations, and found an exonic splicing enhancer acting via SRSF1. Second, by a systematic ASO walk, we isolated ASOs that corrected the aberrant BIM splicing. Eight of 67 ASOs increased exon 4 levels in BIM deletion-containing cells, and restored imatinib-induced apoptosis and TKI sensitivity. This proof-of-principle study proves that resistant CML cells by BIM deletion polymorphism can be resensitized to imatinib via splice-switching BIM ASOs. Future optimizations might yield a therapeutic ASO as precision-medicine adjuvant treatment for BIM-polymorphism-associated TKI-resistant CML and other cancers. PMID:29100409
Liu, Jun; Bhadra, Malini; Sinnakannu, Joanna Rajeswary; Yue, Wan Lin; Tan, Cheryl Weiqi; Rigo, Frank; Ong, S Tiong; Roca, Xavier
2017-09-29
Many tyrosine kinase-driven cancers, including chronic myeloid leukemia (CML), are characterized by high response rates to specific tyrosine kinase inhibitors (TKIs) like imatinib. In East Asians, primary imatinib resistance is caused by a deletion polymorphism in Intron 2 of the BIM gene, whose product is required for TKI-induced apoptosis. The deletion biases BIM splicing from exon 4 to exon 3, generating splice isoforms lacking the exon 4-encoded pro-apoptotic BH3 domain, which impairs the ability of TKIs to induce apoptosis. We sought to identify splice-switching antisense oligonucleotides (ASOs) that block exon 3 but enhance exon 4 splicing, and thereby resensitize BIM deletion-containing cancers to imatinib. First, we mapped multiple cis -acting splicing elements around BIM exon 3 by minigene mutations, and found an exonic splicing enhancer acting via SRSF1. Second, by a systematic ASO walk, we isolated ASOs that corrected the aberrant BIM splicing. Eight of 67 ASOs increased exon 4 levels in BIM deletion-containing cells, and restored imatinib-induced apoptosis and TKI sensitivity. This proof-of-principle study proves that resistant CML cells by BIM deletion polymorphism can be resensitized to imatinib via splice-switching BIM ASOs. Future optimizations might yield a therapeutic ASO as precision-medicine adjuvant treatment for BIM -polymorphism-associated TKI-resistant CML and other cancers.
Ganaie, Safder S; Chen, Aaron Yun; Huang, Chun; Xu, Peng; Kleiboeker, Steve; Du, Aifang; Qiu, Jianming
2018-04-15
Human parvovirus B19 (B19V) expresses a single precursor mRNA (pre-mRNA), which undergoes alternative splicing and alternative polyadenylation to generate 12 viral mRNA transcripts that encode two structural proteins (VP1 and VP2) and three nonstructural proteins (NS1, 7.5-kDa protein, and 11-kDa protein). Splicing at the second 5' donor site (D2 site) of the B19V pre-mRNA is essential for the expression of VP2 and the 11-kDa protein. We previously identified that cis -acting intronic splicing enhancer 2 (ISE2) that lies immediately after the D2 site facilitates the recognition of the D2 donor for its efficient splicing. In this study, we report that ISE2 is critical for the expression of the 11-kDa viral nonstructural protein. We found that ISE2 harbors a consensus RNA binding motif protein 38 (RBM38) binding sequence, 5'-UGUGUG-3'. RBM38 is expressed during the middle stage of erythropoiesis. We first confirmed that RBM38 binds specifically with the ISE2 element in vitro The knockdown of RBM38 significantly decreases the level of spliced mRNA at D2 that encodes the 11-kDa protein but not that of the D2-spliced mRNA that encodes VP2. Importantly, we found that the 11-kDa protein enhances viral DNA replication and virion release. Accordingly, the knockdown of RBM38 decreases virus replication via downregulating 11-kDa protein expression. Taken together, these results suggest that the 11-kDa protein facilitates B19V DNA replication and that RBM38 is an essential host factor for B19V pre-mRNA splicing and for the expression of the 11-kDa protein. IMPORTANCE B19V is a human pathogen that can cause fifth disease, arthropathy, anemia in immunocompromised patients and sickle cell disease patients, myocarditis, and hydrops fetalis in pregnant women. Human erythroid progenitor cells (EPCs) are most susceptible to B19V infection and fully support viral DNA replication. The exclusive tropism of B19V for erythroid-lineage cells is dependent not only on the expression of viral receptors and coreceptors on the cell surface but also on the intracellular host factors that support B19V replication. Our present study shows that B19V uses a host factor, RNA binding motif protein 38 (RBM38), for the processing of its pre-mRNA during virus replication. Specifically, RBM38 interacts with the intronic splicing enhancer 2 (ISE2) element of B19V pre-mRNA and promotes 11-kDa protein expression, thereby regulating the 11-kDa protein-mediated augmentation of B19V replication. The identification of this novel host-pathogen interaction will provide mechanistic insights into B19V replication and aid in finding new targets for anti-B19V therapeutics. Copyright © 2018 American Society for Microbiology.
Ono, Hiroyuki; Saitsu, Hirotomo; Horikawa, Reiko; Nakashima, Shinichi; Ohkubo, Yumiko; Yanagi, Kumiko; Nakabayashi, Kazuhiko; Fukami, Maki; Fujisawa, Yasuko; Ogata, Tsutomu
2018-02-02
Although partial androgen insensitivity syndrome (PAIS) is caused by attenuated responsiveness to androgens, androgen receptor gene (AR) mutations on the coding regions and their splice sites have been identified only in <25% of patients with a diagnosis of PAIS. We performed extensive molecular studies including whole exome sequencing in a Japanese family with PAIS, identifying a deep intronic variant beyond the branch site at intron 6 of AR (NM_000044.4:c.2450-42 G > A). This variant created the splice acceptor motif that was accompanied by pyrimidine-rich sequence and two candidate branch sites. Consistent with this, reverse transcriptase (RT)-PCR experiments for cycloheximide-treated lymphoblastoid cell lines revealed a relatively large amount of aberrant mRNA produced by the newly created splice acceptor site and a relatively small amount of wildtype mRNA produced by the normal splice acceptor site. Furthermore, most of the aberrant mRNA was shown to undergo nonsense mediated decay (NMD) and, if a small amount of aberrant mRNA may have escaped NMD, such mRNA was predicted to generate a truncated AR protein missing some functional domains. These findings imply that the deep intronic mutation creating an alternative splice acceptor site resulted in the production of a relatively small amount of wildtype AR mRNA, leading to PAIS.
Lisbin, Michael J.; Qiu, Jan; White, Kalpana
2001-01-01
Drosophila melanogaster neural-specific protein, ELAV, has been shown to regulate the neural-specific splicing of three genes: neuroglian (nrg), erect wing, and armadillo. Alternative splicing of the nrg transcript involves alternative inclusion of a 3′-terminal exon. Here, using a minigene reporter, we show that the nrg alternatively spliced intron (nASI) has all the determinants required to recreate proper neural-specific RNA processing seen with the endogenous nrg transcript, including regulation by ELAV. An in vitro UV cross-linking assay revealed that ELAV from nuclear extracts cross-links to four distinct sites along the 3200 nucleotide long nASI; one EXS is positioned at the polypyrimidine tract of the default 3′ splice site. ELAV cross-linking sites (EXSs) have in common long tracts of (U)-rich sequence rather than a precise consensus; moreover, each tract has at least two 8/10U elements; their importance is validated by mutant transgene reporter analysis. Further, we propose criteria for ELAV target sequence recognition based on the four EXSs, sites within the nASI that are (U) rich but do not cross-link with ELAV, and predicted EXSs from a phylogenetic comparison with Drosophila virilis nASI. These results suggest that ELAV regulates nrg alternative splicing by direct interaction with the nASI. PMID:11581160
Lisbin, M J; Qiu, J; White, K
2001-10-01
Drosophila melanogaster neural-specific protein, ELAV, has been shown to regulate the neural-specific splicing of three genes: neuroglian (nrg), erect wing, and armadillo. Alternative splicing of the nrg transcript involves alternative inclusion of a 3'-terminal exon. Here, using a minigene reporter, we show that the nrg alternatively spliced intron (nASI) has all the determinants required to recreate proper neural-specific RNA processing seen with the endogenous nrg transcript, including regulation by ELAV. An in vitro UV cross-linking assay revealed that ELAV from nuclear extracts cross-links to four distinct sites along the 3200 nucleotide long nASI; one EXS is positioned at the polypyrimidine tract of the default 3' splice site. ELAV cross-linking sites (EXSs) have in common long tracts of (U)-rich sequence rather than a precise consensus; moreover, each tract has at least two 8/10U elements; their importance is validated by mutant transgene reporter analysis. Further, we propose criteria for ELAV target sequence recognition based on the four EXSs, sites within the nASI that are (U) rich but do not cross-link with ELAV, and predicted EXSs from a phylogenetic comparison with Drosophila virilis nASI. These results suggest that ELAV regulates nrg alternative splicing by direct interaction with the nASI.
iSS-PC: Identifying Splicing Sites via Physical-Chemical Properties Using Deep Sparse Auto-Encoder.
Xu, Zhao-Chun; Wang, Peng; Qiu, Wang-Ren; Xiao, Xuan
2017-08-15
Gene splicing is one of the most significant biological processes in eukaryotic gene expression, such as RNA splicing, which can cause a pre-mRNA to produce one or more mature messenger RNAs containing the coded information with multiple biological functions. Thus, identifying splicing sites in DNA/RNA sequences is significant for both the bio-medical research and the discovery of new drugs. However, it is expensive and time consuming based only on experimental technique, so new computational methods are needed. To identify the splice donor sites and splice acceptor sites accurately and quickly, a deep sparse auto-encoder model with two hidden layers, called iSS-PC, was constructed based on minimum error law, in which we incorporated twelve physical-chemical properties of the dinucleotides within DNA into PseDNC to formulate given sequence samples via a battery of cross-covariance and auto-covariance transformations. In this paper, five-fold cross-validation test results based on the same benchmark data-sets indicated that the new predictor remarkably outperformed the existing prediction methods in this field. Furthermore, it is expected that many other related problems can be also studied by this approach. To implement classification accurately and quickly, an easy-to-use web-server for identifying slicing sites has been established for free access at: http://www.jci-bioinfo.cn/iSS-PC.
TopHat: discovering splice junctions with RNA-Seq
Trapnell, Cole; Pachter, Lior; Salzberg, Steven L.
2009-01-01
Motivation: A new protocol for sequencing the messenger RNA in a cell, known as RNA-Seq, generates millions of short sequence fragments in a single run. These fragments, or ‘reads’, can be used to measure levels of gene expression and to identify novel splice variants of genes. However, current software for aligning RNA-Seq data to a genome relies on known splice junctions and cannot identify novel ones. TopHat is an efficient read-mapping algorithm designed to align reads from an RNA-Seq experiment to a reference genome without relying on known splice sites. Results: We mapped the RNA-Seq reads from a recent mammalian RNA-Seq experiment and recovered more than 72% of the splice junctions reported by the annotation-based software from that study, along with nearly 20 000 previously unreported junctions. The TopHat pipeline is much faster than previous systems, mapping nearly 2.2 million reads per CPU hour, which is sufficient to process an entire RNA-Seq experiment in less than a day on a standard desktop computer. We describe several challenges unique to ab initio splice site discovery from RNA-Seq reads that will require further algorithm development. Availability: TopHat is free, open-source software available from http://tophat.cbcb.umd.edu Contact: cole@cs.umd.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:19289445
Ren, Xiaojun; Deng, Ruijie; Wang, Lida; Zhang, Kaixiang
2017-01-01
RNA splicing, which mainly involves two transesterification steps, is a fundamental process of gene expression and its abnormal regulation contributes to serious genetic diseases. Antisense oligonucleotides (ASOs) are genetic control tools that can be used to specifically control genes through alteration of the RNA splicing pathway. Despite intensive research, how ASOs or various other factors influence the multiple processes of RNA splicing still remains obscure. This is largely due to an inability to analyze the splicing efficiency of each step in the RNA splicing process with high sensitivity. We addressed this limitation by introducing a padlock probe-based isothermal amplification assay to achieve quantification of the specific products in different splicing steps. With this amplified assay, the roles that ASOs play in RNA splicing inhibition in the first and second steps could be distinguished. We identified that 5′-ASO could block RNA splicing by inhibiting the first step, while 3′-ASO could block RNA splicing by inhibiting the second step. This method provides a versatile tool for assisting efficient ASO design and discovering new splicing modulators and therapeutic drugs. PMID:28989608
Genomic overview of mRNA 5′-leader trans-splicing in the ascidian Ciona intestinalis
Satou, Yutaka; Hamaguchi, Makoto; Takeuchi, Keisuke; Hastings, Kenneth E. M.; Satoh, Nori
2006-01-01
Although spliced leader (SL) trans-splicing in the chordates was discovered in the tunicate Ciona intestinalis there has been no genomic overview analysis of the extent of trans-splicing or the make-up of the trans-spliced and non-trans-spliced gene populations of this model organism. Here we report such an analysis for Ciona based on the oligo-capping full-length cDNA approach. We randomly sampled 2078 5′-full-length ESTs representing 668 genes, or 4.2% of the entire genome. Our results indicate that Ciona contains a single major SL, which is efficiently trans-spliced to mRNAs transcribed from a specific set of genes representing ∼50% of the total number of expressed genes, and that individual trans-spliced mRNA species are, on average, 2–3-fold less abundant than non-trans-spliced mRNA species. Our results also identify a relationship between trans-splicing status and gene functional classification; ribosomal protein genes fall predominantly into the non-trans-spliced category. In addition, our data provide the first evidence for the occurrence of polycistronic transcription in Ciona. An interesting feature of the Ciona polycistronic transcription units is that the great majority entirely lack intercistronic sequences. PMID:16822859
Zeng, Weihong; Liu, Xinmei; Liu, Zhicui; Zheng, Ying; Yu, Tiantian; Fu, Shaliu; Li, Xiao; Zhang, Jing; Zhang, Siming; Ma, Xiaoling; Liu, Xiao-Rui; Qin, Xiaoli; Khanniche, Asma; Zhang, Yan; Tian, Fuju; Lin, Yi
2018-01-01
Decidual CD8 + (dCD8) T cells have been proposed to play important roles in immune protection against the invading pathogens and in tolerance toward the growing semi-allogeneic fetus during early pregnancy. However, their phenotypic and functional characteristics remain poorly defined. Here, we performed the first analysis of the transcriptional and alternative splicing (AS) signatures for human first-trimester dCD8 T cells using high-throughput mRNA sequencing. Our data revealed that dCD8 T cells have distinct transcriptional and AS landscapes when compared with their autologous peripheral blood CD8 + (pCD8) T counterparts. Furthermore, human dCD8 T cells were observed to contain CD8-Treg and effector-memory T-cell subsets, and display enhanced functionality in terms of degranulation and cytokine production on a per-cell basis. Additionally, we have identified the novel splice junctions that use a high ratio of the non-canonical splicing motif GC-AG and found that AS is not a major contributor to the gene expression-level changes between paired pCD8 and dCD8 T cells. Together, our findings not only provide a comprehensive framework of the transcriptional and AS landscapes but also reveal the functional feature of human dCD8 T cells, which are of great importance in understanding the biology of these cells and the physiology of human healthy pregnancy.
Matsumoto, Jun; Dewar, Ken; Wasserscheid, Jessica; Wiley, Graham B; Macmil, Simone L; Roe, Bruce A; Zeller, Robert W; Satou, Yutaka; Hastings, Kenneth E M
2010-05-01
Pre-mRNA 5' spliced-leader (SL) trans-splicing occurs in some metazoan groups but not in others. Genome-wide characterization of the trans-spliced mRNA subpopulation has not yet been reported for any metazoan. We carried out a high-throughput analysis of the SL trans-spliced mRNA population of the ascidian tunicate Ciona intestinalis by 454 Life Sciences (Roche) pyrosequencing of SL-PCR-amplified random-primed reverse transcripts of tailbud embryo RNA. We obtained approximately 250,000 high-quality reads corresponding to 8790 genes, approximately 58% of the Ciona total gene number. The great depth of this data revealed new aspects of trans-splicing, including the existence of a significant class of "infrequently trans-spliced" genes, accounting for approximately 28% of represented genes, that generate largely non-trans-spliced mRNAs, but also produce trans-spliced mRNAs, in part through alternative promoter use. Thus, the conventional qualitative dichotomy of trans-spliced versus non-trans-spliced genes should be supplanted by a more accurate quantitative view recognizing frequently and infrequently trans-spliced gene categories. Our data include reads representing approximately 80% of Ciona frequently trans-spliced genes. Our analysis also revealed significant use of closely spaced alternative trans-splice acceptor sites which further underscores the mechanistic similarity of cis- and trans-splicing and indicates that the prevalence of +/-3-nt alternative splicing events at tandem acceptor sites, NAGNAG, is driven by spliceosomal mechanisms, and not nonsense-mediated decay, or selection at the protein level. The breadth of gene representation data enabled us to find new correlations between trans-splicing status and gene function, namely the overrepresentation in the frequently trans-spliced gene class of genes associated with plasma/endomembrane system, Ca(2+) homeostasis, and actin cytoskeleton.
Coordinated tissue-specific regulation of adjacent alternative 3′ splice sites in C. elegans
Ragle, James Matthew; Katzman, Sol; Akers, Taylor F.; Barberan-Soler, Sergio; Zahler, Alan M.
2015-01-01
Adjacent alternative 3′ splice sites, those separated by ≤18 nucleotides, provide a unique problem in the study of alternative splicing regulation; there is overlap of the cis-elements that define the adjacent sites. Identification of the intron's 3′ end depends upon sequence elements that define the branchpoint, polypyrimidine tract, and terminal AG dinucleotide. Starting with RNA-seq data from germline-enriched and somatic cell-enriched Caenorhabditis elegans samples, we identify hundreds of introns with adjacent alternative 3′ splice sites. We identify 203 events that undergo tissue-specific alternative splicing. For these, the regulation is monodirectional, with somatic cells preferring to splice at the distal 3′ splice site (furthest from the 5′ end of the intron) and germline cells showing a distinct shift toward usage of the adjacent proximal 3′ splice site (closer to the 5′ end of the intron). Splicing patterns in somatic cells follow C. elegans consensus rules of 3′ splice site definition; a short stretch of pyrimidines preceding an AG dinucleotide. Splicing in germline cells occurs at proximal 3′ splice sites that lack a preceding polypyrimidine tract, and in three instances the germline-specific site lacks the AG dinucleotide. We provide evidence that use of germline-specific proximal 3′ splice sites is conserved across Caenorhabditis species. We propose that there are differences between germline and somatic cells in the way that the basal splicing machinery functions to determine the intron terminus. PMID:25922281
The effect of the common c.2299delG mutation in USH2A on RNA splicing.
Lenassi, Eva; Saihan, Zubin; Bitner-Glindzicz, Maria; Webster, Andrew R
2014-05-01
Recessive variants in the USH2A gene are an important cause of both Usher syndrome and nonsyndromic retinitis pigmentosa. A single base-pair deletion in exon 13 (c.2299delG, p.Glu767Serfs*21) is considered the most frequent mutation of USH2A. It is predicted to generate a premature termination codon and is presumed to lead to nonsense mediated decay. However the effect of this variant on RNA has not been formally investigated. It is not uncommon for exonic sequence alterations to cause aberrant splicing and the aim of the present report is to evaluate the effect of c.2299delG on USH2A transcripts. Nasal cells represent the simplest available tissue to study splicing defects in USH2A. Nasal brushing, RNA extraction from nasal epithelial cells and reverse transcription PCR were performed in five Usher syndrome patients who were homozygous for c.2299delG, two unaffected c.2299delG heterozygotes and seven control individuals. Primers to amplify between exons 12 and 15 and exons 10 and 14 were utilised. Significant variability was observed between different RT-PCR experiments. Importantly, in controls, PCR product of the expected size were amplified on all occasions (13/13 experiments); for patients this was true in only 4/14 experiments (Fisher exact test p = 0.0002). Bioinformatics tools predict the c.2299delG change to disrupt an exonic splicing enhancer and to create an exonic splicing silencer within exon 13. Here, we report an effect of the common c.2299delG mutation on splicing of exons 12 and 13 of USH2A. Future studies are expected to provide important insights into the contribution of this effect on the phenotype. Copyright © 2014 Elsevier Ltd. All rights reserved.
Suzuki, Masataka G; Tochigi, Mayuko; Sakaguchi, Honami; Aoki, Fugaku; Miyamoto, Norio
2015-06-01
The transformer (tra) gene is an intermediate component of the sex determination hierarchy in many insect species. The homolog of tra is also found in two branchiopod crustacean species but is not known outside arthropods. We have isolated a tra homolog in the acorn worm, Saccoglossus kowalevskii, which is a hemichordate belonging to the deuterostome superphylum. The full-length complementary DNA (cDNA) of the S. kowalevskii tra homolog (Sktra) has a 3786-bp open reading frame that encodes a 1261-amino acid sequence including a TRA-CAM domain and an arginine/serine (RS)-rich domain, both of which are characteristic of TRA orthologs. Reverse transcription PCR (RT-PCR) analyses demonstrated that Sktra showed no differences in expression patterns between testes and ovaries, but its expression level was approximately 7.5-fold higher in the testes than in the ovaries. TRA, together with the protein product of the transformer-2 (tra-2) gene, assembles on doublesex (dsx) pre-messenger RNA (mRNA) via the cis-regulatory element, enhancing female-specific splicing of dsx in Drosophila. To understand functional conservation of the SkTRA protein as a dsx-splicing activator, we investigated whether SkTRA is capable of inducing female-specific splicing of the Drosophila dsx. Ectopic expression of Sktra cDNA in insect cultured cells did not induce the female-specific splicing of dsx. On the other hand, forced expression of Sktra-2 (a tra-2 homolog of S. kowalevskii) was able to induce the female-specific dsx splicing. These results demonstrate that the function as a dsx-splicing activator is not conserved in SkTRA even though SkTRA-2 is capable of functionally replacing the Drosophila TRA-2. We have also found a tra homolog in an echinoderm genome. This study provides the first evidence that that tra is conserved not only in arthropods but also in basal species of deuterostoms.
Graveley, Brenton R.
2008-01-01
Summary Drosophila Dscam encodes 38,016 distinct axon guidance receptors through the mutually exclusive alternative splicing of 95 variable exons. Importantly, known mechanisms that ensure the mutually exclusive splicing of pairs of exons cannot explain this phenomenon in Dscam. I have identified two classes of conserved elements in the Dscam exon 6 cluster, which contains 48 alternative exons—the docking site, located in the intron downstream of constitutive exon 5, and the selector sequences, which are located upstream of each exon 6 variant. Strikingly, each selector sequence is complementary to a portion of the docking site, and this pairing juxtaposes one, and only one, alternative exon to the upstream constitutive exon. The mutually exclusive nature of the docking site:selector sequence interactions suggests that the formation of these competing RNA structures is a central component of the mechanism guaranteeing that only one exon 6 variant is included in each Dscam mRNA. PMID:16213213
Short intronic repeat sequences facilitate circular RNA production.
Liang, Dongming; Wilusz, Jeremy E
2014-10-15
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery "backsplices" and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼ 30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3' end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. © 2014 Liang and Wilusz; Published by Cold Spring Harbor Laboratory Press.
RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease.
Xiong, Hui Y; Alipanahi, Babak; Lee, Leo J; Bretschneider, Hannes; Merico, Daniele; Yuen, Ryan K C; Hua, Yimin; Gueroussov, Serge; Najafabadi, Hamed S; Hughes, Timothy R; Morris, Quaid; Barash, Yoseph; Krainer, Adrian R; Jojic, Nebojsa; Scherer, Stephen W; Blencowe, Benjamin J; Frey, Brendan J
2015-01-09
To facilitate precision medicine and whole-genome annotation, we developed a machine-learning technique that scores how strongly genetic variants affect RNA splicing, whose alteration contributes to many diseases. Analysis of more than 650,000 intronic and exonic variants revealed widespread patterns of mutation-driven aberrant splicing. Intronic disease mutations that are more than 30 nucleotides from any splice site alter splicing nine times as often as common variants, and missense exonic disease mutations that have the least impact on protein function are five times as likely as others to alter splicing. We detected tens of thousands of disease-causing mutations, including those involved in cancers and spinal muscular atrophy. Examination of intronic and exonic variants found using whole-genome sequencing of individuals with autism revealed misspliced genes with neurodevelopmental phenotypes. Our approach provides evidence for causal variants and should enable new discoveries in precision medicine. Copyright © 2015, American Association for the Advancement of Science.
[Alternative splicing regulation: implications in cancer diagnosis and treatment].
Martínez-Montiel, Nancy; Rosas-Murrieta, Nora; Martínez-Contreras, Rebeca
2015-04-08
The accurate expression of the genetic information is regulated by processes like mRNA splicing, proposed after the discoveries of Phil Sharp and Richard Roberts, who demonstrated the existence of intronic sequences, present in almost every structural eukaryotic gene, which should be precisely removed. This intron removal is called "splicing", which generates different proteins from a single mRNA, with different or even antagonistic functions. We currently know that alternative splicing is the most important source of protein diversity, given that 70% of the human genes undergo splicing and that mutations causing defects in this process could originate up to 50% of genetic diseases, including cancer. When these defects occur in genes involved in cell adhesion, proliferation and cell cycle regulation, there is an impact on cancer progression, rising the opportunity to diagnose and treat some types of cancer according to a particular splicing profile. Copyright © 2013 Elsevier España, S.L.U. All rights reserved.
A Predictive Model of Intein Insertion Site for Use in the Engineering of Molecular Switches
Apgar, James; Ross, Mary; Zuo, Xiao; Dohle, Sarah; Sturtevant, Derek; Shen, Binzhang; de la Vega, Humberto; Lessard, Philip; Lazar, Gabor; Raab, R. Michael
2012-01-01
Inteins are intervening protein domains with self-splicing ability that can be used as molecular switches to control activity of their host protein. Successfully engineering an intein into a host protein requires identifying an insertion site that permits intein insertion and splicing while allowing for proper folding of the mature protein post-splicing. By analyzing sequence and structure based properties of native intein insertion sites we have identified four features that showed significant correlation with the location of the intein insertion sites, and therefore may be useful in predicting insertion sites in other proteins that provide native-like intein function. Three of these properties, the distance to the active site and dimer interface site, the SVM score of the splice site cassette, and the sequence conservation of the site showed statistically significant correlation and strong predictive power, with area under the curve (AUC) values of 0.79, 0.76, and 0.73 respectively, while the distance to secondary structure/loop junction showed significance but with less predictive power (AUC of 0.54). In a case study of 20 insertion sites in the XynB xylanase, two features of native insertion sites showed correlation with the splice sites and demonstrated predictive value in selecting non-native splice sites. Structural modeling of intein insertions at two sites highlighted the role that the insertion site location could play on the ability of the intein to modulate activity of the host protein. These findings can be used to enrich the selection of insertion sites capable of supporting intein splicing and hosting an intein switch. PMID:22649521
Evolution of a tissue-specific splicing network
Taliaferro, J. Matthew; Alvarez, Nehemiah; Green, Richard E.; Blanchette, Marco; Rio, Donald C.
2011-01-01
Alternative splicing of precursor mRNA (pre-mRNA) is a strategy employed by most eukaryotes to increase transcript and proteomic diversity. Many metazoan splicing factors are members of multigene families, with each member having different functions. How these highly related proteins evolve unique properties has been unclear. Here we characterize the evolution and function of a new Drosophila splicing factor, termed LS2 (Large Subunit 2), that arose from a gene duplication event of dU2AF50, the large subunit of the highly conserved heterodimeric general splicing factor U2AF (U2-associated factor). The quickly evolving LS2 gene has diverged from the splicing-promoting, ubiquitously expressed dU2AF50 such that it binds a markedly different RNA sequence, acts as a splicing repressor, and is preferentially expressed in testes. Target transcripts of LS2 are also enriched for performing testes-related functions. We therefore propose a path for the evolution of a new splicing factor in Drosophila that regulates specific pre-mRNAs and contributes to transcript diversity in a tissue-specific manner. PMID:21406555
Informational structure of genetic sequences and nature of gene splicing
NASA Astrophysics Data System (ADS)
Trifonov, E. N.
1991-10-01
Only about 1/20 of DNA of higher organisms codes for proteins, by means of classical triplet code. The rest of DNA sequences is largely silent, with unclear functions, if any. The triplet code is not the only code (message) carried by the sequences. There are three levels of molecular communication, where the same sequence ``talks'' to various bimolecules, while having, respectively, three different appearances: DNA, RNA and protein. Since the molecular structures and, hence, sequence specific preferences of these are substantially different, the original DNA sequence has to carry simultaneously three types of sequence patterns (codes, messages), thus, being a composite structure in which one had the same letter (nucleotide) is frequently involved in several overlapping codes of different nature. This multiplicity and overlapping of the codes is a unique feature of the Gnomic, language of genetic sequences. The coexisting codes have to be degenerate in various degrees to allow an optimal and concerted performance of all the encoded functions. There is an obvious conflict between the best possible performance of a given function and necessity to compromise the quality of a given sequence pattern in favor of other patterns. It appears that the major role of various changes in the sequences on their ``ontogenetic'' way from DNA to RNA to protein, like RNA editing and splicing, or protein post-translational modifications is to resolve such conflicts. New data are presented strongly indicating that the gene splicing is such a device to resolve the conflict between the code of DNA folding in chromatin and the triplet code for protein synthesis.
Haas, Brian J; Salzberg, Steven L; Zhu, Wei; Pertea, Mihaela; Allen, Jonathan E; Orvis, Joshua; White, Owen; Buell, C Robin; Wortman, Jennifer R
2008-01-01
EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation. PMID:18190707
Reichert, Vienna; Moore, Melissa J.
2000-01-01
We demonstrate here that replacing potassium chloride (KCl) with potassium acetate (KAc) or potassium glutamate (KGlu) routinely enhances the yield of RNA intermediates and products obtained from in vitro splicing reactions performed in HeLa cell nuclear extract. This effect was reproducibly observed with multiple splicing substrates. The enhanced yields are at least partially due to stabilization of splicing precursors and products in the KAc and KGlu reactions. This stabilization relative to KCl reactions was greatest with KGlu and was observed over an extended potassium concentration range. The RNA stability differences could not be attributed to heavy metal contamination of the KCl, since ultrapure preparations of this salt yielded similar results. After testing various methods for altering the salts, we found that substitution of KAc or KGlu for KCl and MgAc2 for MgCl2 in splicing reactions is the simplest and most effective. Since the conditions defined here more closely mimic in vivo ionic concentrations, they may permit the study of more weakly spliced substrates, as well as facilitate more detailed analyses of spliceosome structure and function. PMID:10606638
Identifying RNA splicing factors using IFT genes in Chlamydomonas reinhardtii.
Lin, Huawen; Zhang, Zhengyan; Iomini, Carlo; Dutcher, Susan K
2018-03-01
Intraflagellar transport moves proteins in and out of flagella/cilia and it is essential for the assembly of these organelles. Using whole-genome sequencing, we identified splice site mutations in two IFT genes, IFT81 ( fla9 ) and IFT121 ( ift121-2 ), which lead to flagellar assembly defects in the unicellular green alga Chlamydomonas reinhardtii The splicing defects in these ift mutants are partially corrected by mutations in two conserved spliceosome proteins, DGR14 and FRA10. We identified a dgr14 deletion mutant, which suppresses the 3' splice site mutation in IFT81 , and a frameshift mutant of FRA10 , which suppresses the 5' splice site mutation in IFT121 Surprisingly, we found dgr14-1 and fra10 mutations suppress both splice site mutations. We suggest these two proteins are involved in facilitating splice site recognition/interaction; in their absence some splice site mutations are tolerated. Nonsense mutations in SMG1 , which is involved in nonsense-mediated decay, lead to accumulation of aberrant transcripts and partial restoration of flagellar assembly in the ift mutants. The high density of introns and the conservation of noncore splicing factors, together with the ease of scoring the ift mutant phenotype, make Chlamydomonas an attractive organism to identify new proteins involved in splicing through suppressor screening. © 2018 The Authors.
Mis-Spliced Lr34 Transcript Events in Winter Wheat.
Fang, Tilin; Carver, Brett F; Hunger, Robert M; Yan, Liuling
2017-01-01
Lr34 in wheat is a non-race-specific gene that confers resistance against multiple fungal pathogens. The resistant allele Lr34 and the susceptible allele Lr34s can be distinguished by three polymorphisms that cause alternation of deduced amino acid sequences of Lr34 at the protein level. In seedlings of a cultivar carrying the resistant Lr34r allele, only a portion (35%) of its transcripts was correctly spliced and the majority (65%) of its transcripts were incorrectly spliced due to multiple mis-splicing events. Lr34 mis-splicing events were also observed at adult plant age when this gene exerts its function. All of the mis-spliced Lr34r cDNA transcripts observed in this study resulted in a premature stop codon due to a shift of the open reading frame; hence, the mis-spliced Lr34r cDNAs were deduced to encode incomplete proteins. Even if a cultivar has a functional Lr34 gene, its transcripts might not completely splice in a correct pattern. These findings suggested that the partial resistance conferred by a quantitative gene might be due to mis-splicing events in its transcripts; hence, the resistance of the gene could be increased by eliminating or mutating regulators that cause mis-splicing events in wheat.
Carter, James R; Keith, James H; Fraser, Tresa S; Dawson, James L; Kucharski, Cheryl A; Horne, Kate M; Higgs, Stephen; Fraser, Malcolm J
2014-06-13
Approximately 100 million confirmed infections and 20,000 deaths are caused by Dengue virus (DENV) outbreaks annually. Global warming and rapid dispersal have resulted in DENV epidemics in formally non-endemic regions. Currently no consistently effective preventive measures for DENV exist, prompting development of transgenic and paratransgenic vector control approaches. Production of transgenic mosquitoes refractory for virus infection and/or transmission is contingent upon defining antiviral genes that have low probability for allowing escape mutations, and are equally effective against multiple serotypes. Previously we demonstrated the effectiveness of an anti-viral group I intron targeting U143 of the DENV genome in mediating trans-splicing and expression of a marker gene with the capsid coding domain. In this report we examine the effectiveness of coupling expression of ΔN Bax to trans-splicing U143 intron activity as a means of suppressing DENV infection of mosquito cells. Targeting the conserved DENV circularization sequence (CS) by U143 intron trans-splicing activity appends a 3' exon RNA encoding ΔN Bax to the capsid coding region of the genomic RNA, resulting in a chimeric protein that induces premature cell death upon infection. TCID50-IFA analyses demonstrate an enhancement of DENV suppression for all DENV serotypes tested over the identical group I intron coupled with the non-apoptotic inducing firefly luciferase as the 3' exon. These cumulative results confirm the increased effectiveness of this αDENV-U143-ΔN Bax group I intron as a sequence specific antiviral that should be useful for suppression of DENV in transgenic mosquitoes. Annexin V staining, caspase 3 assays, and DNA ladder observations confirm DCA-ΔN Bax fusion protein expression induces apoptotic cell death. This report confirms the relative effectiveness of an anti-DENV group I intron coupled to an apoptosis-inducing ΔN Bax 3' exon that trans-splices conserved sequences of the 5' CS region of all DENV serotypes and induces apoptotic cell death upon infection. Our results confirm coupling the targeted ribozyme capabilities of the group I intron with the generation of an apoptosis-inducing transcript increases the effectiveness of infection suppression, improving the prospects of this unique approach as a means of inducing transgenic refractoriness in mosquitoes for all serotypes of this important disease.
In silico study of breast cancer associated gene 3 using LION Target Engine and other tools.
León, Darryl A; Cànaves, Jaume M
2003-12-01
Sequence analysis of individual targets is an important step in annotation and validation. As a test case, we investigated human breast cancer associated gene 3 (BCA3) with LION Target Engine and with other bioinformatics tools. LION Target Engine confirmed that the BCA3 gene is located on 11p15.4 and that the two most likely splice variants (lacking exon 3 and exons 3 and 5, respectively) exist. Based on our manual curation of sequence data, it is proposed that an additional variant (missing only exon 5) published in a public sequence repository, is a prediction artifact. A significant number of new orthologs were also identified, and these were the basis for a high-quality protein secondary structure prediction. Moreover, our research confirmed several distinct functional domains as described in earlier reports. Sequence conservation from multiple sequence alignments, splice variant identification, secondary structure predictions, and predicted phosphorylation sites suggest that the removal of interaction sites through alternative splicing might play a modulatory role in BCA3. This in silico approach shows the depth and relevance of an analysis that can be accomplished by including a variety of publicly available tools with an integrated and customizable life science informatics platform.
Betz-Stablein, B. D.; Töpfer, A.; Littlejohn, M.; Yuen, L.; Colledge, D.; Sozzi, V.; Angus, P.; Thompson, A.; Revill, P.; Beerenwinkel, N.; Warner, N.
2016-01-01
ABSTRACT Chronic hepatitis B (CHB) is prevalent worldwide. The infectious agent, hepatitis B virus (HBV), replicates via an RNA intermediate and is error prone, leading to the rapid generation of closely related but not identical viral variants, including those that can escape host immune responses and antiviral treatments. The complexity of CHB can be further enhanced by the presence of HBV variants with large deletions in the genome generated via splicing (spHBV variants). Although spHBV variants are incapable of autonomous replication, their replication is rescued by wild-type HBV. spHBV variants have been shown to enhance wild-type virus replication, and their prevalence increases with liver disease progression. Single-molecule deep sequencing was performed on whole HBV genomes extracted from samples, including the liver explant, longitudinally collected from a subject with CHB over a 15-year period after liver transplantation. By employing novel bioinformatics methods, this analysis showed that the dynamics of the viral population across a period of changing treatment regimens was complex. The spHBV variants detected in the liver explant remained present posttransplantation, and a highly diverse novel spHBV population as well as variants with multiple deletions in the pre-S genes emerged. The identification of novel mutations outside the HBV reverse transcriptase gene that co-occurred with known drug resistance-associated mutations highlights the relevance of using full-genome deep sequencing and supports the hypothesis that drug resistance involves interactions across the full length of the HBV genome. IMPORTANCE Single-molecule sequencing allowed the characterization, in unprecedented detail, of the evolution of HBV populations and offered unique insights into the dynamics of defective and spHBV variants following liver transplantation and complex treatment regimens. This analysis also showed the rapid adaptation of HBV populations to treatment regimens with evolving drug resistance phenotypes and evidence of purifying selection across the whole genome. Finally, the new open-source bioinformatics tools with the capacity to easily identify potential spliced variants from deep sequencing data are freely available. PMID:27252524
Novel C8orf37 mutations cause retinitis pigmentosa in consanguineous families of Pakistani origin
Ravesh, Zeinab; El Asrag, Mohammed E.; Weisschuh, Nicole; McKibbin, Martin; Reuter, Peggy; Watson, Christopher M.; Baumann, Britta; Poulter, James A.; Sajid, Sundus; Panagiotou, Evangelia S.; O’Sullivan, James; Abdelhamed, Zakia; Bonin, Michael; Soltanifar, Mehdi; Black, Graeme C.M.; Din, Muhammad Amin-ud; Toomes, Carmel; Ansar, Muhammad; Inglehearn, Chris F.; Wissinger, Bernd
2015-01-01
Purpose To investigate the molecular basis of retinitis pigmentosa in two consanguineous families of Pakistani origin with multiple affected members. Methods Homozygosity mapping and Sanger sequencing of candidate genes were performed in one family while the other was analyzed with whole exome next-generation sequencing. A minigene splicing assay was used to confirm the splicing defects. Results In family MA48, a novel homozygous nucleotide substitution in C8orf37, c.244–2A>C, that disrupted the consensus splice acceptor site of exon 3 was found. The minigene splicing assay revealed that this mutation activated a cryptic splice site within exon 3, causing a 22 bp deletion in the transcript that is predicted to lead to a frameshift followed by premature protein truncation. In family MA13, a novel homozygous null mutation in C8orf37, c.555G>A, p.W185*, was identified. Both mutations segregated with the disease phenotype as expected in a recessive manner and were absent in 8,244 unrelated individuals of South Asian origin. Conclusions In this report, we describe C8orf37 mutations that cause retinal dystrophy in two families of Pakistani origin, contributing further data on the phenotype and the spectrum of mutations in this form of retinitis pigmentosa. PMID:25802487
U2AF1 mutations alter splice site recognition in hematological malignancies.
Ilagan, Janine O; Ramakrishnan, Aravind; Hayes, Brian; Murphy, Michele E; Zebari, Ahmad S; Bradley, Philip; Bradley, Robert K
2015-01-01
Whole-exome sequencing studies have identified common mutations affecting genes encoding components of the RNA splicing machinery in hematological malignancies. Here, we sought to determine how mutations affecting the 3' splice site recognition factor U2AF1 alter its normal role in RNA splicing. We find that U2AF1 mutations influence the similarity of splicing programs in leukemias, but do not give rise to widespread splicing failure. U2AF1 mutations cause differential splicing of hundreds of genes, affecting biological pathways such as DNA methylation (DNMT3B), X chromosome inactivation (H2AFY), the DNA damage response (ATR, FANCA), and apoptosis (CASP8). We show that U2AF1 mutations alter the preferred 3' splice site motif in patients, in cell culture, and in vitro. Mutations affecting the first and second zinc fingers give rise to different alterations in splice site preference and largely distinct downstream splicing programs. These allele-specific effects are consistent with a computationally predicted model of U2AF1 in complex with RNA. Our findings suggest that U2AF1 mutations contribute to pathogenesis by causing quantitative changes in splicing that affect diverse cellular pathways, and give insight into the normal function of U2AF1's zinc finger domains. © 2015 Ilagan et al.; Published by Cold Spring Harbor Laboratory Press.
Brown, Roger B; Madrid, Nathaniel J; Suzuki, Hideaki; Ness, Scott A
2017-01-01
RNA-sequencing (RNA-seq) has become the standard method for unbiased analysis of gene expression but also provides access to more complex transcriptome features, including alternative RNA splicing, RNA editing, and even detection of fusion transcripts formed through chromosomal translocations. However, differences in library methods can adversely affect the ability to recover these different types of transcriptome data. For example, some methods have bias for one end of transcripts or rely on low-efficiency steps that limit the complexity of the resulting library, making detection of rare transcripts less likely. We tested several commonly used methods of RNA-seq library preparation and found vast differences in the detection of advanced transcriptome features, such as alternatively spliced isoforms and RNA editing sites. By comparing several different protocols available for the Ion Proton sequencer and by utilizing detailed bioinformatics analysis tools, we were able to develop an optimized random primer based RNA-seq technique that is reliable at uncovering rare transcript isoforms and RNA editing features, as well as fusion reads from oncogenic chromosome rearrangements. The combination of optimized libraries and rapid Ion Proton sequencing provides a powerful platform for the transcriptome analysis of research and clinical samples.
Method of artificial DNA splicing by directed ligation (SDL).
Lebedenko, E N; Birikh, K R; Plutalov, O V; Berlin YuA
1991-01-01
An approach to directed genetic recombination in vitro has been devised, which allows for joining together, in a predetermined way, a series of DNA segments to give a precisely spliced polynucleotide sequence (DNA splicing by directed ligation, SDL). The approach makes use of amplification, by means of several polymerase chain reactions (PCR), of a chosen set of DNA segments. Primers for the amplifications contain recognition sites of the class IIS restriction endonucleases, which transform blunt ends of the amplification products into protruding ends of unique primary structures, the ends to be used for joining segments together being mutually complementary. Ligation of the mixture of the segments so synthesized gives the desired sequence in an unambiguous way. The suggested approach has been exemplified by the synthesis of a totally processed (intronless) gene encoding human mature interleukin-1 alpha. Images PMID:1662363
ABMapper: a suffix array-based tool for multi-location searching and splice-junction mapping.
Lou, Shao-Ke; Ni, Bing; Lo, Leung-Yau; Tsui, Stephen Kwok-Wing; Chan, Ting-Fung; Leung, Kwong-Sak
2011-02-01
Sequencing reads generated by RNA-sequencing (RNA-seq) must first be mapped back to the genome through alignment before they can be further analyzed. Current fast and memory-saving short-read mappers could give us a quick view of the transcriptome. However, they are neither designed for reads that span across splice junctions nor for repetitive reads, which can be mapped to multiple locations in the genome (multi-reads). Here, we describe a new software package: ABMapper, which is specifically designed for exploring all putative locations of reads that are mapped to splice junctions or repetitive in nature. The software is freely available at: http://abmapper.sourceforge.net/. The software is written in C++ and PERL. It runs on all major platforms and operating systems including Windows, Mac OS X and LINUX.
Gao, Panke; Jin, Zhen; Cheng, Yingying; Cao, Xiangshan
2014-10-01
Aberrant splicing events play important roles in the pathogenesis of acute myeloid leukemia (AML). To investigate the aberrant splicing events in AML during treatment, we carried out RNA sequencing in peripheral mononuclear cell samples from a patient with complete remission. In addition to the sequencing samples, selected splicing events were confirmed and validated with real-time quantitative RT-PCR in another seven pairs of samples. A total of 4.05 and 3.39 GB clean data of the AML and remission sample were generated, respectively, and 2,223 differentially expressed genes (DEGs) were identified. Integrated with gene expression profiling on T cells from AML patients compared with healthy donors, 82 DEGs were also differentially expressed in AML CD4 T cells and CD8 T cells. Twenty-three alternative splicing events were considered to be confidential, and they were involved in many biological processes, such as RNA processing, cellular macromolecule catabolic process, and DNA binding process. An exon3-skipping event in TRIP12 was detected in patients at remission and further validated in another three independent samples. TRIP12 is an ubiquitin ligase of ARF, which suppresses aberrant cell growth by activating p53 responses. The exon3-skipping isoform of TRIP12 increased significantly after treatment. Our results may provide new understanding of AML, and the confirmed alternative splicing event of TRIP12 may be used as potential target for future investigations.
HALBERT, CHRISTINE L.; LAM, SIU-LING; MILLER, A. DUSTY
2014-01-01
The transduction efficiency of adeno-associated virus (AAV) vectors in various somatic tissues has been shown to depend heavily on the AAV type from which the vector capsid proteins are derived. Among the AAV types studied, AAV6 efficiently transduces cells of the airway epithelium, making it a good candidate for the treatment of lung diseases such as cystic fibrosis. Here we have evaluated the effects of various promoter sequences on transduction rates and gene expression levels in the lung. Of the strong viral promoters examined, the Rous sarcoma virus (RSV) promoter performed significantly better than a human cytomegalovirus (CMV) promoter in the airway epithelium. However, a hybrid promoter consisting of a CMV enhancer, β-actin promoter and splice donor, and a β-globin splice acceptor (CAG promoter) exhibited even higher expression than either of the strong viral promoters alone, showing a 38-fold increase in protein expression over the RSV promoter. In addition, we show that vectors containing either the RSV or CAG promoter expressed well in the nasal and tracheal epithelium. Transduction rates in the 90% range were achieved in many airways with the CAG promoter, showing that with the proper AAV capsid proteins and promoter sequences, highly efficient transduction can be achieved. PMID:17430088
Concurrent and Accurate Short Read Mapping on Multicore Processors.
Martínez, Héctor; Tárraga, Joaquín; Medina, Ignacio; Barrachina, Sergio; Castillo, Maribel; Dopazo, Joaquín; Quintana-Ortí, Enrique S
2015-01-01
We introduce a parallel aligner with a work-flow organization for fast and accurate mapping of RNA sequences on servers equipped with multicore processors. Our software, HPG Aligner SA (HPG Aligner SA is an open-source application. The software is available at http://www.opencb.org, exploits a suffix array to rapidly map a large fraction of the RNA fragments (reads), as well as leverages the accuracy of the Smith-Waterman algorithm to deal with conflictive reads. The aligner is enhanced with a careful strategy to detect splice junctions based on an adaptive division of RNA reads into small segments (or seeds), which are then mapped onto a number of candidate alignment locations, providing crucial information for the successful alignment of the complete reads. The experimental results on a platform with Intel multicore technology report the parallel performance of HPG Aligner SA, on RNA reads of 100-400 nucleotides, which excels in execution time/sensitivity to state-of-the-art aligners such as TopHat 2+Bowtie 2, MapSplice, and STAR.
MYCN controls an alternative RNA splicing program in high-risk metastatic neuroblastoma
Zhang, Shile; Wei, Jun S.; Li, Samuel Q.; Badgett, Tom C.; Song, Young K.; Agarwal, Saurabh; Coarfa, Cristian; Tolman, Catherine; Hurd, Laura; Liao, Hongling; He, Jianbin; Wen, Xinyu; Liu, Zhihui; Thiele, Carol J.; Westermann, Frank; Asgharzadeh, Shahab; Seeger, Robert C.; Maris, John M.; Auvil, Jamie M Guidry; Smith, Malcolm A; Kolaczyk, Eric D; Shohet, Jason; Khan, Javed
2016-01-01
The molecular mechanisms underlying the aggressive behavior of MYCN driven neuroblastoma (NBL) is under intense investigation; however, little is known about the impact of this family of transcription factors on the splicing program. Here we used high-throughput RNA sequencing to systematically study the expression of RNA isoforms in stage 4 MYCN-amplified NBL, an aggressive subtype of metastatic NBL. We show that MYCN-amplified NBL tumors display a distinct gene splicing pattern affecting multiple cancer hallmark functions. Six splicing factors displayed unique differential expression patterns in MYCN-amplified tumors and cell lines, and the binding motifs for some of these splicing factors are significantly enriched in differentially-spliced genes. Direct binding of MYCN to promoter regions of the splicing factors PTBP1 and HNRNPA1 detected by ChIP-seq demonstrates MYCN controls the splicing pattern by direct regulation of the expression of these key splicing factors. Furthermore, high expression of PTBP1 and HNRNPA1 was significantly associated with poor overall survival of stage4 NBL patients (p≤0.05). Knocking down PTBP1, HNRNPA1 and their downstream target PKM2, an isoform of pro-tumor-growth, result in repressed growth of NBL cells. Therefore, our study reveals a novel role of MYCN in controlling global splicing program through regulation of splicing factors in addition to its well-known role in the transcription program. These findings suggest a therapeutically potential to target the key splicing factors or gene isoforms in high-risk NBL with MYCN-amplification. PMID:26683771
A mutational analysis of U12-dependent splice site dinucleotides
DIETRICH, ROSEMARY C.; FULLER, JOHN D.; PADGETT, RICHARD A.
2005-01-01
Introns spliced by the U12-dependent minor spliceosome are divided into two classes based on their splice site dinucleotides. The /AU-AC/ class accounts for about one-third of U12-dependent introns in humans, while the /GU-AG/ class accounts for the other two-thirds. We have investigated the in vivo and in vitro splicing phenotypes of mutations in these dinucleotide sequences. A 5′ A residue can splice to any 3′ residue, although C is preferred. A 5′ G residue can splice to 3′ G or U residues with a preference for G. Little or no splicing was observed to 3′ A or C residues. A 5′ U or C residue is highly deleterious for U12-dependent splicing, although some combinations, notably 5′ U to 3′ U produced detectable spliced products. The dependence of 3′ splice site activity on the identity of the 5′ residue provides evidence for communication between the first and last nucleotides of the intron. Most mutants in the second position of the 5′ splice site and the next to last position of the 3′ splice site were defective for splicing. Double mutants of these residues showed no evidence of communication between these nucleotides. Varying the distance between the branch site and the 3′ splice site dinucleotide in the /GU-AG/ class showed that a somewhat larger range of distances was functional than for the /AU-AC/ class. The optimum branch site to 3′ splice site distance of 11–12 nucleotides appears to be the same for both classes. PMID:16043500
New Splice Site Acceptor Mutation in AIRE Gene in Autoimmune Polyendocrine Syndrome Type 1
Mora, Mireia; Hanzu, Felicia A.; Pradas-Juni, Marta; Aranda, Gloria B.; Halperin, Irene; Puig-Domingo, Manuel; Aguiló, Sira; Fernández-Rebollo, Eduardo
2014-01-01
Autoimmune polyglandular syndrome type 1 (APS-1, OMIM 240300) is a rare autosomal recessive disorder, characterized by the presence of at least two of three major diseases: hypoparathyroidism, Addison’s disease, and chronic mucocutaneous candidiasis. We aim to identify the molecular defects and investigate the clinical and mutational characteristics in an index case and other members of a consanguineous family. We identified a novel homozygous mutation in the splice site acceptor (SSA) of intron 5 (c.653-1G>A) in two siblings with different clinical outcomes of APS-1. Coding DNA sequencing revealed that this AIRE mutation potentially compromised the recognition of the constitutive SSA of intron 5, splicing upstream onto a nearby cryptic SSA in intron 5. Surprisingly, the use of an alternative SSA entails the uncovering of a cryptic donor splice site in exon 5. This new transcript generates a truncated protein (p.A214fs67X) containing the first 213 amino acids and followed by 68 aberrant amino acids. The mutation affects the proper splicing, not only at the acceptor but also at the donor splice site, highlighting the complexity of recognizing suitable splicing sites and the importance of sequencing the intron-exon junctions for a more precise molecular diagnosis and correct genetic counseling. As both siblings were carrying the same mutation but exhibited a different APS-1 onset, and one of the brothers was not clinically diagnosed, our finding highlights the possibility to suspect mutations in the AIRE gene in cases of childhood chronic candidiasis and/or hypoparathyroidism otherwise unexplained, especially when the phenotype is associated with other autoimmune diseases. PMID:24988226
Mootz, Henning D; Blum, Elyse S; Tyszkiewicz, Amy B; Muir, Tom W
2003-09-03
Protein splicing is a naturally occurring process in which an intervening intein domain excises itself out of a precursor polypeptide in an autocatalytic fashion with concomitant linkage of the two flanking extein sequences by a native peptide bond. We have recently reported an engineered split VMA intein whose splicing activity in trans between two polypeptides can be triggered by the small molecule rapamycin. In this report, we show that this conditional protein splicing (CPS) system can be used in mammalian cells. Two model constructs harboring maltose-binding protein (MBP) and a His-tag as exteins were expressed from a constitutive promoter after transient transfection. The splicing product MBP-His was detected by Western blotting and immunoprecipitation in cells treated with rapamycin or a nontoxic analogue thereof. No background splicing in the absence of the small-molecule inducer was observed over a 24-h time course. Product formation could be detected within 10 min of addition of rapamycin, indicating the advantage of the posttranslational nature of CPS for quick responses. The level of protein splicing was dose dependent and could be competitively attenuated with the small molecule ascomycin. In related studies, the geometric flexibility of the CPS components was investigated with a series of purified proteins. The FKBP and FRB domains, which are dimerized by rapamycin and thereby induce the reconstitution of the split intein, were fused to the extein sequences of the split intein halves. CPS was still triggered by rapamycin when FKBP and FRB occupied one or both of the extein positions. This finding suggests yet further applications of CPS in the area of proteomics. In summary, CPS holds great promise to become a powerful new tool to control protein structure and function in vitro and in living cells.
RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application
2015-01-01
Background The study of RNA has been dramatically improved by the introduction of Next Generation Sequencing platforms allowing massive and cheap sequencing of selected RNA fractions, also providing information on strand orientation (RNA-Seq). The complexity of transcriptomes and of their regulative pathways make RNA-Seq one of most complex field of NGS applications, addressing several aspects of the expression process (e.g. identification and quantification of expressed genes and transcripts, alternative splicing and polyadenylation, fusion genes and trans-splicing, post-transcriptional events, etc.). Moreover, the huge volume of data generated by NGS platforms introduces unprecedented computational and technological challenges to efficiently analyze and store sequence data and results. Methods In order to provide researchers with an effective and friendly resource for analyzing RNA-Seq data, we present here RAP (RNA-Seq Analysis Pipeline), a cloud computing web application implementing a complete but modular analysis workflow. This pipeline integrates both state-of-the-art bioinformatics tools for RNA-Seq analysis and in-house developed scripts to offer to the user a comprehensive strategy for data analysis. RAP is able to perform quality checks (adopting FastQC and NGS QC Toolkit), identify and quantify expressed genes and transcripts (with Tophat, Cufflinks and HTSeq), detect alternative splicing events (using SpliceTrap) and chimeric transcripts (with ChimeraScan). This pipeline is also able to identify splicing junctions and constitutive or alternative polyadenylation sites (implementing custom analysis modules) and call for statistically significant differences in genes and transcripts expression, splicing pattern and polyadenylation site usage (using Cuffdiff2 and DESeq). Results Through a user friendly web interface, the RAP workflow can be suitably customized by the user and it is automatically executed on our cloud computing environment. This strategy allows to access to bioinformatics tools and computational resources without specific bioinformatics and IT skills. RAP provides a set of tabular and graphical results that can be helpful to browse, filter and export analyzed data, according to the user needs. PMID:26046471
NASA Astrophysics Data System (ADS)
Schmidt, J. B.
1985-09-01
This thesis investigates ways of improving the real-time performance of the Stockpoint Logistics Integrated Communication Environment (SPLICE). Performance evaluation through continuous monitoring activities and performance studies are the principle vehicles discussed. The method for implementing this performance evaluation process is the measurement of predefined performance indexes. Performance indexes for SPLICE are offered that would measure these areas. Existing SPLICE capability to carry out performance evaluation is explored, and recommendations are made to enhance that capability.
Both sides of the same coin: Rac1 splicing regulating by EGF signaling.
Fu, Xiang-Dong
2017-04-01
EGF, a well-studied mitogen for cancer cells, is revealed to induce an E3 ubiquitin ligase adaptor SPSB1, which recruits the Elongin B/C-Collin complex to trigger ubiquitylation of the negative splicing regulator hnRNP A1. This event is synergized with EGF-activated SR proteins to alter alternative splicing of a key small GTPase Rac1 to enhance cell migration, highlighting converging EGF signals on both negative and positive splicing regulators to jointly promote a key cancer pathway.
Niimi, Hideki; Ogawa, Tomomi; Note, Rhougou; Hayashi, Shirou; Ueno, Tomohiro; Harada, Kenu; Uji, Yoshinori; Kitajima, Isao
2010-12-01
In recent years, genetic diagnostics of pathogenic splicing abnormalities are increasingly recognized as critically important in the clinical genetic diagnostics. It is reported that approximately 10% of pathogenic mutations causing human inherited diseases are splicing mutations. Nonetheless, it is still difficult to identify splicing abnormalities in routine genetic diagnostic settings. Here, we studied two different kinds of cases with splicing abnormalities. The first case is a protein S deficiency. Nucleotide analyses revealed that the proband had a previously reported G to C substitution in the invariant AG dinucleotide at the splicing acceptor site of intronl/exon2, which produces multiple splicing abnormalities resulting in protein S deficiency. The second case is an antithrombin (AT) deficiency. This proband had a previously reported G to A substitution, at nucleotide position 9788 in intron 4, 14 bp in front of exon 5, which created a de novo exon 5 splice site and resulted in AT deficiency. From a practical standpoint, we discussed the pitfalls, attentions, and screening approaches in genetic diagnostics of pathogenic splicing abnormalities. Due to the difficulty with full-length sequence analysis of introns, and the lack of RNA samples, splicing mutations may escape identification. Although current genetic testing remains to be improved, to screen for splicing abnormalities more efficiently, it is significant to use an appropriate combination of various approaches such as DNA and/or RNA samples, splicing mutation databases, bioinformatic tools to detect splice sites and cis-regulatory elements, and in vitro and/or in vivo experimentally methods as needed.
Wang, Xinye; Xu, Xindong; Lu, Xingyu; Zhang, Yuanbin; Pan, Weiqing
2015-01-01
Alternative splicing is a molecular process that contributes greatly to the diversification of proteome and to gene functions. Understanding the mechanisms of stage-specific alternative splicing can provide a better understanding of the development of eukaryotes and the functions of different genes. Schistosoma japonicum is an infectious blood-dwelling trematode with a complex lifecycle that causes the tropical disease schistosomiasis. In this study, we analyzed the transcriptome of Schistosoma japonicum to discover alternative splicing events in this parasite, by applying RNA-seq to cDNA library of adults and schistosomula. Results were validated by RT-PCR and sequencing. We found 11,623 alternative splicing events among 7,099 protein encoding genes and average proportion of alternative splicing events per gene was 42.14%. We showed that exon skip is the most common type of alternative splicing events as found in high eukaryotes, whereas intron retention is the least common alternative splicing type. According to intron boundary analysis, the parasite possesses same intron boundaries as other organisms, namely the classic “GT-AG” rule. And in alternative spliced introns or exons, this rule is less strict. And we have attempted to detect alternative splicing events in genes encoding proteins with signal peptides and transmembrane helices, suggesting that alternative splicing could change subcellular locations of specific gene products. Our results indicate that alternative splicing is prevalent in this parasitic worm, and that the worm is close to its hosts. The revealed secretome involved in alternative splicing implies new perspective into understanding interaction between the parasite and its host. PMID:26407301
Chee, Gab-Joo; Takami, Hideto
2011-01-01
Group II introns inserted into genes often undergo splicing at unexpected sites, and participate in the transcription of host genes. We identified five copies of a group II intron, designated Oi.Int, in the genome of an extremely halotolerant and alkaliphilic bacillus, Oceanobacillus iheyensis. The Oi.Int4 differs from the Oi.Int3 at four bases. The ligated exons of the Oi.Int4 could not be detected by RT-PCR assays in vivo or in vitro although group II introns can generally self-splice in vitro without the involvement of an intron-encoded open reading frame (ORF). In the Oi.Int4 mutants with base substitutions within the ORF, ligated exons were detected by in vitro self-splicing. It was clear that the ligation of exons during splicing is affected by the sequence of the intron-encoded ORF since the splice sites corresponded to the joining sites of the intron. In addition, the mutant introns showed unexpected multiple products with alternative 5' splice sites. These findings imply that alternative 5' splicing which causes a functional change of ligated exons presumably has influenced past adaptations of O. iheyensis to various environmental changes.
Shults, Cody L; Dingwall, Caitlin B; Kim, Chun K; Pinceti, Elena; Rao, Yathindar S; Pak, Toni R
2018-01-01
Alternative RNA splicing results in the translation of diverse protein products arising from a common nucleotide sequence. These alternative protein products are often functional and can have widely divergent actions from the canonical protein. Studies in humans and other vertebrate animals have demonstrated that alternative splicing events increase with advanced age, sometimes resulting in pathological consequences. Menopause represents a critical transition for women, where the beneficial effects of estrogens are no longer evident; therefore, factors underlying increased pathological conditions in women are confounded by the dual factors of aging and declining estrogens. Estrogen receptors (ERs) are subject to alternative splicing, the spliced variants increase following menopause, and they fail to efficiently activate estrogen-dependent signaling pathways. However, the factors that regulate the alternative splicing of ERs remain unknown. We demonstrate novel evidence supporting a potential biological feedback loop where 17β-estradiol regulates the RNA-binding protein Nova1, which, in turn, regulates the alternative splicing of ERβ. These data increase our understanding of ER alternative splicing and could have potential implications for women taking hormone replacement therapy after menopause. Copyright © 2017 Elsevier Inc. All rights reserved.
Viglianti, G A; Rubinstein, E P; Graves, K L
1992-01-01
The untranslated leader sequences of rhesus macaque simian immunodeficiency virus mRNAs form a stable secondary structure, TAR. This structure can be modified by RNA splicing. In this study, the role of TAR splicing in virus replication was investigated. The proportion of viral RNAs containing a spliced TAR structure is high early after infection and decreases at later times. Moreover, proviruses containing mutations which prevent TAR splicing are significantly delayed in replication. These mutant viruses require approximately 20 days to achieve half-maximal virus production, in contrast to wild-type viruses, which require approximately 8 days. We attribute this delay to the inefficient translation of unspliced-TAR-containing mRNAs. The molecular basis for this translational effect was examined in in vitro assays. We found that spliced-TAR-containing mRNAs were translated up to 8.5 times more efficiently than were similar mRNAs containing an unspliced TAR leader. Furthermore, these spliced-TAR-containing mRNAs were more efficiently associated with ribosomes. We postulate that the level of TAR splicing provides a balance for the optimal expression of both viral proteins and genomic RNA and therefore ultimately controls the production of infectious virions. Images PMID:1629957
Lu, Zhi-xiang; Peng, Jia; Su, Bing
2007-10-01
Neuropsin (kallikrein 8, KLK8) is a secreted-type serine protease preferentially expressed in the central nervous system and involved in learning and memory. Its splicing pattern is different in human and mouse, with the longer form (type II) only expressed in human. Sequence analysis suggested a recent origin of type II during primate evolution. Here we demonstrate that the type II form is absent in nonhuman primates, and is thus a human-specific splice form. With the use of an in vitro splicing assay, we show that a human-specific T to A mutation (c.71-127T>A) triggers the change of splicing pattern, leading to the origin of a novel splice form in the human brain. Using mutation assay, we prove that this mutation is not only necessary but also sufficient for type II expression. Our results demonstrate a molecular mechanism for the creation of novel proteins through alternative splicing in the central nervous system during human evolution. Copyright 2007 Wiley-Liss, Inc.
Lehmann, Kjong-Van; Kahles, André; Kandoth, Cyriac; Lee, William; Schultz, Nikolaus; Stegle, Oliver; Rätsch, Gunnar
2015-01-01
We present a genome-wide analysis of splicing patterns of 282 kidney renal clear cell carcinoma patients in which we integrate data from whole-exome sequencing of tumor and normal samples, RNA-seq and copy number variation. We proposed a scoring mechanism to compare splicing patterns in tumor samples to normal samples in order to rank and detect tumor-specific isoforms that have a potential for new biomarkers. We identified a subset of genes that show introns only observable in tumor but not in normal samples, ENCODE and GEUVADIS samples. In order to improve our understanding of the underlying genetic mechanisms of splicing variation we performed a large-scale association analysis to find links between somatic or germline variants with alternative splicing events. We identified 915 cis- and trans-splicing quantitative trait loci (sQTL) associated with changes in splicing patterns. Some of these sQTL have previously been associated with being susceptibility loci for cancer and other diseases. Our analysis also allowed us to identify the function of several COSMIC variants showing significant association with changes in alternative splicing. This demonstrates the potential significance of variants affecting alternative splicing events and yields insights into the mechanisms related to an array of disease phenotypes.
Wistow, Graeme; Bernstein, Steven L; Wyatt, M Keith; Fariss, Robert N; Behal, Amita; Touchman, Jeffrey W; Bouffard, Gerald; Smith, Don; Peterson, Katherine
2002-06-15
The retinal pigment epithelium (RPE) and choroid comprise a functional unit of the eye that is essential to normal retinal health and function. Here we describe expressed sequence tag (EST) analysis of human RPE/choroid as part of a project for ocular bioinformatics. A cDNA library (cs) was made from human RPE/choroid and sequenced. Data were analyzed and assembled using the program GRIST (GRouping and Identification of Sequence Tags). Complete sequencing, Northern and Western blots, RH mapping, peptide antibody synthesis and immunofluorescence (IF) have been used to examine expression patterns and genome location for selected transcripts and proteins. Ten thousand individual sequence reads yield over 6300 unique gene clusters of which almost half have no matches with named genes. One of the most abundant transcripts is from a gene (named "alpha") that maps to the BBS1 region of chromosome 11. A number of tissue preferred transcripts are common to both RPE/choroid and iris. These include oculoglycan/opticin, for which an alternative splice form is detected in RPE/choroid, and "oculospanin" (Ocsp), a novel tetraspanin that maps to chromosome 17q. Antiserum to Ocsp detects expression in RPE, iris, ciliary body, and retinal ganglion cells by IF. A newly identified gene for a zinc-finger protein (TIRC) maps to 19q13.4. Variant transcripts of several genes were also detected. Most notably, the predominant form of Bestrophin represented in cs contains a longer open reading frame as a result of splice junction skipping. The unamplified cs library gives a view of the transcriptional repertoire of the adult RPE/choroid. A large number of potentially novel genes and splice forms and candidates for genetic diseases are revealed. Clones from this collection are being included in a large, nonredundant set for cDNA microarray construction.
Qiu, Guangrong; Liu, Caixia; Zhou, Jingyi; Liu, Peiyan; Wang, Jun; Jiang, Hongkun; Hou, Zhiyan; Zhao, Yanyan; Sun, Kailai; Li-Ling, Jesse
2010-06-01
X-linked hypophosphatemia (XLH) is the most common form of heritable rickets characterized by X-linked dominant inheritance, renal phosphate wasting, hypophosphatemia, and defective bone mineralization. Inactivating mutations of the PHEX gene located at Xp22.1 have been linked with this disease. Ethnic distribution of such mutations seems widespread but only a few mutations in the Chinese population have been reported to date. We report on a large Han Chinese family affected with XLH rickets, which included 13 patients from four generations. Polymerase chain reaction and direct sequencing were performed for all exons and intron-exon boundaries of the PHEX gene. The effect of nucleotide changes was analyzed using bioinformatic software. Prenatal diagnosis was performed on umbilical cord blood at the 20th gestational week. A novel G-->A splice mutation in intron 7 (c.849+1G>A) was identified in all patients from the family. As confirmed by reverse-transcription (RT)-polymerase chain reaction (PCR), the mutation has rendered loss of a normal splice donor site (c.849+1G) while activating a cryptic one at c.849+519G, which resulted in addition of 518 nucleotides to the mature RNA. Prenatal diagnosis had excluded the fetus for carrying the same mutation. A healthy boy was born later. A novel splice mutation c.849+1G>A in the PHEX gene is responsible for XLH in the studied family. Further studies may enhance our understanding of the role of this mutation in the pathogenesis of XLH.
Verbeeren, Jens; Verma, Bhupendra; Niemelä, Elina H; Yap, Karen; Makeyev, Eugene V; Frilander, Mikko J
2017-05-01
Cellular homeostasis of the minor spliceosome is regulated by a negative feed-back loop that targets U11-48K and U11/U12-65K mRNAs encoding essential components of the U12-type intron-specific U11/U12 di-snRNP. This involves interaction of the U11 snRNP with an evolutionarily conserved splicing enhancer giving rise to unproductive mRNA isoforms. In the case of U11/U12-65K, this mechanism controls the length of the 3' untranslated region (3'UTR). We show that this process is dynamically regulated in developing neurons and some other cell types, and involves a binary switch between translation-competent mRNAs with a short 3'UTR to non-productive isoforms with a long 3'UTR that are retained in the nucleus or/and spliced to the downstream amylase locus. Importantly, the choice between these alternatives is determined by alternative terminal exon definition events regulated by conserved U12- and U2-type 5' splice sites as well as sequence signals used for pre-mRNA cleavage and polyadenylation. We additionally show that U11 snRNP binding to the U11/U12-65K mRNA species with a long 3'UTR is required for their nuclear retention. Together, our studies uncover an intricate molecular circuitry regulating the abundance of a key spliceosomal protein and shed new light on the mechanisms limiting the export of non-productively spliced mRNAs from the nucleus to the cytoplasm.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pham-Dinh, D.; Gaspera, D.B.; Dautigny, A.
1995-09-20
Myelin/oligodendrocyte glycoprotein (MOG), a special component of the central nervous system localization on the outermost lamellae of mature myelin, is a member of the immunoglobulin superfamily. We report here the organization of the human MOG gene, which spans approximately 17 kb, and the characterization of six MOG mRNA splicing variants. The intron/exon structure of the human MOG gene confirmed the splicing pattern, supporting the hypothesis that mRNA isoforms could arise by alternative splicing of a single gene. In addition to the eight exons coding for the major MOG isoform, the human MOG gene also contains 3` region, a previously unknownmore » alternatively spliced coding exon, VIA. Alternative utilization of two acceptor splicing sites for exon VIII could produce two different C-termini. The nucleotide sequences presented here may be a useful tool to study further possible involvement if the MOG gene in hereditary neurological disorders. 23 refs., 5 figs.« less
Quantitative imaging of single mRNA splice variants in living cells
NASA Astrophysics Data System (ADS)
Lee, Kyuwan; Cui, Yi; Lee, Luke P.; Irudayaraj, Joseph
2014-06-01
Alternative messenger RNA (mRNA) splicing is a fundamental process of gene regulation, and errors in RNA splicing are known to be associated with a variety of different diseases. However, there is currently a lack of quantitative technologies for monitoring mRNA splice variants in cells. Here, we show that a combination of plasmonic dimer probes and hyperspectral imaging can be used to detect and quantify mRNA splice variants in living cells. The probes are made from gold nanoparticles functionalized with oligonucleotides and can hybridize to specific mRNA sequences, forming nanoparticle dimers that exhibit distinct spectral shifts due to plasmonic coupling. With this approach, we show that the spatial and temporal distribution of three selected splice variants of the breast cancer susceptibility gene, BRCA1, can be monitored at single-copy resolution by measuring the hybridization dynamics of the nanoplasmonic dimers. Our study provides insights into RNA and its transport in living cells, which could improve our understanding of cellular protein complexes, pharmacogenomics, genetic diagnosis and gene therapies.
Global impact of RNA splicing on transcriptome remodeling in the heart.
Gao, Chen; Wang, Yibin
2012-08-01
In the eukaryotic transcriptome, both the numbers of genes and different RNA species produced by each gene contribute to the overall complexity. These RNA species are generated by the utilization of different transcriptional initiation or termination sites, or more commonly, from different messenger RNA (mRNA) splicing events. Among the 30,000+ genes in human genome, it is estimated that more than 95% of them can generate more than one gene product via alternative RNA splicing. The protein products generated from different RNA splicing variants can have different intracellular localization, activity, or tissue-distribution. Therefore, alternative RNA splicing is an important molecular process that contributes to the overall complexity of the genome and the functional specificity and diversity among different cell types. In this review, we will discuss current efforts to unravel the full complexity of the cardiac transcriptome using a deep-sequencing approach, and highlight the potential of this technology to uncover the global impact of RNA splicing on the transcriptome during development and diseases of the heart.
Alternative splicing of the tyrosinase gene transcript in normal human melanocytes and lymphocytes.
Fryer, J P; Oetting, W S; Brott, M J; King, R A
2001-11-01
We have identified and isolated ectopically expressed tyrosinase transcripts in normal human melanocytes and lymphocytes and in a human melanoma (MNT-1) cell line to establish a baseline for the expression pattern of this gene in normal tissue. Tyrosinase mRNA from human lymphoblastoid cell lines was reverse transcribed and amplified using specific "nested" primers. This amplification yielded eight identifiable transcripts; five that resulted from alternative splicing patterns arising from the utilization of normal and alternative splice sequences. Identical splicing patterns were found in transcripts from human primary melanocytes in culture and a melanoma cell line, indicating that lymphoblastoid cell lines provide an accurate reflection of transcript processing in melanocytes. Similar splicing patterns have also been found with murine melanocyte tyrosinase transcripts. Our results demonstrate that alternative splicing of human tyrosinase gene transcript produces a number of predictable and identifiable transcripts, and that human lymphoblastoid cell lines provide a source of ectopically expressed transcripts that can be used to study the biology of tyrosinase gene expression in humans.
Rare splicing defects of FAS underly severe recessive autoimmune lymphoproliferative syndrome.
Agrebi, N; Ben-Mustapha, I; Matoussi, N; Dhouib, N; Ben-Ali, M; Mekki, N; Ben-Ahmed, M; Larguèche, B; Ben Becher, S; Béjaoui, M; Barbouche, M R
2017-10-01
Autoimmune lymphoproliferative syndrome (ALPS) is a prototypic disorder of impaired apoptosis characterized by autoimmune features and lymphoproliferation. Heterozygous germline or somatic FAS mutations associated with preserved protein expression have been described. Very rare cases of homozygous germline FAS mutations causing severe autosomal recessive form of ALPS with a complete defect of Fas expression have been reported. We report two unrelated patients from highly inbred North African population showing a severe ALPS phenotype and an undetectable Fas surface expression. Two novel homozygous mutations have been identified underlying rare splicing defects mechanisms. The first mutation breaks a branch point sequence and the second alters a regulatory exonic splicing site. These splicing defects induce the skipping of exon 6 encoding the transmembrane domain of CD95. Our findings highlight the requirement of tight regulation of FAS exon 6 splicing for balanced alternative splicing and illustrate the importance of such studies in highly consanguineous populations. Copyright © 2017 Elsevier Inc. All rights reserved.
Adamia, Sophia; Haibe-Kains, Benjamin; Pilarski, Patrick M; Bar-Natan, Michal; Pevzner, Samuel; Avet-Loiseau, Herve; Lode, Laurence; Verselis, Sigitas; Fox, Edward A; Burke, John; Galinsky, Ilene; Dagogo-Jack, Ibiayi; Wadleigh, Martha; Steensma, David P; Motyckova, Gabriela; Deangelo, Daniel J; Quackenbush, John; Stone, Richard; Griffin, James D
2014-03-01
Despite new treatments, acute myeloid leukemia (AML) remains an incurable disease. More effective drug design requires an expanded view of the molecular complexity that underlies AML. Alternative splicing of RNA is used by normal cells to generate protein diversity. Growing evidence indicates that aberrant splicing of genes plays a key role in cancer. We investigated genome-wide splicing abnormalities in AML and based on these abnormalities, we aimed to identify novel potential biomarkers and therapeutic targets. We used genome-wide alternative splicing screening to investigate alternative splicing abnormalities in two independent AML patient cohorts [Dana-Farber Cancer Institute (DFCI) (Boston, MA) and University Hospital de Nantes (UHN) (Nantes, France)] and normal donors. Selected splicing events were confirmed through cloning and sequencing analysis, and than validated in 193 patients with AML. Our results show that approximately 29% of expressed genes genome-wide were differentially and recurrently spliced in patients with AML compared with normal donors bone marrow CD34(+) cells. Results were reproducible in two independent AML cohorts. In both cohorts, annotation analyses indicated similar proportions of differentially spliced genes encoding several oncogenes, tumor suppressor proteins, splicing factors, and heterogeneous-nuclear-ribonucleoproteins, proteins involved in apoptosis, cell proliferation, and spliceosome assembly. Our findings are consistent with reports for other malignances and indicate that AML-specific aberrations in splicing mechanisms are a hallmark of AML pathogenesis. Overall, our results suggest that aberrant splicing is a common characteristic for AML. Our findings also suggest that splice variant transcripts that are the result of splicing aberrations create novel disease markers and provide potential targets for small molecules or antibody therapeutics for this disease. ©2013 AACR
Comprehensive Characterization of Swine Cardiac Troponin T Proteoforms by Top-Down Mass Spectrometry
NASA Astrophysics Data System (ADS)
Lin, Ziqing; Guo, Fang; Gregorich, Zachery R.; Sun, Ruixiang; Zhang, Han; Hu, Yang; Shanmuganayagam, Dhanansayan; Ge, Ying
2018-04-01
Cardiac troponin T (cTnT) regulates the Ca2+-mediated interaction between myosin thick filaments and actin thin filaments during cardiac contraction and relaxation. cTnT is released into the blood following injury, and increased serum levels of the protein are used clinically as a biomarker for myocardial infarction. Moreover, mutations in cTnT are causative in a number of familial cardiomyopathies. With the increasing use of large animal (swine) model to recapitulate human diseases, it is essential to characterize species-dependent protein sequence variants, alternative RNA splicing, and post-translational modifications (PTMs), but challenges remain due to the incomplete database and lack of validation of the predicted splicing isoforms. Herein, we integrated top-down mass spectrometry (MS) with online liquid chromatography (LC) and immunoaffinity purification to comprehensively characterize miniature swine cTnT proteoforms, including those arising from alternative RNA splicing and PTMs. A total of seven alternative splicing isoforms of cTnT were identified by LC/MS from swine left ventricular tissue, with each isoform containing un-phosphorylated and mono-phosphorylated proteoforms. The phosphorylation site was localized to Ser1 for the mono-phosphorylated proteoforms of cTnT1, 3, 4, and 6 by online MS/MS combining collisionally activated dissociation (CAD) and electron transfer dissociation (ETD). Offline MS/MS on Fourier-transform ion cyclotron resonance (FT-ICR) mass spectrometer with CAD and electron capture dissociation (ECD) was then utilized to achieve deep sequencing of mono-phosphorylated cTnT1 (35.2 kDa) with a high sequence coverage of 87%. Taken together, this study demonstrated the unique advantage of top-down MS in the comprehensive characterization of protein alternative splicing isoforms together with PTMs. [Figure not available: see fulltext.
Xue, Yuan; Schoser, Benedikt; Rao, Aliz R; Quadrelli, Roberto; Vaglio, Alicia; Rupp, Verena; Beichler, Christine; Nelson, Stanley F; Schapacher-Tilp, Gudrun; Windpassinger, Christian; Wilcox, William R
2016-04-01
Previously, we reported a rare X-linked disorder, Uruguay syndrome in a single family. The main features are pugilistic facies, skeletal deformities, and muscular hypertrophy despite a lack of exercise and cardiac ventricular hypertrophy leading to premature death. An ≈19 Mb critical region on X chromosome was identified through identity-by-descent analysis of 3 affected males. Exome sequencing was conducted on one affected male to identify the disease-causing gene and variant. A splice site variant (c.502-2A>G) in the FHL1 gene was highly suspicious among other candidate genes and variants. FHL1A is the predominant isoform of FHL1 in cardiac and skeletal muscle. Sequencing cDNA showed the splice site variant led to skipping of exons 6 of the FHL1A isoform, equivalent to the FHL1C isoform. Targeted analysis showed that this splice site variant cosegregated with disease in the family. Western blot and immunohistochemical analysis of muscle from the proband showed a significant decrease in protein expression of FHL1A. Real-time polymerase chain reaction analysis of different isoforms of FHL1 demonstrated that the FHL1C is markedly increased. Mutations in the FHL1 gene have been reported in disorders with skeletal and cardiac myopathy but none has the skeletal or facial phenotype seen in patients with Uruguay syndrome. Our data suggest that a novel FHL1 splice site variant results in the absence of FHL1A and the abundance of FHL1C, which may contribute to the complex and severe phenotype. Mutation screening of the FHL1 gene should be considered for patients with uncharacterized myopathies and cardiomyopathies. © 2016 American Heart Association, Inc.
Potentially pathogenic germline CHEK2 c.319+2T>A among multiple early-onset cancer families.
Dominguez-Valentin, Mev; Nakken, Sigve; Tubeuf, Hélène; Vodak, Daniel; Ekstrøm, Per Olaf; Nissen, Anke M; Morak, Monika; Holinski-Feder, Elke; Martins, Alexandra; Møller, Pål; Hovig, Eivind
2018-01-01
To study the potential contribution of genes other than BRCA1/2, PTEN, and TP53 to the biological and clinical characteristics of multiple early-onset cancers in Norwegian families, including early-onset breast cancer, Cowden-like and Li-Fraumeni-like syndromes (BC, CSL and LFL, respectively). The Hereditary Cancer Biobank from the Norwegian Radium Hospital was used to identify early-onset BC, CSL or LFL for whom no pathogenic variants in BRCA1/2, PTEN, or TP53 had been found in routine diagnostic DNA sequencing. Forty-four cancer susceptibility genes were selected and analyzed by our in-house designed TruSeq amplicon-based assay for targeted sequencing. Protein- and RNA splicing-dedicated in silico analyses were performed for all variants of unknown significance (VUS). Variants predicted as the more likely to affect splicing were experimentally analyzed by minigene assay. We identified a CSL individual carrying a variant in CHEK2 (c.319+2T>A, IVS2), here considered as likely pathogenic. Out of the five VUS (BRCA2, CDH1, CHEK2, MAP3K1, NOTCH3) tested in the minigene splicing assay, only NOTCH3 c.14090C>T (p.Ser497Leu) showed a significant effect on RNA splicing, notably by inducing partial skipping of exon 9. Among 13 early-onset BC, CSL and LFL patients, gene panel sequencing identified a potentially pathogenic variant in CHEK2 that affects a canonical RNA splicing signal. Our study provides new information on genetic loci that may affect the risk of developing cancer in these patients and their families, demonstrating that genes presently not routinely tested in molecular diagnostic settings may be important for capturing cancer predisposition in these families.
Widespread Use of Non-productive Alternative Splice Sites in Saccharomyces cerevisiae
Kawashima, Tadashi; Douglass, Stephen; Gabunilas, Jason; Pellegrini, Matteo; Chanfreau, Guillaume F.
2014-01-01
Saccharomyces cerevisiae has been used as a model system to investigate the mechanisms of pre-mRNA splicing but only a few examples of alternative splice site usage have been described in this organism. Using RNA-Seq analysis of nonsense-mediated mRNA decay (NMD) mutant strains, we show that many S. cerevisiae intron-containing genes exhibit usage of alternative splice sites, but many transcripts generated by splicing at these sites are non-functional because they introduce premature termination codons, leading to degradation by NMD. Analysis of splicing mutants combined with NMD inactivation revealed the role of specific splicing factors in governing the use of these alternative splice sites and identified novel functions for Prp17p in enhancing the use of branchpoint-proximal upstream 3′ splice sites and for Prp18p in suppressing the usage of a non-canonical AUG 3′-splice site in GCR1. The use of non-productive alternative splice sites can be increased in stress conditions in a promoter-dependent manner, contributing to the down-regulation of genes during stress. These results show that alternative splicing is frequent in S. cerevisiae but masked by RNA degradation and that the use of alternative splice sites in this organism is mostly aimed at controlling transcript levels rather than increasing proteome diversity. PMID:24722551
Developmental expression of a regulatory gene is programmed at the level of splicing.
Chou, T B; Zachar, Z; Bingham, P M
1987-01-01
We report sequence and transcript structures for a 6191-base chromosomal segment containing the presumptive regulatory gene from Drosophila, suppressor-of-white-apricot [su(wa)]. Our results indicate that su(wa) expression is controlled by regulating occurrence of specific splices. Seven introns are removed from the su(wa) primary transcript during precellular blastoderm development. The sequence of this mature RNA indicates that it is a conventional messenger RNA. In contrast, after cellular blastoderm the first two of these introns cease to be efficiently removed. The mature RNAs resulting from this failure to remove the first two introns have structures quite unexpected of mRNAs. We propose that postcellular blastoderm su(wa) expression is repressed by preventing splices necessary to produce a functional mRNA. Implications and mechanisms are discussed. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. PMID:2832151
MYCN controls an alternative RNA splicing program in high-risk metastatic neuroblastoma.
Zhang, Shile; Wei, Jun S; Li, Samuel Q; Badgett, Tom C; Song, Young K; Agarwal, Saurabh; Coarfa, Cristian; Tolman, Catherine; Hurd, Laura; Liao, Hongling; He, Jianbin; Wen, Xinyu; Liu, Zhihui; Thiele, Carol J; Westermann, Frank; Asgharzadeh, Shahab; Seeger, Robert C; Maris, John M; Guidry Auvil, Jamie M; Smith, Malcolm A; Kolaczyk, Eric D; Shohet, Jason; Khan, Javed
2016-02-28
The molecular mechanisms underlying the aggressive behavior of MYCN driven neuroblastoma (NBL) is under intense investigation; however, little is known about the impact of this family of transcription factors on the splicing program. Here we used high-throughput RNA sequencing to systematically study the expression of RNA isoforms in stage 4 MYCN-amplified NBL, an aggressive subtype of metastatic NBL. We show that MYCN-amplified NBL tumors display a distinct gene splicing pattern affecting multiple cancer hallmark functions. Six splicing factors displayed unique differential expression patterns in MYCN-amplified tumors and cell lines, and the binding motifs for some of these splicing factors are significantly enriched in differentially-spliced genes. Direct binding of MYCN to promoter regions of the splicing factors PTBP1 and HNRNPA1 detected by ChIP-seq demonstrates that MYCN controls the splicing pattern by direct regulation of the expression of these key splicing factors. Furthermore, high expression of PTBP1 and HNRNPA1 was significantly associated with poor overall survival of stage4 NBL patients (p ≤ 0.05). Knocking down PTBP1, HNRNPA1 and their downstream target PKM2, an isoform of pro-tumor-growth, result in repressed growth of NBL cells. Therefore, our study reveals a novel role of MYCN in controlling global splicing program through regulation of splicing factors in addition to its well-known role in the transcription program. These findings suggest a therapeutically potential to target the key splicing factors or gene isoforms in high-risk NBL with MYCN-amplification. Published by Elsevier Ireland Ltd.
Nanoplasmonic probes of RNA folding and assembly during pre-mRNA splicing
NASA Astrophysics Data System (ADS)
Nguyen, Anh H.; Lee, Jong Uk; Sim, Sang Jun
2016-02-01
RNA splicing plays important roles in transcriptome and proteome diversity. Herein, we describe the use of a nanoplasmonic system that unveils RNA folding and assembly during pre-mRNA splicing wherein the quantification of mRNA splice variants is not taken into account. With a couple of SERS-probes and plasmonic probes binding at the boundary sites of exon-2/intron-2 and intron-2/exon-3 of the pre-mature RNA of the β-globin gene, the splicing process brings the probes into the plasmonic bands. For plasmonic probes, a plasmon shift increase of ~29 nm, corresponding to intron removal and exon-2 and exon-3 connection to form the mRNA molecule, is measured by plasmonic coupling. The increased scattering intensity and surface-enhanced Raman scattering (SERS) fingerprinting reveal the clear dynamics of pre-mRNA splicing. Moreover, a time-resolved experiment of individual RNA molecules exhibited a successful splicing and an inhibited splicing event by 33 μM biflavonoid isoginkgetin, a general inhibitor of RNA splicing. The results suggest that the RNA splicing is successfully monitored with the nanoplasmonic system. Thus, this platform can be useful for studying RNA nanotechnology, biomolecular folding, alternative splicing, and maturation of microRNA.
Novel BRCA1 splice-site mutation in ovarian cancer patients of Slavic origin.
Krivokuca, Ana; Dragos, Vita Setrajcic; Stamatovic, Ljiljana; Blatnik, Ana; Boljevic, Ivana; Stegel, Vida; Rakobradovic, Jelena; Skerl, Petra; Jovandic, Stevo; Krajc, Mateja; Magic, Mirjana Brankovic; Novakovic, Srdjan
2018-04-01
Mutations in breast cancer susceptibility gene 1 (BRCA1) lead to defects in a number of cellular pathways including DNA damage repair and transcriptional regulation, resulting in the elevated genome instability and predisposing to breast and ovarian cancers. We report a novel mutation LRG_292t1:c.4356delA,p.(Ala1453Glnfs*3) in the 12th exon of BRCA1, in the splice site region near the donor site of intron 12. It is a frameshift mutation with the termination codon generated on the third amino acid position from the site of deletion. Human Splice Finder 3.0 and MutationTaster have assessed this variation as disease causing, based on the alteration of splicing, creation of premature stop codon and other potential alterations initiated by nucleotide deletion. Among the most important alterations are frameshift and splice site changes (score of the newly created donor splice site: 0.82). c.4356delA was associated with two ovarian cancer cases in two families of Slavic origin. It was detected by next generation sequencing, and confirmed with Sanger sequencing in both cases. Because of the fact that it changes the reading frame of the protein, novel mutation c.4356delA p.(Ala1453Glnfs*3) in BRCA1 gene might be of clinical significance for hereditary ovarian cancer. Further functional as well as segregation analyses within the families are necessary for appropriate clinical classification of this variant. Since it has been detected in two ovarian cancer patients of Slavic origin, it is worth investigating founder effect of this mutation in Slavic populations.
Leveraging transcript quantification for fast computation of alternative splicing profiles.
Alamancos, Gael P; Pagès, Amadís; Trincado, Juan L; Bellora, Nicolás; Eyras, Eduardo
2015-09-01
Alternative splicing plays an essential role in many cellular processes and bears major relevance in the understanding of multiple diseases, including cancer. High-throughput RNA sequencing allows genome-wide analyses of splicing across multiple conditions. However, the increasing number of available data sets represents a major challenge in terms of computation time and storage requirements. We describe SUPPA, a computational tool to calculate relative inclusion values of alternative splicing events, exploiting fast transcript quantification. SUPPA accuracy is comparable and sometimes superior to standard methods using simulated as well as real RNA-sequencing data compared with experimentally validated events. We assess the variability in terms of the choice of annotation and provide evidence that using complete transcripts rather than more transcripts per gene provides better estimates. Moreover, SUPPA coupled with de novo transcript reconstruction methods does not achieve accuracies as high as using quantification of known transcripts, but remains comparable to existing methods. Finally, we show that SUPPA is more than 1000 times faster than standard methods. Coupled with fast transcript quantification, SUPPA provides inclusion values at a much higher speed than existing methods without compromising accuracy, thereby facilitating the systematic splicing analysis of large data sets with limited computational resources. The software is implemented in Python 2.7 and is available under the MIT license at https://bitbucket.org/regulatorygenomicsupf/suppa. © 2015 Alamancos et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Chen, Kenian; Sloan, Steven A.; Bennett, Mariko L.; Scholze, Anja R.; O'Keeffe, Sean; Phatnani, Hemali P.; Guarnieri, Paolo; Caneda, Christine; Ruderisch, Nadine; Deng, Shuyun; Liddelow, Shane A.; Zhang, Chaolin; Daneman, Richard; Maniatis, Tom; Barres, Ben A.
2014-01-01
The major cell classes of the brain differ in their developmental processes, metabolism, signaling, and function. To better understand the functions and interactions of the cell types that comprise these classes, we acutely purified representative populations of neurons, astrocytes, oligodendrocyte precursor cells, newly formed oligodendrocytes, myelinating oligodendrocytes, microglia, endothelial cells, and pericytes from mouse cerebral cortex. We generated a transcriptome database for these eight cell types by RNA sequencing and used a sensitive algorithm to detect alternative splicing events in each cell type. Bioinformatic analyses identified thousands of new cell type-enriched genes and splicing isoforms that will provide novel markers for cell identification, tools for genetic manipulation, and insights into the biology of the brain. For example, our data provide clues as to how neurons and astrocytes differ in their ability to dynamically regulate glycolytic flux and lactate generation attributable to unique splicing of PKM2, the gene encoding the glycolytic enzyme pyruvate kinase. This dataset will provide a powerful new resource for understanding the development and function of the brain. To ensure the widespread distribution of these datasets, we have created a user-friendly website (http://web.stanford.edu/group/barres_lab/brain_rnaseq.html) that provides a platform for analyzing and comparing transciption and alternative splicing profiles for various cell classes in the brain. PMID:25186741
[Analysis of USH2A gene mutation in a Chinese family affected with Usher syndrome].
Li, Pengcheng; Liu, Fei; Zhang, Mingchang; Wang, Qiufen; Liu, Mugen
2015-08-01
To investigate the disease-causing mutation in a Chinese family affected with Usher syndrome type II. All of the 11 members from the family underwent comprehensive ophthalmologic examination and hearing test, and their genomic DNA were isolated from venous leukocytes. PCR and direct sequencing of USH2A gene were performed for the proband. Wild type and mutant type minigene vectors containing exon 42, intron 42 and exon 43 of the USH2A gene were constructed and transfected into Hela cells by lipofectamine reagent. Reverse transcription (RT)-PCR was carried out to verify the splicing of the minigenes. Pedigree analysis and clinical diagnosis indicated that the patients have suffered from autosomal recessive Usher syndrome type II. DNA sequencing has detected a homozygous c.8559-2A>G mutation of the USH2A gene in the proband, which has co-segregated with the disease in the family. The mutation has affected a conserved splice site in intron 42, which has led to inactivation of the splice site. Minigene experiment has confirmed the retaining of intron 42 in mature mRNA. The c.8559-2A>G mutation in the USH2A gene probably underlies the Usher syndrome type II in this family. The splice site mutation has resulted in abnormal splicing of USH2A pre-mRNA.
Capturing novel mouse genes encoding chromosomal and other nuclear proteins.
Tate, P; Lee, M; Tweedie, S; Skarnes, W C; Bickmore, W A
1998-09-01
The burgeoning wealth of gene sequences contrasts with our ignorance of gene function. One route to assigning function is by determining the sub-cellular location of proteins. We describe the identification of mouse genes encoding proteins that are confined to nuclear compartments by splicing endogeneous gene sequences to a promoterless betageo reporter, using a gene trap approach. Mouse ES (embryonic stem) cell lines were identified that express betageo fusions located within sub-nuclear compartments, including chromosomes, the nucleolus and foci containing splicing factors. The sequences of 11 trapped genes were ascertained, and characterisation of endogenous protein distribution in two cases confirmed the validity of the approach. Three novel proteins concentrated within distinct chromosomal domains were identified, one of which appears to be a serine/threonine kinase. The sequence of a gene whose product co-localises with splicesome components suggests that this protein may be an E3 ubiquitin-protein ligase. The majority of the other genes isolated represent novel genes. This approach is shown to be a powerful tool for identifying genes encoding novel proteins with specific sub-nuclear localisations and exposes our ignorance of the protein composition of the nucleus. Motifs in two of the isolated genes suggest new links between cellular regulatory mechanisms (ubiquitination and phosphorylation) and mRNA splicing and chromosome structure/function.
The Nucleotide Sequence and Spliced pol mRNA Levels of the Nonprimate Spumavirus Bovine Foamy Virus
Holzschu, Donald L.; Delaney, Mari A.; Renshaw, Randall W.; Casey, James W.
1998-01-01
We have determined the complete nucleotide sequence of a replication-competent clone of bovine foamy virus (BFV) and have quantitated the amount of splice pol mRNA processed early in infection. The 544-amino-acid Gag protein precursor has little sequence similarity with its primate foamy virus homologs, but the putative nucleocapsid (NC) protein, like the primate NCs, contains the three glycine-arginine-rich regions that are postulated to bind genomic RNA during virion assembly. The BFV gag and pol open reading frames overlap, with pro and pol in the same translational frame. As with the human foamy virus (HFV) and feline foamy virus, we have detected a spliced pol mRNA by PCR. Quantitatively, this mRNA approximates the level of full-length genomic RNA early in infection. The integrase (IN) domain of reverse transcriptase does not contain the canonical HH-CC zinc finger motif present in all characterized retroviral INs, but it does contain a nearby histidine residue that could conceivably participate as a member of the zinc finger. The env gene encodes a protein that is over 40% identical in sequence to the HFV Env. By comparison, the Gag precursor of BFV is predicted to be only 28% identical to the HFV protein. PMID:9499074
An RRM–ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion
Collins, Katherine M.; Kainov, Yaroslav A.; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A.
2017-01-01
Abstract RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1–ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. PMID:28379442
An RRM-ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion.
Collins, Katherine M; Kainov, Yaroslav A; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A; Makeyev, Eugene V; Ramos, Andres
2017-06-20
RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1-ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Lineage-specific splicing of a brain-enriched alternative exon promotes glioblastoma progression
Ferrarese, Roberto; Harsh, Griffith R.; Yadav, Ajay K.; Bug, Eva; Maticzka, Daniel; Reichardt, Wilfried; Dombrowski, Stephen M.; Miller, Tyler E.; Masilamani, Anie P.; Dai, Fangping; Kim, Hyunsoo; Hadler, Michael; Scholtens, Denise M.; Yu, Irene L.Y.; Beck, Jürgen; Srinivasasainagendra, Vinodh; Costa, Fabrizio; Baxan, Nicoleta; Pfeifer, Dietmar; von Elverfeldt, Dominik; Backofen, Rolf; Weyerbrock, Astrid; Duarte, Christine W.; He, Xiaolin; Prinz, Marco; Chandler, James P.; Vogel, Hannes; Chakravarti, Arnab; Rich, Jeremy N.; Carro, Maria S.; Bredel, Markus
2014-01-01
Tissue-specific alternative splicing is critical for the emergence of tissue identity during development, yet the role of this process in malignant transformation is undefined. Tissue-specific splicing involves evolutionarily conserved, alternative exons that represent only a minority of the total alternative exons identified. Many of these conserved exons have functional features that influence signaling pathways to profound biological effect. Here, we determined that lineage-specific splicing of a brain-enriched cassette exon in the membrane-binding tumor suppressor annexin A7 (ANXA7) diminishes endosomal targeting of the EGFR oncoprotein, consequently enhancing EGFR signaling during brain tumor progression. ANXA7 exon splicing was mediated by the ribonucleoprotein PTBP1, which is normally repressed during neuronal development. PTBP1 was highly expressed in glioblastomas due to loss of a brain-enriched microRNA (miR-124) and to PTBP1 amplification. The alternative ANXA7 splicing trait was present in precursor cells, suggesting that glioblastoma cells inherit the trait from a potential tumor-initiating ancestor and that these cells exploit this trait through accumulation of mutations that enhance EGFR signaling. Our data illustrate that lineage-specific splicing of a tissue-regulated alternative exon in a constituent of an oncogenic pathway eliminates tumor suppressor functions and promotes glioblastoma progression. This paradigm may offer a general model as to how tissue-specific regulatory mechanisms can reprogram normal developmental processes into oncogenic ones. PMID:24865424
Intragenic motifs regulate the transcriptional complexity of Pkhd1/PKHD1
Boddu, Ravindra; Yang, Chaozhe; O’Connor, Amber K.; Hendrickson, Robert Curtis; Boone, Braden; Cui, Xiangqin; Garcia-Gonzalez, Miguel; Igarashi, Peter; Onuchic, Luiz F.; Germino, Gregory G.
2014-01-01
Autosomal recessive polycystic kidney disease (ARPKD) results from mutations in the human PKHD1 gene. Both this gene, and its mouse ortholog, Pkhd1, are primarily expressed in renal and biliary ductal structures. The mouse protein product, fibrocystin/polyductin complex (FPC), is a 445-kDa protein encoded by a 67-exon transcript that spans >500 kb of genomic DNA. In the current study, we observed multiple alternatively spliced Pkhd1 transcripts that varied in size and exon composition in embryonic mouse kidney, liver, and placenta samples, as well as among adult mouse pancreas, brain, heart, lung, testes, liver, and kidney. Using reverse transcription PCR and RNASeq, we identified 22 novel Pkhd1 kidney transcripts with unique exon junctions. Various mechanisms of alternative splicing were observed, including exon skipping, use of alternate acceptor/donor splice sites, and inclusion of novel exons. Bioinformatic analyses identified, and exon-trapping minigene experiments validated, consensus binding sites for serine/arginine-rich proteins that modulate alternative splicing. Using site-directed mutagenesis, we examined the functional importance of selected splice enhancers. In addition, we demonstrated that many of the novel transcripts were polysome bound, thus likely translated. Finally, we determined that the human PKHD1 R760H missense variant alters a splice enhancer motif that disrupts exon splicing in vitro and is predicted to truncate the protein. Taken together, these data provide evidence of the complex transcriptional regulation of Pkhd1/PKHD1 and identified motifs that regulate its splicing. Our studies indicate that Pkhd1/PKHD1 transcription is modulated, in part by intragenic factors, suggesting that aberrant PKHD1 splicing represents an unappreciated pathogenic mechanism in ARPKD. PMID:24984783
Characterization of variegate porphyria mutations using a minigene approach.
Granata, Barbara Xoana; Baralle, Marco; De Conti, Laura; Parera, Victoria; Rossetti, Maria Victoria
2015-01-01
Porphyrias are a group of metabolic diseases that affect the skin and/or nervous system. In 2008, three unrelated patients were diagnosed with variegate porphyria at the CIPYP (Centro de Investigaciones sobre Porfirinas y Porfirias). Sequencing of the protoporphyrinogen oxidase gene, the gene altered in this type of porphyria, revealed three previously undescribed mutations: c.338+3insT, c.807G>A, and c.808-1G>C. As these mutations do not affect the protein sequence, we hypothesized that they might be splicing mutations. RT-PCRs performed on the patient's mRNAs showed normal mRNA or no amplification at all. This result indicated that the aberrant spliced transcript is possibly being degraded. In order to establish whether they were responsible or not for the patient's disease by causing aberrant splicing, we utilized a minigene approach. We found that the three mutations lead to exon skipping; therefore, the abnormal mRNAs are most likely degraded by a mechanism such as nonsense-mediated decay. In conclusion, these mutations are responsible for the disease because they alter the normal splicing pathway, thus providing a functional explanation for the appearance of disease and highlighting the use of minigene assays to complement transcript analysis.
Involvement of Alternative Splicing in Barley Seed Germination
Zhang, Qisen; Zhang, Xiaoqi; Wang, Songbo; Tan, Cong; Zhou, Gaofeng; Li, Chengdao
2016-01-01
Seed germination activates many new biological processes including DNA, membrane and mitochondrial repairs and requires active protein synthesis and sufficient energy supply. Alternative splicing (AS) regulates many cellular processes including cell differentiation and environmental adaptations. However, limited information is available on the regulation of seed germination at post-transcriptional levels. We have conducted RNA-sequencing experiments to dissect AS events in barley seed germination. We identified between 552 and 669 common AS transcripts in germinating barley embryos from four barley varieties (Hordeum vulgare L. Bass, Baudin, Harrington and Stirling). Alternative 3’ splicing (34%-45%), intron retention (32%-34%) and alternative 5’ splicing (16%-21%) were three major AS events in germinating embryos. The AS transcripts were predominantly mapped onto ribosome, RNA transport machineries, spliceosome, plant hormone signal transduction, glycolysis, sugar and carbon metabolism pathways. Transcripts of these genes were also very abundant in the early stage of seed germination. Correlation analysis of gene expression showed that AS hormone responsive transcripts could also be co-expressed with genes responsible for protein biosynthesis and sugar metabolisms. Our RNA-sequencing data revealed that AS could play important roles in barley seed germination. PMID:27031341
IRAS: High-Throughput Identification of Novel Alternative Splicing Regulators.
Zheng, S
2016-01-01
Alternative splicing is a fundamental regulatory process of gene expression. Defects in alternative splicing can lead to various diseases, and modification of disease-causing splicing events presents great therapeutic promise. Splicing outcome is commonly affected by extracellular stimuli and signaling cascades that converge on RNA-binding splicing regulators. These trans-acting factors recognize cis-elements in pre-mRNA transcripts to affect spliceosome assembly and splice site choices. Identification of these splicing regulators and/or upstream modulators has been difficult and traditionally done by piecemeal. High-throughput screening strategies to find multiple regulators of exon splicing have great potential to accelerate the discovery process, but typically confront low sensitivity and low specificity of screening assays. Here we describe a unique screening strategy, IRAS (identifying regulators of alternative splicing), using a pair of dual-output minigene reporters to allow for sensitive detection of exon splicing changes. Each dual-output reporter produces green fluorescent protein (GFP) and red fluorescent protein (RFP) fluorescent signals to assay the two spliced isoforms exclusively. The two complementary minigene reporters alter GFP/RFP output ratios in the opposite direction in response to splicing change. Applying IRAS in cell-based high-throughput screens allows sensitive and specific identification of splicing regulators and modulators for any alternative exons of interest. In comparison to previous high-throughput screening methods, IRAS substantially enhances the specificity of the screening assay. This strategy significantly eliminates false positives without sacrificing sensitive identification of true regulators of splicing. © 2016 Elsevier Inc. All rights reserved.
Organellar maturases: A window into the evolution of the spliceosome.
Schmitz-Linneweber, Christian; Lampe, Marie-Kristin; Sultan, Laure D; Ostersetzer-Biran, Oren
2015-09-01
During the evolution of eukaryotic genomes, many genes have been interrupted by intervening sequences (introns) that must be removed post-transcriptionally from RNA precursors to form mRNAs ready for translation. The origin of nuclear introns is still under debate, but one hypothesis is that the spliceosome and the intron-exon structure of genes have evolved from bacterial-type group II introns that invaded the eukaryotic genomes. The group II introns were most likely introduced into the eukaryotic genome from an α-proteobacterial predecessor of mitochondria early during the endosymbiosis event. These self-splicing and mobile introns spread through the eukaryotic genome and later degenerated. Pieces of introns became part of the general splicing machinery we know today as the spliceosome. In addition, group II introns likely brought intron maturases with them to the nucleus. Maturases are found in most bacterial introns, where they act as highly specific splicing factors for group II introns. In the spliceosome, the core protein Prp8 shows homology to group II intron-encoded maturases. While maturases are entirely intron specific, their descendant of the spliceosomal machinery, the Prp8 protein, is an extremely versatile splicing factor with multiple interacting proteins and RNAs. How could such a general player in spliceosomal splicing evolve from the monospecific bacterial maturases? Analysis of the organellar splicing machinery in plants may give clues on the evolution of nuclear splicing. Plants encode various proteins which are closely related to bacterial maturases. The organellar genomes contain one maturase each, named MatK in chloroplasts and MatR in mitochondria. In addition, several maturase genes have been found in the nucleus as well, which are acting on mitochondrial pre-RNAs. All plant maturases show sequence deviation from their progenitor bacterial maturases, and interestingly are all acting on multiple organellar group II intron targets. Moreover, they seem to function in the splicing of group II introns together with a number of additional nuclear-encoded splicing factors, possibly acting as an organellar proto-spliceosome. Together, this makes them interesting models for the early evolution of nuclear spliceosomal splicing. In this review, we summarize recent advances in our understanding of the role of plant maturases and their accessory factors in plants. This article is part of a Special Issue entitled: Chloroplast Biogenesis. Copyright © 2015 Elsevier B.V. All rights reserved.
Lakshmi, G. Girija; Ghosh, Sushmita; Jones, Gabriel P.; Parikh, Roshni; Rawlins, Bridgette A.; Vaughn, Jack C.
2014-01-01
Alternative splicing greatly enhances the diversity of proteins encoded by eukaryotic genomes, and is also important in gene expression control. In contrast to the great depth of knowledge as to molecular mechanisms in the splicing pathway itself, relatively little is known about the regulatory events behind this process. The 5′-UTR and 3′-UTR in pre-mRNAs play a variety of roles in controlling eukaryotic gene expression, including translational modulation, and nearly 4,000 of the roughly 14,000 protein coding genes in Drosophila contain introns of unknown functional significance in their 5′-UTR. Here we report the results of an RNA electrophoretic mobility shift analysis of Drosophila rnp-4f 5′-UTR intron 0 splicing regulatory proteins. The pre-mRNA potential regulatory element consists of an evolutionarily-conserved 177-nt stem-loop arising from pairing of intron 0 with part of adjacent exon 2. Incubation of in vitro transcribed probe with embryo protein extract is shown to result in two shifted RNA-protein bands, and protein extract from a dADAR null mutant fly line results in only one shifted band. A mutated stem-loop in which the conserved exon 2 primary sequence is changed but secondary structure maintained by introducing compensatory base changes results in diminished band shifts. To test the hypothesis that dADAR plays a role in intron splicing regulation in vivo, levels of unspliced rnp-4f mRNA in dADAR mutant were compared to wild-type via real-time qRT-PCR. The results show that during embryogenesis unspliced rnp-4f mRNA levels fall by up to 85% in the mutant, in support of the hypothesis. Taken together, these results demonstrate a novel role for dADAR protein in rnp-4f 5′-UTR alternative intron splicing regulation which is consistent with a previously proposed model. PMID:23026215
Diversification of the muscle proteome through alternative splicing.
Nakka, Kiran; Ghigna, Claudia; Gabellini, Davide; Dilworth, F Jeffrey
2018-03-06
Skeletal muscles express a highly specialized proteome that allows the metabolism of energy sources to mediate myofiber contraction. This muscle-specific proteome is partially derived through the muscle-specific transcription of a subset of genes. Surprisingly, RNA sequencing technologies have also revealed a significant role for muscle-specific alternative splicing in generating protein isoforms that give specialized function to the muscle proteome. In this review, we discuss the current knowledge with respect to the mechanisms that allow pre-mRNA transcripts to undergo muscle-specific alternative splicing while identifying some of the key trans-acting splicing factors essential to the process. The importance of specific splicing events to specialized muscle function is presented along with examples in which dysregulated splicing contributes to myopathies. Though there is now an appreciation that alternative splicing is a major contributor to proteome diversification, the emergence of improved "targeted" proteomic methodologies for detection of specific protein isoforms will soon allow us to better appreciate the extent to which alternative splicing modifies the activity of proteins (and their ability to interact with other proteins) in the skeletal muscle. In addition, we highlight a continued need to better explore the signaling pathways that contribute to the temporal control of trans-acting splicing factor activity to ensure specific protein isoforms are expressed in the proper cellular context. An understanding of the signal-dependent and signal-independent events driving muscle-specific alternative splicing has the potential to provide us with novel therapeutic strategies to treat different myopathies.
Jin, Lirong; Li, Guanglin; Yu, Dazhao; Huang, Wei; Cheng, Chao; Liao, Shengjie; Wu, Qijia; Zhang, Yi
2017-02-06
Alternative splicing (AS) regulation is extensive and shapes the functional complexity of higher organisms. However, the contribution of alternative splicing to fungal biology is not well studied. This study provides sequences of the transcriptomes of the plant wilt pathogen Verticillium dahliae, using two different strains and multiple methods for cDNA library preparations. We identified alternatively spliced mRNA isoforms in over a half of the multi-exonic fungal genes. Over one-thousand isoforms involve TopHat novel splice junction; multiple types of combinatory alternative splicing patterns were identified. We showed that one Verticillium gene could use four different 5' splice sites and two different 3' donor sites to produce up to five mature mRNAs, representing one of the most sophisticated alternative splicing model in eukaryotes other than animals. Hundreds of novel intron types involving a pair of new splice sites were identified in the V. dahliae genome. All the types of AS events were validated by using RT-PCR. Functional enrichment analysis showed that AS genes are involved in most known biological functions and enriched in ATP biosynthesis, sexual/asexual reproduction, morphogenesis, signal transduction etc., predicting that the AS regulation modulates mRNA isoform output and shapes the V. dahliae proteome plasticity of the pathogen in response to the environmental and developmental changes. These findings demonstrate the comprehensive alternative splicing mechanisms in a fungal plant pathogen, which argues the importance of this fungus in developing complicate genome regulation strategies in eukaryotes.
Yu, Xie; Hai-Yan, Liao; Shu-Jie, Chen; Ling-Yu, Shi; Li-Yan, Ou; Ping-Ying, Teng; Dan, Xia; Qi-Wei, Chen; Sinan, Zheng; Xiao-Hong, Zhou
2016-07-12
To clone and identify the heat shock factors (HSFs) of Schistosoma japonicum and analyze its molecular structure and alternative splicing pattern. The New Zealand rabbits were infected with the cercariae of Schistosoma japonicum and were killed and dissected 42 days post-infection, and the adult worms of S. japonicum and the livers of the rabbits were harvested. Then, the total RNA was extracted by using Trizol reagent. The Sj-hsf open reading frame (ORF) and the alternative splicing fragments were amplified by RT-PCR from the female, male and egg samples, then cloned and verified by enzyme digestion and sequencing. DNAMAN 8.0, InterPro, Mega 6 combined with the Internet databases were utilized to clarify the gene structure, functional domains, alternative splicing pattern, and the homology and phylogenetic tree of HSFs. Sj-hsf ORF and the alternative splicing fragments were amplified from the female, male and egg samples of S. japonicum by RT-PCR. After cloning, the positive recombinant plasmids pB Sj HSFf-F, pB Sj HSFf-M, pB Sj HSFf-E containing Sj-hsf ORF, pB Sj HSFs-F, pB Sj HSFs-M, pB Sj HSFs-E with Sj-hsf alternative splicing fragments were identified by enzyme digestion and sequencing. Three alternative splicing Sj-hsf isoforms were observed through sequence analysis: Sj-hsf -isoform1 (2 050 bp), Sj-hsf -isoform2 (2 086 bp) and Sj - hsf -isoform3 (2 111 bp); the GenBank accession numbers were KU954546, KX119143 and KX119144, respectively. All the three isoforms located in the same Contig SJC_S000780 of S. japonicum genome and all expressed at female, male and egg stages, but Sj-hsf -isoform1 with a high-level expression. Sj -HSF-isoform1 (671 aa) and Sj -HSF-isoform2 (683 aa) had DBD (DNA binding domain), HR-A/B and HR-C domains, while Sj -HSF-isoform3 (282 aa) stopped in advance without HR-C domain. Phylogenetic tree analysis of HSFs illustrated that Sj - HSFs belonged to HSF1 family, with a close phylogenetic relationship to Sm -HSFs. There are three alternative splicing isoforms of Sj -HSF existing in the female, male and egg stages of S. japonicum , but Sj -HSF-isoform1 expresses in a high-level. This study lays the foundation for further study on molecular mechanisms of Sj- HSFs in regulating the heat shock response system.
NASA Astrophysics Data System (ADS)
Hsu, Justin Bo-Kai; Huang, Kai-Yao; Weng, Tzu-Ya; Huang, Chien-Hsun; Lee, Tzong-Yi
2014-01-01
Machinery of pre-mRNA splicing is carried out through the interaction of RNA sequence elements and a variety of RNA splicing-related proteins (SRPs) (e.g. spliceosome and splicing factors). Alternative splicing, which is an important post-transcriptional regulation in eukaryotes, gives rise to multiple mature mRNA isoforms, which encodes proteins with functional diversities. However, the regulation of RNA splicing is not yet fully elucidated, partly because SRPs have not yet been exhaustively identified and the experimental identification is labor-intensive. Therefore, we are motivated to design a new method for identifying SRPs with their functional roles in the regulation of RNA splicing. The experimentally verified SRPs were manually curated from research articles. According to the functional annotation of Splicing Related Gene Database, the collected SRPs were further categorized into four functional groups including small nuclear Ribonucleoprotein, Splicing Factor, Splicing Regulation Factor and Novel Spliceosome Protein. The composition of amino acid pairs indicates that there are remarkable differences among four functional groups of SRPs. Then, support vector machines (SVMs) were utilized to learn the predictive models for identifying SRPs as well as their functional roles. The cross-validation evaluation presents that the SVM models trained with significant amino acid pairs and functional domains could provide a better predictive performance. In addition, the independent testing demonstrates that the proposed method could accurately identify SRPs in mammals/plants as well as effectively distinguish between SRPs and RNA-binding proteins. This investigation provides a practical means to identifying potential SRPs and a perspective for exploring the regulation of RNA splicing.
Hsu, Justin Bo-Kai; Huang, Kai-Yao; Weng, Tzu-Ya; Huang, Chien-Hsun; Lee, Tzong-Yi
2014-01-01
Machinery of pre-mRNA splicing is carried out through the interaction of RNA sequence elements and a variety of RNA splicing-related proteins (SRPs) (e.g. spliceosome and splicing factors). Alternative splicing, which is an important post-transcriptional regulation in eukaryotes, gives rise to multiple mature mRNA isoforms, which encodes proteins with functional diversities. However, the regulation of RNA splicing is not yet fully elucidated, partly because SRPs have not yet been exhaustively identified and the experimental identification is labor-intensive. Therefore, we are motivated to design a new method for identifying SRPs with their functional roles in the regulation of RNA splicing. The experimentally verified SRPs were manually curated from research articles. According to the functional annotation of Splicing Related Gene Database, the collected SRPs were further categorized into four functional groups including small nuclear Ribonucleoprotein, Splicing Factor, Splicing Regulation Factor and Novel Spliceosome Protein. The composition of amino acid pairs indicates that there are remarkable differences among four functional groups of SRPs. Then, support vector machines (SVMs) were utilized to learn the predictive models for identifying SRPs as well as their functional roles. The cross-validation evaluation presents that the SVM models trained with significant amino acid pairs and functional domains could provide a better predictive performance. In addition, the independent testing demonstrates that the proposed method could accurately identify SRPs in mammals/plants as well as effectively distinguish between SRPs and RNA-binding proteins. This investigation provides a practical means to identifying potential SRPs and a perspective for exploring the regulation of RNA splicing.
Zhao, Shan; Lu, Xin; Zhang, Yueling; Zhao, Xianliang; Zhong, Mingqi; Li, Shengkang; Lun, Jingsheng
2013-01-01
Recent evidences suggest that invertebrates express families of immune molecules with high levels of sequence diversity. Hemocyanin is an important non-specific immune molecule present in the hemolymph of both mollusks and arthropods. In the present study, we characterized a novel alternative splicing variant of hemocyanin (cHE1) from Litopenaeus vannamei that produced mRNA transcript of 2579 bp in length. The isoform contained two additional sequences of 296 and 267 bp in the 5'- and 3'-terminus respectively, in comparison to that of wild type hemocyanin (cHE). Sequence of cHE1 shows 100% identity to that of hemocyanin genomic DNA (HE, which does not form an open reading frame), suggesting that cHE1 might be an alternative splicing variant due to intron retention. Moreover, cHE1 could be detected by RT-PCR from five tissues (heart, gill, stomach, intestine and brain), and from shrimps at stages from nauplius to mysis larva. Further, cHE1 mRNA transcripts were significantly increased in hearts after 12h of infection with Vibrio parahemolyticus or poly I: C, while no significant difference in the transcript levels of hepatopancreas cHE was detected in the pathogen-treated shrimps during the period. In summary, these studies suggested a novel splicing variant of hemocyanin in shrimp, which might be involved in shrimp resistance to pathogenic infection. Copyright © 2013 Elsevier B.V. All rights reserved.
Congenital analbuminemia caused by a novel aberrant splicing in the albumin gene.
Caridi, Gianluca; Dagnino, Monica; Erdeve, Omer; Di Duca, Marco; Yildiz, Duran; Alan, Serdar; Atasay, Begum; Arsan, Saadet; Campagnoli, Monica; Galliano, Monica; Minchiotti, Lorenzo
2014-01-01
Congenital analbuminemia is a rare autosomal recessive disorder manifested by the presence of a very low amount of circulating serum albumin. It is an allelic heterogeneous defect, caused by variety of mutations within the albumin gene in homozygous or compound heterozygous state. Herein we report the clinical and molecular characterization of a new case of congenital analbuminemia diagnosed in a female newborn of consanguineous (first degree cousins) parents from Ankara, Turkey, who presented with a low albumin concentration (< 8 g/L) and severe clinical symptoms. The albumin gene of the index case was screened by single-strand conformation polymorphism, heteroduplex analysis, and direct DNA sequencing. The effect of the splicing mutation was evaluated by examining the cDNA obtained by reverse transcriptase - polymerase chain reaction (RT-PCR) from the albumin mRNA extracted from proband's leukocytes. DNA sequencing revealed that the proband is homozygous, and both parents are heterozygous, for a novel G>A transition at position c.1652+1, the first base of intron 12, which inactivates the strongly conserved GT dinucleotide at the 5' splice site consensus sequence of this intron. The splicing defect results in the complete skipping of the preceding exon (exon 12) and in a frame-shift within exon 13 with a premature stop codon after the translation of three mutant amino acid residues. Our results confirm the clinical diagnosis of congenital analbuminemia in the proband and the inheritance of the trait and contribute to shed light on the molecular genetics of analbuminemia.
DeVry, C G; Tsai, W; Clarke, S
1996-11-15
The protein L-isoaspartyl/D-aspartyl O-methyltransferase (EC 2.1.1.77) catalyzes the first step in the repair of proteins damaged in the aging process by isomerization or racemization reactions at aspartyl and asparaginyl residues. A single gene has been localized to human chromosome 6 and multiple transcripts arising through alternative splicing have been identified. Restriction enzyme mapping, subcloning, and DNA sequence analysis of three overlapping clones from a human genomic library in bacteriophage P1 indicate that the gene spans approximately 60 kb and is composed of 8 exons interrupted by 7 introns. Analysis of intron/exon splice junctions reveals that all of the donor and acceptor splice sites are in agreement with the mammalian consensus splicing sequence. Determination of transcription initiation sites by primer extension analysis of poly(A)+ mRNA from human brain identifies multiple start sites, with a major site 159 nucleotides upstream from the ATG start codon. Sequence analysis of the 5'-untranslated region demonstrates several potential cis-acting DNA elements including SP1, ETF, AP1, AP2, ARE, XRE, CREB, MED-1, and half-palindromic ERE motifs. The promoter of this methyltransferase gene lacks an identifiable TATA box but is characterized by a CpG island which begins approximately 723 nucleotides upstream of the major transcriptional start site and extends through exon 1 and into the first intron. These features are characteristic of housekeeping genes and are consistent with the wide tissue distribution observed for this methyltransferase activity.
Expanding the action of duplex RNAs into the nucleus: redirecting alternative splicing
Liu, Jing; Hu, Jiaxin; Corey, David R.
2012-01-01
Double-stranded RNAs are powerful agents for silencing gene expression in the cytoplasm of mammalian cells. The potential for duplex RNAs to control expression in the nucleus has received less attention. Here, we investigate the ability of small RNAs to redirect splicing. We identify RNAs targeting an aberrant splice site that restore splicing and production of functional protein. RNAs can target sequences within exons or introns and affect the inclusion of exons within SMN2 and dystrophin, genes responsible for spinal muscular atrophy and Duchenne muscular dystrophy, respectively. Duplex RNAs recruit argonaute 2 (AGO2) to pre-mRNA transcripts and altered splicing requires AGO2 expression. AGO2 promotes transcript cleavage in the cytoplasm, but recruitment of AGO2 to pre-mRNAs does not reduce transcript levels, exposing a difference between cytoplasmic and nuclear pathways. Involvement of AGO2 in splicing, a classical nuclear process, reinforces the conclusion from studies of RNA-mediated transcriptional silencing that RNAi pathways can be adapted to function in the mammalian nucleus. These data provide a new strategy for controlling splicing and expand the reach of small RNAs within the nucleus of mammalian cells. PMID:21948593
Shabanpoor, Fazel; McClorey, Graham; Saleh, Amer F.; Järver, Peter; Wood, Matthew J.A.; Gait, Michael J.
2015-01-01
The potential for therapeutic application of splice-switching oligonucleotides (SSOs) to modulate pre-mRNA splicing is increasingly evident in a number of diseases. However, the primary drawback of this approach is poor cell and in vivo oligonucleotide uptake efficacy. Biological activities can be significantly enhanced through the use of synthetically conjugated cationic cell penetrating peptides (CPPs). Studies to date have focused on the delivery of a single SSO conjugated to a CPP, but here we describe the conjugation of two phosphorodiamidate morpholino oligonucleotide (PMO) SSOs to a single CPP for simultaneous delivery and pre-mRNA targeting of two separate genes, exon 23 of the Dmd gene and exon 5 of the Acvr2b gene, in a mouse model of Duchenne muscular dystrophy. Conjugations of PMOs to a single CPP were carried out through an amide bond in one case and through a triazole linkage (‘click chemistry’) in the other. The most active bi-specific CPP–PMOs demonstrated comparable exon skipping levels for both pre-mRNA targets when compared to individual CPP–PMO conjugates both in cell culture and in vivo in the mdx mouse model. Thus, two SSOs with different target sequences conjugated to a single CPP are biologically effective and potentially suitable for future therapeutic exploitation. PMID:25468897
Liao, Wei; Jordaan, Gwen; Nham, Phillipp; Phan, Ryan T; Pelegrini, Matteo; Sharma, Sanjai
2015-10-16
To determine differentially expressed and spliced RNA transcripts in chronic lymphocytic leukemia specimens a high throughput RNA-sequencing (HTS RNA-seq) analysis was performed. Ten CLL specimens and five normal peripheral blood CD19+ B cells were analyzed by HTS RNA-seq. The library preparation was performed with Illumina TrueSeq RNA kit and analyzed by Illumina HiSeq 2000 sequencing system. An average of 48.5 million reads for B cells, and 50.6 million reads for CLL specimens were obtained with 10396 and 10448 assembled transcripts for normal B cells and primary CLL specimens respectively. With the Cuffdiff analysis, 2091 differentially expressed genes (DEG) between B cells and CLL specimens based on FPKM (fragments per kilobase of transcript per million reads and false discovery rate, FDR q < 0.05, fold change >2) were identified. Expression of selected DEGs (n = 32) with up regulated and down regulated expression in CLL from RNA-seq data were also analyzed by qRT-PCR in a test cohort of CLL specimens. Even though there was a variation in fold expression of DEG genes between RNA-seq and qRT-PCR; more than 90 % of analyzed genes were validated by qRT-PCR analysis. Analysis of RNA-seq data for splicing alterations in CLL and B cells was performed by Multivariate Analysis of Transcript Splicing (MATS analysis). Skipped exon was the most frequent splicing alteration in CLL specimens with 128 significant events (P-value <0.05, minimum inclusion level difference >0.1). The RNA-seq analysis of CLL specimens identifies novel DEG and alternatively spliced genes that are potential prognostic markers and therapeutic targets. High level of validation by qRT-PCR for a number of DEG genes supports the accuracy of this analysis. Global comparison of transcriptomes of B cells, IGVH non-mutated CLL (U-CLL) and mutated CLL specimens (M-CLL) with multidimensional scaling analysis was able to segregate CLL and B cell transcriptomes but the M-CLL and U-CLL transcriptomes were indistinguishable. The analysis of HTS RNA-seq data to identify alternative splicing events and other genetic abnormalities specific to CLL is an added advantage of RNA-seq that is not feasible with other genome wide analysis.
Prchalova, Darina; Havlovicova, Marketa; Sterbova, Katalin; Stranecky, Viktor; Hancarova, Miroslava; Sedlacek, Zdenek
2017-06-02
Whole exome sequencing is a powerful tool for the analysis of genetically heterogeneous conditions. The prioritization of variants identified often focuses on nonsense, frameshift and canonical splice site mutations, and highly deleterious missense variants, although other defects can also play a role. The definition of the phenotype range and course of rare genetic conditions requires long-term clinical follow-up of patients. We report an adult female patient with severe intellectual disability, severe speech delay, epilepsy, autistic features, aggressiveness, sleep problems, broad-based clumsy gait and constipation. Whole exome sequencing identified a de novo mutation in the SYNGAP1 gene. The variant was located in the broader splice donor region of intron 10 and replaced G by A at position +5 of the splice site. The variant was predicted in silico and shown experimentally to abolish the regular splice site and to activate a cryptic donor site within exon 10, causing frameshift and premature termination. The overall clinical picture of the patient corresponded well with the characteristic SYNGAP1-associated phenotype observed in previously reported patients. However, our patient was 31 years old which contrasted with most other published SYNGAP1 cases who were much younger. Our patient had a significant growth delay and microcephaly. Both features normalised later, although the head circumference stayed only slightly above the lower limit of the norm. The patient had a delayed puberty. Her cognitive and language performance remained at the level of a one-year-old child even in adulthood and showed a slow decline. Myopathic facial features and facial dysmorphism became more pronounced with age. Although the gait of the patient was unsteady in childhood, more severe gait problems developed in her teens. While the seizures remained well-controlled, her aggressive behaviour worsened with age and required extensive medication. The finding in our patient underscores the notion that the interpretation of variants identified using whole exome sequencing should focus not only on variants in the canonical splice dinucleotides GT and AG, but also on broader splice regions. The long-term clinical follow-up of our patient contributes to the knowledge of the developmental trajectory in individuals with SYNGAP1 gene defects.
Global regulation of alternative RNA splicing by the SR-rich protein RBM39.
Mai, Sanyue; Qu, Xiuhua; Li, Ping; Ma, Qingjun; Cao, Cheng; Liu, Xuan
2016-08-01
RBM39 is a serine/arginine-rich RNA-binding protein that is highly homologous to the splicing factor U2AF65. However, the role of RBM39 in alternative splicing is poorly understood. In this study, RBM39-mediated global alternative splicing was investigated using RNA-Seq and genome-wide RBM39-RNA interactions were mapped via cross-linking and immunoprecipitation coupled with deep sequencing (CLIP-Seq) in wild-type and RBM39-knockdown MCF-7 cells. RBM39 was involved in the up- or down-regulation of the transcript levels of various genes. Hundreds of alternative splicing events regulated by endogenous RBM39 were identified. The majority of these events were cassette exons. Genes containing RBM39-regulated alternative exons were found to be linked to G2/M transition, cellular response to DNA damage, adherens junctions and endocytosis. CLIP-Seq analysis showed that the binding site of RBM39 was mainly in proximity to 5' and 3' splicing sites. Considerable RBM39 binding to mRNAs encoding proteins involved in translation was observed. Of particular importance, ~20% of the alternative splicing events that were significantly regulated by RBM39 were similarly regulated by U2AF65. RBM39 is extensively involved in alternative splicing of RNA and helps regulate transcript levels. RBM39 may modulate alternative splicing similarly to U2AF65 by either directly binding to RNA or recruiting other splicing factors, such as U2AF65. The current study offers a genome-wide view of RBM39's regulatory function in alternative splicing. RBM39 may play important roles in multiple cellular processes by regulating both alternative splicing of RNA molecules and transcript levels. Copyright © 2016 Elsevier B.V. All rights reserved.
Mapping RNA-seq Reads with STAR
Dobin, Alexander; Gingeras, Thomas R.
2015-01-01
Mapping of large sets of high-throughput sequencing reads to a reference genome is one of the foundational steps in RNA-seq data analysis. The STAR software package performs this task with high levels of accuracy and speed. In addition to detecting annotated and novel splice junctions, STAR is capable of discovering more complex RNA sequence arrangements, such as chimeric and circular RNA. STAR can align spliced sequences of any length with moderate error rates providing scalability for emerging sequencing technologies. STAR generates output files that can be used for many downstream analyses such as transcript/gene expression quantification, differential gene expression, novel isoform reconstruction, signal visualization, and so forth. In this unit we describe computational protocols that produce various output files, use different RNA-seq datatypes, and utilize different mapping strategies. STAR is Open Source software that can be run on Unix, Linux or Mac OS X systems. PMID:26334920
Mapping RNA-seq Reads with STAR.
Dobin, Alexander; Gingeras, Thomas R
2015-09-03
Mapping of large sets of high-throughput sequencing reads to a reference genome is one of the foundational steps in RNA-seq data analysis. The STAR software package performs this task with high levels of accuracy and speed. In addition to detecting annotated and novel splice junctions, STAR is capable of discovering more complex RNA sequence arrangements, such as chimeric and circular RNA. STAR can align spliced sequences of any length with moderate error rates, providing scalability for emerging sequencing technologies. STAR generates output files that can be used for many downstream analyses such as transcript/gene expression quantification, differential gene expression, novel isoform reconstruction, and signal visualization. In this unit, we describe computational protocols that produce various output files, use different RNA-seq datatypes, and utilize different mapping strategies. STAR is open source software that can be run on Unix, Linux, or Mac OS X systems. Copyright © 2015 John Wiley & Sons, Inc.
Goyal, K; Browne, J A; Burnell, A M; Tunnacliffe, A
2005-06-01
Accumulation of the non-reducing disaccharide trehalose is associated with desiccation tolerance during anhydrobiosis in a number of invertebrates, but there is little information on trehalose biosynthetic genes in these organisms. We have identified two trehalose-6-phosphate synthase (tps) genes in the anhydrobiotic nematode Aphelenchus avenae and determined full length cDNA sequences for both; for comparison, full length tps cDNAs from the model nematode, Caenorhabditis elegans, have also been obtained. The A. avenae genes encode very similar proteins containing the catalytic domain characteristic of the GT-20 family of glycosyltransferases and are most similar to tps-2 of C. elegans; no evidence was found for a gene in A. avenae corresponding to Ce-tps-1. Analysis of A. avenae tps cDNAs revealed several features of interest, including alternative trans-splicing of spliced leader sequences in Aav-tps-1, and four different, novel SL1-related trans-spliced leaders, which were different to the canonical SL1 sequence found in all other nematodes studied. The latter observation suggests that A. avenae does not comply with the strict evolutionary conservation of SL1 sequences observed in other species. Unusual features were also noted in predicted nematode TPS proteins, which distinguish them from homologues in other higher eukaryotes (plants and insects) and in micro-organisms. Phylogenetic analysis confirmed their membership of the GT-20 glycosyltransferase family, but indicated an accelerated rate of molecular evolution. Furthermore, nematode TPS proteins possess N- and C-terminal domains, which are unrelated to those of other eukaryotes: nematode C-terminal domains, for example, do not contain trehalose-6-phosphate phosphatase-like sequences, as seen in plant and insect homologues. During onset of anhydrobiosis, both tps genes in A. avenae are upregulated, but exposure to cold or increased osmolarity also results in gene induction, although to a lesser extent. Trehalose seems likely therefore to play a role in a number of stress responses in nematodes.
Wistow, Graeme; Bernstein, Steven L; Wyatt, M Keith; Behal, Amita; Touchman, Jeffrey W; Bouffard, Gerald; Smith, Don; Peterson, Katherine
2002-06-15
To explore the expression profile of the human lens and to provide a resource for microarray studies, expressed sequence tag (EST) analysis has been performed on cDNA libraries from adult lenses. A cDNA library was constructed from two adult (40 year old) human lenses. Over two thousand clones were sequenced from the unamplified, un-normalized library. The library was then normalized and a further 2200 sequences were obtained. All the data were analyzed using GRIST (GRouping and Identification of Sequence Tags), a procedure for gene identification and clustering. The lens library (by) contains a low percentage of non-mRNA contaminants and a high fraction (over 75%) of apparently full length cDNA clones. Approximately 2000 reads from the unamplified library yields 810 clusters, potentially representing individual genes expressed in the lens. After normalization, the content of crystallins and other abundant cDNAs is markedly reduced and a similar number of reads from this library (fs) yields 1455 unique groups of which only two thirds correspond to named genes in GenBank. Among the most abundant cDNAs is one for a novel gene related to glutamine synthetase, which was designated "lengsin" (LGS). Analyses of ESTs also reveal examples of alternative transcripts, including a major alternative splice form for the lens specific membrane protein MP19. Variant forms for other transcripts, including those encoding the apoptosis inhibitor Livin and the armadillo repeat protein ARVCF, are also described. The lens cDNA libraries are a resource for gene discovery, full length cDNAs for functional studies and microarrays. The discovery of an abundant, novel transcript, lengsin, and a major novel splice form of MP19 reflect the utility of unamplified libraries constructed from dissected tissue. Many novel transcripts and splice forms are represented, some of which may be candidates for genetic diseases.
Evolution of Nova-Dependent Splicing Regulation in the Brain
Živin, Marko; Darnell, Robert B
2007-01-01
A large number of alternative exons are spliced with tissue-specific patterns, but little is known about how such patterns have evolved. Here, we study the conservation of the neuron-specific splicing factors Nova1 and Nova2 and of the alternatively spliced exons they regulate in mouse brain. Whereas Nova RNA binding domains are 94% identical across vertebrate species, Nova-dependent splicing silencer and enhancer elements (YCAY clusters) show much greater divergence, as less than 50% of mouse YCAY clusters are conserved at orthologous positions in the zebrafish genome. To study the relation between the evolution of tissue-specific splicing and YCAY clusters, we compared the brain-specific splicing of Nova-regulated exons in zebrafish, chicken, and mouse. The presence of YCAY clusters in lower vertebrates invariably predicted conservation of brain-specific splicing across species, whereas their absence in lower vertebrates correlated with a loss of alternative splicing. We hypothesize that evolution of Nova-regulated splicing in higher vertebrates proceeds mainly through changes in cis-acting elements, that tissue-specific splicing might in some cases evolve in a single step corresponding to evolution of a YCAY cluster, and that the conservation level of YCAY clusters relates to the functions encoded by the regulated RNAs. PMID:17937501
Pre-mRNA splicing in cancer: the relevance in oncogenesis, treatment and drug resistance.
Wojtuszkiewicz, Anna; Assaraf, Yehuda G; Maas, Marielle J P; Kaspers, Gertjan J L; Jansen, Gerrit; Cloos, Jacqueline
2015-05-01
Aberrant pre-mRNA splicing in cancer is emerging as an important determinant of oncogenesis, response to treatment and anticancer drug resistance. At the same time, the spliceosome has become a target for a novel class of pre-clinical chemotherapeutics with a potential future application in cancer treatment. Taken together, these findings offer novel opportunities for the enhancement of the efficacy of cancer therapy. This review presents a comprehensive overview of the molecular mechanisms involved in splicing and current developments regarding splicing aberrations in relation to several aspects of cancer formation and therapy. Identified mutations in the various components of the spliceosome and their implications for cancer prognosis are delineated. Moreover, the contribution of abnormal splicing patterns as well as deregulated splicing factors to chemoresistance is discussed, along with novel splicing-based therapeutic approaches. Significant progress has been made in deciphering the role of splicing factors in cancer including carcinogenesis and drug resistance. Splicing-based prognostic tools as well as therapeutic options hold great potential towards improvements in cancer therapy. However, gaining more in-depth molecular insight into the consequences of mutations in various components of the splicing machinery as well as of cellular effects of spliceosome inhibition is a prerequisite to establish the role of splicing in tumor progression and treatment options, respectively.
Global Profiling of the Cellular Alternative RNA Splicing Landscape during Virus-Host Interactions
Boudreault, Simon; Martenon-Brodeur, Camille; Caron, Marie; Garant, Jean-Michel; Tremblay, Marie-Pier; Armero, Victoria E. S.; Durand, Mathieu; Lapointe, Elvy; Thibault, Philippe; Tremblay-Létourneau, Maude; Perreault, Jean-Pierre; Scott, Michelle S.; Lemay, Guy; Bisaillon, Martin
2016-01-01
Alternative splicing (AS) is a central mechanism of genetic regulation which modifies the sequence of RNA transcripts in higher eukaryotes. AS has been shown to increase both the variability and diversity of the cellular proteome by changing the composition of resulting proteins through differential choice of exons to be included in mature mRNAs. In the present study, alterations to the global RNA splicing landscape of cellular genes upon viral infection were investigated using mammalian reovirus as a model. Our study provides the first comprehensive portrait of global changes in the RNA splicing signatures that occur in eukaryotic cells following infection with a human virus. We identify 240 modified alternative splicing events upon infection which belong to transcripts frequently involved in the regulation of gene expression and RNA metabolism. Using mass spectrometry, we also confirm modifications to transcript-specific peptides resulting from AS in virus-infected cells. These findings provide additional insights into the complexity of virus-host interactions as these splice variants expand proteome diversity and function during viral infection. PMID:27598998
Global Profiling of the Cellular Alternative RNA Splicing Landscape during Virus-Host Interactions.
Boudreault, Simon; Martenon-Brodeur, Camille; Caron, Marie; Garant, Jean-Michel; Tremblay, Marie-Pier; Armero, Victoria E S; Durand, Mathieu; Lapointe, Elvy; Thibault, Philippe; Tremblay-Létourneau, Maude; Perreault, Jean-Pierre; Scott, Michelle S; Lemay, Guy; Bisaillon, Martin
2016-01-01
Alternative splicing (AS) is a central mechanism of genetic regulation which modifies the sequence of RNA transcripts in higher eukaryotes. AS has been shown to increase both the variability and diversity of the cellular proteome by changing the composition of resulting proteins through differential choice of exons to be included in mature mRNAs. In the present study, alterations to the global RNA splicing landscape of cellular genes upon viral infection were investigated using mammalian reovirus as a model. Our study provides the first comprehensive portrait of global changes in the RNA splicing signatures that occur in eukaryotic cells following infection with a human virus. We identify 240 modified alternative splicing events upon infection which belong to transcripts frequently involved in the regulation of gene expression and RNA metabolism. Using mass spectrometry, we also confirm modifications to transcript-specific peptides resulting from AS in virus-infected cells. These findings provide additional insights into the complexity of virus-host interactions as these splice variants expand proteome diversity and function during viral infection.
Martínez-Montiel, Nancy; Rosas-Murrieta, Nora Hilda; Martínez-Montiel, Mónica; Gaspariano-Cholula, Mayra Patricia; Martínez-Contreras, Rebeca D
2016-01-01
In eukaryotes, genes are frequently interrupted with noncoding sequences named introns. Alternative splicing is a nuclear mechanism by which these introns are removed and flanking coding regions named exons are joined together to generate a message that will be translated in the cytoplasm. This mechanism is catalyzed by a complex machinery known as the spliceosome, which is conformed by more than 300 proteins and ribonucleoproteins that activate and regulate the precision of gene expression when assembled. It has been proposed that several genetic diseases are related to defects in the splicing process, including cancer. For this reason, natural products that show the ability to regulate splicing have attracted enormous attention due to its potential use for cancer treatment. Some microbial metabolites have shown the ability to inhibit gene splicing and the molecular mechanism responsible for this inhibition is being studied for future applications. Here, we summarize the main types of natural products that have been characterized as splicing inhibitors, the recent advances regarding molecular and cellular effects related to these molecules, and the applications reported so far in cancer therapeutics.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Solera, J.; Magallon, M.; Martin-Villar, J.
1992-02-01
DNA from a patient with severe hemophilia B was evaluated by RFLP analysis, producing results which suggested the existence of a partial deletion within the factor IX gene. The deletion was further localized and characterized by PCR amplification and sequencing. The altered allele has a 4,442-bp deletion which removes both the donor splice site located at the 5[prime] end of intron d and the two last coding nucleotides located at the 3[prime] end of exon IV in the normal factor IX gene; this fragment has been inserted in inverted orientation. Two homologous sequences have been discovered at the ends ofmore » the deleted DNA fragment.« less
Becker, Y; Asher, Y; Tabor, E; Davidson, I; Malkinson, M
1994-01-01
A DNA segment of the MDV-1 BamHI-D fragment was sequenced, and the open reading frames (ORFs) present in the 4556 nucleotide fragment were analyzed by computer programs. Computer analysis identified 19 putative ORFs in the sequence ranging from a coding capacity of 37 amino acids (aa) (ORF-1a) to 684aa (ORF-1). The special properties of four ORFs (1a, 1, 2, and 3) were investigated. Two adjacent ORFs, ORF-1a and ORF-1, were found by computer analysis to have the properties of two introns encoding a glycoprotein: ORF-1a encodes an aa sequence with the properties of a signal peptide, and ORF-1 encodes a polypeptide with a membrane anchor domain and putative N-glycosylation sites in the aa sequence. ORF-1a and ORF-1 were found to be transcribed in MDV-1-infected cells. Two RNA transcripts were detected: a precursor RNA and its spliced form. Both are transcribed from a promoter located 5' to ORF-1a, and splice donor and acceptor sites are used to splice the mRNA after cleavage of a 71-nucleotide sequence. This finding suggest that ORF-1a and ORF-1 are two introns of a new MDV-1 glycoprotein gene. The DNA sequence containing ORF-1 was transiently expressed in COS-1 cells, and the viral protein produced in these cells was found to react with anti-MDV serotype-1 Antigen B-specific monoclonal antibodies. These studies indicate that the protein encoded by ORF-1 has antigenic properties resembling Antigen B of MDV-1. A gene homologous to ORF-1 was detected in the genome of both MDV-2(SB1) and MDV-3(HVT), which serve as commercial vaccine strains. Two additional ORFs were noted in the 4556 nucleotide sequence: ORF-2, which encodes a 333 aa polypeptide initiating in the UL and terminating in the TRL prior to the putative origin of replication, and ORF-3, which encodes a 155 aa polypeptide that is partly homologous to the phosphoprotein pp38 encoded by the BamHI-H sequence. The 65 N-terminal aa of the two gene products are identical, both being derived from the nucleotide sequences in the TRL and IRL, respectively. Additional homologous aa sequences are the hydrophobic aa domain in the middle of both proteins. The functions of ORF-2, ORF-3, and additional ORFs are under study.
The carnegie protein trap library: a versatile tool for Drosophila developmental studies.
Buszczak, Michael; Paterno, Shelley; Lighthouse, Daniel; Bachman, Julia; Planck, Jamie; Owen, Stephenie; Skora, Andrew D; Nystul, Todd G; Ohlstein, Benjamin; Allen, Anna; Wilhelm, James E; Murphy, Terence D; Levis, Robert W; Matunis, Erika; Srivali, Nahathai; Hoskins, Roger A; Spradling, Allan C
2007-03-01
Metazoan physiology depends on intricate patterns of gene expression that remain poorly known. Using transposon mutagenesis in Drosophila, we constructed a library of 7404 protein trap and enhancer trap lines, the Carnegie collection, to facilitate gene expression mapping at single-cell resolution. By sequencing the genomic insertion sites, determining splicing patterns downstream of the enhanced green fluorescent protein (EGFP) exon, and analyzing expression patterns in the ovary and salivary gland, we found that 600-900 different genes are trapped in our collection. A core set of 244 lines trapped different identifiable protein isoforms, while insertions likely to act as GFP-enhancer traps were found in 256 additional genes. At least 8 novel genes were also identified. Our results demonstrate that the Carnegie collection will be useful as a discovery tool in diverse areas of cell and developmental biology and suggest new strategies for greatly increasing the coverage of the Drosophila proteome with protein trap insertions.
2014-01-01
Background Alternative splicing is an important process in higher eukaryotes that allows obtaining several transcripts from one gene. A specific case of alternative splicing is mutually exclusive splicing, in which exactly one exon out of a cluster of neighbouring exons is spliced into the mature transcript. Recently, a new algorithm for the prediction of these exons has been developed based on the preconditions that the exons of the cluster have similar lengths, sequence homology, and conserved splice sites, and that they are translated in the same reading frame. Description In this contribution we introduce Kassiopeia, a database and web application for the generation, storage, and presentation of genome-wide analyses of mutually exclusive exomes. Currently, Kassiopeia provides access to the mutually exclusive exomes of twelve Drosophila species, the thale cress Arabidopsis thaliana, the flatworm Caenorhabditis elegans, and human. Mutually exclusive spliced exons (MXEs) were predicted based on gene reconstructions from Scipio. Based on the standard prediction values, with which 83.5% of the annotated MXEs of Drosophila melanogaster were reconstructed, the exomes contain surprisingly more MXEs than previously supposed and identified. The user can search Kassiopeia using BLAST or browse the genes of each species optionally adjusting the parameters used for the prediction to reveal more divergent or only very similar exon candidates. Conclusions We developed a pipeline to predict MXEs in the genomes of several model organisms and a web interface, Kassiopeia, for their visualization. For each gene Kassiopeia provides a comprehensive gene structure scheme, the sequences and predicted secondary structures of the MXEs, and, if available, further evidence for MXE candidates from cDNA/EST data, predictions of MXEs in homologous genes of closely related species, and RNA secondary structure predictions. Kassiopeia can be accessed at http://www.motorprotein.de/kassiopeia. PMID:24507667
Iqbal, Muhammad; Hayat, Maqsood
2016-05-01
Gene splicing is a vital source of protein diversity. Perfectly eradication of introns and joining exons is the prominent task in eukaryotic gene expression, as exons are usually interrupted by introns. Identification of splicing sites through experimental techniques is complicated and time-consuming task. With the avalanche of genome sequences generated in the post genomic age, it remains a complicated and challenging task to develop an automatic, robust and reliable computational method for fast and effective identification of splicing sites. In this study, a hybrid model "iSS-Hyb-mRMR" is proposed for quickly and accurately identification of splicing sites. Two sample representation methods namely; pseudo trinucleotide composition (PseTNC) and pseudo tetranucleotide composition (PseTetraNC) were used to extract numerical descriptors from DNA sequences. Hybrid model was developed by concatenating PseTNC and PseTetraNC. In order to select high discriminative features, minimum redundancy maximum relevance algorithm was applied on the hybrid feature space. The performance of these feature representation methods was tested using various classification algorithms including K-nearest neighbor, probabilistic neural network, general regression neural network, and fitting network. Jackknife test was used for evaluation of its performance on two benchmark datasets S1 and S2, respectively. The predictor, proposed in the current study achieved an accuracy of 93.26%, sensitivity of 88.77%, and specificity of 97.78% for S1, and the accuracy of 94.12%, sensitivity of 87.14%, and specificity of 98.64% for S2, respectively. It is observed, that the performance of proposed model is higher than the existing methods in the literature so for; and will be fruitful in the mechanism of RNA splicing, and other research academia. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application.
D'Antonio, Mattia; D'Onorio De Meo, Paolo; Pallocca, Matteo; Picardi, Ernesto; D'Erchia, Anna Maria; Calogero, Raffaele A; Castrignanò, Tiziana; Pesole, Graziano
2015-01-01
The study of RNA has been dramatically improved by the introduction of Next Generation Sequencing platforms allowing massive and cheap sequencing of selected RNA fractions, also providing information on strand orientation (RNA-Seq). The complexity of transcriptomes and of their regulative pathways make RNA-Seq one of most complex field of NGS applications, addressing several aspects of the expression process (e.g. identification and quantification of expressed genes and transcripts, alternative splicing and polyadenylation, fusion genes and trans-splicing, post-transcriptional events, etc.). In order to provide researchers with an effective and friendly resource for analyzing RNA-Seq data, we present here RAP (RNA-Seq Analysis Pipeline), a cloud computing web application implementing a complete but modular analysis workflow. This pipeline integrates both state-of-the-art bioinformatics tools for RNA-Seq analysis and in-house developed scripts to offer to the user a comprehensive strategy for data analysis. RAP is able to perform quality checks (adopting FastQC and NGS QC Toolkit), identify and quantify expressed genes and transcripts (with Tophat, Cufflinks and HTSeq), detect alternative splicing events (using SpliceTrap) and chimeric transcripts (with ChimeraScan). This pipeline is also able to identify splicing junctions and constitutive or alternative polyadenylation sites (implementing custom analysis modules) and call for statistically significant differences in genes and transcripts expression, splicing pattern and polyadenylation site usage (using Cuffdiff2 and DESeq). Through a user friendly web interface, the RAP workflow can be suitably customized by the user and it is automatically executed on our cloud computing environment. This strategy allows to access to bioinformatics tools and computational resources without specific bioinformatics and IT skills. RAP provides a set of tabular and graphical results that can be helpful to browse, filter and export analyzed data, according to the user needs.
mRNA trans-splicing in gene therapy for genetic diseases.
Berger, Adeline; Maire, Séverine; Gaillard, Marie-Claude; Sahel, José-Alain; Hantraye, Philippe; Bemelmans, Alexis-Pierre
2016-07-01
Spliceosome-mediated RNA trans-splicing, or SMaRT, is a promising strategy to design innovative gene therapy solutions for currently intractable genetic diseases. SMaRT relies on the correction of mutations at the post-transcriptional level by modifying the mRNA sequence. To achieve this, an exogenous RNA is introduced into the target cell, usually by means of gene transfer, to induce a splice event in trans between the exogenous RNA and the target endogenous pre-mRNA. This produces a chimeric mRNA composed partly of exons of the latter, and partly of exons of the former, encoding a sequence free of mutations. The principal challenge of SMaRT technology is to achieve a reaction as complete as possible, i.e., resulting in 100% repairing of the endogenous mRNA target. The proof of concept of SMaRT feasibility has already been established in several models of genetic diseases caused by recessive mutations. In such cases, in fact, the repair of only a portion of the mutant mRNA pool may be sufficient to obtain a significant therapeutic effect. However in the case of dominant mutations, the target cell must be freed from the majority of mutant mRNA copies, requiring a highly efficient trans-splicing reaction. This likely explains why only a few examples of SMaRT approaches targeting dominant mutations are reported in the literature. In this review, we explain in details the mechanism of trans-splicing, review the different strategies that are under evaluation to lead to efficient trans-splicing, and discuss the advantages and limitations of SMaRT. WIREs RNA 2016, 7:487-498. doi: 10.1002/wrna.1347 For further resources related to this article, please visit the WIREs website. © 2016 The Authors. WIREs RNA published by Wiley Periodicals, Inc.
Co-evolution of SNF spliceosomal proteins with their RNA targets in trans-splicing nematodes.
Strange, Rex Meade; Russelburg, L Peyton; Delaney, Kimberly J
2016-08-01
Although the mechanism of pre-mRNA splicing has been well characterized, the evolution of spliceosomal proteins is poorly understood. The U1A/U2B″/SNF family (hereafter referred to as the SNF family) of RNA binding spliceosomal proteins participates in both the U1 and U2 small interacting nuclear ribonucleoproteins (snRNPs). The highly constrained nature of this system has inhibited an analysis of co-evolutionary trends between the proteins and their RNA binding targets. Here we report accelerated sequence evolution in the SNF protein family in Phylum Nematoda, which has allowed an analysis of protein:RNA co-evolution. In a comparison of SNF genes from ecdysozoan species, we found a correlation between trans-splicing species (nematodes) and increased phylogenetic branch lengths of the SNF protein family, with respect to their sister clade Arthropoda. In particular, we found that nematodes (~70-80 % of pre-mRNAs are trans-spliced) have experienced higher rates of SNF sequence evolution than arthropods (predominantly cis-spliced) at both the nucleotide and amino acid levels. Interestingly, this increased evolutionary rate correlates with the reliance on trans-splicing by nematodes, which would alter the role of the SNF family of spliceosomal proteins. We mapped amino acid substitutions to functionally important regions of the SNF protein, specifically to sites that are predicted to disrupt protein:RNA and protein:protein interactions. Finally, we investigated SNF's RNA targets: the U1 and U2 snRNAs. Both are more divergent in nematodes than arthropods, suggesting the RNAs have co-evolved with SNF in order to maintain the necessarily high affinity interaction that has been characterized in other species.
Tan, Wei; Dean, Michael; Law, Amanda J.
2010-01-01
ErbB4 is a growth factor receptor tyrosine kinase essential for neurodevelopment. Genetic variation in ErbB4 is associated with schizophrenia and risk-associated polymorphisms predict overexpression of ErbB4 CYT-1 isoforms in the brain in the disorder. The molecular mechanism of association is unclear because the polymorphisms flank exon 3 of the gene and reside 700 kb distal to the CYT-1 defining exon. We hypothesized that the polymorphisms are indirectly associated with ErbB4 CYT-1 via splicing of exon 3 on the CYT-1 background. We report via cloning and sequencing of adult and fetal human brain cDNA libraries the identification of novel splice isoforms of ErbB4, whereby exon 3 is skipped (del.3). ErbB4 del.3 transcripts exist as CYT-2 isoforms and are predicted to produce truncated proteins. Furthermore, our data refine the structure of the human ErbB4 gene, clarify that juxtamembrane (JM) splice variants of ErbB4, JM-a and JM-b respectively, are characterized by the replacement of a 75 nucleotide (nt) sequence with a 45-nt insertion, and demonstrate that there are four alternative exons in the gene. Our analyses reveal that novel splice variants of ErbB4 exist in the developing and adult human brain and, given the failure to identify ErbB4 del.3 CYT-1 transcripts, suggest that the association of risk polymorphisms in the ErbB4 gene with CYT-1 transcript levels is not mediated via an exon 3 splicing event. PMID:20886074
Verbeeren, Jens; Verma, Bhupendra
2017-01-01
Cellular homeostasis of the minor spliceosome is regulated by a negative feed-back loop that targets U11-48K and U11/U12-65K mRNAs encoding essential components of the U12-type intron-specific U11/U12 di-snRNP. This involves interaction of the U11 snRNP with an evolutionarily conserved splicing enhancer giving rise to unproductive mRNA isoforms. In the case of U11/U12-65K, this mechanism controls the length of the 3′ untranslated region (3′UTR). We show that this process is dynamically regulated in developing neurons and some other cell types, and involves a binary switch between translation-competent mRNAs with a short 3′UTR to non-productive isoforms with a long 3′UTR that are retained in the nucleus or/and spliced to the downstream amylase locus. Importantly, the choice between these alternatives is determined by alternative terminal exon definition events regulated by conserved U12- and U2-type 5′ splice sites as well as sequence signals used for pre-mRNA cleavage and polyadenylation. We additionally show that U11 snRNP binding to the U11/U12-65K mRNA species with a long 3′UTR is required for their nuclear retention. Together, our studies uncover an intricate molecular circuitry regulating the abundance of a key spliceosomal protein and shed new light on the mechanisms limiting the export of non-productively spliced mRNAs from the nucleus to the cytoplasm. PMID:28549066
Antisense Oligonucleotides for the Treatment of Spinal Muscular Atrophy
Porensky, Paul N.
2013-01-01
Abstract Spinal muscular atrophy (SMA) is an autosomal recessive disease affecting ∼1 in 10,000 live births. The most striking component is the loss of α-motor neurons in the ventral horn of the spinal cord, resulting in progressive paralysis and eventually premature death. There is no current treatment paradigm other than supportive care, though the past 15 years has seen a striking advancement in understanding of both SMA genetics and molecular mechanisms. A variety of disease-modifying interventions are rapidly bridging the translational gap from the laboratory to clinical trials, including the application of antisense oligonucleotide (ASO) therapy for the correction of aberrant RNA splicing characteristic of SMA. Survival motor neuron (SMN) is a ubiquitously expressed 38-kD protein. Humans have two genes that produce SMN, SMN1 and SMN2, the former of which is deleted or nonfunctional in the majority of patients with SMA. These two genes are nearly identical with one exception, a C to T transition (C6T) within exon 7 of SMN2. C6T disrupts a modulator of splicing, leading to the exclusion of exon 7 from ∼90% of the mRNA transcript. The resultant truncated Δ7SMN protein does not oligomerize efficiently and is rapidly degraded. SMA can therefore be considered a disease of too little SMN protein. A number of cis-acting splice modifiers have been identified in the region of exon 7, the steric block of which enhances the retention of the exon and a resultant full-length mRNA sequence. ASOs targeted to these splice motifs have shown impressive phenotype rescue in multiple SMA mouse models. PMID:23544870
Mechanism for DNA transposons to generate introns on genomic scales
Huff, Jason T.; Zilberman, Daniel; Roy, Scott W.
2017-01-01
Discovered four decades ago, the existence of introns was one of the most unexpected findings in molecular biology1. Introns are sequences interrupting genes that must be removed as part of mRNA production. Genome sequencing projects have documented that most eukaryotic genes contain at least one and frequently many introns2,3. Comparison of these genomes reveals a history of long evolutionary periods with little intron gain punctuated by episodes of rapid, extensive gain2,3. However, no detailed mechanism for such episodic intron generation has been empirically supported on a sufficient scale, despite several proposals4–8. Here we show how short non-autonomous DNA transposons independently generated hundreds to thousands of introns in the prasinophyte Micromonas pusilla and the pelagophyte Aureococcus anophagefferens. Each transposon carries one splice site. The other splice site is co-opted from gene sequence duplicated upon transposon insertion, allowing perfect splicing out of RNA. The distributions of sequences that can be co-opted are biased with respect to codons, and phasing of transposon-generated introns is similarly biased. These transposons insert between preexisting nucleosomes, so that multiple nearby insertions generate nucleosome-sized intervening segments. Thus, transposon insertion and sequence co-option may explain the intron phase biases2 and prevalence of nucleosome-sized exons9 observed in eukaryotes. Overall, the two independent examples of proliferating elements illustrate a general DNA transposon mechanism plausibly accounting for episodes of rapid, extensive intron gain during eukaryotic evolution2,3. PMID:27760113
Ryu, Nari; Lee, Seokwon; Park, Hong-Joon; Lee, Byeonghyeon; Kwon, Tae-Jun; Bok, Jinwoong; Park, Chan Ik; Lee, Kyu-Yup; Baek, Jeong-In; Kim, Un-Kyung
2017-09-05
Hereditary hearing loss (HHL) is a common genetically heterogeneous disorder, which follows Mendelian inheritance in humans. Because of this heterogeneity, the identification of the causative gene of HHL by linkage analysis or Sanger sequencing have shown economic and temporal limitations. With recent advances in next-generation sequencing (NGS) techniques, rapid identification of a causative gene via massively parallel sequencing is now possible. We recruited a Korean family with three generations exhibiting autosomal dominant inheritance of hearing loss (HL), and the clinical information about this family revealed that there are no other symptoms accompanied with HL. To identify a causative mutation of HL in this family, we performed whole-exome sequencing of 4 family members, 3 affected and an unaffected. As the result, A novel splicing mutation, c.763+1G>T, in the solute carrier family 17, member 8 (SLC17A8) gene was identified in the patients, and the genotypes of the mutation were co-segregated with the phenotype of HL. Additionally, this mutation was not detected in 100 Koreans with normal hearing. Via NGS, we detected a novel splicing mutation that might influence the hearing ability within the patients with autosomal dominant non-syndromic HL. Our data suggests that this technique is a powerful tool to discover causative genetic factors of HL and facilitate diagnoses of the primary cause of HHL. Copyright © 2017 Elsevier B.V. All rights reserved.
Mammalian splicing factor SF1 interacts with SURP domains of U2 snRNP-associated proteins
Crisci, Angela; Raleff, Flore; Bagdiul, Ivona; Raabe, Monika; Urlaub, Henning; Rain, Jean-Christophe; Krämer, Angela
2015-01-01
Splicing factor 1 (SF1) recognizes the branch point sequence (BPS) at the 3′ splice site during the formation of early complex E, thereby pre-bulging the BPS adenosine, thought to facilitate subsequent base-pairing of the U2 snRNA with the BPS. The 65-kDa subunit of U2 snRNP auxiliary factor (U2AF65) interacts with SF1 and was shown to recruit the U2 snRNP to the spliceosome. Co-immunoprecipitation experiments of SF1-interacting proteins from HeLa cell extracts shown here are consistent with the presence of SF1 in early splicing complexes. Surprisingly almost all U2 snRNP proteins were found associated with SF1. Yeast two-hybrid screens identified two SURP domain-containing U2 snRNP proteins as partners of SF1. A short, evolutionarily conserved region of SF1 interacts with the SURP domains, stressing their role in protein–protein interactions. A reduction of A complex formation in SF1-depleted extracts could be rescued with recombinant SF1 containing the SURP-interaction domain, but only partial rescue was observed with SF1 lacking this sequence. Thus, SF1 can initially recruit the U2 snRNP to the spliceosome during E complex formation, whereas U2AF65 may stabilize the association of the U2 snRNP with the spliceosome at later times. In addition, these findings may have implications for alternative splicing decisions. PMID:26420826
NASA Astrophysics Data System (ADS)
Shih, Shin-Ru; Nemeroff, Martin E.; Krug, Robert M.
1995-07-01
The influenza virus M1 mRNA has two alternative 5' splice sites: a distal 5' splice site producing mRNA_3 that has the coding potential for 9 amino acids and a proximal 5' splice site producing M2 mRNA encoding the essential M2 ion-channel protein. Only mRNA_3 was made in uninfected cells transfected with DNA expressing M1 mRNA. Similarly, using nuclear extracts from uninfected cells, in vitro splicing of M1 mRNA yielded only mRNA_3. Only when the mRNA_3 5' splice site was inactivated by mutation was M2 mRNA made in uninfected cells and in uninfected cell extracts. In influenza virus-infected cells, M2 mRNA was made, but only after a delay, suggesting that newly synthesized viral gene product(s) were needed to activate the M2 5' splice site. We present strong evidence that these gene products are the complex of the three polymerase proteins, the same complex that functions in the transcription and replication of the viral genome. Gel shift experiments showed that the viral polymerase complex bound to the 5' end of the viral M1 mRNA in a sequence-specific and cap-dependent manner. During in vitro splicing catalyzed by uninfected cell extracts, the binding of the viral polymerase complex blocked the mRNA_3 5' splice site, resulting in the switch to the M2 mRNA 5' splice site and the production of M2 mRNA.
Targeted Single-Shot Methods for Diffusion-Weighted Imaging in the Kidneys
Jin, Ning; Deng, Jie; Zhang, Longjiang; Zhang, Zhuoli; Lu, Guangming; Omary, Reed A.; Larson, Andrew C.
2011-01-01
Purpose To investigate the feasibility of combining the inner-volume-imaging (IVI) technique with single-shot diffusion-weighted (DW) spin-echo echo-planar imaging (SE-EPI) and DW-SPLICE (split acquisition of fast spin-echo) sequences for renal DW imaging. Materials and Methods Renal DW imaging was performed in 10 healthy volunteers using single-shot DW-SE-EPI, DW-SPLICE, targeted-DW-SE-EPI and targeted-DW-SPLICE. We compared the quantitative diffusion measurement accuracy and image quality of these targeted-DW-SE-EPI and targeted DW-SPLICE methods with conventional full FOV DW-SE-EPI and DW-SPLICE measurements in phantoms and normal volunteers. Results Compared with full FOV DW-SE-EPI and DW-SPLICE methods, targeted-DW-SE-EPI and targeted-DW-SPLICE approaches produced images of superior overall quality with fewer artifacts, less distortion and reduced spatial blurring in both phantom and volunteer studies. The ADC values measured with each of the four methods were similar and in agreement with previously published data. There were no statistically significant differences between the ADC values and intra-voxel incoherent motion (IVIM) measurements in the kidney cortex and medulla using single-shot DW-SE-EPI, targeted-DW-EPI and targeted-DW-SPLICE (p > 0.05). Conclusion Compared with full-FOV DW imaging methods, targeted-DW-SE-EPI and targeted-DW-SPLICE techniques reduced image distortion and artifacts observed in the single-shot DW-SE-EPI images, reduced blurring in DW-SPLICE images and produced comparable quantitative DW and IVIM measurements to those produced with conventional full-FOV approaches. PMID:21591023
Li, An-Ming; Yu, Bing-Yun; Chen, Fu-Hua; Gan, Hui-Yan; Yuan, Jian-Gang; Qiu, Rongliang; Huang, Jun-Chao; Yang, Zhong-Yi; Xu, Zeng-Fu
2009-01-01
Phytochelatins (PCs) play an important role in detoxification of heavy metals in plants. PCs are synthesized from glutathione by phytochelatin synthase (PCS), a dipeptidyltransferase. Sesbania rostrata is a tropical legume plant that can tolerate high concentrations of Cd and Zn. In this study, the S. rostrata PCS gene (SrPCS) and cDNAs were isolated and characterized. Southern blot and sequence analysis revealed that a single copy of the SrPCS gene occurs in the S. rostrata genome, and produces four different SrPCS mRNAs and proteins, SrPCS1–SrPCS4, by alternative splicing of the SrPCS pre-mRNA. The SrPCS1 and SrPCS3 proteins conferred Cd tolerance when expressed in yeast cells, whereas the SrPCS2 and SrPCS4 proteins, which lack the catalytic triad and the N-terminal domains, did not. These results suggested that SrPCS1 and SrPCS3 have potential applications in genetic engineering of plants for enhancing heavy metal tolerance and phytoremediation of contaminated soils. PMID:20111680
Li, An-Ming; Yu, Bing-Yun; Chen, Fu-Hua; Gan, Hui-Yan; Yuan, Jian-Gang; Qiu, Rongliang; Huang, Jun-Chao; Yang, Zhong-Yi; Xu, Zeng-Fu
2009-07-24
Phytochelatins (PCs) play an important role in detoxification of heavy metals in plants. PCs are synthesized from glutathione by phytochelatin synthase (PCS), a dipeptidyltransferase. Sesbania rostrata is a tropical legume plant that can tolerate high concentrations of Cd and Zn. In this study, the S. rostrata PCS gene (SrPCS) and cDNAs were isolated and characterized. Southern blot and sequence analysis revealed that a single copy of the SrPCS gene occurs in the S. rostrata genome, and produces four different SrPCS mRNAs and proteins, SrPCS1-SrPCS4, by alternative splicing of the SrPCS pre-mRNA. The SrPCS1 and SrPCS3 proteins conferred Cd tolerance when expressed in yeast cells, whereas the SrPCS2 and SrPCS4 proteins, which lack the catalytic triad and the N-terminal domains, did not. These results suggested that SrPCS1 and SrPCS3 have potential applications in genetic engineering of plants for enhancing heavy metal tolerance and phytoremediation of contaminated soils.
Alternative Splicing of STAT3 Is Affected by RNA Editing.
Goldberg, Lior; Abutbul-Amitai, Mor; Paret, Gideon; Nevo-Caspi, Yael
2017-05-01
A-to-I RNA editing, carried out by adenosine deaminase acting on RNA (ADAR) enzymes, is an epigenetic phenomenon of posttranscriptional modifications on pre-mRNA. RNA editing in intronic sequences may influence alternative splicing of flanking exons. We have previously shown that conditions that induce editing result in elevated expression of signal transducer and activator of transcription 3 (STAT3), preferentially the alternatively-spliced STAT3β isoform. Mechanisms regulating alternative splicing of STAT3 have not been elucidated. STAT3 undergoes A-to-I RNA editing in an intron residing in proximity to the alternatively spliced exon. We hypothesized that RNA editing plays a role in regulating alternative splicing toward STAT3β. In this study we extend our observation connecting RNA editing to the preferential induction of STAT3β expression. We study the involvement of ADAR1 in STAT3 editing and reveal the connection between editing and alternative splicing of STAT3. Deferoaxamine treatment caused the induction in STAT3 RNA editing and STAT3β expression. Silencing ADAR1 caused a decrease in STAT3 editing and expression with a preferential decrease in STAT3β. Cells transfected with a mutated minigene showed preferential splicing toward the STAT3β transcript. Editing in the STAT3 intron is performed by ADAR1 and affects STAT3 alternative splicing. These results suggest that RNA editing is one of the molecular mechanisms regulating the expression of STAT3β.
Ajiro, Masahiko; Tang, Shuang; Doorbar, John; Zheng, Zhi-Ming
2016-10-15
Human papillomavirus 18 (HPV18) is the second most common oncogenic HPV type associated with cervical, anogenital, and oropharyngeal cancers. Like other oncogenic HPVs, HPV18 encodes two major (one early and one late) polycistronic pre-mRNAs that are regulated by alternative RNA splicing to produce a repertoire of viral transcripts for the expression of individual viral genes. However, RNA cis-regulatory elements and trans-acting factors contributing to HPV18 alternative RNA splicing remain unknown. In this study, an exonic splicing enhancer (ESE) in the nucleotide (nt) 3520 to 3550 region in the HPV18 genome was identified and characterized for promotion of HPV18 929^3434 splicing and E1^E4 production through interaction with SRSF3, a host oncogenic splicing factor differentially expressed in epithelial cells and keratinocytes. Introduction of point mutations in the SRSF3-binding site or knockdown of SRSF3 expression in cells reduces 929^3434 splicing and E1^E4 production but activates other, minor 929^3465 and 929^3506 splicing. Knockdown of SRSF3 expression also enhances the expression of E2 and L1 mRNAs. An exonic splicing silencer (ESS) in the HPV18 nt 612 to 639 region was identified as being inhibitory to the 233^416 splicing of HPV18 E6E7 pre-mRNAs via binding to hnRNP A1, a well-characterized, abundantly and ubiquitously expressed RNA-binding protein. Introduction of point mutations into the hnRNP A1-binding site or knockdown of hnRNP A1 expression promoted 233^416 splicing and reduced E6 expression. These data provide the first evidence that the alternative RNA splicing of HPV18 pre-mRNAs is subject to regulation by viral RNA cis elements and host trans-acting splicing factors. Expression of HPV18 genes is regulated by alternative RNA splicing of viral polycistronic pre-mRNAs to produce a repertoire of viral early and late transcripts. RNA cis elements and trans-acting factors contributing to HPV18 alternative RNA splicing have been discovered in this study for the first time. The identified ESS at the E7 open reading frame (ORF) prevents HPV18 233^416 splicing in the E6 ORF through interaction with a host splicing factor, hnRNP A1, and regulates E6 and E7 expression of the early E6E7 polycistronic pre-mRNA. The identified ESE at the E1^E4 ORF promotes HPV18 929^3434 splicing of both viral early and late pre-mRNAs and E1^E4 production through interaction with SRSF3. This study provides important observations on how alternative RNA splicing of HPV18 pre-mRNAs is subject to regulation by viral RNA cis elements and host splicing factors and offers potential therapeutic targets to overcome HPV-related cancer. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Ajiro, Masahiko; Tang, Shuang; Doorbar, John
2016-01-01
ABSTRACT Human papillomavirus 18 (HPV18) is the second most common oncogenic HPV type associated with cervical, anogenital, and oropharyngeal cancers. Like other oncogenic HPVs, HPV18 encodes two major (one early and one late) polycistronic pre-mRNAs that are regulated by alternative RNA splicing to produce a repertoire of viral transcripts for the expression of individual viral genes. However, RNA cis-regulatory elements and trans-acting factors contributing to HPV18 alternative RNA splicing remain unknown. In this study, an exonic splicing enhancer (ESE) in the nucleotide (nt) 3520 to 3550 region in the HPV18 genome was identified and characterized for promotion of HPV18 929^3434 splicing and E1^E4 production through interaction with SRSF3, a host oncogenic splicing factor differentially expressed in epithelial cells and keratinocytes. Introduction of point mutations in the SRSF3-binding site or knockdown of SRSF3 expression in cells reduces 929^3434 splicing and E1^E4 production but activates other, minor 929^3465 and 929^3506 splicing. Knockdown of SRSF3 expression also enhances the expression of E2 and L1 mRNAs. An exonic splicing silencer (ESS) in the HPV18 nt 612 to 639 region was identified as being inhibitory to the 233^416 splicing of HPV18 E6E7 pre-mRNAs via binding to hnRNP A1, a well-characterized, abundantly and ubiquitously expressed RNA-binding protein. Introduction of point mutations into the hnRNP A1-binding site or knockdown of hnRNP A1 expression promoted 233^416 splicing and reduced E6 expression. These data provide the first evidence that the alternative RNA splicing of HPV18 pre-mRNAs is subject to regulation by viral RNA cis elements and host trans-acting splicing factors. IMPORTANCE Expression of HPV18 genes is regulated by alternative RNA splicing of viral polycistronic pre-mRNAs to produce a repertoire of viral early and late transcripts. RNA cis elements and trans-acting factors contributing to HPV18 alternative RNA splicing have been discovered in this study for the first time. The identified ESS at the E7 open reading frame (ORF) prevents HPV18 233^416 splicing in the E6 ORF through interaction with a host splicing factor, hnRNP A1, and regulates E6 and E7 expression of the early E6E7 polycistronic pre-mRNA. The identified ESE at the E1^E4 ORF promotes HPV18 929^3434 splicing of both viral early and late pre-mRNAs and E1^E4 production through interaction with SRSF3. This study provides important observations on how alternative RNA splicing of HPV18 pre-mRNAs is subject to regulation by viral RNA cis elements and host splicing factors and offers potential therapeutic targets to overcome HPV-related cancer. PMID:27489271
Circular RNAs: Unexpected outputs of many protein-coding genes
Wilusz, Jeremy E.
2017-01-01
ABSTRACT Pre-mRNAs from thousands of eukaryotic genes can be non-canonically spliced to generate circular RNAs, some of which accumulate to higher levels than their associated linear mRNA. Recent work has revealed widespread mechanisms that dictate whether the spliceosome generates a linear or circular RNA. For most genes, circular RNA biogenesis via backsplicing is far less efficient than canonical splicing, but circular RNAs can accumulate due to their long half-lives. Backsplicing is often initiated when complementary sequences from different introns base pair and bring the intervening splice sites close together. This process is further regulated by the combinatorial action of RNA binding proteins, which allow circular RNAs to be expressed in unique patterns. Some genes do not require complementary sequences to generate RNA circles and instead take advantage of exon skipping events. It is still unclear what most mature circular RNAs do, but future investigations into their functions will be facilitated by recently described methods to modulate circular RNA levels. PMID:27571848
Wang, Binbin; Li, Lin; Zhu, Ying; Zhang, Wei; Wang, Xi; Chen, Beili; Li, Tengyan; Pan, Hong; Wang, Jing; Kee, Kehkooi; Cao, Yunxia
2017-10-01
Does a novel heterozygous KHDRBS1 variant, identified using whole-exome sequencing (WES) in two patients with primary ovarian insufficiency (POI) in a pedigree, cause defects in mRNA alternative splicing? The heterozygous variant of KHDRBS1 was confirmed to cause defects in alternative splicing of many genes involved in DNA replication and repair. Studies in mice revealed that Khdrbs1 deficient females are subfertile, which manifests as delayed sexual maturity and significantly reduced numbers of secondary and pre-antral follicles. No mutation of KHDRBS1, however, has been reported in patients with POI. This genetic and functional study used WES to find putative mutations in a POI pedigree. Altogether, 215 idiopathic POI patients and 400 healthy controls were screened for KHDRBS1 mutations. Two POI patients were subjected to WES to identify sequence variants. Mutational analysis of the KHDRBS1 gene in 215 idiopathic POI patients and 400 healthy controls were performed. RNA-sequencing was carried out to find the mis-regulation of gene expression due to KHDRBS1 mutation. Bioinformatics was used to analyze the change in alternative splicing events. We identified a heterozygous mutation (c.460A > G, p.M154V) in KHDRBS1 in two patients. Further mutational analysis of 215 idiopathic POI patients with the KHDRBS1 gene found one heterozygous mutation (c.263C > T, p.P88L). We failed to find these two mutations in 400 healthy control women. Using RNA-sequencing, we found that the KGN cells expressing the M154V KHDRBS1 mutant had different expression of 66 genes compared with wild-type (WT) cells. Furthermore, 145 genes were alternatively spliced in M154V cells, and these genes were enriched for DNA replication and repair function, revealing a potential underlying mechanism of the pathology that leads to POI. Although the in vitro assays demonstrated the effect of the KHDRBS1 variant on alternative splicing, further studies are needed to validate the in vivo effects on germ cell and follicle development. This finding provides researchers and clinicians a better understanding of the etiology and molecular mechanism of POI. This study was supported by the Ministry of Science and Technology of China (2012CB944704; 2012CB966702), National Research Institute for Family Planning (2017GJZ05), the National Natural Science Foundation of China (31171429) and Beijing Advanced Innovation Center for Structural Biology. The authors declare no conflict of interest. © The Author 2017. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Ebstein, F.; Textoris-Taube, K.; Keller, C.; Golnik, R.; Vigneron, N.; Van den Eynde, B. J.; Schuler-Thurner, B.; Schadendorf, D.; Lorenz, F. K. M.; Uckert, W.; Urban, S.; Lehmann, A.; Albrecht-Koepke, N.; Janek, K.; Henklein, P.; Niewienda, A.; Kloetzel, P. M.; Mishto, M.
2016-01-01
Proteasome-catalyzed peptide splicing represents an additional catalytic activity of proteasomes contributing to the pool of MHC-class I-presented epitopes. We here biochemically and functionally characterized a new melanoma gp100 derived spliced epitope. We demonstrate that the gp100mel47–52/40–42 antigenic peptide is generated in vitro and in cellulo by a not yet described proteasomal condensation reaction. gp100mel47–52/40–42 generation is enhanced in the presence of the β5i/LMP7 proteasome-subunit and elicits a peptide-specific CD8+ T cell response. Importantly, we demonstrate that different gp100mel-derived spliced epitopes are generated and presented to CD8+ T cells with efficacies comparable to non-spliced canonical tumor epitopes and that gp100mel-derived spliced epitopes trigger activation of CD8+ T cells found in peripheral blood of half of the melanoma patients tested. Our data suggest that both transpeptidation and condensation reactions contribute to the frequent generation of spliced epitopes also in vivo and that their immune relevance may be comparable to non-spliced epitopes. PMID:27049119
Lee, Younghee; Han, Seonggyun; Kim, Dongwook; Kim, Dokyoon; Horgousluoglu, Emrin; Risacher, Shannon L; Saykin, Andrew J; Nho, Kwangsik
2018-01-01
Genetic variation in cis-regulatory elements related to splicing machinery and splicing regulatory elements (SREs) results in exon skipping and undesired protein products. We developed a splicing decision model to identify actionable loci among common SNPs for gene regulation. The splicing decision model identified SNPs affecting exon skipping by analyzing sequence-driven alternative splicing (AS) models and by scanning the genome for the regions with putative SRE motifs. We used non-Hispanic Caucasians with neuroimaging, and fluid biomarkers for Alzheimer's disease (AD) and identified 17,088 common exonic SNPs affecting exon skipping. GWAS identified one SNP (rs1140317) in HLA-DQB1 as significantly associated with entorhinal cortical thickness, AD neuroimaging biomarker, after controlling for multiple testing. Further analysis revealed that rs1140317 was significantly associated with brain amyloid-f deposition (PET and CSF). HLA-DQB1 is an essential immune gene and may regulate AS, thereby contributing to AD pathology. SRE may hold potential as novel therapeutic targets for AD.
Di Giacomo, Daniela; Gaildrat, Pascaline; Abuli, Anna; Abdat, Julie; Frébourg, Thierry; Tosi, Mario; Martins, Alexandra
2013-11-01
Exonic variants can alter pre-mRNA splicing either by changing splice sites or by modifying splicing regulatory elements. Often these effects are difficult to predict and are only detected by performing RNA analyses. Here, we analyzed, in a minigene assay, 26 variants identified in the exon 7 of BRCA2, a cancer predisposition gene. Our results revealed eight new exon skipping mutations in this exon: one directly altering the 5' splice site and seven affecting potential regulatory elements. This brings the number of splicing regulatory mutations detected in BRCA2 exon 7 to a total of 11, a remarkably high number considering the total number of variants reported in this exon (n = 36), all tested in our minigene assay. We then exploited this large set of splicing data to test the predictive value of splicing regulator hexamers' scores recently established by Ke et al. (). Comparisons of hexamer-based predictions with our experimental data revealed high sensitivity in detecting variants that increased exon skipping, an important feature for prescreening variants before RNA analysis. In conclusion, hexamer scores represent a promising tool for predicting the biological consequences of exonic variants and may have important applications for the interpretation of variants detected by high-throughput sequencing. © 2013 WILEY PERIODICALS, INC.
ZHU, MING; CHEN, HUI-MEI; WANG, YA-PING
2013-01-01
The MLH1 and MSH2 genes in DNA mismatch repair are important in the pathogenesis of gastrointestinal cancer. Recent studies of normal and alternative splicing suggest that the deleterious effects of missense mutations may in fact be splicing-related when they are located in exonic splicing enhancers (ESEs) or exonic splicing silencers (ESSs). In this study, we used ESE-finder and FAS-ESS software to analyze the potential ESE/ESS motifs of the 114 missense mutations detected in the two genes in East Asian gastrointestinal cancer patients. In addition, we used the SIFT tool to functionally analyze these mutations. The amount of the ESE losses (68) was 51.1% higher than the ESE gains (45) of all the mutations. However, the amount of the ESS gains (27) was 107.7% higher than the ESS losses (13). In total, 56 (49.1%) mutations possessed a potential exonic splicing regulator (ESR) error. Eighty-one mutations (71.1%) were predicted to be deleterious with a lower tolerance index as detected by the Sorting Intolerant from Tolerant (SIFT) tool. Among these, 38 (33.3%) mutations were predicted to be functionally deleterious and possess one potential ESR error, while 18 (15.8%) mutations were predicted to be functionally deleterious and exhibit two potential ESR errors. These may be more likely to affect exon splicing. Our results indicated that there is a strong correlation between missense mutations in MLH1 and MSH2 genes detected in East Asian gastrointestinal cancer patients and ESR motifs. In order to correctly understand the molecular nature of mutations, splicing patterns should be compared between wild-type and mutant samples. PMID:23760103
Gatto, Alberto; Torroja-Fungairiño, Carlos; Mazzarotto, Francesco; Cook, Stuart A; Barton, Paul J R; Sánchez-Cabo, Fátima; Lara-Pezzi, Enrique
2014-04-01
Alternative splicing is the main mechanism governing protein diversity. The recent developments in RNA-Seq technology have enabled the study of the global impact and regulation of this biological process. However, the lack of standardized protocols constitutes a major bottleneck in the analysis of alternative splicing. This is particularly important for the identification of exon-exon junctions, which is a critical step in any analysis workflow. Here we performed a systematic benchmarking of alignment tools to dissect the impact of design and method on the mapping, detection and quantification of splice junctions from multi-exon reads. Accordingly, we devised a novel pipeline based on TopHat2 combined with a splice junction detection algorithm, which we have named FineSplice. FineSplice allows effective elimination of spurious junction hits arising from artefactual alignments, achieving up to 99% precision in both real and simulated data sets and yielding superior F1 scores under most tested conditions. The proposed strategy conjugates an efficient mapping solution with a semi-supervised anomaly detection scheme to filter out false positives and allows reliable estimation of expressed junctions from the alignment output. Ultimately this provides more accurate information to identify meaningful splicing patterns. FineSplice is freely available at https://sourceforge.net/p/finesplice/.
Splicing and local reinforcement of concrete filled FRP tubes.
DOT National Transportation Integrated Search
2014-01-01
This report includes fulfillment of Task 1 of a multi-task contract to further enhance concrete filled FRP tubes, or : the Bridge in a Backpack. Task 1 investigates and develops a feasible solution for splicing the concrete filled FRP : tubes. This w...
Pombert, Jean-François; Otis, Christian; Turmel, Monique; Lemieux, Claude
2013-01-01
Organelle genes are often interrupted by group I and or group II introns. Splicing of these mobile genetic occurs at the RNA level via serial transesterification steps catalyzed by the introns'own tertiary structures and, sometimes, with the help of external factors. These catalytic ribozymes can be found in cis or trans configuration, and although trans-arrayed group II introns have been known for decades, trans-spliced group I introns have been reported only recently. In the course of sequencing the complete mitochondrial genome of the prasinophyte picoplanktonic green alga Prasinoderma coloniale CCMP 1220 (Prasinococcales, clade VI), we uncovered two additional cases of trans-spliced group I introns. Here, we describe these introns and compare the 54,546 bp-long mitochondrial genome of Prasinoderma with those of four other prasinophytes (clades II, III and V). This comparison underscores the highly variable mitochondrial genome architecture in these ancient chlorophyte lineages. Both Prasinoderma trans-spliced introns reside within the large subunit rRNA gene (rnl) at positions where cis-spliced relatives, often containing homing endonuclease genes, have been found in other organelles. In contrast, all previously reported trans-spliced group I introns occur in different mitochondrial genes (rns or coxI). Each Prasinoderma intron is fragmented into two pieces, forming at the RNA level a secondary structure that resembles those of its cis-spliced counterparts. As observed for other trans-spliced group I introns, the breakpoint of the first intron maps to the variable loop L8, whereas that of the second is uniquely located downstream of P9.1. The breakpoint In each Prasinoderma intron corresponds to the same region where the open reading frame (ORF) occurs when present in cis-spliced orthologs. This correlation between the intron breakpoint and the ORF location in cis-spliced orthologs also holds for other trans-spliced introns; we discuss the possible implications of this interesting observation for trans-splicing of group I introns. PMID:24386369
Ezquerra-Inchausti, Maitane; Barandika, Olatz; Anasagasti, Ander; Irigoyen, Cristina; López de Munain, Adolfo; Ruiz-Ederra, Javier
2017-01-01
Retinitis pigmentosa is the most frequent group of inherited retinal dystrophies. It is highly heterogeneous, with more than 80 disease-causing genes 27 of which are known to cause autosomal dominant RP (adRP), having been identified. In this study a total of 29 index cases were ascertained based on a family tree compatible with adRP. A custom panel of 31 adRP genes was analysed by targeted next-generation sequencing using the Ion PGM platform in combination with Sanger sequencing. This allowed us to detect putative disease-causing mutations in 14 out of the 29 (48.28%) families analysed. Remarkably, around 38% of all adRP cases analysed showed mutations affecting the splicing process, mainly due to mutations in genes coding for spliceosome factors (SNRNP200 and PRPF8) but also due to splice-site mutations in RHO. Twelve of the 14 mutations found had been reported previously and two were novel mutations found in PRPF8 in two unrelated patients. In conclusion, our results will lead to more accurate genetic counselling and will contribute to a better characterisation of the disease. In addition, they may have a therapeutic impact in the future given the large number of studies currently underway based on targeted RNA splicing for therapeutic purposes. PMID:28045043
G-quadruplex structure at intron 2 of TFE3 and its role in Xp11.2 translocation and splicing.
Verma, Shiv Prakash; Das, Parimal
2018-03-01
Transcription Factor E3 (TFE3) translocation is found in a group of different type of cancers and most of the translocations are located in the 5' region of TFE3 which may be considered as Breakpoint Region (BR). In our In silico study by QGRS mapper and non BdB web servers we found a Potential G-quadruplex forming Sequence (PQS) in the intron 2 of TFE3 gene. In vitro G-quadruplex formation was shown by native PAGE in presence of Pyridostatin(PDS), which with inter molecular secondary structure caused reduced mobility to migrate slower. G-quadruplex formation was mapped at single base resolution by Sanger sequencing and Circular Dichroism showed the formation of parallel G-quadruplex. FRET analysis revealed increased and decreased formation of G-quadruplex in presence of PDS and antisense oligonucleotide respectively. PCR stop assay, transcriptional and translational inhibition by PQS showed stable G-quadruplex formation affecting the biological processes. TFE3 minigene splicing study showed the involvement of this G-quadruplex in TFE3 splicing too. Therefore, G-quadruplex is evident to be the reason behind TFE3 induced oncogenesis executed by translocation and also involved in the mRNA splicing. Copyright © 2017 Elsevier B.V. All rights reserved.
Mutation-adapted U1 snRNA corrects a splicing error of the dopa decarboxylase gene.
Lee, Ni-Chung; Lee, Yu-May; Chen, Pin-Wen; Byrne, Barry J; Hwu, Wuh-Liang
2016-12-01
Aromatic l-amino acid decarboxylase (AADC) deficiency is an inborn error of monoamine neurotransmitter synthesis, which results in dopamine, serotonin, epinephrine and norepinephrine deficiencies. The DDC gene founder mutation IVS6 + 4A > T is highly prevalent in Chinese patients with AADC deficiency. In this study, we designed several U1 snRNA vectors to adapt U1 snRNA binding sequences of the mutated DDC gene. We found that only the modified U1 snRNA (IVS-AAA) that completely matched both the intronic and exonic U1 binding sequences of the mutated DDC gene could correct splicing errors of either the mutated human DDC minigene or the mouse artificial splicing construct in vitro. We further injected an adeno-associated viral (AAV) vector to express IVS-AAA in the brain of a knock-in mouse model. This treatment was well tolerated and improved both the survival and brain dopamine and serotonin levels of mice with AADC deficiency. Therefore, mutation-adapted U1 snRNA gene therapy can be a promising method to treat genetic diseases caused by splicing errors, but the efficiency of such a treatment still needs improvements. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Integrating alternative splicing detection into gene prediction.
Foissac, Sylvain; Schiex, Thomas
2005-02-10
Alternative splicing (AS) is now considered as a major actor in transcriptome/proteome diversity and it cannot be neglected in the annotation process of a new genome. Despite considerable progresses in term of accuracy in computational gene prediction, the ability to reliably predict AS variants when there is local experimental evidence of it remains an open challenge for gene finders. We have used a new integrative approach that allows to incorporate AS detection into ab initio gene prediction. This method relies on the analysis of genomically aligned transcript sequences (ESTs and/or cDNAs), and has been implemented in the dynamic programming algorithm of the graph-based gene finder EuGENE. Given a genomic sequence and a set of aligned transcripts, this new version identifies the set of transcripts carrying evidence of alternative splicing events, and provides, in addition to the classical optimal gene prediction, alternative optimal predictions (among those which are consistent with the AS events detected). This allows for multiple annotations of a single gene in a way such that each predicted variant is supported by a transcript evidence (but not necessarily with a full-length coverage). This automatic combination of experimental data analysis and ab initio gene finding offers an ideal integration of alternatively spliced gene prediction inside a single annotation pipeline.
Westholm, Jakub O.; Miura, Pedro; Olson, Sara; Shenker, Sol; Joseph, Brian; Sanfilippo, Piero; Celniker, Susan E.; Graveley, Brenton R.; Lai, Eric C.
2014-01-01
Circularization was recently recognized to broadly expand transcriptome complexity. Here, we exploit massive Drosophila total RNA-sequencing data, >5 billion paired-end reads from >100 libraries covering diverse developmental stages, tissues and cultured cells, to rigorously annotate >2500 fruitfly circular RNAs. These mostly derive from back-splicing of protein-coding genes and lack poly(A) tails, and circularization of hundreds of genes is conserved across multiple Drosophila species. We elucidate structural and sequence properties of Drosophila circular RNAs, which exhibit commonalities and distinctions from mammalian circles. Notably, Drosophila circular RNAs harbor >1000 well-conserved canonical miRNA seed matches, especially within coding regions, and coding conserved miRNA sites reside preferentially within circularized exons. Finally, we analyze the developmental and tissue specificity of circular RNAs, and note their preferred derivation from neural genes and enhanced accumulation in neural tissues. Interestingly, circular isoforms increase dramatically relative to linear isoforms during CNS aging, and constitute a novel aging biomarker. PMID:25544350
Westholm, Jakub O.; Miura, Pedro; Olson, Sara; ...
2014-11-26
Circularization was recently recognized to broadly expand transcriptome complexity. Here, we exploit massive Drosophila total RNA-sequencing data, >5 billion paired-end reads from >100 libraries covering diverse developmental stages, tissues, and cultured cells, to rigorously annotate >2,500 fruit fly circular RNAs. These mostly derive from back-splicing of protein-coding genes and lack poly(A) tails, and the circularization of hundreds of genes is conserved across multiple Drosophila species. We elucidate structural and sequence properties of Drosophila circular RNAs, which exhibit commonalities and distinctions from mammalian circles. Notably, Drosophila circular RNAs harbor >1,000 well-conserved canonical miRNA seed matches, especially within coding regions, and codingmore » conserved miRNA sites reside preferentially within circularized exons. Finally, we analyze the developmental and tissue specificity of circular RNAs and note their preferred derivation from neural genes and enhanced accumulation in neural tissues. Interestingly, circular isoforms increase substantially relative to linear isoforms during CNS aging and constitute an aging biomarker.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Westholm, Jakub O.; Miura, Pedro; Olson, Sara
Circularization was recently recognized to broadly expand transcriptome complexity. Here, we exploit massive Drosophila total RNA-sequencing data, >5 billion paired-end reads from >100 libraries covering diverse developmental stages, tissues, and cultured cells, to rigorously annotate >2,500 fruit fly circular RNAs. These mostly derive from back-splicing of protein-coding genes and lack poly(A) tails, and the circularization of hundreds of genes is conserved across multiple Drosophila species. We elucidate structural and sequence properties of Drosophila circular RNAs, which exhibit commonalities and distinctions from mammalian circles. Notably, Drosophila circular RNAs harbor >1,000 well-conserved canonical miRNA seed matches, especially within coding regions, and codingmore » conserved miRNA sites reside preferentially within circularized exons. Finally, we analyze the developmental and tissue specificity of circular RNAs and note their preferred derivation from neural genes and enhanced accumulation in neural tissues. Interestingly, circular isoforms increase substantially relative to linear isoforms during CNS aging and constitute an aging biomarker.« less
Jurczyk, Barbara; Pociecha, Ewa; Grzesiak, Maciej; Kalita, Katarzyna; Rapacz, Marcin
2016-07-01
Alternative splicing of the Rubisco activase gene was shown to be a point for optimization of photosynthetic carbon assimilation. It can be expected to be a stress-regulated event that depends on plant freezing tolerance. The aim of the study was to examine the relationships among Rubisco activity, the expression of two Rubisco activase splicing variants and photoacclimation to low temperature. The experiment was performed on two Lolium perenne genotypes with contrasting levels of freezing tolerance. The study investigated the effect of pre-hardening (15°C) and cold acclimation (4°C) on net photosynthesis, photosystem II photochemical activity, Rubisco activity and the expression of two splicing variants of the Rubisco activase gene. The results showed an induction of Rubisco activity at both 15°C and 4°C only in a highly freezing-tolerant genotype. The enhanced Rubisco activity after pre-hardening corresponded to increased expression of the splicing variant representing the large isoform, while the increase in Rubisco activity during cold acclimation was due to the activation of both transcript variants. These boosts in Rubisco activity also corresponded to an activation of non-photochemical mechanism of photoacclimation induced at low temperature exclusively in the highly freezing-tolerant genotype. In conclusion, enhanced expression of Rubisco activase splicing variants caused an increase in Rubisco activity during pre-hardening and cold acclimation in the more freezing-tolerant Lolium perenne genotype. The induction of the transcript variant representing the large isoform may be an important element of increasing the carbon assimilation rate supporting the photochemical mechanism of photosynthetic acclimation to cold. Copyright © 2016 Elsevier GmbH. All rights reserved.
Identification of human short introns
Abebrese, Emmanuel L.; Arnold, Zachary R.; Armstrong, Katharine; Burns, Lindsay; Day, R. Thomas; Hsu, Daniel G.; Jarrell, Katherine; Luo, Yi; Mugayo, Daphine
2017-01-01
Canonical pre-mRNA splicing requires snRNPs and associated splicing factors to excise conserved intronic sequences, with a minimum intron length required for efficient splicing. Non-canonical splicing–intron excision without the spliceosome–has been documented; most notably, some tRNAs and the XBP1 mRNA contain short introns that are not removed by the spliceosome. There have been some efforts to identify additional short introns, but little is known about how many short introns are processed from mRNAs. Here, we report an approach to identify RNA short introns from RNA-Seq data, discriminating against small genomic deletions. We identify hundreds of short introns conserved among multiple human cell lines. These short introns are often alternatively spliced and are found in a variety of RNAs–both mRNAs and lncRNAs. Short intron splicing efficiency is increased by secondary structure, and we detect both canonical and non-canonical short introns. In many cases, splicing of these short introns from mRNAs is predicted to alter the reading frame and change protein output. Our findings imply that standard gene prediction models which often assume a lower limit for intron size fail to predict short introns effectively. We conclude that short introns are abundant in the human transcriptome, and short intron splicing represents an added layer to mRNA regulation. PMID:28520720
Armero, Victoria E. S.; Tremblay, Marie-Pier; Allaire, Andréa; Boudreault, Simon; Martenon-Brodeur, Camille; Duval, Cyntia; Durand, Mathieu; Lapointe, Elvy; Thibault, Philippe; Tremblay-Létourneau, Maude; Perreault, Jean-Pierre; Scott, Michelle S.
2017-01-01
Multiple human diseases including cancer have been associated with a dysregulation in RNA splicing patterns. In the current study, modifications to the global RNA splicing landscape of cellular genes were investigated in the context of Epstein-Barr virus-associated gastric cancer. Global alterations to the RNA splicing landscape of cellular genes was examined in a large-scale screen from 295 primary gastric adenocarcinomas using high-throughput RNA sequencing data. RT-PCR analysis, mass spectrometry, and co-immunoprecipitation studies were also used to experimentally validate and investigate the differential alternative splicing (AS) events that were observed through RNA-seq studies. Our study identifies alterations in the AS patterns of approximately 900 genes such as tumor suppressor genes, transcription factors, splicing factors, and kinases. These findings allowed the identification of unique gene signatures for which AS is misregulated in both Epstein-Barr virus-associated gastric cancer and EBV-negative gastric cancer. Moreover, we show that the expression of Epstein–Barr nuclear antigen 1 (EBNA1) leads to modifications in the AS profile of cellular genes and that the EBNA1 protein interacts with cellular splicing factors. These findings provide insights into the molecular differences between various types of gastric cancer and suggest a role for the EBNA1 protein in the dysregulation of cellular AS. PMID:28493890
Armero, Victoria E S; Tremblay, Marie-Pier; Allaire, Andréa; Boudreault, Simon; Martenon-Brodeur, Camille; Duval, Cyntia; Durand, Mathieu; Lapointe, Elvy; Thibault, Philippe; Tremblay-Létourneau, Maude; Perreault, Jean-Pierre; Scott, Michelle S; Bisaillon, Martin
2017-01-01
Multiple human diseases including cancer have been associated with a dysregulation in RNA splicing patterns. In the current study, modifications to the global RNA splicing landscape of cellular genes were investigated in the context of Epstein-Barr virus-associated gastric cancer. Global alterations to the RNA splicing landscape of cellular genes was examined in a large-scale screen from 295 primary gastric adenocarcinomas using high-throughput RNA sequencing data. RT-PCR analysis, mass spectrometry, and co-immunoprecipitation studies were also used to experimentally validate and investigate the differential alternative splicing (AS) events that were observed through RNA-seq studies. Our study identifies alterations in the AS patterns of approximately 900 genes such as tumor suppressor genes, transcription factors, splicing factors, and kinases. These findings allowed the identification of unique gene signatures for which AS is misregulated in both Epstein-Barr virus-associated gastric cancer and EBV-negative gastric cancer. Moreover, we show that the expression of Epstein-Barr nuclear antigen 1 (EBNA1) leads to modifications in the AS profile of cellular genes and that the EBNA1 protein interacts with cellular splicing factors. These findings provide insights into the molecular differences between various types of gastric cancer and suggest a role for the EBNA1 protein in the dysregulation of cellular AS.
Chen, W L; Luo, D F; Gao, C; Ding, Y; Wang, S Y
2015-07-01
The familial acute myeloid leukemia related factor gene (FAMLF) was previously identified from a familial AML subtractive cDNA library and shown to undergo alternative splicing. This study used real-time quantitative PCR to investigate the expression of the FAMLF alternative-splicing transcript consensus sequence (FAMLF-CS) in peripheral blood mononuclear cells (PBMCs) from 119 patients with de novo acute leukemia (AL) and 104 healthy controls, as well as in CD34+ cells from 12 AL patients and 10 healthy donors. A 429-bp fragment from a novel splicing variant of FAMLF was obtained, and a 363-bp consensus sequence was targeted to quantify total FAMLF expression. Kruskal-Wallis, Nemenyi, Spearman's correlation, and Mann-Whitney U-tests were used to analyze the data. FAMLF-CS expression in PBMCs from AL patients and CD34+ cells from AL patients and controls was significantly higher than in control PBMCs (P < 0.0001). Moreover, FAMLF-CS expression in PBMCs from the AML group was positively correlated with red blood cell count (rs =0.317, P=0.006), hemoglobin levels (rs = 0.210, P = 0.049), and percentage of peripheral blood blasts (rs = 0.256, P = 0.027), but inversely correlated with hemoglobin levels in the control group (rs = -0.391, P < 0.0001). AML patients with high CD34+ expression showed significantly higher FAMLF-CS expression than those with low CD34+ expression (P = 0.041). Our results showed that FAMLF is highly expressed in both normal and malignant immature hematopoietic cells, but that expression is lower in normal mature PBMCs.
Two Novel Variants Affecting CDKL5 Transcript Associated with Epileptic Encephalopathy.
Neupauerová, Jana; Štěrbová, Katalin; Vlčková, Markéta; Sebroňová, Věra; Maříková, Tat'ána; Krůtová, Marcela; David, Staněk; Kršek, Pavel; Žaliová, Markéta; Seeman, Pavel; Laššuthová, Petra
2017-10-01
Variants in the human X-linked cyclin-dependent kinase-like 5 (CDKL5) gene have been reported as being etiologically associated with early infantile epileptic encephalopathy type 2 (EIEE2). We report on two patients, a boy and a girl, with EIEE2 that present with early onset epilepsy, hypotonia, severe intellectual disability, and poor eye contact. Massively parallel sequencing (MPS) of a custom-designed gene panel for epilepsy and epileptic encephalopathy containing 112 epilepsy-related genes was performed. Sanger sequencing was used to confirm the novel variants. For confirmation of the functional consequence of an intronic CDKL5 variant in patient 2, an RNA study was done. DNA sequencing revealed de novo variants in CDKL5, a c.2578C>T (p. Gln860*) present in a hemizygous state in a 3-year-old boy, and a potential splice site variant c.463+5G>A in heterozygous state in a 5-year-old girl. Multiple in silico splicing algorithms predicted a highly reduced splice site score for c.463+5G>A. A subsequent mRNA study confirmed an aberrant shorter transcript lacking exon 7. Our data confirmed that variants in the CDKL5 are associated with EIEE2. There is credible evidence that the novel identified variants are pathogenic and, therefore, are likely the cause of the disease in the presented patients. In one of the patients a stop codon variant is predicted to produce a truncated protein, and in the other patient an intronic variant results in aberrant splicing.
Congenital analbuminemia caused by a novel aberrant splicing in the albumin gene
Caridi, Gianluca; Dagnino, Monica; Erdeve, Omer; Di Duca, Marco; Yildiz, Duran; Alan, Serdar; Atasay, Begum; Arsan, Saadet; Campagnoli, Monica; Galliano, Monica; Minchiotti, Lorenzo
2014-01-01
Introduction: Congenital analbuminemia is a rare autosomal recessive disorder manifested by the presence of a very low amount of circulating serum albumin. It is an allelic heterogeneous defect, caused by variety of mutations within the albumin gene in homozygous or compound heterozygous state. Herein we report the clinical and molecular characterization of a new case of congenital analbuminemia diagnosed in a female newborn of consanguineous (first degree cousins) parents from Ankara, Turkey, who presented with a low albumin concentration (< 8 g/L) and severe clinical symptoms. Materials and methods: The albumin gene of the index case was screened by single-strand conformation polymorphism, heteroduplex analysis, and direct DNA sequencing. The effect of the splicing mutation was evaluated by examining the cDNA obtained by reverse transcriptase - polymerase chain reaction (RT-PCR) from the albumin mRNA extracted from proband’s leukocytes. Results: DNA sequencing revealed that the proband is homozygous, and both parents are heterozygous, for a novel G>A transition at position c.1652+1, the first base of intron 12, which inactivates the strongly conserved GT dinucleotide at the 5′ splice site consensus sequence of this intron. The splicing defect results in the complete skipping of the preceding exon (exon 12) and in a frame-shift within exon 13 with a premature stop codon after the translation of three mutant amino acid residues. Conclusions: Our results confirm the clinical diagnosis of congenital analbuminemia in the proband and the inheritance of the trait and contribute to shed light on the molecular genetics of analbuminemia. PMID:24627724
Ryan, Michael C; Zeeberg, Barry R; Caplen, Natasha J; Cleland, James A; Kahn, Ari B; Liu, Hongfang; Weinstein, John N
2008-01-01
Background Over 60% of protein-coding genes in vertebrates express mRNAs that undergo alternative splicing. The resulting collection of transcript isoforms poses significant challenges for contemporary biological assays. For example, RT-PCR validation of gene expression microarray results may be unsuccessful if the two technologies target different splice variants. Effective use of sequence-based technologies requires knowledge of the specific splice variant(s) that are targeted. In addition, the critical roles of alternative splice forms in biological function and in disease suggest that assay results may be more informative if analyzed in the context of the targeted splice variant. Results A number of contemporary technologies are used for analyzing transcripts or proteins. To enable investigation of the impact of splice variation on the interpretation of data derived from those technologies, we have developed SpliceCenter. SpliceCenter is a suite of user-friendly, web-based applications that includes programs for analysis of RT-PCR primer/probe sets, effectors of RNAi, microarrays, and protein-targeting technologies. Both interactive and high-throughput implementations of the tools are provided. The interactive versions of SpliceCenter tools provide visualizations of a gene's alternative transcripts and probe target positions, enabling the user to identify which splice variants are or are not targeted. The high-throughput batch versions accept user query files and provide results in tabular form. When, for example, we used SpliceCenter's batch siRNA-Check to process the Cancer Genome Anatomy Project's large-scale shRNA library, we found that only 59% of the 50,766 shRNAs in the library target all known splice variants of the target gene, 32% target some but not all, and 9% do not target any currently annotated transcript. Conclusion SpliceCenter provides unique, user-friendly applications for assessing the impact of transcript variation on the design and interpretation of RT-PCR, RNAi, gene expression microarrays, antibody-based detection, and mass spectrometry proteomics. The tools are intended for use by bench biologists as well as bioinformaticists. PMID:18638396
Visootsat, Akasit; Payungporn, Sunchai; T-Thienprasert, Nattanan P
2015-12-01
Hepatitis B virus (HBV) infection is a primary cause of hepatocellular carcinoma and liver cirrhosis worldwide. To develop novel antiviral drugs, a better understanding of HBV gene expression regulation is vital. One important aspect is to understand how HBV hijacks the cellular machinery to export unspliced RNA from the nucleus. The HBV post-transcriptional regulatory element (HBV PRE) has been proposed to be the HBV RNA nuclear export element. However, the function remains controversial, and the core element is unclear. This study, therefore, aimed to identify functional regulatory elements within the HBV PRE and investigate their functions. Using bioinformatics programs based on sequence conservation and conserved RNA secondary structures, three regulatory elements were predicted, namely PRE 1151-1410, PRE 1520-1620 and PRE 1650-1684. PRE 1151-1410 significantly increased intronless and unspliced luciferase activity in both HepG2 and COS-7 cells. Likewise, PRE 1151-1410 significantly elevated intronless and unspliced HBV surface transcripts in liver cancer cells. Moreover, motif analysis predicted that PRE 1151-1410 contains several regulatory motifs. This study reported the roles of PRE 1151-1410 in intronless transcript nuclear export and the splicing mechanism. Additionally, these results provide knowledge in the field of HBV RNA regulation. Moreover, PRE 1151-1410 may be used to enhance the expression of other mRNAs in intronless reporter plasmids.
Unusual splice site mutations disrupt FANCA exon 8 definition.
Mattioli, Chiara; Pianigiani, Giulia; De Rocco, Daniela; Bianco, Anna Monica Rosaria; Cappelli, Enrico; Savoia, Anna; Pagani, Franco
2014-07-01
The pathological role of mutations that affect not conserved splicing regulatory sequences can be difficult to determine. In a patient with Fanconi anemia, we identified two unpredictable splicing mutations that act on either sides of FANCA exon 8. In patients-derived cells and in minigene splicing assay, we showed that both an apparently benign intronic c.710-5T>C transition and the nonsense c.790C>T substitution induce almost complete exon 8 skipping. Site-directed mutagenesis experiments indicated that the c.710-5T>C transition affects a polypyrimidine tract where most of the thymidines cannot be compensated by cytidines. The c.790C>T mutation located in position -3 relative to the donor site induce exon 8 skipping in an NMD-independent manner and complementation experiments with modified U1 snRNAs showed that U1 snRNP is only partially involved in the splicing defect. Our results highlight the importance of performing splicing functional assay for correct identification of disease-causing mechanism of genomic variants and provide mechanistic insights on how these two FANCA mutations affect exon 8 definition. Copyright © 2014 Elsevier B.V. All rights reserved.
Short, Stephen; Peterkin, Tessa; Guille, Matthew; Patient, Roger; Sharpe, Colin
2015-01-01
Vertebrate NCoR-family co-repressors play central roles in the timing of embryo and stem cell differentiation by repressing the activity of a range of transcription factors. They interact with nuclear receptors using short linear motifs (SLiMs) termed co-repressor for nuclear receptor (CoRNR) boxes. Here, we identify the pathway leading to increasing co-repressor diversity across the deuterostomes. The final complement of CoRNR boxes arose in an ancestral cephalochordate, and was encoded in one large exon; the urochordates and vertebrates then split this region between 10 and 12 exons. In Xenopus, alternative splicing is prevalent in NCoR2, but absent in NCoR1. We show for one NCoR1 exon that alternative splicing can be recovered by a single point mutation, suggesting NCoR1 lost the capacity for alternative splicing. Analyses in Xenopus and zebrafish identify that cellular context, rather than gene sequence, predominantly determines species differences in alternative splicing. We identify a pathway to diversity for the NCoR family beginning with the addition of a SLiM, followed by gene duplication, the generation of alternatively spliced isoforms and their differential deployment. PMID:26289800
Tang, Rongying; Prosser, Debra O.; Love, Donald R.
2016-01-01
The increasing diagnostic use of gene sequencing has led to an expanding dataset of novel variants that lie within consensus splice junctions. The challenge for diagnostic laboratories is the evaluation of these variants in order to determine if they affect splicing or are merely benign. A common evaluation strategy is to use in silico analysis, and it is here that a number of programmes are available online; however, currently, there are no consensus guidelines on the selection of programmes or protocols to interpret the prediction results. Using a collection of 222 pathogenic mutations and 50 benign polymorphisms, we evaluated the sensitivity and specificity of four in silico programmes in predicting the effect of each variant on splicing. The programmes comprised Human Splice Finder (HSF), Max Entropy Scan (MES), NNSplice, and ASSP. The MES and ASSP programmes gave the highest performance based on Receiver Operator Curve analysis, with an optimal cut-off of score reduction of 10%. The study also showed that the sensitivity of prediction is affected by the level of conservation of individual positions, with in silico predictions for variants at positions −4 and +7 within consensus splice sites being largely uninformative. PMID:27313609
Wang, Yanyan; Zhang, Tianbao; Song, Xiaxia; Zhang, Jianping; Dang, Zhanhai; Pei, Xinwu; Long, Yan
2018-01-01
Alternative splicing is a popular phenomenon in different types of plants. It can produce alternative spliced transcripts that encode proteins with altered functions. Previous studies have shown that one transcription factor, ABSCISIC ACID INSENSITIVE3 (ABI3), which encodes an important component in abscisic acid (ABA) signaling, is subjected to alternative splicing in both mono- and dicotyledons. In the current study, we identified two homologs of ABI3 in the genome of linseed flax. We screened two alternatively spliced flax LuABI3 transcripts, LuABI3-2 and LuABI3-3, and one normal flax LuABI3 transcript, LuABI3-1. Sequence analysis revealed that one of the alternatively spliced transcripts, LuABI3-3, retained a 6 bp intron. RNA accumulation analysis showed that all three transcripts were expressed during seed development, while subcellular localization and transgene experiments showed that LuABI3-3 had no biological function. The two normal transcripts, LuABI3-1 and LuABI3-2, are the important functional isoforms in flax and play significant roles in the ABA regulatory pathway during seed development, germination, and maturation.
Sikorav, J L; Duval, N; Anselmet, A; Bon, S; Krejci, E; Legay, C; Osterlund, M; Reimund, B; Massoulié, J
1988-01-01
In this paper, we show the existence of alternative splicing in the 3' region of the coding sequence of Torpedo acetylcholinesterase (AChE). We describe two cDNA structures which both diverge from the previously described coding sequence of the catalytic subunit of asymmetric (A) forms (Schumacher et al., 1986; Sikorav et al., 1987). They both contain a coding sequence followed by a non-coding sequence and a poly(A) stretch. Both of these structures were shown to exist in poly(A)+ RNAs, by S1 mapping experiments. The divergent region encoded by the first sequence corresponds to the precursor of the globular dimeric form (G2a), since it contains the expected C-terminal amino acids, Ala-Cys. These amino acids are followed by a 29 amino acid extension which contains a hydrophobic segment and must be replaced by a glycolipid in the mature protein. Analyses of intact G2a AChE showed that the common domain of the protein contains intersubunit disulphide bonds. The divergent region of the second type of cDNA consists of an adjacent genomic sequence, which is removed as an intron in A and Ga mRNAs, but may encode a distinct, less abundant catalytic subunit. The structures of the cDNA clones indicate that they are derived from minor mRNAs, shorter than the three major transcripts which have been described previously (14.5, 10.5 and 5.5 kb). Oligonucleotide probes specific for the asymmetric and globular terminal regions hybridize with the three major transcripts, indicating that their size is determined by 3'-untranslated regions which are not related to the differential splicing leading to A and Ga forms. Images PMID:3181125
Regulation of insulin preRNA splicing by glucose
Wang, Juehu; Shen, Luping; Najafi, Habiba; Kolberg, Janice; Matschinsky, Franz M.; Urdea, Mickey; German, Michael
1997-01-01
Glucose tightly regulates the synthesis and secretion of insulin by β cells in the pancreatic islets of Langerhans. To investigate whether glucose regulates insulin synthesis at the level of insulin RNA splicing, we developed a method to detect and quantify a small amount of RNA by using the branched DNA (bDNA) signal-amplification technique. This assay is both sensitive and highly specific: mouse insulin II mRNA can be detected from a single β cell (βTC3 cells or mouse islets), whereas 1 million non-insulin-producing α cells (αTC1.6 cells) give no signal. By using intron and exon sequences, oligonucleotide probes were designed to distinguish the various unspliced and partially spliced insulin preRNAs from mature insulin mRNA. Insulin RNA splicing rates were estimated from the rate of disappearance of insulin preRNA signal from β cells treated with actinomycin D to block transcription. We found that the two introns in mouse insulin II are not spliced with the same efficiency. Intron 2 is spliced out more efficiently than intron 1. As a result, some mRNA retaining intron 1 enters the cytoplasm, making up ≈2-10% of insulin mRNA in the cell. This partially spliced cytoplasmic mRNA is quite stable, with a half-life similar to the completely spliced form. When islets grown in high glucose are shifted to low glucose medium, the level of insulin preRNA and the rate of splicing fall significantly. We conclude that glucose stimulates insulin gene transcription and insulin preRNA splicing. Previous estimates of insulin transcription rates based on insulin preRNA levels that did not consider the rate of splicing may have underestimated the effect of glucose on insulin gene transcription. PMID:9113994
RNA-Seq of Arabidopsis Pollen Uncovers Novel Transcription and Alternative Splicing1[C][W][OA
Loraine, Ann E.; McCormick, Sheila; Estrada, April; Patel, Ketan; Qin, Peng
2013-01-01
Pollen grains of Arabidopsis (Arabidopsis thaliana) contain two haploid sperm cells enclosed in a haploid vegetative cell. Upon germination, the vegetative cell extrudes a pollen tube that carries the sperm to an ovule for fertilization. Knowing the identity, relative abundance, and splicing patterns of pollen transcripts will improve our understanding of pollen and allow investigation of tissue-specific splicing in plants. Most Arabidopsis pollen transcriptome studies have used the ATH1 microarray, which does not assay splice variants and lacks specific probe sets for many genes. To investigate the pollen transcriptome, we performed high-throughput sequencing (RNA-Seq) of Arabidopsis pollen and seedlings for comparison. Gene expression was more diverse in seedling, and genes involved in cell wall biogenesis were highly expressed in pollen. RNA-Seq detected at least 4,172 protein-coding genes expressed in pollen, including 289 assayed only by nonspecific probe sets. Additional exons and previously unannotated 5′ and 3′ untranslated regions for pollen-expressed genes were revealed. We detected regions in the genome not previously annotated as expressed; 14 were tested and 12 were confirmed by polymerase chain reaction. Gapped read alignments revealed 1,908 high-confidence new splicing events supported by 10 or more spliced read alignments. Alternative splicing patterns in pollen and seedling were highly correlated. For most alternatively spliced genes, the ratio of variants in pollen and seedling was similar, except for some encoding proteins involved in RNA splicing. This study highlights the robustness of splicing patterns in plants and the importance of ongoing annotation and visualization of RNA-Seq data using interactive tools such as Integrated Genome Browser. PMID:23590974
Matos, Liliana; Canals, Isaac; Dridi, Larbi; Choi, Yoo; Prata, Maria João; Jordan, Peter; Desviat, Lourdes R; Pérez, Belén; Pshezhetsky, Alexey V; Grinberg, Daniel; Alves, Sandra; Vilageliu, Lluïsa
2014-12-10
Mutations affecting RNA splicing represent more than 20% of the mutant alleles in Sanfilippo syndrome type C, a rare lysosomal storage disorder that causes severe neurodegeneration. Many of these mutations are localized in the conserved donor or acceptor splice sites, while few are found in the nearby nucleotides. In this study we tested several therapeutic approaches specifically designed for different splicing mutations depending on how the mutations affect mRNA processing. For three mutations that affect the donor site (c.234 + 1G > A, c.633 + 1G > A and c.1542 + 4dupA), different modified U1 snRNAs recognizing the mutated donor sites, have been developed in an attempt to rescue the normal splicing process. For another mutation that affects an acceptor splice site (c.372-2A > G) and gives rise to a protein lacking four amino acids, a competitive inhibitor of the HGSNAT protein, glucosamine, was tested as a pharmacological chaperone to correct the aberrant folding and to restore the normal trafficking of the protein to the lysosome. Partial correction of c.234 + 1G > A mutation was achieved with a modified U1 snRNA that completely matches the splice donor site suggesting that these molecules may have a therapeutic potential for some splicing mutations. Furthermore, the importance of the splice site sequence context is highlighted as a key factor in the success of this type of therapy. Additionally, glucosamine treatment resulted in an increase in the enzymatic activity, indicating a partial recovery of the correct folding. We have assayed two therapeutic strategies for different splicing mutations with promising results for the future applications.
Margaglione, M; Santacroce, R; Colaizzo, D; Seripa, D; Vecchione, G; Lupone, M R; De Lucia, D; Fortina, P; Grandone, E; Perricone, C; Di Minno, G
2000-10-01
Congenital afibrinogenemia is a rare autosomal recessive disorder characterized by a hemorrhagic diathesis of variable severity. Although more than 100 families with this disorder have been described, genetic defects have been characterized in few cases. An investigation of a young propositus, offspring of a consanguineous marriage, with undetectable levels of functional and quantitative fibrinogen, was conducted. Sequence analysis of the fibrinogen genes showed a homozygous G-to-A mutation at the fifth nucleotide (nt 2395) of the third intervening sequence (IVS) of the gamma-chain gene. Her first-degree relatives, who had approximately half the normal fibrinogen values and showed concordance between functional and immunologic levels, were heterozygtes. The G-to-A change predicts the disappearance of a donor splice site. After transfection with a construct, containing either the wild-type or the mutated sequence, cells with the mutant construct showed an aberrant messenger RNA (mRNA), consistent with skipping of exon 3, but not the expected mRNA. Sequencing of the abnormal mRNA showed the complete absence of exon 3. Skipping of exon 3 predicts the deletion of amino acid sequence from residue 16 to residue 75 and shifting of reading frame at amino acid 76 with a premature stop codon within exon 4 at position 77. Thus, the truncated gamma-chain gene product would not interact with other chains to form the mature fibrinogen molecule. The current findings show that mutations within highly conserved IVS regions of fibrinogen genes could affect the efficiency of normal splicing, giving rise to congenital afibrinogenemia.
Sequencing of mRNA identifies re-expression of fetal splice variants in cardiac hypertrophy
Ames, EG; Lawson, MJ; Mackey, AJ; Holmes, JW
2013-01-01
Cardiac hypertrophy has been well-characterized at the level of transcription. During cardiac hypertrophy, genes normally expressed primarily during fetal heart development are reexpressed, and this fetal gene program is believed to be a critical component of the hypertrophic process. Recently, alternative splicing of mRNA transcripts has been shown to be temporally regulated during heart development, leading us to consider whether fetal patterns of splicing also reappear during hypertrophy. We hypothesized that patterns of alternative splicing occurring during heart development are recapitulated during cardiac hypertrophy. Here we present a study of isoform expression during pressure-overload cardiac hypertrophy induced by 10 days of transverse aortic constriction (TAC) in rats and in developing fetal rat hearts compared to sham-operated adult rat hearts, using high-throughput sequencing of poly(A) tail mRNA. We find a striking degree of overlap between the isoforms expressed differentially in fetal and pressure-overloaded hearts compared to control: forty-four percent of the isoforms with significantly altered expression in TAC hearts are also expressed at significantly different levels in fetal hearts compared to control (P < 0.001). The isoforms that are shared between hypertrophy and fetal heart development are significantly enriched for genes involved in cytoskeletal organization, RNA processing, developmental processes, and metabolic enzymes. Our data strongly support the concept that mRNA splicing patterns normally associated with heart development recur as part of the hypertrophic response to pressure overload. These findings suggest that cardiac hypertrophy shares post-transcriptional as well as transcriptional regulatory mechanisms with fetal heart development. PMID:23688780
Spliceman2: a computational web server that predicts defects in pre-mRNA splicing.
Cygan, Kamil Jan; Sanford, Clayton Hendrick; Fairbrother, William Guy
2017-09-15
Most pre-mRNA transcripts in eukaryotic cells must undergo splicing to remove introns and join exons, and splicing elements present a large mutational target for disease-causing mutations. Splicing elements are strongly position dependent with respect to the transcript annotations. In 2012, we presented Spliceman, an online tool that used positional dependence to predict how likely distant mutations around annotated splice sites were to disrupt splicing. Here, we present an improved version of the previous tool that will be more useful for predicting the likelihood of splicing mutations. We have added industry-standard input options (i.e. Spliceman now accepts variant call format files), which allow much larger inputs than previously available. The tool also can visualize the locations-within exons and introns-of sequence variants to be analyzed and the predicted effects on splicing of the pre-mRNA transcript. In addition, Spliceman2 integrates with RNAcompete motif libraries to provide a prediction of which trans -acting factors binding sites are disrupted/created and links out to the UCSC genome browser. In summary, the new features in Spliceman2 will allow scientists and physicians to better understand the effects of single nucleotide variations on splicing. Freely available on the web at http://fairbrother.biomed.brown.edu/spliceman2 . Website implemented in PHP framework-Laravel 5, PostgreSQL, Apache, and Perl, with all major browsers supported. william_fairbrother@brown.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Changes in exon–intron structure during vertebrate evolution affect the splicing pattern of exons
Gelfman, Sahar; Burstein, David; Penn, Osnat; Savchenko, Anna; Amit, Maayan; Schwartz, Schraga; Pupko, Tal; Ast, Gil
2012-01-01
Exon–intron architecture is one of the major features directing the splicing machinery to the short exons that are located within long flanking introns. However, the evolutionary dynamics of exon–intron architecture and its impact on splicing is largely unknown. Using a comparative genomic approach, we analyzed 17 vertebrate genomes and reconstructed the ancestral motifs of both 3′ and 5′ splice sites, as also the ancestral length of exons and introns. Our analyses suggest that vertebrate introns increased in length from the shortest ancestral introns to the longest primate introns. An evolutionary analysis of splice sites revealed that weak splice sites act as a restrictive force keeping introns short. In contrast, strong splice sites allow recognition of exons flanked by long introns. Reconstruction of the ancestral state suggests these phenomena were not prevalent in the vertebrate ancestor, but appeared during vertebrate evolution. By calculating evolutionary rate shifts in exons, we identified cis-acting regulatory sequences that became fixed during the transition from early vertebrates to mammals. Experimental validations performed on a selection of these hexamers confirmed their regulatory function. We additionally revealed many features of exons that can discriminate alternative from constitutive exons. These features were integrated into a machine-learning approach to predict whether an exon is alternative. Our algorithm obtains very high predictive power (AUC of 0.91), and using these predictions we have identified and successfully validated novel alternatively spliced exons. Overall, we provide novel insights regarding the evolutionary constraints acting upon exons and their recognition by the splicing machinery. PMID:21974994
Ma, Nina S; Malloy, Peter J; Pitukcheewanont, Pisit; Dreimane, Daina; Geffner, Mitchell E; Feldman, David
2009-10-01
To study the vitamin D receptor (VDR) gene in a young girl with severe rickets and clinical features of hereditary vitamin D resistant rickets, including hypocalcemia, hypophosphatemia, partial alopecia, and elevated serum levels of 1,25-dihydroxyvitamin D. We amplified and sequenced DNA samples from blood from the patient, her mother, and the patient's two siblings. We also amplified and sequenced the VDR cDNA from RNA isolated from the patient's blood. DNA sequence analyses of the VDR gene showed that the patient was homozygous for a novel guanine to thymine substitution in the 5'-splice site in the exon 8-intron J junction. Analysis of the VDR cDNA using reverse transcriptase-polymerase chain reaction showed that exons 7 and 9 were fused, and that exon 8 was skipped. The mother was heterozygous for the mutation and the two siblings were unaffected. A novel splice site mutation was identified in the VDR gene that caused exon 8 to be skipped. The mutation deleted amino acids 303-341 in the VDR ligand-binding domain, which is expected to render the VDR non-functional. Nevertheless, successful outpatient treatment was achieved with frequent high doses of oral calcium.
Villate, Olatz; Ibarluzea, Nekane; Fraile-Bethencourt, Eugenia; Valenzuela, Alberto; Velasco, Eladio A; Grozeva, Detelina; Raymond, F L; Botella, María P; Tejada, María-Isabel
2018-01-01
Mutations in CHD7 have been shown to be a major cause of CHARGE syndrome, which presents many symptoms and features common to other syndromes making its diagnosis difficult. Next generation sequencing (NGS) of a panel of intellectual disability related genes was performed in an adult patient without molecular diagnosis. A splice donor variant in CHD7 (c.5665 + 1G > T) was identified. To study its potential pathogenicity, exons and flanking intronic sequences were amplified from patient DNA and cloned into the pSAD ® splicing vector. HeLa cells were transfected with this construct and a wild-type minigene and functional analysis were performed. The construct with the c.5665 + 1G > T variant produced an aberrant transcript with an insert of 63 nucleotides of intron 28 creating a premature termination codon (TAG) 25 nucleotides downstream. This would lead to the insertion of 8 new amino acids and therefore a truncated 1896 amino acid protein. As a result of this, the patient was diagnosed with CHARGE syndrome. Functional analyses underline their usefulness for studying the pathogenicity of variants found by NGS and therefore its application to accurately diagnose patients.
Yu, Yi; Panhuysen, Carolien; Kranzler, Henry R; Hesselbrock, Victor; Rounsaville, Bruce; Weiss, Roger; Brady, Kathleen; Farrer, Lindsay A; Gelernter, Joel
2006-07-15
We report here a study considering association of alleles and haplotypes at the DOPA decarboxylase (DDC) locus with the DSM-IV diagnosis of nicotine dependence (ND) or a quantitative measure for ND using the Fagerstrom Test for Nicotine Dependence (FTND). We genotyped 18 single nucleotide polymorphisms (SNPs) spanning a region of approximately 210 kb that includes DDC and the genes immediately flanking DDC in 1,590 individuals from 621 families of African-American (AA) or European-American (EA) ancestry. Evidence of association (family-based tests) was observed with several SNPs for both traits (0.0002
Mutations in the Promoter Region of the Aldolase B Gene that cause Hereditary Fructose Intolerance
Coffee, Erin M.; Tolan, Dean R.
2010-01-01
SUMMARY Hereditary fructose intolerance (HFI) is a potentially fatal inherited metabolic disease caused by a deficiency of aldolase B activity in the liver and kidney. Over 40 disease-causing mutations are known in the protein-coding region of ALDOB. Mutations upstream of the protein-coding portion of ALDOB are reported here for the first time. DNA sequence analysis of 61 HFI patients revealed single base mutations in the promoter, intronic enhancer, and the first exon, which is entirely untranslated. One mutation, g.–132G>A, is located within the promoter at an evolutionarily conserved nucleotide within a transcription factor-binding site. A second mutation, IVS1+1G>C, is at the donor splice site of the first exon. In vitro electrophoretic mobility shift assays show a decrease in nuclear extract-protein binding at the g.–132G>A mutant site. The promoter mutation results in decreased transcription using luciferase reporter plasmids. Analysis of cDNA from cells transfected with plasmids harboring the IVS1+1G>C mutation results in aberrant splicing leading to complete retention of the first intron (~ 5 kb). The IVS1+1G>C splicing mutation results in loss of luciferase activity from a reporter plasmid. These novel mutations in ALDOB represent 2% of alleles in American HFI patients, with IVS1+1G>C representing a significantly higher allele frequency (6%) among HFI patients of Hispanic and African-American ethnicity. PMID:20882353
Bauer, William J.; Heath, Jason; Jenkins, Jermaine L.; Kielkopf, Clara L.
2012-01-01
T-cell intracellular antigen-1 (TIA-1) regulates developmental and stress-responsive pathways through distinct activities at the levels of alternative pre-mRNA splicing and mRNA translation. The TIA-1 polypeptide contains three RNA recognition motifs (RRMs). The central RRM2 and C-terminal RRM3 associate with cellular mRNAs. The N-terminal RRM1 enhances interactions of a C-terminal Q-rich domain of TIA-1 with the U1-C splicing factor, despite linear separation of the domains in the TIA-1 sequence. Given the expanded functional repertoire of the RRM family, it was unknown whether TIA-1 RRM1 contributes to RNA binding as well as documented protein interactions. To address this question, we used isothermal titration calorimetry and small-angle X-ray scattering (SAXS) to dissect the roles of the TIA-1 RRMs in RNA recognition. Notably, the fas RNA exhibited two binding sites with indistinguishable affinities for TIA-1. Analyses of TIA-1 variants established that RRM1 was dispensable for binding AU-rich fas sites, yet all three RRMs were required to bind a polyU RNA with high affinity. SAXS analyses demonstrated a `V' shape for a TIA-1 construct comprising the three RRMs, and revealed that its dimensions became more compact in the RNA-bound state. The sequence-selective involvement of TIA-1 RRM1 in RNA recognition suggests a possible role for RNA sequences in regulating the distinct functions of TIA-1. Further implications for U1-C recruitment by the adjacent TIA-1 binding sites of the fas pre-mRNA and the bent TIA-1 shape, which organizes the N- and C-termini on the same side of the protein, are discussed. PMID:22154808
Kuroyanagi, Hidehito; Watanabe, Yohei; Suzuki, Yutaka; Hagiwara, Masatoshi
2013-01-01
A large fraction of protein-coding genes in metazoans undergo alternative pre-mRNA splicing in tissue- or cell-type-specific manners. Recent genome-wide approaches have identified many putative-binding sites for some of tissue-specific trans-acting splicing regulators. However, the mechanisms of splicing regulation in vivo remain largely unknown. To elucidate the modes of splicing regulation by the neuron-specific CELF family RNA-binding protein UNC-75 in Caenorhabditis elegans, we performed deep sequencing of poly(A)+ RNAs from the unc-75(+)- and unc-75-mutant worms and identified more than 20 cassette and mutually exclusive exons repressed or activated by UNC-75. Motif searches revealed that (G/U)UGUUGUG stretches are enriched in the upstream and downstream introns of the UNC-75-repressed and -activated exons, respectively. Recombinant UNC-75 protein specifically binds to RNA fragments carrying the (G/U)UGUUGUG stretches in vitro. Bi-chromatic fluorescence alternative splicing reporters revealed that the UNC-75-target exons are regulated in tissue-specific and (G/U)UGUUGUG element-dependent manners in vivo. The unc-75 mutation affected the splicing reporter expression specifically in the nervous system. These results indicate that UNC-75 regulates alternative splicing of its target exons in neuron-specific and position-dependent manners through the (G/U)UGUUGUG elements in C. elegans. This study thus reveals the repertoire of target events for the CELF family in the living organism. PMID:23416545
Misregulation of Alternative Splicing in a Mouse Model of Rett Syndrome
Li, Ronghui; Dong, Qiping; Yuan, Xinni; Zeng, Xin; Gao, Yu; Li, Hongda; Keles, Sunduz; Wang, Zefeng; Chang, Qiang
2016-01-01
Mutations in the human MECP2 gene cause Rett syndrome (RTT), a severe neurodevelopmental disorder that predominantly affects girls. Despite decades of work, the molecular function of MeCP2 is not fully understood. Here we report a systematic identification of MeCP2-interacting proteins in the mouse brain. In addition to transcription regulators, we found that MeCP2 physically interacts with several modulators of RNA splicing, including LEDGF and DHX9. These interactions are disrupted by RTT causing mutations, suggesting that they may play a role in RTT pathogenesis. Consistent with the idea, deep RNA sequencing revealed misregulation of hundreds of splicing events in the cortex of Mecp2 knockout mice. To reveal the functional consequence of altered RNA splicing due to the loss of MeCP2, we focused on the regulation of the splicing of the flip/flop exon of Gria2 and other AMPAR genes. We found a significant splicing shift in the flip/flop exon toward the flop inclusion, leading to a faster decay in the AMPAR gated current and altered synaptic transmission. In summary, our study identified direct physical interaction between MeCP2 and splicing factors, a novel MeCP2 target gene, and established functional connection between a specific RNA splicing change and synaptic phenotypes in RTT mice. These results not only help our understanding of the molecular function of MeCP2, but also reveal potential drug targets for future therapies. PMID:27352031
Zhang, Yanju; Lameijer, Eric-Wubbo; 't Hoen, Peter A C; Ning, Zemin; Slagboom, P Eline; Ye, Kai
2012-02-15
RNA-seq is a powerful technology for the study of transcriptome profiles that uses deep-sequencing technologies. Moreover, it may be used for cellular phenotyping and help establishing the etiology of diseases characterized by abnormal splicing patterns. In RNA-Seq, the exact nature of splicing events is buried in the reads that span exon-exon boundaries. The accurate and efficient mapping of these reads to the reference genome is a major challenge. We developed PASSion, a pattern growth algorithm-based pipeline for splice site detection in paired-end RNA-Seq reads. Comparing the performance of PASSion to three existing RNA-Seq analysis pipelines, TopHat, MapSplice and HMMSplicer, revealed that PASSion is competitive with these packages. Moreover, the performance of PASSion is not affected by read length and coverage. It performs better than the other three approaches when detecting junctions in highly abundant transcripts. PASSion has the ability to detect junctions that do not have known splicing motifs, which cannot be found by the other tools. Of the two public RNA-Seq datasets, PASSion predicted ≈ 137,000 and 173,000 splicing events, of which on average 82 are known junctions annotated in the Ensembl transcript database and 18% are novel. In addition, our package can discover differential and shared splicing patterns among multiple samples. The code and utilities can be freely downloaded from https://trac.nbic.nl/passion and ftp://ftp.sanger.ac.uk/pub/zn1/passion.
A Presumptive Developmental Role for a Sea Urchin Cyclin B Splice Variant
Lozano, Jean-Claude; Schatt, Philippe; Marquès, François; Peaucellier, Gérard; Fort, Philippe; Féral, Jean-Pierre; Genevière, Anne-Marie; Picard, André
1998-01-01
We show that a splice variant–derived cyclin B is produced in sea urchin oocytes and embryos. This splice variant protein lacks highly conserved sequences in the COOH terminus of the protein. It is found strikingly abundant in growing oocytes and cells committed to differentiation during embryogenesis. Cyclin B splice variant (CBsv) protein associates weakly in the cell with Xenopus cdc2 and with budding yeast CDC28p. In contrast to classical cyclin B, CBsv very poorly complements a triple CLN deletion in budding yeast, and its microinjection prevents an initial step in MPF activation, leading to an important delay in oocyte meiosis reinitiation. CBsv microinjection in fertilized eggs induces cell cycle delay and abnormal development. We assume that CBsv is produced in growing oocytes to keep them in prophase, and during embryogenesis to slow down cell cycle in cells that will be committed to differentiation. PMID:9442104
Inference of alternative splicing from RNA-Seq data with probabilistic splice graphs
LeGault, Laura H.; Dewey, Colin N.
2013-01-01
Motivation: Alternative splicing and other processes that allow for different transcripts to be derived from the same gene are significant forces in the eukaryotic cell. RNA-Seq is a promising technology for analyzing alternative transcripts, as it does not require prior knowledge of transcript structures or genome sequences. However, analysis of RNA-Seq data in the presence of genes with large numbers of alternative transcripts is currently challenging due to efficiency, identifiability and representation issues. Results: We present RNA-Seq models and associated inference algorithms based on the concept of probabilistic splice graphs, which alleviate these issues. We prove that our models are often identifiable and demonstrate that our inference methods for quantification and differential processing detection are efficient and accurate. Availability: Software implementing our methods is available at http://deweylab.biostat.wisc.edu/psginfer. Contact: cdewey@biostat.wisc.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23846746
Barta, Andrea; Kalyna, Maria; Reddy, Anireddy S N
2010-09-01
Growing interest in alternative splicing in plants and the extensive sequencing of new plant genomes necessitate more precise definition and classification of genes coding for splicing factors. SR proteins are a family of RNA binding proteins, which function as essential factors for constitutive and alternative splicing. We propose a unified nomenclature for plant SR proteins, taking into account the newly revised nomenclature of the mammalian SR proteins and a number of plant-specific properties of the plant proteins. We identify six subfamilies of SR proteins in Arabidopsis thaliana and rice (Oryza sativa), three of which are plant specific. The proposed subdivision of plant SR proteins into different subfamilies will allow grouping of paralogous proteins and simple assignment of newly discovered SR orthologs from other plant species and will promote functional comparisons in diverse plant species.
DIEGO: detection of differential alternative splicing using Aitchison's geometry.
Doose, Gero; Bernhart, Stephan H; Wagener, Rabea; Hoffmann, Steve
2018-03-15
Alternative splicing is a biological process of fundamental importance in most eukaryotes. It plays a pivotal role in cell differentiation and gene regulation and has been associated with a number of different diseases. The widespread availability of RNA-Sequencing capacities allows an ever closer investigation of differentially expressed isoforms. However, most tools for differential alternative splicing (DAS) analysis do not take split reads, i.e. the most direct evidence for a splice event, into account. Here, we present DIEGO, a compositional data analysis method able to detect DAS between two sets of RNA-Seq samples based on split reads. The python tool DIEGO works without isoform annotations and is fast enough to analyze large experiments while being robust and accurate. We provide python and perl parsers for common formats. The software is available at: www.bioinf.uni-leipzig.de/Software/DIEGO. steve@bioinf.uni-leipzig.de. Supplementary data are available at Bioinformatics online.
Zhu, Yi; Zhang, Jing-Jing; Xie, Kun-Ling; Tang, Jie; Liang, Wen-Biao; Zhu, Rong; Zhu, Yan; Wang, Bin; Tao, Jin-Qiu; Zhi, Xiao-Fei; Li, Zheng; Gao, Wen-Tao; Jiang, Kui-Rong; Miao, Yi; Xu, Ze-Kuan
2014-11-04
MUC4 plays important roles in the malignant progression of human pancreatic cancer. But the huge length of MUC4 gene fragment restricts its functional and mechanism research. As one of its splice variants, MUC4/Y with coding sequence is most similar to that of the full-length MUC4 (FL-MUC4), together with alternative splicing of the MUC4 transcript has been observed in pancreatic carcinomas but not in normal pancreas. So we speculated that MUC4/Y might be involved in malignant progression similarly to FL-MUC4, and as a research model of MUC4 in pancreatic cancer. The conjecture was confirmed in the present study. MUC4/Y expression was detected by real-time quantitative reverse transcription polymerase chain reaction (qRT-PCR) using gene-specific probe in the clinic samples. The effects of MUC4/Y were observed by serial in vitro and in vivo experiments based on stable over-expressed cell model. The underlying mechanisms were investigated by sequence-based transcriptome analysis and verified by qRT-PCR, Western blot and enzyme-linked immunosorbent assays. The detection of clinical samples indicates that MUC4/Y is significantly positive-correlated with tumor invasion and distant metastases. Based on stable forced-expressed pancreatic cancer PANC-1 cell model, functional studies show that MUC4/Y enhances malignant activity in vitro and in vivo, including proliferation under low-nutritional-pressure, resistance to apoptosis, motility, invasiveness, angiogenesis, and distant metastasis. Mechanism studies indicate the novel finding that MUC4/Y triggers malignancy-related positive feedback loops for concomitantly up-regulating the expression of survival factors to resist adverse microenvironment and increasing the expression of an array of cytokines and adhesion molecules to affect the tumor milieu. In light of the enormity of the potential regulatory circuitry in cancer afforded by MUC4 and/or MUC4/Y, repressing MUC4 transcription, inhibiting post-transcriptional regulation, including alternative splicing, or blocking various pathways simultaneously may be helpful for controlling malignant progression. MUC4/Y- expression model is proven to a valuable tool for the further dissection of MUC4-mediated functions and mechanisms.
Targeting RNA structure in SMN2 reverses spinal muscular atrophy molecular phenotypes.
Garcia-Lopez, Amparo; Tessaro, Francesca; Jonker, Hendrik R A; Wacker, Anna; Richter, Christian; Comte, Arnaud; Berntenis, Nikolaos; Schmucki, Roland; Hatje, Klas; Petermann, Olivier; Chiriano, Gianpaolo; Perozzo, Remo; Sciarra, Daniel; Konieczny, Piotr; Faustino, Ignacio; Fournet, Guy; Orozco, Modesto; Artero, Ruben; Metzger, Friedrich; Ebeling, Martin; Goekjian, Peter; Joseph, Benoît; Schwalbe, Harald; Scapozza, Leonardo
2018-05-23
Modification of SMN2 exon 7 (E7) splicing is a validated therapeutic strategy against spinal muscular atrophy (SMA). However, a target-based approach to identify small-molecule E7 splicing modifiers has not been attempted, which could reveal novel therapies with improved mechanistic insight. Here, we chose as a target the stem-loop RNA structure TSL2, which overlaps with the 5' splicing site of E7. A small-molecule TSL2-binding compound, homocarbonyltopsentin (PK4C9), was identified that increases E7 splicing to therapeutic levels and rescues downstream molecular alterations in SMA cells. High-resolution NMR combined with molecular modelling revealed that PK4C9 binds to pentaloop conformations of TSL2 and promotes a shift to triloop conformations that display enhanced E7 splicing. Collectively, our study validates TSL2 as a target for small-molecule drug discovery in SMA, identifies a novel mechanism of action for an E7 splicing modifier, and sets a precedent for other splicing-mediated diseases where RNA structure could be similarly targeted.
Kandpal, Raj P; Rajasimha, Harsha K; Brooks, Matthew J; Nellissery, Jacob; Wan, Jun; Qian, Jiang; Kern, Timothy S; Swaroop, Anand
2012-01-01
To define gene expression changes associated with diabetic retinopathy in a mouse model using next generation sequencing, and to utilize transcriptome signatures to assess molecular pathways by which pharmacological agents inhibit diabetic retinopathy. We applied a high throughput RNA sequencing (RNA-seq) strategy using Illumina GAIIx to characterize the entire retinal transcriptome from nondiabetic and from streptozotocin-treated mice 32 weeks after induction of diabetes. Some of the diabetic mice were treated with inhibitors of receptor for advanced glycation endproducts (RAGE) and p38 mitogen activated protein (MAP) kinase, which have previously been shown to inhibit diabetic retinopathy in rodent models. The transcripts and alternatively spliced variants were determined in all experimental groups. Next generation sequencing-based RNA-seq profiles provided comprehensive signatures of transcripts that are altered in early stages of diabetic retinopathy. These transcripts encoded proteins involved in distinct yet physiologically relevant disease-associated pathways such as inflammation, microvasculature formation, apoptosis, glucose metabolism, Wnt signaling, xenobiotic metabolism, and photoreceptor biology. Significant upregulation of crystallin transcripts was observed in diabetic animals, and the diabetes-induced upregulation of these transcripts was inhibited in diabetic animals treated with inhibitors of either RAGE or p38 MAP kinase. These two therapies also showed dissimilar regulation of some subsets of transcripts that included alternatively spliced versions of arrestin, neutral sphingomyelinase activation associated factor (Nsmaf), SH3-domain GRB2-like interacting protein 1 (Sgip1), and axin. Diabetes alters many transcripts in the retina, and two therapies that inhibit the vascular pathology similarly inhibit a portion of these changes, pointing to possible molecular mechanisms for their beneficial effects. These therapies also changed the abundance of various alternatively spliced versions of signaling transcripts, suggesting a possible role of alternative splicing in disease etiology. Our studies clearly demonstrate RNA-seq as a comprehensive strategy for identifying disease-specific transcripts, and for determining comparative profiles of molecular changes mediated by candidate drugs.
Permanent Neonatal Diabetes Caused by Creation of an Ectopic Splice Site within the INS Gene
Gastaldo, Elena; Harries, Lorna W.; Rubio-Cabezas, Oscar; Castaño, Luis
2012-01-01
Background The aim of this study was to characterize the genetic etiology in a patient who presented with permanent neonatal diabetes at 2 months of age. Methodology/Principal Findings Regulatory elements and coding exons 2 and 3 of the INS gene were amplified and sequenced from genomic and complementary DNA samples. A novel heterozygous INS mutation within the terminal intron of the gene was identified in the proband and her affected father. This mutation introduces an ectopic splice site leading to the insertion of 29 nucleotides from the intronic sequence into the mature mRNA, which results in a longer and abnormal transcript. Conclusions/Significance This study highlights the importance of routinely sequencing the exon-intron boundaries and the need to carry out additional studies to confirm the pathogenicity of any identified intronic genetic variants. PMID:22235272
GETPrime: a gene- or transcript-specific primer database for quantitative real-time PCR.
Gubelmann, Carine; Gattiker, Alexandre; Massouras, Andreas; Hens, Korneel; David, Fabrice; Decouttere, Frederik; Rougemont, Jacques; Deplancke, Bart
2011-01-01
The vast majority of genes in humans and other organisms undergo alternative splicing, yet the biological function of splice variants is still very poorly understood in large part because of the lack of simple tools that can map the expression profiles and patterns of these variants with high sensitivity. High-throughput quantitative real-time polymerase chain reaction (qPCR) is an ideal technique to accurately quantify nucleic acid sequences including splice variants. However, currently available primer design programs do not distinguish between splice variants and also differ substantially in overall quality, functionality or throughput mode. Here, we present GETPrime, a primer database supported by a novel platform that uniquely combines and automates several features critical for optimal qPCR primer design. These include the consideration of all gene splice variants to enable either gene-specific (covering the majority of splice variants) or transcript-specific (covering one splice variant) expression profiling, primer specificity validation, automated best primer pair selection according to strict criteria and graphical visualization of the latter primer pairs within their genomic context. GETPrime primers have been extensively validated experimentally, demonstrating high transcript specificity in complex samples. Thus, the free-access, user-friendly GETPrime database allows fast primer retrieval and visualization for genes or groups of genes of most common model organisms, and is available at http://updepla1srv1.epfl.ch/getprime/. Database URL: http://deplanckelab.epfl.ch.
GETPrime: a gene- or transcript-specific primer database for quantitative real-time PCR
Gubelmann, Carine; Gattiker, Alexandre; Massouras, Andreas; Hens, Korneel; David, Fabrice; Decouttere, Frederik; Rougemont, Jacques; Deplancke, Bart
2011-01-01
The vast majority of genes in humans and other organisms undergo alternative splicing, yet the biological function of splice variants is still very poorly understood in large part because of the lack of simple tools that can map the expression profiles and patterns of these variants with high sensitivity. High-throughput quantitative real-time polymerase chain reaction (qPCR) is an ideal technique to accurately quantify nucleic acid sequences including splice variants. However, currently available primer design programs do not distinguish between splice variants and also differ substantially in overall quality, functionality or throughput mode. Here, we present GETPrime, a primer database supported by a novel platform that uniquely combines and automates several features critical for optimal qPCR primer design. These include the consideration of all gene splice variants to enable either gene-specific (covering the majority of splice variants) or transcript-specific (covering one splice variant) expression profiling, primer specificity validation, automated best primer pair selection according to strict criteria and graphical visualization of the latter primer pairs within their genomic context. GETPrime primers have been extensively validated experimentally, demonstrating high transcript specificity in complex samples. Thus, the free-access, user-friendly GETPrime database allows fast primer retrieval and visualization for genes or groups of genes of most common model organisms, and is available at http://updepla1srv1.epfl.ch/getprime/. Database URL: http://deplanckelab.epfl.ch. PMID:21917859
Preußer, Christian; Rossbach, Oliver; Hung, Lee-Hsueh; Li, Dan; Bindereif, Albrecht
2014-01-01
Trans-splicing in trypanosomes adds a 39-nucleotide mini-exon from the spliced leader (SL) RNA to the 5′ end of each protein-coding sequence. On the other hand, cis-splicing of the few intron-containing genes requires the U1 small nuclear ribonucleoprotein (snRNP) particle. To search for potential new functions of the U1 snRNP in Trypanosoma brucei, we applied genome-wide individual-nucleotide resolution crosslinking-immunoprecipitation (iCLIP), focusing on the U1 snRNP-specific proteins U1C and U1-70K. Surprisingly, U1C and U1-70K interact not only with the U1, but also with U6 and SL RNAs. In addition, mapping of crosslinks to the cis-spliced PAP [poly(A) polymerase] pre-mRNA indicate an active role of these proteins in 5′ splice site recognition. In sum, our results demonstrate that the iCLIP approach provides insight into stable and transient RNA–protein contacts within the spliceosomal network. We propose that the U1 snRNP may represent an evolutionary link between the cis- and trans-splicing machineries, playing a dual role in 5′ splice site recognition on the trans-spliceosomal SL RNP as well as on pre-mRNA cis-introns. PMID:24748659
Gaur, R K; Valcárcel, J; Green, M R
1995-01-01
Splicing of pre-mRNAs occurs via a lariat intermediate in which an intronic adenosine, embedded within a branch point sequence, forms a 2',5'-phosphodiester bond (RNA branch) with the 5' end of the intron. How the branch point is recognized and activated remains largely unknown. Using site-specific photochemical cross-linking, we have identified two proteins that specifically interact with the branch point during the splicing reaction. U2AF65, an essential splicing factor that binds to the adjacent polypyrimidine tract, crosslinks to the branch point at the earliest stage of spliceosome formation in an ATP-independent manner. A novel 28-kDa protein, which is a constituent of the mature spliceosome, contacts the branch point after the first catalytic step. Our results indicate that the branch point is sequentially recognized by distinct splicing factors in the course of the splicing reaction. Images FIGURE 1 FIGURE 2 FIGURE 3 FIGURE 4 FIGURE 5 FIGURE 6 FIGURE 7 FIGURE 8 FIGURE 9 PMID:7493318
Malouf, Gabriel G; Su, Xiaoping; Yao, Hui; Gao, Jianjun; Xiong, Liangwen; He, Qiuming; Compérat, Eva; Couturier, Jérôme; Molinié, Vincent; Escudier, Bernard; Camparo, Philippe; Doss, Denaha J; Thompson, Erika J; Khayat, David; Wood, Christopher G; Yu, Willie; Teh, Bin T; Weinstein, John; Tannir, Nizar M
2014-08-01
MITF/TFE translocation renal cell carcinoma (TRCC) is a rare subtype of kidney cancer. Its incidence and the genome-wide characterization of its genetic origin have not been fully elucidated. We performed RNA and exome sequencing on an exploratory set of TRCC (n = 7), and validated our findings using The Cancer Genome Atlas (TCGA) clear-cell RCC (ccRCC) dataset (n = 460). Using the TCGA dataset, we identified seven TRCC (1.5%) cases and determined their genomic profile. We discovered three novel partners of MITF/TFE (LUC7L3, KHSRP, and KHDRBS2) that are involved in RNA splicing. TRCC displayed a unique gene expression signature as compared with other RCC types, and showed activation of MITF, the transforming growth factor β1 and the PI3K complex targets. Genes differentially spliced between TRCC and other RCC types were enriched for MITF and ID2 targets. Exome sequencing of TRCC revealed a distinct mutational spectrum as compared with ccRCC, with frequent mutations in chromatin-remodeling genes (six of eight cases, three of which were from the TCGA). In two cases, we identified mutations in INO80D, an ATP-dependent chromatin-remodeling gene, previously shown to control the amplitude of the S phase. Knockdown of INO80D decreased cell proliferation in a novel cell line bearing LUC7L3-TFE3 translocation. This genome-wide study defines the incidence of TRCC within a ccRCC-directed project and expands the genomic spectrum of TRCC by identifying novel MITF/TFE partners involved in RNA splicing and frequent mutations in chromatin-remodeling genes. ©2014 American Association for Cancer Research.
Yu, T; Wang, X; Ding, Q; Fu, Q; Dai, J; Lu, Y; Xi, X; Wang, H
2009-11-01
Factor VII deficiency which transmitted as an autosomal recessive disorder is a rare haemorrhagic condition. The aim of this study was to identify the molecular genetic defect and determine its functional consequences in a Chinese pedigree with FVII deficiency. The proband was diagnosed as inherited coagulation FVII deficiency by reduced plasma levels of FVII activity (4.4%) and antigen (38.5%). All nine exons and their flanking sequence of F7 gene were amplified by polymerase chain reaction (PCR) for the proband and the PCR products were directly sequenced. The compound heterozygous mutations of F7 (NM_000131.3) c.572-1G>A and F7 (NM_000131.3) c.1165T>G; p.Cys389Gly were identified in the proband's F7 gene. To investigate the splicing patterns associated with F7 c.572-1G>A, ectopic transcripts in leucocytes of the proband were analyzed. F7 minigenes, spanning from intron 4 to intron 7 and carrying either an A or a G at position -1 of intron 5, were constructed and transiently transfected into human embryonic kidney (HEK) 293T cells, followed by RT-PCR analysis. The aberrant transcripts from the F7 c.572-1G>A mutant allele were not detected by ectopic transcription study. Sequencing of the RT-PCR products from the mutant transfectant demonstrated the production of an erroneously spliced mRNA with exon 6 skipping, whereas a normal splicing occurred in the wide type transfectant. The aberrant mRNA produced from the F7 c.572-1G>A mutant allele is responsible for the factor VII deficiency in this pedigree.
Guerriero, Gea; Spadiut, Oliver; Kerschbamer, Christine; Giorno, Filomena; Baric, Sanja; Ezcurra, Inés
2016-01-01
Cellulose synthase (CesA) genes constitute a complex multigene family with six major phylogenetic clades in angiosperms. The recently sequenced genome of domestic apple, Malus×domestica, was mined for CesA genes, by blasting full-length cellulose synthase protein (CESA) sequences annotated in the apple genome against protein databases from the plant models Arabidopsis thaliana and Populus trichocarpa. Thirteen genes belonging to the six angiosperm CesA clades and coding for proteins with conserved residues typical of processive glycosyltransferases from family 2 were detected. Based on their phylogenetic relationship to Arabidopsis CESAs, as well as expression patterns, a nomenclature is proposed to facilitate further studies. Examination of their genomic organization revealed that MdCesA8-A is closely linked and co-oriented with WDR53, a gene coding for a WD40 repeat protein. The WDR53 and CesA8 genes display conserved collinearity in dicots and are partially co-expressed in the apple xylem. Interestingly, the presence of a bicistronic WDR53–CesA8A transcript was detected in phytoplasma-infected phloem tissues of apple. The bicistronic transcript contains a spliced intergenic sequence that is predicted to fold into hairpin structures typical of internal ribosome entry sites, suggesting its potential cap-independent translation. Surprisingly, the CesA8A cistron is alternatively spliced and lacks the zinc-binding domain. The possible roles of WDR53 and the alternatively spliced CESA8 variant during cellulose biosynthesis in M.×domestica are discussed. PMID:23048131
Kilian, A; Bowtell, D D; Abud, H E; Hime, G R; Venter, D J; Keese, P K; Duncan, E L; Reddel, R R; Jefferson, R A
1997-11-01
Telomerase is a multicomponent reverse transcriptase enzyme that adds DNA repeats to the ends of chromosomes using its RNA component as a template for synthesis. Telomerase activity is detected in the germline as well as the majority of tumors and immortal cell lines, and at low levels in several types of normal cells. We have cloned a human gene homologous to a protein from Saccharomyces cerevisiae and Euplotes aediculatus that has reverse transcriptase motifs and is thought to be the catalytic subunit of telomerase in those species. This gene is present in the human genome as a single copy sequence with a dominant transcript of approximately 4 kb in a human colon cancer cell line, LIM1215. The cDNA sequence was determined using clones from a LIM1215 cDNA library and by RT-PCR, cRACE and 3'RACE on mRNA from the same source. We show that the gene is expressed in several normal tissues, telomerase-positive post-crisis (immortal) cell lines and various tumors but is not expressed in the majority of normal tissues analyzed, pre-crisis (non-immortal) cells and telomerase-negative immortal (ALT) cell lines. Multiple products were identified by RT-PCR using primers within the reverse transcriptase domain. Sequencing of these products suggests that they arise by alternative splicing. Strikingly, various tumors, cell lines and even normal tissues (colonic crypt and testis) showed considerable differences in the splicing patterns. Alternative splicing of the telomerase catalytic subunit transcript may be important for the regulation of telomerase activity and may give rise to proteins with different biochemical functions.
Floris, Matteo; Orsini, Massimiliano; Thanaraj, Thangavel Alphonse
2008-10-02
It is often the case that mammalian genes are alternatively spliced; the resulting alternate transcripts often encode protein isoforms that differ in amino acid sequences. Changes among the protein isoforms can alter the cellular properties of proteins. The effect can range from a subtle modulation to a complete loss of function. (i) We examined human splice-mediated protein isoforms (as extracted from a manually curated data set, and from a computationally predicted data set) for differences in the annotation for protein signatures (Pfam domains and PRINTS fingerprints) and we characterized the differences & their effects on protein functionalities. An important question addressed relates to the extent of protein isoforms that may lack any known function in the cell. (ii) We present a database that reports differences in protein signatures among human splice-mediated protein isoform sequences. (i) Characterization: The work points to distinct sets of alternatively spliced genes with varying degrees of annotation for the splice-mediated protein isoforms. Protein molecular functions seen to be often affected are those that relate to: binding, catalytic, transcription regulation, structural molecule, transporter, motor, and antioxidant; and the processes that are often affected are nucleic acid binding, signal transduction, and protein-protein interactions. Signatures are often included/excluded and truncated in length among protein isoforms; truncation is seen as the predominant type of change. Analysis points to the following novel aspects: (a) Analysis using data from the manually curated Vega indicates that one in 8.9 genes can lead to a protein isoform of no "known" function; and one in 18 expressed protein isoforms can be such an "orphan" isoform; the corresponding numbers as seen with computationally predicted ASD data set are: one in 4.9 genes and one in 9.8 isoforms. (b) When swapping of signatures occurs, it is often between those of same functional classifications. (c) Pfam domains can occur in varying lengths, and PRINTS fingerprints can occur with varying number of constituent motifs among isoforms - since such a variation is seen in large number of genes, it could be a general mechanism to modulate protein function. (ii) The reported resource (at http://www.bioinformatica.crs4.org/tools/dbs/splivap/) provides the community ability to access data on splice-mediated protein isoforms (with value-added annotation such as association with diseases) through changes in protein signatures.
Shen, Yingfang; Wu, Xiaopei; Liu, Demei; Song, Shengjing; Liu, Dengcai; Wang, Haiqing
2016-05-27
Histone methylation is an epigenetic modification mechanism that regulates gene expression in eukaryotic cells. Jumonji C domain-containing demethylases are involved in removal of methyl groups at lysine or arginine residues. The JmjC domain-only member, JMJ30/JMJD5 of Arabidopsis, is a component of the plant circadian clock. Although some plant circadian clock genes undergo alternative splicing in response to external cues, there is no evidence that JMJ30/JMJD5 is regulated by alternative splicing. In this study, the expression of an Arabidopsis JMJ30/JMJD5 ortholog in Medicago truncatula, MtJMJC5, in response to circadian clock and abiotic stresses were characterized. The results showed that MtJMJC5 oscillates with a circadian rhythm, and undergoes cold specifically induced alternative splicing. The cold-induced alternative splicing could be reversed after ambient temperature returning to the normal. Sequencing results revealed four alternative splicing RNA isoforms including a full-length authentic protein encoding variant, and three premature termination condon-containing variants due to alternative 3' splice sites at the first and second intron. Under cold treatment, the variants that share a common 3' alternative splicing site at the second intron were intensively up-regulated while the authentic protein encoding variant and the premature termination condon-containing variant only undergoing a 3' alternative splicing at the first intron were down regulated. Although all the premature termination condon-harboring alternative splicing variants were sensitive to nonsense-mediated decay, the premature termination codon-harboring alternative splicing variants sharing the 3' alternative splicing site at the second intron showed less sensitivity than the one only containing the 3' alternative slicing site at the first intron under cold treatment. These results suggest that the cold-dependent alternative splicing of MtJMJC5 is likely a species or genus-specific mechanism of gene expression regulation on RNA levels, and might play a role in epigenetic regulation of the link between the circadian clock and ambient temperature fluctuation in Medicago. Copyright © 2016 Elsevier Inc. All rights reserved.
van der Woerd, Wendy L; Mulder, Johanna; Pagani, Franco; Beuers, Ulrich; Houwen, Roderick H J; van de Graaf, Stan F J
2015-04-01
ATP8B1 deficiency is a severe autosomal recessive liver disease resulting from mutations in the ATP8B1 gene characterized by a continuous phenotypical spectrum from intermittent (benign recurrent intrahepatic cholestasis; BRIC) to progressive familial intrahepatic cholestasis (PFIC). Current therapeutic options are insufficient, and elucidating the molecular consequences of mutations could lead to personalized mutation-specific therapies. We investigated the effect on pre-messenger RNA splicing of 14 ATP8B1 mutations at exon-intron boundaries using an in vitro minigene system. Eleven mutations, mostly associated with a PFIC phenotype, resulted in aberrant splicing and a complete absence of correctly spliced product. In contrast, three mutations led to partially correct splicing and were associated with a BRIC phenotype. These findings indicate an inverse correlation between the level of correctly spliced product and disease severity. Expression of modified U1 small nuclear RNAs (snRNA) complementary to the splice donor sites strongly improved or completely rescued splicing for several ATP8B1 mutations located at donor, as well as acceptor, splice sites. In one case, we also evaluated exon-specific U1 snRNAs that, by targeting nonconserved intronic sequences, might reduce possible off-target events. Although very effective in correcting exon skipping, they also induced retention of the short downstream intron. We systematically characterized the molecular consequences of 14 ATP8B1 mutations at exon-intron boundaries associated with ATP8B1 deficiency and found that the majority resulted in total exon skipping. The amount of correctly spliced product inversely correlated with disease severity. Compensatory modified U1 snRNAs, complementary to mutated donor splice sites, were able to improve exon definition very efficiently and could be a novel therapeutic strategy in ATP8B1 deficiency as well as other genetic diseases. © 2014 by the American Association for the Study of Liver Diseases.
NASA Astrophysics Data System (ADS)
Zhao, Xiaoqing; Li, Hong; Bao, Tonglaga; Ying, Zhiqiang
2012-09-01
Many experiment evidences showed that sequence structures of introns and intron loss/gain can influence gene expression, but current mechanisms did not refer to the functions of post-spliced introns directly. We propose that postspliced introns play their functions in gene expression by interacting with their mRNA sequences and the interaction is characterized by the matched segments between introns and their CDS. In this study, we investigated the interaction characters with length series by improved Smith-Waterman local alignment software for the ribosomal protein genes in C. elegans and D. melanogaster. Our results showed that RF values of five intron groups are significantly high in the central non-conserved region and very low in 5'-end and 3'-end splicing region. It is interesting that the number of the optimal matched regions gradually increases with intron length. Distributions of the optimal matched regions are different for five intron groups. Our study revealed that there are more interaction regions between longer introns and their CDS than shorter, and it provides a positive pattern for regulating the gene expression.
Selfish DNA: homing endonucleases find a home.
Edgell, David R
2009-02-10
Self-splicing group I introns come in two flavours - those with a homing endonuclease to promote mobility of the intron, and those without an endonuclease. How homing endonucleases and self-splicing introns associate to form a composite selfish genetic element is a question of long-standing interest. Recent work has revealed that a shared characteristic of both introns and endonucleases, the targeting of conserved sequences, may provide the impetus for the evolution of composite mobile genetic elements.
Barik, Sailen
2008-01-01
The significance of the intron-exon structure of genes is a mystery. As eukaryotic proteins are made up of modular functional domains, each exon was suspected to encode some form of module; however, the definition of a module remained vague. Comparison of pre-mRNA splice junctions with the three-dimensional architecture of its protein product from different eukaryotes revealed that the junctions were far less likely to occur inside the α-helices and β-strands of proteins than within the more flexible linker regions (‘turns’ and ‘loops’) connecting them. The splice junctions were equally distributed in the different types of linkers and throughout the linker sequence, although a slight preference for the central region of the linker was observed. The avoidance of the α-helix and the β-strand by splice junctions suggests the existence of a selection pressure against their disruption, perhaps underscoring the investment made by nature in building these intricate secondary structures. A corollary is that the helix and the strand are the smallest integral architectural units of a protein and represent the minimal modules in the evolution of protein structure. These results should find use in comparative genomics, designing of cloning strategies, and in the mutual verification of genome sequences with protein structures. PMID:15381847
Barik, Sailen
2004-09-01
The significance of the intron-exon structure of genes is a mystery. As eukaryotic proteins are made up of modular functional domains, each exon was suspected to encode some form of module; however, the definition of a module remained vague. Comparison of pre-mRNA splice junctions with the three-dimensional architecture of its protein product from different eukaryotes revealed that the junctions were far less likely to occur inside the alpha-helices and beta-strands of proteins than within the more flexible linker regions ('turns' and 'loops') connecting them. The splice junctions were equally distributed in the different types of linkers and throughout the linker sequence, although a slight preference for the central region of the linker was observed. The avoidance of the alpha-helix and the beta-strand by splice junctions suggests the existence of a selection pressure against their disruption, perhaps underscoring the investment made by nature in building these intricate secondary structures. A corollary is that the helix and the strand are the smallest integral architectural units of a protein and represent the minimal modules in the evolution of protein structure. These results should find use in comparative genomics, designing of cloning strategies, and in the mutual verification of genome sequences with protein structures.
Seth, Puneet; Yeowell, Heather N
2010-04-01
Scleroderma (systemic sclerosis [SSc]) is a complex connective tissue disorder characterized by hardening and thickening of the skin. One hallmark of scleroderma is excessive accumulation of collagen accompanied by increased levels of pyridinoline collagen crosslinks derived from hydroxylysine residues in the collagen telopeptide domains. Lysyl hydroxylase 2 (LH2), an important alternatively spliced enzyme in collagen biosynthesis, acts as a collagen telopeptide hydroxylase. Changes in the pattern of LH2 alternative splicing, favoring increased inclusion of the alternatively spliced LH2 exon 13A, thereby increasing the levels of the long transcript of LH2 (LH2[long]), are linked to scleroderma disease. This study was undertaken to examine the role played by RNA binding protein Fox-2 in regulating exon 13A inclusion, which leads to the generation of scleroderma-associated LH2(long) messenger RNA (mRNA). Phylogenetic sequence analysis of introns flanking exon 13A was performed. A tetracycline-inducible system in T-Rex 293 cells was used to induce Fox-2 protein, and endogenous LH2(long) mRNA was determined by reverse transcriptase-polymerase chain reaction. An LH2 minigene was designed, validated, and used in Fox-2 overexpression and mutagenesis experiments. Knockdown of Fox-2 was performed in mouse embryonic fibroblasts and in fibroblasts from SSc patients. Overexpression of Fox-2 enhanced the inclusion of exon 13A and increased the generation of LH2(long) mRNA, whereas knockdown of Fox-2 decreased LH2(long) transcripts. Mutational analysis of an LH2 minigene demonstrated that 2 of the 4 Fox binding motifs flanking LH2 exon 13A are required for inclusion of exon 13A. In early passage fibroblasts derived from patients with scleroderma, the knockdown of Fox-2 protein significantly decreased the endogenous levels of LH2(long) mRNA. Our findings indicate that Fox-2 plays an integral role in the regulation of LH2 splicing. Knockdown of Fox-2 and other methods to decrease the levels of fibrosis-associated LH2(long) mRNA in primary scleroderma cells may suggest a novel approach to strategies directed against scleroderma.
Soukarieh, Omar; Gaildrat, Pascaline; Hamieh, Mohamad; Drouet, Aurélie; Baert-Desurmont, Stéphanie; Frébourg, Thierry; Tosi, Mario; Martins, Alexandra
2016-01-01
The identification of a causal mutation is essential for molecular diagnosis and clinical management of many genetic disorders. However, even if next-generation exome sequencing has greatly improved the detection of nucleotide changes, the biological interpretation of most exonic variants remains challenging. Moreover, particular attention is typically given to protein-coding changes often neglecting the potential impact of exonic variants on RNA splicing. Here, we used the exon 10 of MLH1, a gene implicated in hereditary cancer, as a model system to assess the prevalence of RNA splicing mutations among all single-nucleotide variants identified in a given exon. We performed comprehensive minigene assays and analyzed patient’s RNA when available. Our study revealed a staggering number of splicing mutations in MLH1 exon 10 (77% of the 22 analyzed variants), including mutations directly affecting splice sites and, particularly, mutations altering potential splicing regulatory elements (ESRs). We then used this thoroughly characterized dataset, together with experimental data derived from previous studies on BRCA1, BRCA2, CFTR and NF1, to evaluate the predictive power of 3 in silico approaches recently described as promising tools for pinpointing ESR-mutations. Our results indicate that ΔtESRseq and ΔHZEI-based approaches not only discriminate which variants affect splicing, but also predict the direction and severity of the induced splicing defects. In contrast, the ΔΨ-based approach did not show a compelling predictive power. Our data indicates that exonic splicing mutations are more prevalent than currently appreciated and that they can now be predicted by using bioinformatics methods. These findings have implications for all genetically-caused diseases. PMID:26761715
USDA-ARS?s Scientific Manuscript database
Alternative splicing is a well-known phenomenon that dramatically increases eukaryotic transcriptome diversity. The extent of mRNA isoform diversity among porcine tissues was assessed using Pacific Biosciences single-molecule long-read isoform sequencing (Iso-Seq) and Illumina short read sequencing ...
Diamant, Gil; Eisenbaum, Tal; Leshkowitz, Dena; Dikstein, Rivka
2016-05-01
The proinflammatory cytokine tumor necrosis factor alpha (TNF-α) modulates the expression of many genes, primarily through activation of NF-κB. Here, we examined the global effects of the elongation factor Spt5 on nascent and mature mRNAs of TNF-α-induced cells using chromatin and cytosolic subcellular fractions. We identified several classes of TNF-α-induced genes controlled at the level of transcription, splicing, and chromatin retention. Spt5 was found to facilitate splicing and chromatin release in genes displaying high induction rates. Further analysis revealed striking effects of TNF-α on the splicing of 25% of expressed genes; the vast majority were not transcriptionally induced. Splicing enhancement of noninduced genes by TNF-α was transient and independent of NF-κB. Investigating the underlying basis, we found that Spt5 is required for the splicing facilitation of the noninduced genes. In line with this, Spt5 interacts with Sm core protein splicing factors. Furthermore, following TNF-α treatment, levels of RNA polymerase II (Pol II) but not Spt5 are reduced from the splicing-induced genes, suggesting that these genes become enriched with a Pol II-Spt5 form. Our findings revealed the Pol II-Spt5 complex as a highly competent coordinator of cotranscriptional splicing. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Rebhandl, Stefan; Huemer, Michael; Zaborsky, Nadja; Gassner, Franz Josef; Catakovic, Kemal; Felder, Thomas Klaus; Greil, Richard; Geisberger, Roland
2014-01-01
Activation-induced deaminase (AID) is a DNA-mutating enzyme that mediates class-switch recombination as well as somatic hypermutation of antibody genes in B cells. Due to off-target activity, AID is implicated in lymphoma development by introducing genome-wide DNA damage and initiating chromosomal translocations such as c-myc/IgH. Several alternative splice transcripts of AID have been reported in activated B cells as well as malignant B cells such as chronic lymphocytic leukemia (CLL). As most commercially available antibodies fail to recognize alternative splice variants, their abundance in vivo, and hence their biological significance, has not been determined. In this study, we assessed the protein levels of AID splice isoforms by introducing an AID splice reporter construct into cell lines and primary CLL cells from patients as well as from WT and TCL1tg C57BL/6 mice (where TCL1 is T-cell leukemia/lymphoma 1). The splice construct is 5′-fused to a GFP-tag, which is preserved in all splice isoforms and allows detection of translated protein. Summarizing, we show a thorough quantification of alternatively spliced AID transcripts and demonstrate that the corresponding protein abundances, especially those of splice variants AID-ivs3 and AID-ΔE4, are not stoichiometrically equivalent. Our data suggest that enhanced proteasomal degradation of low-abundance proteins might be causative for this discrepancy. PMID:24668151
Kim, Dong Seon; Hahn, Yoonsoo
2012-11-13
Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution.
Lee, Joseph C; Stiles, David; Lu, Jun; Cam, Margaret C
2007-01-01
Background Microarrays are a popular tool used in experiments to measure gene expression levels. Improving the reproducibility of microarray results produced by different chips from various manufacturers is important to create comparable and combinable experimental results. Alternative splicing has been cited as a possible cause of differences in expression measurements across platforms, though no study to this point has been conducted to show its influence in cross-platform differences. Results Using probe sequence data, a new microarray probe/transcript annotation was created based on the AceView Aug05 release that allowed for the categorization of genes based on their expression measurements' susceptibility to alternative splicing differences across microarray platforms. Examining gene expression data from multiple platforms in light of the new categorization, genes unsusceptible to alternative splicing differences showed higher signal agreement than those genes most susceptible to alternative splicing differences. The analysis gave rise to a different probe-level visualization method that can highlight probe differences according to transcript specificity. Conclusion The results highlight the need for detailed probe annotation at the transcriptome level. The presence of alternative splicing within a given sample can affect gene expression measurements and is a contributing factor to overall technical differences across platforms. PMID:17708771
Alternative splicing at the intersection of biological timing, development, and stress responses.
Staiger, Dorothee; Brown, John W S
2013-10-01
High-throughput sequencing for transcript profiling in plants has revealed that alternative splicing (AS) affects a much higher proportion of the transcriptome than was previously assumed. AS is involved in most plant processes and is particularly prevalent in plants exposed to environmental stress. The identification of mutations in predicted splicing factors and spliceosomal proteins that affect cell fate, the circadian clock, plant defense, and tolerance/sensitivity to abiotic stress all point to a fundamental role of splicing/AS in plant growth, development, and responses to external cues. Splicing factors affect the AS of multiple downstream target genes, thereby transferring signals to alter gene expression via splicing factor/AS networks. The last two to three years have seen an ever-increasing number of examples of functional AS. At a time when the identification of AS in individual genes and at a global level is exploding, this review aims to bring together such examples to illustrate the extent and importance of AS, which are not always obvious from individual publications. It also aims to ensure that plant scientists are aware that AS is likely to occur in the genes that they study and that dynamic changes in AS and its consequences need to be considered routinely.
Staiger, Dorothee; Brown, John W.S.
2013-01-01
High-throughput sequencing for transcript profiling in plants has revealed that alternative splicing (AS) affects a much higher proportion of the transcriptome than was previously assumed. AS is involved in most plant processes and is particularly prevalent in plants exposed to environmental stress. The identification of mutations in predicted splicing factors and spliceosomal proteins that affect cell fate, the circadian clock, plant defense, and tolerance/sensitivity to abiotic stress all point to a fundamental role of splicing/AS in plant growth, development, and responses to external cues. Splicing factors affect the AS of multiple downstream target genes, thereby transferring signals to alter gene expression via splicing factor/AS networks. The last two to three years have seen an ever-increasing number of examples of functional AS. At a time when the identification of AS in individual genes and at a global level is exploding, this review aims to bring together such examples to illustrate the extent and importance of AS, which are not always obvious from individual publications. It also aims to ensure that plant scientists are aware that AS is likely to occur in the genes that they study and that dynamic changes in AS and its consequences need to be considered routinely. PMID:24179132
The bromodomain protein BRD4 regulates splicing during heat shock
Hussong, Michelle; Kaehler, Christian; Kerick, Martin; Grimm, Christina; Franz, Alexandra; Timmermann, Bernd; Welzel, Franziska; Isensee, Jörg; Hucho, Tim; Krobitsch, Sylvia; Schweiger, Michal R.
2017-01-01
The cellular response to heat stress is an ancient and evolutionarily highly conserved defence mechanism characterised by the transcriptional up-regulation of cyto-protective genes and a partial inhibition of splicing. These features closely resemble the proteotoxic stress response during tumor development. The bromodomain protein BRD4 has been identified as an integral member of the oxidative stress as well as of the inflammatory response, mainly due to its role in the transcriptional regulation process. In addition, there are also several lines of evidence implicating BRD4 in the splicing process. Using RNA-sequencing we found a significant increase in splicing inhibition, in particular intron retentions (IR), following heat treatment in BRD4-depleted cells. This leads to a decrease of mRNA abundancy of the affected transcripts, most likely due to premature termination codons. Subsequent experiments revealed that BRD4 interacts with the heat shock factor 1 (HSF1) such that under heat stress BRD4 is recruited to nuclear stress bodies and non-coding SatIII RNA transcripts are up-regulated. These findings implicate BRD4 as an important regulator of splicing during heat stress. Our data which links BRD4 to the stress induced splicing process may provide novel mechanisms of BRD4 inhibitors in regard to anti-cancer therapies. PMID:27536004
SF3B1 mutations constitute a novel therapeutic target in breast cancer
Maguire, Sarah L; Leonidou, Andri; Wai, Patty; Marchiò, Caterina; Ng, Charlotte KY; Sapino, Anna; Salomon, Anne-Vincent; Reis-Filho, Jorge S; Weigelt, Britta; Natrajan, Rachael C
2015-01-01
Mutations in genes encoding proteins involved in RNA splicing have been found to occur at relatively high frequencies in several tumour types including myelodysplastic syndromes, chronic lymphocytic leukaemia, uveal melanoma, and pancreatic cancer, and at lower frequencies in breast cancer. To investigate whether dysfunction in RNA splicing is implicated in the pathogenesis of breast cancer, we performed a re-analysis of published exome and whole genome sequencing data. This analysis revealed that mutations in spliceosomal component genes occurred in 5.6% of unselected breast cancers, including hotspot mutations in the SF3B1 gene, which were found in 1.8% of unselected breast cancers. SF3B1 mutations were significantly associated with ER-positive disease, AKT1 mutations, and distinct copy number alterations. Additional profiling of hotspot mutations in a panel of special histological subtypes of breast cancer showed that 16% and 6% of papillary and mucinous carcinomas of the breast harboured the SF3B1 K700E mutation. RNA sequencing identified differentially spliced events expressed in tumours with SF3B1 mutations including the protein coding genes TMEM14C, RPL31, DYNL11, UQCC, and ABCC5, and the long non-coding RNA CRNDE. Moreover, SF3B1 mutant cell lines were found to be sensitive to the SF3b complex inhibitor spliceostatin A and treatment resulted in perturbation of the splicing signature. Albeit rare, SF3B1 mutations result in alternative splicing events, and may constitute drivers and a novel therapeutic target in a subset of breast cancers. © 2014 The Authors. The Journal of Pathology published by John Wiley & Sons Ltd on behalf of Pathological Society of Great Britain and Ireland. PMID:25424858
Piekielko-Witkowska, Agnieszka; Kedzierska, Hanna; Poplawski, Piotr; Wojcicka, Anna; Rybicka, Beata; Maksymowicz, Maria; Grajkowska, Wieslawa; Matyja, Ewa; Mandat, Tomasz; Bonicki, Wieslaw; Nauman, Pawel
2013-06-01
Pituitary tumors belong to the group of most common neoplasms of the sellar region. Iodothyronine deiodinase types 1 (DIO1) and 2 (DIO2) are enzymes contributing to the levels of locally synthesized T3, a hormone regulating key physiological processes in the pituitary, including its development, cellular proliferation, and hormone secretion. Previous studies revealed that the expression of deiodinases in pituitary tumors is variable and, moreover, there is no correlation between mRNA and protein products of the particular gene, suggesting the potential role of posttranscriptional regulatory mechanisms. In this work we hypothesized that one of such mechanisms could be the alternative splicing. Therefore, we analyzed expression and sequences of DIO1 and DIO2 splicing variants in 30 pituitary adenomas and 9 non-tumorous pituitary samples. DIO2 mRNA was expressed as only two mRNA isoforms. In contrast, nine splice variants of DIO1 were identified. Among them, five were devoid of exon 3. In silico sequence analysis of DIO1 revealed multiple putative binding sites for splicing factor SF2/ASF, of which the top-ranked sites were located in exon 3. Silencing of SF2/ASF in pituitary tumor GH3 cells resulted in change of ratio between DIO1 isoforms with or without exon 3, favoring the expression of variants without exon 3. The expression of SF2/ASF mRNA in pituitary tumors was increased when compared with non-neoplastic control samples. In conclusion, we provide a new mechanism of posttranscriptional regulation of DIO1 and show deregulation of DIO1 expression in pituitary adenoma, possibly resulting from disturbed expression of SF2/ASF. Copyright © 2013 Elsevier B.V. All rights reserved.
Genome-based identification of spliceosomal proteins in the silk moth Bombyx mori.
Somarelli, Jason A; Mesa, Annia; Fuller, Myron E; Torres, Jacqueline O; Rodriguez, Carol E; Ferrer, Christina M; Herrera, Rene J
2010-12-01
Pre-messenger RNA splicing is a highly conserved eukaryotic cellular function that takes place by way of a large, RNA-protein assembly known as the spliceosome. In the mammalian system, nearly 300 proteins associate with uridine-rich small nuclear (sn)RNAs to form this complex. Some of these splicing factors are ubiquitously present in the spliceosome, whereas others are involved only in the processing of specific transcripts. Several proteomics analyses have delineated the proteins of the spliceosome in several species. In this study, we mine multiple sequence data sets of the silk moth Bombyx mori in an attempt to identify the entire set of known spliceosomal proteins. Five data sets were utilized, including the 3X, 6X, and Build 2.0 genomic contigs as well as the expressed sequence tag and protein libraries. While homologs for 88% of vertebrate splicing factors were delineated in the Bombyx mori genome, there appear to be several spliceosomal polypeptides absent in Bombyx mori and seven additional insect species. This apparent increase in spliceosomal complexity in vertebrates may reflect the tissue-specific and developmental stage-specific alternative pre-mRNA splicing requirements in vertebrates. Phylogenetic analyses of 15 eukaryotic taxa using the core splicing factors suggest that the essential functional units of the pre-mRNA processing machinery have remained highly conserved from yeast to humans. The Sm and LSm proteins are the most conserved, whereas proteins of the U1 small nuclear ribonucleoprotein particle are the most divergent. These data highlight both the differential conservation and relative phylogenetic signals of the essential spliceosomal components throughout evolution. © 2010 Wiley Periodicals, Inc.
The Carnegie Protein Trap Library: A Versatile Tool for Drosophila Developmental Studies
Buszczak, Michael; Paterno, Shelley; Lighthouse, Daniel; Bachman, Julia; Planck, Jamie; Owen, Stephenie; Skora, Andrew D.; Nystul, Todd G.; Ohlstein, Benjamin; Allen, Anna; Wilhelm, James E.; Murphy, Terence D.; Levis, Robert W.; Matunis, Erika; Srivali, Nahathai; Hoskins, Roger A.; Spradling, Allan C.
2007-01-01
Metazoan physiology depends on intricate patterns of gene expression that remain poorly known. Using transposon mutagenesis in Drosophila, we constructed a library of 7404 protein trap and enhancer trap lines, the Carnegie collection, to facilitate gene expression mapping at single-cell resolution. By sequencing the genomic insertion sites, determining splicing patterns downstream of the enhanced green fluorescent protein (EGFP) exon, and analyzing expression patterns in the ovary and salivary gland, we found that 600–900 different genes are trapped in our collection. A core set of 244 lines trapped different identifiable protein isoforms, while insertions likely to act as GFP-enhancer traps were found in 256 additional genes. At least 8 novel genes were also identified. Our results demonstrate that the Carnegie collection will be useful as a discovery tool in diverse areas of cell and developmental biology and suggest new strategies for greatly increasing the coverage of the Drosophila proteome with protein trap insertions. PMID:17194782
Alternative RNA splicing of leucocyte tissue transglutaminase in coeliac disease.
Arbildi, P; Sóñora, C; Del Río, N; Marqués, J M; Hernández, A
2018-05-01
Tissue transglutaminase is a ubiquitous and multifunctional protein that contributes to several processes such as apoptosis/survival, efferocytosis, inflammation and tissue repairing under physiological and pathological conditions. Several activities can be associated with well-established functional domains; in addition, four RNA alternative splice variants have been described, characterized by sequence divergences and residues deletion at the C-terminal domains. Tissue transglutaminase is recognized as the central player in the physiopathology of coeliac disease (CD) mainly through calcium-dependent enzymatic activities. It can be hypothesized that differential regulation of tissue transglutaminase splice variants expression in persons with CD contributes to pathology by altering the protein functionality. We characterized the expression pattern of RNA alternative splice variants by RT-PCR in peripheral cells from patients with CD under free gluten diet adhesion; we considered inflammatory parameters and specific antibodies as markers of the stage of disease. We found significant higher expression of both the full length and the shortest C-truncated splice variants in leucocytes from patients with CD in comparison with healthy individuals. As tissue transglutaminase expression and canonical enzymatic activity are linked to inflammation, we studied the RNA expression of inflammatory cytokines in peripheral leucocytes of persons with CD in relation with splice variants expression; interestingly, we found that recently diagnosed patients showed significant correlation between both the full length and the shortest alternative spliced variants with IL-1 expression. Our results points that regulation of alternative splicing of tissue transglutaminase could account for the complex physiopathology of CD. © 2018 The Foundation for the Scandinavian Journal of Immunology.
Härter, Bettina; Fuchs, Irene; Müller, Thomas; Akbulut, Ulas Emre; Cakir, Murat; Janecke, Andreas R
2016-04-01
Autosomal recessive proprotein convertase 1/3 (PC1/3) deficiency, caused by mutations in the PCSK1 gene, is characterized by severe congenital malabsorptive diarrhea, early-onset obesity, and certain endocrine abnormalities. We suspected PC1/3 deficiency in a 4-month-old girl based on the presence of congenital diarrhea and polyuria. Sequencing the whole coding region and splice sites detected a novel homozygous PCSK1 splice-site mutation, c.544-2A>G, in the patient. The mutation resulted in the skipping of exon 5, the generation of a premature termination codon, and nonsense-mediated PCSK1 messenger ribonucleic acid decay, which was demonstrated in complementary DNA derived from fibroblasts.
cDNA sequences and organization of IgM heavy chain genes in two holostean fish.
Wilson, M R; van Ravenstein, E; Miller, N W; Clem, L W; Middleton, D L; Warr, G W
1995-01-01
Immunoglobulin M heavy chain (mu) sequences of two holostean fish, the bowfin, Amia calva, and the longnose gar, Lepisosteus osseus, were amplified from spleen mRNA by RACE-PCR, cloned, and sequenced. Each mu chain showed the conserved four constant domain structure typical of a secreted mu chain. Southern blot analyses with specific heavy chain variable (VH) and constant (CH) region probes suggest that both fish possess an IgH locus that resembles that of the teleosts, amphibians, and mammals in its organization. The overall sequence similarity of gar and bowfin mu chains was 60% and 48% at the nucleotide and amino acid levels, respectively, while similarity to the mu chains of teleosts and elasmobranchs was lower. The bowfin mu chain possesses a distinctive proline-rich sequence at the C mu 1/C mu 2 boundary; a shorter proline-rich sequence is present at this position in the gar mu chain. Both gar and bowfin show, in their C mu 4 sequences, motifs that could serve as cryptic splice donor sites for the production of mRNA encoding the membrane-bound form of the mu chains, and the bowfin also shows a potential cryptic splice donor site in the C mu 3 exon.
Two- and three-input TALE-based AND logic computation in embryonic stem cells.
Lienert, Florian; Torella, Joseph P; Chen, Jan-Hung; Norsworthy, Michael; Richardson, Ryan R; Silver, Pamela A
2013-11-01
Biological computing circuits can enhance our ability to control cellular functions and have potential applications in tissue engineering and medical treatments. Transcriptional activator-like effectors (TALEs) represent attractive components of synthetic gene regulatory circuits, as they can be designed de novo to target a given DNA sequence. We here demonstrate that TALEs can perform Boolean logic computation in mammalian cells. Using a split-intein protein-splicing strategy, we show that a functional TALE can be reconstituted from two inactive parts, thus generating two-input AND logic computation. We further demonstrate three-piece intein splicing in mammalian cells and use it to perform three-input AND computation. Using methods for random as well as targeted insertion of these relatively large genetic circuits, we show that TALE-based logic circuits are functional when integrated into the genome of mouse embryonic stem cells. Comparing construct variants in the same genomic context, we modulated the strength of the TALE-responsive promoter to improve the output of these circuits. Our work establishes split TALEs as a tool for building logic computation with the potential of controlling expression of endogenous genes or transgenes in response to a combination of cellular signals.
Arman, Ahmet; Ozon, Alev; Isguven, Pinar S; Coker, Ajda; Peker, Ismail; Yordam, Nursen
2008-01-01
Growth hormone (GH) is involved in growth, and fat and carbohydrate metabolism. Interaction of GH with the GH receptor (GHR) is necessary for systemic and local production of insulin-like growth factor-I (IGF-I) which mediates GH actions. Mutations in the GHR cause severe postnatal growth failure; the disorder is an autosomal recessive genetic disease resulting in GH insensitivity, called Laron syndrome. It is characterized by dwarfism with elevated serum GH and low levels of IGF-I. We analyzed the GHR gene for mutations and polymorphisms in eight patients with Laron-type dwarfism from six families. We found three missense mutations (S40L, V125A, I526L), one nonsense mutation (W157X), and one splice site mutation in the extracellular domain of GHR. Furthermore, G168G and exon 3 deletion polymorphisms were detected in patients with Laron syndrome. The splice site mutation, which is a novel mutation, was located at the donor splice site of exon 2/ intron 2 within GHR. Although this mutation changed the highly conserved donor splice site consensus sequence GT to GGT by insertion of a G residue, the intron splicing between exon 2 and exon 3 was detected in the patient. These results imply that the splicing occurs arthe GT site in intron 2, leaving the extra inserted G residue at the end of exon 2, thus changing the open reading frame of GHR resulting in a premature termination codon in exon 3.
CEP78 is mutated in a distinct type of Usher syndrome.
Fu, Qing; Xu, Mingchu; Chen, Xue; Sheng, Xunlun; Yuan, Zhisheng; Liu, Yani; Li, Huajin; Sun, Zixi; Li, Huiping; Yang, Lizhu; Wang, Keqing; Zhang, Fangxia; Li, Yumei; Zhao, Chen; Sui, Ruifang; Chen, Rui
2017-03-01
Usher syndrome is a genetically heterogeneous disorder featured by combined visual impairment and hearing loss. Despite a dozen of genes involved in Usher syndrome having been identified, the genetic basis remains unknown in 20-30% of patients. In this study, we aimed to identify the novel disease-causing gene of a distinct subtype of Usher syndrome. Ophthalmic examinations and hearing tests were performed on patients with Usher syndrome in two consanguineous families. Target capture sequencing was initially performed to screen causative mutations in known retinal disease-causing loci. Whole exome sequencing (WES) and whole genome sequencing (WGS) were applied for identifying novel disease-causing genes. RT-PCR and Sanger sequencing were performed to evaluate the splicing-altering effect of identified CEP78 variants. Patients from the two independent families show a mild Usher syndrome phenotype featured by juvenile or adult-onset cone-rod dystrophy and sensorineural hearing loss. WES and WGS identified two homozygous rare variants that affect mRNA splicing of a ciliary gene CEP78 . RT-PCR confirmed that the two variants indeed lead to abnormal splicing, resulting in premature stop of protein translation due to frameshift. Our results provide evidence that CEP78 is a novel disease-causing gene for Usher syndrome, demonstrating an additional link between ciliopathy and Usher protein network in photoreceptor cells and inner ear hair cells. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.
Rath, Matthias; Jenssen, Sönke E; Schwefel, Konrad; Spiegler, Stefanie; Kleimeier, Dana; Sperling, Christian; Kaderali, Lars; Felbor, Ute
2017-09-01
Cerebral cavernous malformations (CCM) are vascular lesions of the central nervous system that can cause headaches, seizures and hemorrhagic stroke. Disease-associated mutations have been identified in three genes: CCM1/KRIT1, CCM2 and CCM3/PDCD10. The precise proportion of deep-intronic variants in these genes and their clinical relevance is yet unknown. Here, a long-range PCR (LR-PCR) approach for target enrichment of the entire genomic regions of the three genes was combined with next generation sequencing (NGS) to screen for coding and non-coding variants. NGS detected all six CCM1/KRIT1, two CCM2 and four CCM3/PDCD10 mutations that had previously been identified by Sanger sequencing. Two of the pathogenic variants presented here are novel. Additionally, 20 stringently selected CCM index cases that had remained mutation-negative after conventional sequencing and exclusion of copy number variations were screened for deep-intronic mutations. The combination of bioinformatics filtering and transcript analyses did not reveal any deep-intronic splice mutations in these cases. Our results demonstrate that target enrichment by LR-PCR combined with NGS can be used for a comprehensive analysis of the entire genomic regions of the CCM genes in a research context. However, its clinical utility is limited as deep-intronic splice mutations in CCM1/KRIT1, CCM2 and CCM3/PDCD10 seem to be rather rare. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Wu, Jing Qin; Wang, Xi; Beveridge, Natalie J.; Tooney, Paul A.; Scott, Rodney J.; Carr, Vaughan J.; Cairns, Murray J.
2012-01-01
Background While hybridization based analysis of the cortical transcriptome has provided important insight into the neuropathology of schizophrenia, it represents a restricted view of disease-associated gene activity based on predetermined probes. By contrast, sequencing technology can provide un-biased analysis of transcription at nucleotide resolution. Here we use this approach to investigate schizophrenia-associated cortical gene expression. Methodology/Principal Findings The data was generated from 76 bp reads of RNA-Seq, aligned to the reference genome and assembled into transcripts for quantification of exons, splice variants and alternative promoters in postmortem superior temporal gyrus (STG/BA22) from 9 male subjects with schizophrenia and 9 matched non-psychiatric controls. Differentially expressed genes were then subjected to further sequence and functional group analysis. The output, amounting to more than 38 Gb of sequence, revealed significant alteration of gene expression including many previously shown to be associated with schizophrenia. Gene ontology enrichment analysis followed by functional map construction identified three functional clusters highly relevant to schizophrenia including neurotransmission related functions, synaptic vesicle trafficking, and neural development. Significantly, more than 2000 genes displayed schizophrenia-associated alternative promoter usage and more than 1000 genes showed differential splicing (FDR<0.05). Both types of transcriptional isoforms were exemplified by reads aligned to the neurodevelopmentally significant doublecortin-like kinase 1 (DCLK1) gene. Conclusions This study provided the first deep and un-biased analysis of schizophrenia-associated transcriptional diversity within the STG, and revealed variants with important implications for the complex pathophysiology of schizophrenia. PMID:22558445
Validation of Splicing Events in Transcriptome Sequencing Data
Kaisers, Wolfgang; Ptok, Johannes; Schwender, Holger; Schaal, Heiner
2017-01-01
Genomic alignments of sequenced cellular messenger RNA contain gapped alignments which are interpreted as consequence of intron removal. The resulting gap-sites, genomic locations of alignment gaps, are landmarks representing potential splice-sites. As alignment algorithms report gap-sites with a considerable false discovery rate, validations are required. We describe two quality scores, gap quality score (gqs) and weighted gap information score (wgis), developed for validation of putative splicing events: While gqs solely relies on alignment data wgis additionally considers information from the genomic sequence. FASTQ files obtained from 54 human dermal fibroblast samples were aligned against the human genome (GRCh38) using TopHat and STAR aligner. Statistical properties of gap-sites validated by gqs and wgis were evaluated by their sequence similarity to known exon-intron borders. Within the 54 samples, TopHat identifies 1,000,380 and STAR reports 6,487,577 gap-sites. Due to the lack of strand information, however, the percentage of identified GT-AG gap-sites is rather low. While gap-sites from TopHat contain ≈89% GT-AG, gap-sites from STAR only contain ≈42% GT-AG dinucleotide pairs in merged data from 54 fibroblast samples. Validation with gqs yields 156,251 gap-sites from TopHat alignments and 166,294 from STAR alignments. Validation with wgis yields 770,327 gap-sites from TopHat alignments and 1,065,596 from STAR alignments. Both alignment algorithms, TopHat and STAR, report gap-sites with considerable false discovery rate, which can drastically be reduced by validation with gqs and wgis. PMID:28545234
Ruggles, Kelly V; Tang, Zuojian; Wang, Xuya; Grover, Himanshu; Askenazi, Manor; Teubl, Jennifer; Cao, Song; McLellan, Michael D; Clauser, Karl R; Tabb, David L; Mertins, Philipp; Slebos, Robbert; Erdmann-Gilmore, Petra; Li, Shunqiang; Gunawardena, Harsha P; Xie, Ling; Liu, Tao; Zhou, Jian-Ying; Sun, Shisheng; Hoadley, Katherine A; Perou, Charles M; Chen, Xian; Davies, Sherri R; Maher, Christopher A; Kinsinger, Christopher R; Rodland, Karen D; Zhang, Hui; Zhang, Zhen; Ding, Li; Townsend, R Reid; Rodriguez, Henry; Chan, Daniel; Smith, Richard D; Liebler, Daniel C; Carr, Steven A; Payne, Samuel; Ellis, Matthew J; Fenyő, David
2016-03-01
Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations, and splice variants identified in cancer cells are translated. Herein, we apply a proteogenomic data integration tool (QUILTS) to illustrate protein variant discovery using whole genome, whole transcriptome, and global proteome datasets generated from a pair of luminal and basal-like breast-cancer-patient-derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS sample process replicates defined here as an independent tandem MS experiment using identical sample material. Despite analysis of over 30 sample process replicates, only about 10% of SNVs (somatic and germline) detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNVs without a detectable mRNA transcript were also observed, suggesting that transcriptome coverage was incomplete (∼80%). In contrast to germline variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than in the luminal tumor, raising the possibility of differential translation or protein degradation effects. In conclusion, this large-scale proteogenomic integration allowed us to determine the degree to which mutations are translated and identify gaps in sequence coverage, thereby benchmarking current technology and progress toward whole cancer proteome and transcriptome analysis. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
King, Benjamin L.; Shi, Ling Fang; Kao, Peter; Clusin, William T.
2015-01-01
Elasmobranchs detect small potentials using excitable cells of the ampulla of Lorenzini which have calcium-activated K+ channels, first described in l974. A distinctive feature of the outward current in voltage clamped ampullae is its apparent insensitivity to voltage. The sequence of a BK channel α isoform expressed in the ampulla of the skate was characterized. A signal peptide is present at the beginning of the gene. When compared to human isoform 1 (the canonical sequence), the largest difference was absence of a 59 amino acid region from the S8-S9 intracellular linker that contains the strex regulatory domain. The ampulla isoform was also compared with the isoform predicted˜ in late skate embryos where strex was also absent. The BK voltage sensors were conserved in both skate isoforms. Differences between the skate and human BK channel included alternative splicing. Alternative splicing occurs at seven previously defined sites that are characteristic for BK channels in general and hair cells in particular. Skate BK sequences were highly similar to the Australian ghost shark and several other vertebrate species. Based on alignment of known BK sequences with the skate genome and transcriptome, there are at least two isoforms of Kcnma1α expressed in the skate. One of the β subunits (β4), which is known to decrease voltage sensitivity, was also identified in the skate genome and transcriptome and in the ampulla. These studies advance our knowledge of BK channels and suggest further studies in the ampulla and other excitable tissues. PMID:26687710
Hartmann, Linda; Neveling, Kornelia; Borkens, Stephanie; Schneider, Hildegard; Freund, Marcel; Grassman, Elke; Theiss, Stephan; Wawer, Angela; Burdach, Stefan; Auerbach, Arleen D.; Schindler, Detlev; Hanenberg, Helmut; Schaal, Heiner
2010-01-01
The U1 small nuclear RNA (U1 snRNA) as a component of the major U2-dependent spliceosome recognizes 5′ splice sites (5′ss) containing GT as the canonical dinucleotide in the intronic positions +1 and +2. The c.165+1G>T germline mutation in the 5′ss of exon 2 of the Fanconi anemia C (FANCC) gene commonly predicted to prevent correct splicing was identified in nine FA patients from three pedigrees. RT-PCR analysis of the endogenous FANCC mRNA splicing pattern of patient-derived fibroblasts revealed aberrant mRNA processing, but surprisingly also correct splicing at the TT dinucleotide, albeit with lower efficiency. This consequently resulted in low levels of correctly spliced transcript and minute levels of normal posttranslationally processed FANCD2 protein, indicating that this naturally occurring TT splicing might contribute to the milder clinical manifestations of the disease in these patients. Functional analysis of this FANCC 5′ss within splicing reporters revealed that both the noncanonical TT dinucleotide and the genomic context of FANCC were required for the residual correct splicing at this mutant 5′ss. Finally, use of lentiviral vectors as a delivery system to introduce expression cassettes for TT-adapted U1 snRNAs into primary FANCC patient fibroblasts allowed the correction of the DNA-damage-induced G2 cell-cycle arrest in these cells, thus representing an alternative transcript-targeting approach for genetic therapy of inherited splice-site mutations. PMID:20869034
Genome-wide CRISPR screen identifies HNRNPL as a prostate cancer dependency regulating RNA splicing.
Fei, Teng; Chen, Yiwen; Xiao, Tengfei; Li, Wei; Cato, Laura; Zhang, Peng; Cotter, Maura B; Bowden, Michaela; Lis, Rosina T; Zhao, Shuang G; Wu, Qiu; Feng, Felix Y; Loda, Massimo; He, Housheng Hansen; Liu, X Shirley; Brown, Myles
2017-06-27
Alternative RNA splicing plays an important role in cancer. To determine which factors involved in RNA processing are essential in prostate cancer, we performed a genome-wide CRISPR/Cas9 knockout screen to identify the genes that are required for prostate cancer growth. Functional annotation defined a set of essential spliceosome and RNA binding protein (RBP) genes, including most notably heterogeneous nuclear ribonucleoprotein L (HNRNPL). We defined the HNRNPL-bound RNA landscape by RNA immunoprecipitation coupled with next-generation sequencing and linked these RBP-RNA interactions to changes in RNA processing. HNRNPL directly regulates the alternative splicing of a set of RNAs, including those encoding the androgen receptor, the key lineage-specific prostate cancer oncogene. HNRNPL also regulates circular RNA formation via back splicing. Importantly, both HNRNPL and its RNA targets are aberrantly expressed in human prostate tumors, supporting their clinical relevance. Collectively, our data reveal HNRNPL and its RNA clients as players in prostate cancer growth and potential therapeutic targets.
DBATE: database of alternative transcripts expression.
Bianchi, Valerio; Colantoni, Alessio; Calderone, Alberto; Ausiello, Gabriele; Ferrè, Fabrizio; Helmer-Citterich, Manuela
2013-01-01
The use of high-throughput RNA sequencing technology (RNA-seq) allows whole transcriptome analysis, providing an unbiased and unabridged view of alternative transcript expression. Coupling splicing variant-specific expression with its functional inference is still an open and difficult issue for which we created the DataBase of Alternative Transcripts Expression (DBATE), a web-based repository storing expression values and functional annotation of alternative splicing variants. We processed 13 large RNA-seq panels from human healthy tissues and in disease conditions, reporting expression levels and functional annotations gathered and integrated from different sources for each splicing variant, using a variant-specific annotation transfer pipeline. The possibility to perform complex queries by cross-referencing different functional annotations permits the retrieval of desired subsets of splicing variant expression values that can be visualized in several ways, from simple to more informative. DBATE is intended as a novel tool to help appreciate how, and possibly why, the transcriptome expression is shaped. DATABASE URL: http://bioinformatica.uniroma2.it/DBATE/.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Doerk, T.; Wulbrand, U.; Tuemmler, B.
1993-03-01
Single cases of the four novel splice site mutations 1525[minus]1 G [r arrow] A (intron 9), 3601[minus]2 A [r arrow] G (intron 18), 3850[minus]3 T [r arrow] G (intron 19), and 4374+1 G [r arrow] T (intron 23) were detected in the CFTR gene of cystic fibrosis patients of Indo-Iranian, Turkish, Polish, and Germany descent. The nucleotide substitutions at the +1, [minus]1, and [minus]2 positions all destroy splice sites and lead to severe disease alleles associated with features typical of gastrointestinal and pulmonary cystic fibrosis disease. The 3850[minus]3 T-to-G change was discovered in a very mildly affected 33-year-old [Delta]F508 compoundmore » heterozygote, suggesting that the T-to-G transversion at the less conserved [minus]3 position of the acceptor splice site may retain some wildtype function. 13 refs., 1 fig., 2 tabs.« less
APPRIS: annotation of principal and alternative splice isoforms
Rodriguez, Jose Manuel; Maietta, Paolo; Ezkurdia, Iakes; Pietrelli, Alessandro; Wesselink, Jan-Jaap; Lopez, Gonzalo; Valencia, Alfonso; Tress, Michael L.
2013-01-01
Here, we present APPRIS (http://appris.bioinfo.cnio.es), a database that houses annotations of human splice isoforms. APPRIS has been designed to provide value to manual annotations of the human genome by adding reliable protein structural and functional data and information from cross-species conservation. The visual representation of the annotations provided by APPRIS for each gene allows annotators and researchers alike to easily identify functional changes brought about by splicing events. In addition to collecting, integrating and analyzing reliable predictions of the effect of splicing events, APPRIS also selects a single reference sequence for each gene, here termed the principal isoform, based on the annotations of structure, function and conservation for each transcript. APPRIS identifies a principal isoform for 85% of the protein-coding genes in the GENCODE 7 release for ENSEMBL. Analysis of the APPRIS data shows that at least 70% of the alternative (non-principal) variants would lose important functional or structural information relative to the principal isoform. PMID:23161672
Martinez, Fernando; Lee, Jeong Ho; Lee, Ji Eun; Blanco, Sandra; Nickerson, Elizabeth; Gabriel, Stacey; Frye, Michaela; Al-Gazali, Lihadh; Gleeson, Joseph G.
2016-01-01
Dubowitz Syndrome is an autosomal recessive disorder characterized by the constellation of mild microcephaly, growth and mental retardation, eczema and peculiar facies, but causes are still unknown. We studied a multiplex consanguineous family with many features of Dubowitz syndrome using whole exome sequencing and identified a splice mutation in NSUN2, encoding a conserved RNA methyltransferase. NSUN2 has been implicated in Myc-induced cell proliferation and mitotic spindle stability, which might help explain the varied clinical presentations that can include chromosomal instability and immunological defects. Patient cells displayed loss of NSUN2-specific methylation at two residues of the aspartate tRNA. Our findings establish NSUN2 as the first causal gene with relationship to the Dubowitz syndrome spectrum phenotype. PMID:22577224
RNA splicing, cell signaling, and response to therapies.
Abou Faycal, Cherine; Gazzeri, Sylvie; Eymin, Beatrice
2016-01-01
PremRNA alternative splicing is more a rule than an exception as it affects more than 90% of multiexons genes and plays a key role in proteome diversity. Here, we discuss some recent studies published in the extensively growing field linking RNA splicing and cancer. These last years, the development of high-throughput studies together with appropriate bioinformatic tools have led to the identification of new cancer-specific splicing patterns that allow to distinguish various cancer types, and provide new prognosis biomarkers. In addition, the functional consequences of hot spot mutations affecting various components of the spliceosome machinery in cancers have been described. As an example, missplicing of the enhancer of zeste homolog 2 histone methyltransferase premRNA in response to hot spot mutation of the splicing factor SRSF2 was found to participate to the pathogenesis of myelodysplastic syndrome. Moreover, proofs of principle that targeting the RNA splicing machinery can be used to correct aberrant missplicing, kill oncogene-driven cancer cells, or reverse resistance of tumor cells to targeted therapies have been done. As another example, the core spliceosomal function was recently found to be critical for the survival of Myc-driven breast cancer cells, rendering them hypersensitive to spliceosome inhibitors. Dysregulation of premRNA alternative splicing appears to be one of the hallmarks of cancer. The characterization of novel splicing signatures in cancer as well as the identification of original signaling networks involving RNA splicing regulators should allow to decipher novel oncogenic mechanisms and to develop new therapeutic strategies.
Horiuchi, Keiko; Perez-Cerezales, Serafín; Papasaikas, Panagiotis; Ramos-Ibeas, Priscila; López-Cardona, Angela Patricia; Laguna-Barraza, Ricardo; Fonseca Balvís, Noelia; Pericuesta, Eva; Fernández-González, Raul; Planells, Benjamín; Viera, Alberto; Suja, Jose Angel; Ross, Pablo Juan; Alén, Francisco; Orio, Laura; Rodriguez de Fonseca, Fernando; Pintado, Belén; Valcárcel, Juan; Gutiérrez-Adán, Alfonso
2018-04-03
The U2AF35-like ZRSR1 has been implicated in the recognition of 3' splice site during spliceosome assembly, but ZRSR1 knockout mice do not show abnormal phenotypes. To analyze ZRSR1 function and its precise role in RNA splicing, we generated ZRSR1 mutant mice containing truncating mutations within its RNA-recognition motif. Homozygous mutant mice exhibited severe defects in erythrocytes, muscle stretch, and spermatogenesis, along with germ cell sloughing and apoptosis, ultimately leading to azoospermia and male sterility. Testis RNA sequencing (RNA-seq) analyses revealed increased intron retention of both U2- and U12-type introns, including U12-type intron events in genes with key functions in spermatogenesis and spermatid development. Affected U2 introns were commonly found flanking U12 introns, suggesting functional cross-talk between the two spliceosomes. The splicing and tissue defects observed in mutant mice attributed to ZRSR1 loss of function suggest a physiological role for this factor in U12 intron splicing. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
A mechanism underlying position-specific regulation of alternative splicing
Hamid, Fursham M.
2017-01-01
Abstract Many RNA-binding proteins including a master regulator of splicing in developing brain and muscle, polypyrimidine tract-binding protein 1 (PTBP1), can either activate or repress alternative exons depending on the pre-mRNA recruitment position. When bound upstream or within regulated exons PTBP1 tends to promote their skipping, whereas binding to downstream sites often stimulates inclusion. How this switch is orchestrated at the molecular level is poorly understood. Using bioinformatics and biochemical approaches we show that interaction of PTBP1 with downstream intronic sequences can activate natural cassette exons by promoting productive docking of the spliceosomal U1 snRNP to a suboptimal 5′ splice site. Strikingly, introducing upstream PTBP1 sites to this circuitry leads to a potent splicing repression accompanied by the assembly of an exonic ribonucleoprotein complex with a tightly bound U1 but not U2 snRNP. Our data suggest a molecular mechanism underlying the transition between a better-known repressive function of PTBP1 and its role as a bona fide splicing activator. More generally, we argue that the functional outcome of individual RNA contacts made by an RNA-binding protein is subject to extensive context-specific modulation.
Chuang, Trees-Juen; Wu, Chan-Shuo; Chen, Chia-Ying; Hung, Li-Yuan; Chiang, Tai-Wei; Yang, Min-Yu
2016-02-18
Analysis of RNA-seq data often detects numerous 'non-co-linear' (NCL) transcripts, which comprised sequence segments that are topologically inconsistent with their corresponding DNA sequences in the reference genome. However, detection of NCL transcripts involves two major challenges: removal of false positives arising from alignment artifacts and discrimination between different types of NCL transcripts (trans-spliced, circular or fusion transcripts). Here, we developed a new NCL-transcript-detecting method ('NCLscan'), which utilized a stepwise alignment strategy to almost completely eliminate false calls (>98% precision) without sacrificing true positives, enabling NCLscan outperform 18 other publicly-available tools (including fusion- and circular-RNA-detecting tools) in terms of sensitivity and precision, regardless of the generation strategy of simulated dataset, type of intragenic or intergenic NCL event, read depth of coverage, read length or expression level of NCL transcript. With the high accuracy, NCLscan was applied to distinguishing between trans-spliced, circular and fusion transcripts on the basis of poly(A)- and nonpoly(A)-selected RNA-seq data. We showed that circular RNAs were expressed more ubiquitously, more abundantly and less cell type-specifically than trans-spliced and fusion transcripts. Our study thus describes a robust pipeline for the discovery of NCL transcripts, and sheds light on the fundamental biology of these non-canonical RNA events in human transcriptome. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Choi, Hyun-Woo; Kim, Hye-Ran; Baek, Hee-Jo; Kook, Hoon; Cho, Duck; Shin, Jong-Hee; Suh, Soon-Pal; Ryang, Dong-Wook; Shin, Myung-Geun
2015-01-01
Recurrent somatic SET-binding protein 1 (SETBP1) and splicing pathway gene mutations have recently been found in atypical chronic myeloid leukemia and other hematologic malignancies. These mutations have been comprehensively analyzed in adult AML, but not in childhood AML. We investigated possible alteration of the SETBP1, splicing factor 3B subunit 1 (SF3B1), U2 small nuclear RNA auxiliary factor 1 (U2AF1), and serine/arginine-rich splicing factor 2 (SRSF2) genes in childhood AML. Cytogenetic and molecular analyses were performed to reveal chromosomal and genetic alterations. Sequence alterations in the SETBP1, SF3B1, U2AF1, and SRSF2 genes were examined by using direct sequencing in a cohort of 53 childhood AML patients. Childhood AML patients did not harbor any recurrent SETBP1 gene mutations, although our study did identify a synonymous mutation in one patient. None of the previously reported aberrations in the mutational hotspot of SF3B1, U2AF1, and SRSF2 were identified in any of the 53 patients. Alterations of the SETBP1 gene or SF3B1, U2AF1, and SRSF2 genes are not common genetic events in childhood AML, implying that the mutations are unlikely to exert a driver effect in myeloid leukemogenesis during childhood.
Nasser, Nicola J.; Avivi, Aaron; Shafat, Itay; Edovitsky, Evgeny; Zcharia, Eyal; Ilan, Neta; Vlodavsky, Israel; Nevo, Eviatar
2009-01-01
Heparanase is an endoglycosidase that degrades heparan sulfate (HS) at the cell surface and in the extracellular matrix. Heparanase is expressed mainly by cancer cells, and its expression is correlated with increased tumor aggressiveness, metastasis, and angiogenesis. Here, we report the cloning of a unique splice variant (splice 36) of heparanase from the subterranean blind mole rat (Spalax). This splice variant results from skipping part of exon 3, exons 4 and 5, and part of exon 6 and functions as a dominant negative to the wild-type enzyme. It inhibits HS degradation, suppresses glioma tumor growth, and decreases experimental B16–BL6 lung colonization in a mouse model. Intriguingly, Spalax splice variant 7 of heparanase (which results from skipping of exon 7) is devoid of enzymatic activity, but unlike splice 36 it enhances tumor growth. Our results demonstrate that alternative splicing of heparanase regulates its enzymatic activity and might adapt the heparanase function to the fluctuating normoxic–hypoxic subterranean environment that Spalax experiences. Development of anticancer drugs designed to suppress tumor growth, angiogenesis, and metastasis is a major challenge, of which heparanase inhibition is a promising approach. We anticipate that the heparanase splicing model, evolved during 40 million years of Spalacid adaptation to underground life, would pave the way for the development of heparanase-based therapeutic modalities directed against angiogenesis, tumor growth, and metastasis. PMID:19164514
Shen, Shihao; Park, Juw Won; Lu, Zhi-xiang; Lin, Lan; Henry, Michael D; Wu, Ying Nian; Zhou, Qing; Xing, Yi
2014-12-23
Ultra-deep RNA sequencing (RNA-Seq) has become a powerful approach for genome-wide analysis of pre-mRNA alternative splicing. We previously developed multivariate analysis of transcript splicing (MATS), a statistical method for detecting differential alternative splicing between two RNA-Seq samples. Here we describe a new statistical model and computer program, replicate MATS (rMATS), designed for detection of differential alternative splicing from replicate RNA-Seq data. rMATS uses a hierarchical model to simultaneously account for sampling uncertainty in individual replicates and variability among replicates. In addition to the analysis of unpaired replicates, rMATS also includes a model specifically designed for paired replicates between sample groups. The hypothesis-testing framework of rMATS is flexible and can assess the statistical significance over any user-defined magnitude of splicing change. The performance of rMATS is evaluated by the analysis of simulated and real RNA-Seq data. rMATS outperformed two existing methods for replicate RNA-Seq data in all simulation settings, and RT-PCR yielded a high validation rate (94%) in an RNA-Seq dataset of prostate cancer cell lines. Our data also provide guiding principles for designing RNA-Seq studies of alternative splicing. We demonstrate that it is essential to incorporate biological replicates in the study design. Of note, pooling RNAs or merging RNA-Seq data from multiple replicates is not an effective approach to account for variability, and the result is particularly sensitive to outliers. The rMATS source code is freely available at rnaseq-mats.sourceforge.net/. As the popularity of RNA-Seq continues to grow, we expect rMATS will be useful for studies of alternative splicing in diverse RNA-Seq projects.
Li, X-y; Yao, X; Li, S-n; Suo, A-l; Ruan, Z-p; Liang, X; Kong, Y; Zhang, W-g; Yao, Y
2014-01-01
Multiple genetic alterations that affect the process of acute myeloid leukemia (AML) have been discovered, and more evidence also indicates that aberrant splicing plays an important role in cancer. We present a RNA-Seq profiling of an AML patient with complete remission after treatment, to analyze the aberrant splicing of genes during treatment. We sequenced 3.97 and 3.32 Gbp clean data of the AML and remission sample, respectively. Firstly, by analyzing biomarkers associated with AML, to assist normal clinical tests, we confirmed that the patient was anormal karyo type, with NPM1 and IDH2 mutations and deregulation patterns of related genes, such as BAALC, ERG, MN1 and HOX family. Then, we performed alternative splicing detection of the AML and remission sample. We detected 91 differentially splicing events in 68 differentially splicing genes (DSGs) by mixture of isoforms (MISO). Considering Psi values (Ψ) and confidence intervals, 25 differentially expressed isoforms were identified as more confident isoforms, which were associated with RNA processing, cellular macromolecule catabolic process and DNA binding according to GO enrichment analysis. An exon2-skipping event in oncogene FOS (FBJ murine osteosarcoma viral oncogene homolog) were detected and validated in this study. FOS has a critical function in regulating cell proliferation, differentiation and transformation. The exon2-skipping isoform of FOS was increased significantly after treatment. All the data and information of RNA-Seq provides highly accurate and comprehensive supplements to conventional clinical tests of AML. Moreover, the splicing aberrations would be another source for biomarker and even therapeutic target discovery. More information of splicing may also assist the better understanding of leukemogenesis.
Vitamin D and alternative splicing of RNA
Zhou, Rui; Chun, Rene F.; Lisse, Thomas S.; Garcia, Alejandro J.; Xu, Jianzhong; Adams, John S.; Hewison, Martin
2014-01-01
The active form of vitamin D (1α,25-dihydroxyvitamin D, 1,25(OH)2D) exerts its genomic effects via binding to a nuclear high-affinity vitamin D receptor (VDR). Recent deep sequencing analysis of VDR binding locations across the complete genome has significantly expanded our understanding of the actions of vitamin D and VDR on gene transcription. However, these studies have also promoted appreciation of the extra-transcriptional impact of vitamin D on gene expression. It is now clear that vitamin D interacts with the epigenome via effects on DNA methylation, histone acetylation, and microRNA generation to maintain normal biological functions. There is also increasing evidence that vitamin D can influence pre-mRNA constitutive splicing and alternative splicing, although the mechanism for this remains unclear. Pre-mRNA splicing has long been thought to be a post-transcription RNA processing event, but current data indicate that this occurs co-transcriptionally. Several steroid hormones have been recognized to coordinately control gene transcription and pre-mRNA splicing through the recruitment of nuclear receptor co-regulators that can both control gene transcription and splicing. The current review will discuss this concept with specific reference to vitamin D, and the potential role of heterogeneous nuclear ribonucleoprotein C (hnRNPC), a nuclear factor with an established function in RNA splicing. hnRNPC, has been shown to be involved in the VDR transcriptional complex as a vitamin D-response element-binding protein (VDRE-BP), and may act as a coupling factor linking VDR-directed gene transcription with RNA splicing. In this way hnRNPC may provide an additional mechanism for the fine-tuning of vitamin D-regulated target gene expression. PMID:25447737
Zhang, Yanju; Lameijer, Eric-Wubbo; 't Hoen, Peter A. C.; Ning, Zemin; Slagboom, P. Eline; Ye, Kai
2012-01-01
Motivation: RNA-seq is a powerful technology for the study of transcriptome profiles that uses deep-sequencing technologies. Moreover, it may be used for cellular phenotyping and help establishing the etiology of diseases characterized by abnormal splicing patterns. In RNA-Seq, the exact nature of splicing events is buried in the reads that span exon–exon boundaries. The accurate and efficient mapping of these reads to the reference genome is a major challenge. Results: We developed PASSion, a pattern growth algorithm-based pipeline for splice site detection in paired-end RNA-Seq reads. Comparing the performance of PASSion to three existing RNA-Seq analysis pipelines, TopHat, MapSplice and HMMSplicer, revealed that PASSion is competitive with these packages. Moreover, the performance of PASSion is not affected by read length and coverage. It performs better than the other three approaches when detecting junctions in highly abundant transcripts. PASSion has the ability to detect junctions that do not have known splicing motifs, which cannot be found by the other tools. Of the two public RNA-Seq datasets, PASSion predicted ∼ 137 000 and 173 000 splicing events, of which on average 82 are known junctions annotated in the Ensembl transcript database and 18% are novel. In addition, our package can discover differential and shared splicing patterns among multiple samples. Availability: The code and utilities can be freely downloaded from https://trac.nbic.nl/passion and ftp://ftp.sanger.ac.uk/pub/zn1/passion Contact: y.zhang@lumc.nl; k.ye@lumc.nl Supplementary information: Supplementary data are available at Bioinformatics online. PMID:22219203
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kaler, S.G.; Gahl, W.A.
1994-09-01
Menkes disease is an X linked recessive disorder of copper metabolism produced by abnormalities in a gene that encodes a copper transporting ATPase. The clinical spectrum of Menkes disease includes a range of neurological severity from the classical type to the occipital horn syndrome (OHS) in which slightly subnormal intelligence or signs of autonomic dysfunction are the only neurologic abnormalities. We previously documented a distinctive, less severe Menkes phenotype associated with a +3 intronic splice donor mutation at the 3{prime} end of the gene in which exon skipping occurred but some normally spliced message was also detectable. We now reportmore » a similar splicing mutation in a patient with a typical OHS phenotype an A to G transition at the 2 exonic position of a splice donor site in the middle of the Menkes coding sequence. Some normally sized transcripts are evident by RT-PCR of lymphoblast mRNA from this individual, as well as 2 truncated fragments generated by exon skipping and activation of a cryptic splice acceptor site, respectively. The predicted effect of the mutation on the gene product involves a serine to glycine substitution in a noncritical region of the Menkes ATPase from the patient`s normally sized message, and premature termination due to translational frameshift in both truncated transcripts. The mutation eliminates a Dde 1 restriction site in the gene which provided a method to rapidly screen other family members, and revealed that the patient`s mother is a non-carrier. The mutational base change was not present in 25 normal X chromosomes studied. Preliminary analysis of the Menkes locus in 5 other Menkes disease families indicates aberrant mRNA splicing in 2. Our findings confirm allelism at the Menkes locus, indicate that splice mutations are relatively common mutational event in Menkes disease, and suggest that splice mutations in which some normal splicing is preserved may underlie milder Menkes disease variants, including OHS.« less
2012-01-01
Background Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. Results We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Conclusions Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution. PMID:23148531
Splicing factor SFRS1 recognizes a functionally diverse landscape of RNA transcripts.
Sanford, Jeremy R; Wang, Xin; Mort, Matthew; Vanduyn, Natalia; Cooper, David N; Mooney, Sean D; Edenberg, Howard J; Liu, Yunlong
2009-03-01
Metazoan genes are encrypted with at least two superimposed codes: the genetic code to specify the primary structure of proteins and the splicing code to expand their proteomic output via alternative splicing. Here, we define the specificity of a central regulator of pre-mRNA splicing, the conserved, essential splicing factor SFRS1. Cross-linking immunoprecipitation and high-throughput sequencing (CLIP-seq) identified 23,632 binding sites for SFRS1 in the transcriptome of cultured human embryonic kidney cells. SFRS1 was found to engage many different classes of functionally distinct transcripts including mRNA, miRNA, snoRNAs, ncRNAs, and conserved intergenic transcripts of unknown function. The majority of these diverse transcripts share a purine-rich consensus motif corresponding to the canonical SFRS1 binding site. The consensus site was not only enriched in exons cross-linked to SFRS1 in vivo, but was also enriched in close proximity to splice sites. mRNAs encoding RNA processing factors were significantly overrepresented, suggesting that SFRS1 may broadly influence the post-transcriptional control of gene expression in vivo. Finally, a search for the SFRS1 consensus motif within the Human Gene Mutation Database identified 181 mutations in 82 different genes that disrupt predicted SFRS1 binding sites. This comprehensive analysis substantially expands the known roles of human SR proteins in the regulation of a diverse array of RNA transcripts.
RNA Splicing in a New Rhabdovirus from Culex Mosquitoes▿†
Kuwata, Ryusei; Isawa, Haruhiko; Hoshino, Keita; Tsuda, Yoshio; Yanase, Tohru; Sasaki, Toshinori; Kobayashi, Mutsuo; Sawabe, Kyoko
2011-01-01
Among members of the order Mononegavirales, RNA splicing events have been found only in the family Bornaviridae. Here, we report that a new rhabdovirus isolated from the mosquito Culex tritaeniorhynchus replicates in the nuclei of infected cells and requires RNA splicing for viral mRNA maturation. The virus, designated Culex tritaeniorhynchus rhabdovirus (CTRV), shares a similar genome organization with other rhabdoviruses, except for the presence of a putative intron in the coding region for the L protein. Molecular phylogenetic studies indicated that CTRV belongs to the family Rhabdoviridae, but it is yet to be assigned a genus. Electron microscopic analysis revealed that the CTRV virion is extremely elongated, unlike virions of rhabdoviruses, which are generally bullet shaped. Northern hybridization confirmed that a large transcript (approximately 6,500 nucleotides [nt]) from the CTRV L gene was present in the infected cells. Strand-specific reverse transcription-PCR (RT-PCR) analyses identified the intron-exon boundaries and the 76-nt intron sequence, which contains the typical motif for eukaryotic spliceosomal intron-splice donor/acceptor sites (GU-AG), a predicted branch point, and a polypyrimidine tract. In situ hybridization exhibited that viral RNAs are primarily localized in the nucleus of infected cells, indicating that CTRV replicates in the nucleus and is allowed to utilize the host's nuclear splicing machinery. This is the first report of RNA splicing among the members of the family Rhabdoviridae. PMID:21507977
RNA splicing in a new rhabdovirus from Culex mosquitoes.
Kuwata, Ryusei; Isawa, Haruhiko; Hoshino, Keita; Tsuda, Yoshio; Yanase, Tohru; Sasaki, Toshinori; Kobayashi, Mutsuo; Sawabe, Kyoko
2011-07-01
Among members of the order Mononegavirales, RNA splicing events have been found only in the family Bornaviridae. Here, we report that a new rhabdovirus isolated from the mosquito Culex tritaeniorhynchus replicates in the nuclei of infected cells and requires RNA splicing for viral mRNA maturation. The virus, designated Culex tritaeniorhynchus rhabdovirus (CTRV), shares a similar genome organization with other rhabdoviruses, except for the presence of a putative intron in the coding region for the L protein. Molecular phylogenetic studies indicated that CTRV belongs to the family Rhabdoviridae, but it is yet to be assigned a genus. Electron microscopic analysis revealed that the CTRV virion is extremely elongated, unlike virions of rhabdoviruses, which are generally bullet shaped. Northern hybridization confirmed that a large transcript (approximately 6,500 nucleotides [nt]) from the CTRV L gene was present in the infected cells. Strand-specific reverse transcription-PCR (RT-PCR) analyses identified the intron-exon boundaries and the 76-nt intron sequence, which contains the typical motif for eukaryotic spliceosomal intron-splice donor/acceptor sites (GU-AG), a predicted branch point, and a polypyrimidine tract. In situ hybridization exhibited that viral RNAs are primarily localized in the nucleus of infected cells, indicating that CTRV replicates in the nucleus and is allowed to utilize the host's nuclear splicing machinery. This is the first report of RNA splicing among the members of the family Rhabdoviridae.
Xiong, Wei; Gao, Depei; Li, Yunfeng; Liu, Xin; Dai, Peiling; Qin, Jiyong; Wang, Guanshun; Li, Kangming; Bai, Han; Li, Wenhui
2016-10-01
Alternative splicing is a key mechanism that regulates protein diversity and has been found to be associated with colon cancer progression and metastasis. However, the function of alternative splicing in chemoradiation‑resistant colon cancer remains elusive. In this study, we constructed a chemoradiation‑resistant colon cancer cell line. Through RNA-sequencing of normal and chemoradiation‑resistant colon cancer cells (HCT116), we found 818 genes that were highly expressed in the normal HCT116 cells, whereas 285 genes were highly expressed in the chemoradiation-resistant HCT116 (RCR-HCT116) cells. Gene ontology (GO) analysis showed that genes that were highly expressed in the HCT116 cells were enriched in GO categories related to cell cycle and cell division, whereas genes that were highly expressed in the RCR-HCT116 cells were associated with regulation of system processes and response to wounding. Analysis of alternative splicing events revealed that exon skipping was significantly increased in the chemoradiation‑resistant colon cancer cells. Moreover, we identified 323 alternative splicing events in 293 genes that were significantly different between the two different HCT116 cell types. These alternative splicing‑related genes were clustered functionally into several groups related with DNA replication, such as deoxyribonucleotide metabolic/catabolic processes, response to DNA damage stimulus and helicase activity. These findings enriched our knowledge by elucidating the function of alternative splicing in chemoradiation-resistant colon cancer.
Diabetes-induced changes in the alternative splicing of the slo gene in corporal tissue.
Davies, Kelvin P; Zhao, Weixin; Tar, Moses; Figueroa, Johanna C; Desai, Pratik; Verselis, Vytas K; Kronengold, Jack; Wang, Hong-Zhan; Melman, Arnold; Christ, George J
2007-10-01
Erectile dysfunction is a common diabetic complication. Preclinical studies have documented that the Slo gene (encoding the BK or Maxi-K channel alpha-subunit) plays a critical role in erectile function. Therefore, we determined whether diabetes induces changes in the splicing of the Slo gene relevant to erectile function. Reverse transcriptase-polymerase chain reaction was used to compare Slo splice variant expression in corporal tissue excised from control and streptozotocin (STZ)-induced diabetic Fischer F-344 rats. Splice variants were sequenced, characterized by patch clamping, and fused to green fluorescent protein to determine cellular localization. The impact of altered Slo expression on erectile function was further evaluated in vivo. A novel Slo splice variant (SVcyt, with a cytoplasmic location) was predominantly expressed in corporal tissue from control rats. STZ-diabetes caused upregulation of a channel-forming transcript SV0. Preliminary results suggest that SV0 was also more prevalent in the corporal tissue of human diabetic compared with nondiabetic patients. The change in isoform expression in STZ-treated rats was partially reversed by insulin treatment. Intracorporal injection of a plasmid expressing the SV0 transcript, but not SVcyt, restored erectile function in STZ-diabetic rats. Alternative splicing of the Slo transcript may represent an important compensatory mechanism to increase the ease with which relaxation of corporal tissue may be triggered as a result of a diabetes-related decline in erectile capacity.
Saturation mutagenesis reveals manifold determinants of exon definition.
Ke, Shengdong; Anquetil, Vincent; Zamalloa, Jorge Rojas; Maity, Alisha; Yang, Anthony; Arias, Mauricio A; Kalachikov, Sergey; Russo, James J; Ju, Jingyue; Chasin, Lawrence A
2018-01-01
To illuminate the extent and roles of exonic sequences in the splicing of human RNA transcripts, we conducted saturation mutagenesis of a 51-nt internal exon in a three-exon minigene. All possible single and tandem dinucleotide substitutions were surveyed. Using high-throughput genetics, 5560 minigene molecules were assayed for splicing in human HEK293 cells. Up to 70% of mutations produced substantial (greater than twofold) phenotypes of either increased or decreased splicing. Of all predicted secondary structural elements, only a single 15-nt stem-loop showed a strong correlation with splicing, acting negatively. The in vitro formation of exon-protein complexes between the mutant molecules and proteins associated with spliceosome formation (U2AF35, U2AF65, U1A, and U1-70K) correlated with splicing efficiencies, suggesting exon definition as the step affected by most mutations. The measured relative binding affinities of dozens of human RNA binding protein domains as reported in the CISBP-RNA database were found to correlate either positively or negatively with splicing efficiency, more than could fit on the 51-nt test exon simultaneously. The large number of these functional protein binding correlations point to a dynamic and heterogeneous population of pre-mRNA molecules, each responding to a particular collection of binding proteins. © 2018 Ke et al.; Published by Cold Spring Harbor Laboratory Press.
Martínez-Salazar, Martha; López-Urrutia, Eduardo; Arechaga-Ocampo, Elena; Bonilla-Moreno, Raul; Martínez-Castillo, Macario; Díaz-Hernández, Job; Del Moral-Hernández, Oscar; Cedillo-Barrón, Leticia; Martines-Juarez, Víctor; De Nova-Ocampo, Monica; Valdes, Jesús; Berumen, Jaime; Villegas-Sepúlveda, Nicolás
2014-12-05
The human papillomavirus type 16 (HPV-16) E6/E7 spliced transcripts are heterogeneously expressed in cervical carcinoma. The heterogeneity of the E6/E7 splicing profile might be in part due to the intrinsic variation of splicing factors in tumor cells. However, the splicing factors that bind the E6/E7 intron 1 (In-1) have not been defined. Therefore, we aimed to identify these factors; we used HeLa nuclear extracts (NE) for in vitro spliceosome assembly. The proteins were allowed to bind to an RNA/DNA hybrid formed by the In-1 transcript and a 5'-biotinylated DNA oligonucleotide complementary to the upstream exon sequence, which prevented interference in protein binding to the intron. The hybrid probes bound with the nuclear proteins were coupled to streptavidin magnetic beads for chromatography affinity purification. Proteins were eluted and identified by mass spectrometry (MS). Approximately 170 proteins were identified by MS, 80% of which were RNA binding proteins, including canonical spliceosome core components, helicases and regulatory splicing factors. The canonical factors were identified as components of the spliceosomal B-complex. Although 35-40 of the identified factors were cognate splicing factors or helicases, they have not been previously detected in spliceosome complexes that were assembled using in vivo or in vitro models. Copyright © 2014 Elsevier B.V. All rights reserved.
The bromodomain protein BRD4 regulates splicing during heat shock.
Hussong, Michelle; Kaehler, Christian; Kerick, Martin; Grimm, Christina; Franz, Alexandra; Timmermann, Bernd; Welzel, Franziska; Isensee, Jörg; Hucho, Tim; Krobitsch, Sylvia; Schweiger, Michal R
2017-01-09
The cellular response to heat stress is an ancient and evolutionarily highly conserved defence mechanism characterised by the transcriptional up-regulation of cyto-protective genes and a partial inhibition of splicing. These features closely resemble the proteotoxic stress response during tumor development. The bromodomain protein BRD4 has been identified as an integral member of the oxidative stress as well as of the inflammatory response, mainly due to its role in the transcriptional regulation process. In addition, there are also several lines of evidence implicating BRD4 in the splicing process. Using RNA-sequencing we found a significant increase in splicing inhibition, in particular intron retentions (IR), following heat treatment in BRD4-depleted cells. This leads to a decrease of mRNA abundancy of the affected transcripts, most likely due to premature termination codons. Subsequent experiments revealed that BRD4 interacts with the heat shock factor 1 (HSF1) such that under heat stress BRD4 is recruited to nuclear stress bodies and non-coding SatIII RNA transcripts are up-regulated. These findings implicate BRD4 as an important regulator of splicing during heat stress. Our data which links BRD4 to the stress induced splicing process may provide novel mechanisms of BRD4 inhibitors in regard to anti-cancer therapies. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Schultz, Kris Ann; Harris, Anne; Messinger, Yoav; Sencer, Susan; Baldinger, Shari; Dehner, Louis P.; Hill, D. Ashley
2015-01-01
Germline DICER1 mutations have been described in individuals with pleuropulmonary blastoma (PPB), ovarian Sertoli-Leydig cell tumor (SLCT), sarcomas, multinodular goiter, thyroid carcinoma, cystic nephroma and other neoplastic conditions. Early results from the International Ovarian and Testicular Stromal Tumor Registry show germline DICER1 mutations in 48% of girls and women with SLCT. In this report, a young woman presented with ovarian undifferentiated sarcoma. Four years later, she presented with SLCT. She was successfully treated for both malignancies. Sequence results showed a germline intronic mutation in DICER1. This mutation results in an exact duplication of the six bases at the splice site at the intron 23 and exon 24 junction. Predicted improper splicing leads to inclusion of 10 bases of intronic sequence, frameshift and premature truncation of the protein disrupting the RNase IIIb domain. A second individual with SLCT was found to have an identical germline mutation. In each of the ovarian tumors, an additional somatic mutation in the RNase IIIb domain of DICER1 was found. In rare patients, germline intronic mutations in DICER1 that are predicted to cause incorrect splicing can also contribute to the pathogenesis of SLCT. PMID:26289771
A SIGMAR1 splice-site mutation causes distal hereditary motor neuropathy.
Li, Xiaobo; Hu, Zhengmao; Liu, Lei; Xie, Yongzhi; Zhan, Yajing; Zi, Xiaohong; Wang, Junling; Wu, Lixiang; Xia, Kun; Tang, Beisha; Zhang, Ruxu
2015-06-16
To identify the underlying genetic cause in a consanguineous Chinese family segregating distal hereditary motor neuropathy (dHMN) in an autosomal recessive pattern. We used whole-exome sequencing and homozygosity mapping to detect the genetic variant in 2 affected individuals of the consanguineous Chinese family with dHMN. RNA analysis of peripheral blood leukocytes and immunofluorescence and immunoblotting of stable cell lines were performed to support the pathogenicity of the identified mutation. We identified 3 shared novel homozygous variants in 3 shared homozygous regions of the affected individuals. Sequencing of these 3 variants in family members revealed the c.151+1G>T mutation in SIGMAR1 gene, which located in homozygous region spanning approximately 5.3 Mb at chromosome 9p13.1-p13.3, segregated with the dHMN phenotype. The mutation causes an alternative splicing event and generates a transcript variant with an in-frame deletion of 60 base pairs in exon 1 (c.92_151del), and results in an internally shortened protein σ1R(31_50del). The proteasomal inhibitor treatment increased the intracellular amount of σ1R(31_50del) and led to the formation of nuclear aggregates. Stable expressing σ1R(31_50del) induced endoplasmic reticulum stress and enhanced apoptosis. The homozygous c.151+1G>T mutation in SIGMAR1 caused a novel form of autosomal recessive dHMN in a Chinese consanguineous family. Endoplasmic reticulum stress may have a role in the pathogenesis of dHMN. © 2015 American Academy of Neurology.
Comprehensive splicing functional analysis of DNA variants of the BRCA2 gene by hybrid minigenes
2012-01-01
Introduction The underlying pathogenic mechanism of a large fraction of DNA variants of disease-causing genes is the disruption of the splicing process. We aimed to investigate the effect on splicing of the BRCA2 variants c.8488-1G > A (exon 20) and c.9026_9030del (exon 23), as well as 41 BRCA2 variants reported in the Breast Cancer Information Core (BIC) mutation database. Methods DNA variants were analyzed with the splicing prediction programs NNSPLICE and Human Splicing Finder. Functional analyses of candidate variants were performed by lymphocyte RT-PCR and/or hybrid minigene assays. Forty-one BIC variants of exons 19, 20, 23 and 24 were bioinformatically selected and generated by PCR-mutagenesis of the wild type minigenes. Results Lymphocyte RT-PCR of c.8488-1G > A showed intron 19 retention and a 12-nucleotide deletion in exon 20, whereas c.9026_9030del did not show any splicing anomaly. Minigene analysis of c.8488-1G > A displayed the aforementioned aberrant isoforms but also exon 20 skipping. We further evaluated the splicing outcomes of 41 variants of four BRCA2 exons by minigene analysis. Eighteen variants presented splicing aberrations. Most variants (78.9%) disrupted the natural splice sites, whereas four altered putative enhancers/silencers and had a weak effect. Fluorescent RT-PCR of minigenes accurately detected 14 RNA isoforms generated by cryptic site usage, exon skipping and intron retention events. Fourteen variants showed total splicing disruptions and were predicted to truncate or eliminate essential domains of BRCA2. Conclusions A relevant proportion of BRCA2 variants are correlated with splicing disruptions, indicating that RNA analysis is a valuable tool to assess the pathogenicity of a particular DNA change. The minigene system is a straightforward and robust approach to detect variants with an impact on splicing and contributes to a better knowledge of this gene expression step. PMID:22632462
Oliveira, Jorge; Negrão, Luís; Fineza, Isabel; Taipa, Ricardo; Melo-Pires, Manuel; Fortuna, Ana Maria; Gonçalves, Ana Rita; Froufe, Hugo; Egas, Conceição; Santos, Rosário; Sousa, Mário
2015-06-01
Muscular dystrophies (MDs) are a group of hereditary muscle disorders that include two particularly heterogeneous subgroups: limb-girdle MD and congenital MD, linked to 52 different genes (seven common to both subgroups). Massive parallel sequencing technology may avoid the usual stepwise gene-by-gene analysis. We report the whole-exome sequencing (WES) analysis of a patient with childhood-onset progressive MD, also presenting mental retardation and dilated cardiomyopathy. Conventional sequencing had excluded eight candidate genes. WES of the trio (patient and parents) was performed using the ion proton sequencing system. Data analysis resorted to filtering steps using the GEMINI software revealed a novel silent variant in the choline kinase beta (CHKB) gene. Inspection of sequence alignments ultimately identified the causal variant (CHKB:c.1031+3G>C). This splice site mutation was confirmed using Sanger sequencing and its effect was further evaluated with gene expression analysis. On reassessment of the muscle biopsy, typical abnormal mitochondrial oxidative changes were observed. Mutations in CHKB have been shown to cause phosphatidylcholine deficiency in myofibers, causing a rare form of CMD (only 21 patients reported). Notwithstanding interpretative difficulties that need to be overcome before the integration of WES in the diagnostic workflow, this work corroborates its utility in solving cases from highly heterogeneous groups of diseases, in which conventional diagnostic approaches fail to provide a definitive diagnosis.
Scholthof, Karen-Beth G.
2015-01-01
In eukaryotes, alternative splicing (AS) promotes transcriptome and proteome diversity. The extent of genome-wide AS changes occurring during a plant-microbe interaction is largely unknown. Here, using high-throughput, paired-end RNA sequencing, we generated an isoform-level spliceome map of Brachypodium distachyon infected with Panicum mosaic virus and its satellite virus. Overall, we detected ∼44,443 transcripts in B. distachyon, ∼30% more than those annotated in the reference genome. Expression of ∼28,900 transcripts was ≥2 fragments per kilobase of transcript per million mapped fragments, and ∼42% of multi-exonic genes were alternatively spliced. Comparative analysis of AS patterns in B. distachyon, rice (Oryza sativa), maize (Zea mays), sorghum (Sorghum bicolor), Arabidopsis thaliana, potato (Solanum tuberosum), Medicago truncatula, and poplar (Populus trichocarpa) revealed conserved ratios of the AS types between monocots and dicots. Virus infection quantitatively altered AS events in Brachypodium with little effect on the AS ratios. We discovered AS events for >100 immune-related genes encoding receptor-like kinases, NB-LRR resistance proteins, transcription factors, RNA silencing, and splicing-associated proteins. Cloning and molecular characterization of SCL33, a serine/arginine-rich splicing factor, identified multiple novel intron-retaining splice variants that are developmentally regulated and modulated during virus infection. B. distachyon SCL33 splicing patterns are also strikingly conserved compared with a distant Arabidopsis SCL33 ortholog. This analysis provides new insights into AS landscapes conserved among monocots and dicots and uncovered AS events in plant defense-related genes. PMID:25634987
2009-09-01
binding ETS domain) and five type II (without ETS domain). Fusion-positive type I– and type II–containing phages were amplified with T3 and T7 primers...will be performed to identify the authentic 3’ UTRs from the mRNA pool from CaP patient specimens. Using phage excision strategy, we will use to... phage DNA sequences plasmids (cDNA) clones were generated by using phage excision strategy. Figure 1. ERG splice variants in prostate cancer
Ajiro, Masahiko; Jia, Rong; Zhang, Lifang; Liu, Xuefeng; Zheng, Zhi-Ming
2012-01-01
HPV16 E6 and E7, two viral oncogenes, are expressed from a single bicistronic pre-mRNA. In this report, we provide the evidence that the bicistronic pre-mRNA intron 1 contains three 5′ splice sites (5′ ss) and three 3′ splice sites (3′ ss) normally used in HPV16+ cervical cancer and its derived cell lines. The choice of two novel alternative 5′ ss (nt 221 5′ ss and nt 191 5′ ss) produces two novel isoforms of E6E7 mRNAs (E6*V and E6*VI). The nt 226 5′ ss and nt 409 3′ ss is preferentially selected over the other splice sites crossing over the intron to excise a minimal length of the intron in RNA splicing. We identified AACAAAC as the preferred branch point sequence (BPS) and an adenosine at nt 385 (underlined) in the BPS as a branch site to dictate the selection of the nt 409 3′ ss for E6*I splicing and E7 expression. Introduction of point mutations into the mapped BPS led to reduced U2 binding to the BPS and thereby inhibition of the second step of E6E7 splicing at the nt 409 3′ ss. Importantly, the E6E7 bicistronic RNA with a mutant BPS and inefficient splicing makes little or no E7 and the resulted E6 with mutations of 91QYNK94 to 91PSFW94 displays attenuate activity on p53 degradation. Together, our data provide structural basis of the E6E7 intron 1 for better understanding of how viral E6 and E7 expression is regulated by alternative RNA splicing. This study elucidates for the first time a mapped branch point in HPV16 genome involved in viral oncogene expression. PMID:23056301
Friedberg, Felix
2009-05-01
In this paper we examine (restricted to homo sapiens) the products resulting from gene duplication and the subsequent alternative splicing for the members of a multidomain group of proteins which possess the evolutionary conserved calponin homology CH domain, i.e. an "actin binding domain", as a singlet and which, in addition, contain the conserved cysteine rich double Zn finger possessing Lim domain, also as a singlet. Seven genes, resulting from gene duplications, were identified that code for seven group members for which pre-mRNAs appear to have undergone multiple alternative splicing: Mical 1, 2 and 3 are located on chromosomes 6q21, 11p15 and 22q11, respectively. The LMO7 gene is present on chromosome 13q22 and the LIMCH1 gene on chromosome 4p13. Micall1 is mapped to chromosome 22q13 and Micall2 to chromosome 7p22. Translated Gen/Bank ESTs suggest the existence of multiple products alternatively spliced from the pre-mRNAs encoded by these genes. Characteristic indicators of such splicing among the proteins derived from one gene must include containment of some common extensive 100% identical regions. In some instances only one exon might be partly or completely eliminated. Sometimes alternative splicing is also associated with an increased frequency of creation of an exon or part of an exon from an intron. Not only coding regions for the body of the protein but also for its N- or -C ends could be affected by the splicing. If created forms are merely beginning at different starting points but remain identical in sequence thereafter, their existence as products of alternate splicing must be questioned. In the splicings, described in this paper, multiple isoforms rather than a single isoform appear as products during the gene expression.
Jiang, Cong; Li, Yang; Li, Chaohui; Liu, Huiquan; Kang, Zhensheng; Xu, Jin-Rong
2016-01-01
PRP4 encodes the only kinase among the spliceosome components. Although it is an essential gene in the fission yeast and other eukaryotic organisms, the Fgprp4 mutant was viable in the wheat scab fungus Fusarium graminearum. Deletion of FgPRP4 did not block intron splicing but affected intron splicing efficiency in over 60% of the F. graminearum genes. The Fgprp4 mutant had severe growth defects and produced spontaneous suppressors that were recovered in growth rate. Suppressor mutations were identified in the PRP6, PRP31, BRR2, and PRP8 orthologs in nine suppressor strains by sequencing analysis with candidate tri-snRNP component genes. The Q86K mutation in FgMSL1 was identified by whole genome sequencing in suppressor mutant S3. Whereas two of the suppressor mutations in FgBrr2 and FgPrp8 were similar to those characterized in their orthologs in yeasts, suppressor mutations in Prp6 and Prp31 orthologs or FgMSL1 have not been reported. Interestingly, four and two suppressor mutations identified in FgPrp6 and FgPrp31, respectively, all are near the conserved Prp4-phosphorylation sites, suggesting that these mutations may have similar effects with phosphorylation by Prp4 kinase. In FgPrp31, the non-sense mutation at R464 resulted in the truncation of the C-terminal 130 aa region that contains all the conserved Prp4-phosphorylation sites. Deletion analysis showed that the N-terminal 310-aa rich in SR residues plays a critical role in the localization and functions of FgPrp4. We also conducted phosphoproteomics analysis with FgPrp4 and identified S289 as the phosphorylation site that is essential for its functions. These results indicated that FgPrp4 is critical for splicing efficiency but not essential for intron splicing, and FgPrp4 may regulate pre-mRNA splicing by phosphorylation of other components of the tri-snRNP although itself may be activated by phosphorylation at S289. PMID:27058959
PVT: An Efficient Computational Procedure to Speed up Next-generation Sequence Analysis
2014-01-01
Background High-throughput Next-Generation Sequencing (NGS) techniques are advancing genomics and molecular biology research. This technology generates substantially large data which puts up a major challenge to the scientists for an efficient, cost and time effective solution to analyse such data. Further, for the different types of NGS data, there are certain common challenging steps involved in analysing those data. Spliced alignment is one such fundamental step in NGS data analysis which is extremely computational intensive as well as time consuming. There exists serious problem even with the most widely used spliced alignment tools. TopHat is one such widely used spliced alignment tools which although supports multithreading, does not efficiently utilize computational resources in terms of CPU utilization and memory. Here we have introduced PVT (Pipelined Version of TopHat) where we take up a modular approach by breaking TopHat’s serial execution into a pipeline of multiple stages, thereby increasing the degree of parallelization and computational resource utilization. Thus we address the discrepancies in TopHat so as to analyze large NGS data efficiently. Results We analysed the SRA dataset (SRX026839 and SRX026838) consisting of single end reads and SRA data SRR1027730 consisting of paired-end reads. We used TopHat v2.0.8 to analyse these datasets and noted the CPU usage, memory footprint and execution time during spliced alignment. With this basic information, we designed PVT, a pipelined version of TopHat that removes the redundant computational steps during ‘spliced alignment’ and breaks the job into a pipeline of multiple stages (each comprising of different step(s)) to improve its resource utilization, thus reducing the execution time. Conclusions PVT provides an improvement over TopHat for spliced alignment of NGS data analysis. PVT thus resulted in the reduction of the execution time to ~23% for the single end read dataset. Further, PVT designed for paired end reads showed an improved performance of ~41% over TopHat (for the chosen data) with respect to execution time. Moreover we propose PVT-Cloud which implements PVT pipeline in cloud computing system. PMID:24894600
PVT: an efficient computational procedure to speed up next-generation sequence analysis.
Maji, Ranjan Kumar; Sarkar, Arijita; Khatua, Sunirmal; Dasgupta, Subhasis; Ghosh, Zhumur
2014-06-04
High-throughput Next-Generation Sequencing (NGS) techniques are advancing genomics and molecular biology research. This technology generates substantially large data which puts up a major challenge to the scientists for an efficient, cost and time effective solution to analyse such data. Further, for the different types of NGS data, there are certain common challenging steps involved in analysing those data. Spliced alignment is one such fundamental step in NGS data analysis which is extremely computational intensive as well as time consuming. There exists serious problem even with the most widely used spliced alignment tools. TopHat is one such widely used spliced alignment tools which although supports multithreading, does not efficiently utilize computational resources in terms of CPU utilization and memory. Here we have introduced PVT (Pipelined Version of TopHat) where we take up a modular approach by breaking TopHat's serial execution into a pipeline of multiple stages, thereby increasing the degree of parallelization and computational resource utilization. Thus we address the discrepancies in TopHat so as to analyze large NGS data efficiently. We analysed the SRA dataset (SRX026839 and SRX026838) consisting of single end reads and SRA data SRR1027730 consisting of paired-end reads. We used TopHat v2.0.8 to analyse these datasets and noted the CPU usage, memory footprint and execution time during spliced alignment. With this basic information, we designed PVT, a pipelined version of TopHat that removes the redundant computational steps during 'spliced alignment' and breaks the job into a pipeline of multiple stages (each comprising of different step(s)) to improve its resource utilization, thus reducing the execution time. PVT provides an improvement over TopHat for spliced alignment of NGS data analysis. PVT thus resulted in the reduction of the execution time to ~23% for the single end read dataset. Further, PVT designed for paired end reads showed an improved performance of ~41% over TopHat (for the chosen data) with respect to execution time. Moreover we propose PVT-Cloud which implements PVT pipeline in cloud computing system.
Mubiru, James N; Yang, Alice S; Olsen, Christian; Nayak, Sudhir; Livi, Carolina B; Dick, Edward J; Owston, Michael; Garcia-Forey, Magdalena; Shade, Robert E; Rogers, Jeffrey
2014-01-01
The function of prostate-specific antigen (PSA) is to liquefy the semen coagulum so that the released sperm can fuse with the ovum. Fifteen spliced variants of the PSA gene have been reported in humans, but little is known about alternative splicing in nonhuman primates. Positive selection has been reported in sex- and reproductive-related genes from sea urchins to Drosophila to humans; however, there are few studies of adaptive evolution of the PSA gene. Here, using polymerase chain reaction (PCR) product cloning and sequencing, we study PSA transcript variant heterogeneity in the prostates of chimpanzees (Pan troglodytes), cynomolgus monkeys (Macaca fascicularis), baboons (Papio hamadryas anubis), and African green monkeys (Chlorocebus aethiops). Six PSA variants were identified in the chimpanzee prostate, but only two variants were found in cynomolgus monkeys, baboons, and African green monkeys. In the chimpanzee the full-length transcript is expressed at the same magnitude as the transcripts that retain intron 3. We have found previously unidentified splice variants of the PSA gene, some of which might be linked to disease conditions. Selection on the PSA gene was studied in 11 primate species by computational methods using the sequences reported here for African green monkey, cynomolgus monkey, baboon, and chimpanzee and other sequences available in public databases. A codon-based analysis (dN/dS) of the PSA gene identified potential adaptive evolution at five residue sites (Arg45, Lys70, Gln144, Pro189, and Thr203).
Waluk, Dominik P; Zur, Gila; Kaufmann, Ronnie; Welle, Monika M; Jagannathan, Vidhya; Drögemüller, Cord; Müller, Eliane J; Leeb, Tosso; Galichet, Arnaud
2016-09-08
X-linked hypohidrotic ectodermal dysplasia (XLHED) caused by variants in the EDA gene represents the most common ectodermal dysplasia in humans. We investigated three male mixed-breed dogs with an ectodermal dysplasia phenotype characterized by marked hypotrichosis and multifocal complete alopecia, almost complete absence of sweat and sebaceous glands, and altered dentition with missing and abnormally shaped teeth. Analysis of SNP chip genotypes and whole genome sequence data from the three affected dogs revealed that the affected dogs shared the same haplotype on a large segment of the X-chromosome, including the EDA gene. Unexpectedly, the whole genome sequence data did not reveal any nonsynonymous EDA variant in the affected dogs. We therefore performed an RNA-seq experiment on skin biopsies to search for changes in the transcriptome. This analysis revealed that the EDA transcript in the affected dogs lacked 103 nucleotides encoded by exon 2. We speculate that this exon skipping is caused by a genetic variant located in one of the large introns flanking this exon, which was missed by whole genome sequencing with the illumina short read technology. The altered EDA transcript splicing most likely causes the observed ectodermal dysplasia in the affected dogs. These dogs thus offer an excellent opportunity to gain insights into the complex splicing processes required for expression of the EDA gene, and other genes with large introns. Copyright © 2016 Waluk et al.
Novel variants in PAX6 gene caused congenital aniridia in two Chinese families.
Zhang, R; Linpeng, S; Wei, X; Li, H; Huang, Y; Guo, J; Wu, Q; Liang, D; Wu, L
2017-06-01
PurposeTo reveal the underlying genetic defect in two four-generation Chinese families with aniridia and explore the pathologic mechanism.MethodsFull ophthalmic examinations were performed in two families with aniridia. The PAX6 gene was directly sequenced in patients of two families, and the detected variants were screened in unaffected family members and two hundred unrelated healthy controls. Real-time quantitative PCR was used to explore pathologic mechanisms of the two variants.ResultsAniridia, cataract, and oscillatory nystagmus were observed in patients of the two families. In addition, we observed corneal opacity and microphthalmus in family 1, and strabismus, left ectopia lentis, microphthalmus, and microcornea in family 2. Sanger sequencing detected a novel 1-bp duplication (c.50dupA) in family 1 and a novel 2-bp splice site deletion (c.765+1_765+2delGT) in family 2. Sequencing of cDNA indicated skipping of exon 9 caused by the splice site deletion, being predicted to cause a premature stop codon, as well as the duplication. The PAX6 mRNA significantly lower in patients with aniridia than in unaffected family members in both families, suggesting that the duplication and splice site deletion caused nonsense-mediated mRNA decay.ConclusionsOur study identified two novel PAX6 variants in two families with aniridia and revealed the pathogenicity of the variants; this would expand the variant spectrum of PAX6 and help us better understand the molecular basis of aniridia, thus facilitating genetic counseling.
Wang, Dan; Liang, Shengyun; Zhang, Zhao; Zhao, Guoru; Hu, Yuan; Liang, Shengran; Zhang, Xipeng; Banerjee, Santasree
2017-03-28
Familial adenomatous polyposis (FAP) is an autosomal dominant precancerous condition, clinically characterized by the presence of multiple colorectal adenomas or polyps. Patients with FAP has a high risk of developing colorectal cancer (CRC) from these colorectal adenomatous polyps by the mean age of diagnosis at 40 years. Germline mutations of the APC gene cause familial adenomatous polyposis (FAP). Colectomy has recommended for the FAP patients with significant polyposis. Here, we present a clinical molecular study of a four generation Chinese family with FAP. Clinical diagnosis of FAP has been done according to the phenotype, family history and medical records. Patient's blood samples were collected and genomic DNA was extracted. In order to identify the pathogenic mutation underlying the disease phenotype targeted next-generation sequencing and confirmatory sanger sequencing has undertaken. Targeted next generation sequencing identified a novel heterozygous splice-acceptor site mutation [c.1744-1G>A] in intron 14 of APC gene, which is co-segregated with the FAP phenotypes in the proband and amongst all the affected family members. This mutation is not present in unaffected family members and in normal healthy controls of same ethnic origin. According to the LOVD database for Chinese colorectal cancer patients, in Chinese population, 60% of the previously reported APC gene mutations causes FAP, are missense mutations. This novel splice-acceptor site mutation causing FAP in this Chinese family expands the germline mutation spectrum of the APC gene in the Chinese population.
Cai, Xiaohong; Qian, Chengrui; Wu, Wenman; Lei, Hang; Ding, Qiulan; Zou, Wei; Xiang, Dong; Wang, Xuefeng
2017-09-01
The amino acid substitutions caused by ABO gene mutations are usually predicted to impact glycosyltransferase's function or its biosynthesis. Here we report an ABO exonic missense mutation that affects B-antigen expression by decreasing the mRNA level of the ABO gene rather than the amino acid change. Serologic studies including plasma total GTB transfer capacity were performed. The exon sequences of the ABO gene were analyzed by Sanger sequencing. B 310 cDNA with c.28G>A (p.G10R) mutation was expressed in HeLa cells and total GTB transfer capacity in cell supernatant was measured. Flow cytometry was performed on these HeLa cells after transfection, and agglutination of Hela-B weak cells was also examined. The mRNA of the ABO gene was analyzed by direct sequencing and real-time reverse transcriptase-polymerase chain reaction. A minigene construct was prepared to evaluate the potential of splicing. While plasma total GTB transfer capacity was undetectable in this B 3 -like individual, the relative percentage of antigen-expressing cells and mean fluorescence index of the B weak red blood cells (RBCs) were 19 and 14% of normal B RBCs, respectively. There was no significant difference of total GTB transfer capacity in cell supernatant and B-antigen expression on cell surfaces between HeLa cells transfected with B 310 cDNA and B cDNA. The mRNA expression level of B 310 in peripheral whole blood was significantly reduced. The amount of splicing is significantly lower in c.28G>A construct compared to that in wild-type construct after transfection in K562 cells. ABO c.28G>A mutation may cause B 3 -like subgroup by affecting RNA splicing of the ABO gene. © 2017 AABB.
Ramirez, Agnese; Crisafulli, Sebastiano G.; Rizzuti, Mafalda; Bresolin, Nereo; Comi, Giacomo P.; Corti, Stefania
2018-01-01
Spinal muscular atrophy (SMA) is an autosomal-recessive childhood motor neuron disease and the main genetic cause of infant mortality. SMA is caused by deletions or mutations in the survival motor neuron 1 (SMN1) gene, which results in SMN protein deficiency. Only one approved drug has recently become available and allows for the correction of aberrant splicing of the paralogous SMN2 gene by antisense oligonucleotides (ASOs), leading to production of full-length SMN protein. We have already demonstrated that a sequence of an ASO variant, Morpholino (MO), is particularly suitable because of its safety and efficacy profile and is both able to increase SMN levels and rescue the murine SMA phenotype. Here, we optimized this strategy by testing the efficacy of four new MO sequences targeting SMN2. Two out of the four new MO sequences showed better efficacy in terms of SMN protein production both in SMA induced pluripotent stem cells (iPSCs) and SMAΔ7 mice. Further, the effect was enhanced when different MO sequences were administered in combination. Our data provide an important insight for MO-based treatment for SMA. Optimization of the target sequence and validation of a treatment based on a combination of different MO sequences could support further pre-clinical studies and the progression toward future clinical trials. PMID:29316633
Ramirez, Agnese; Crisafulli, Sebastiano G; Rizzuti, Mafalda; Bresolin, Nereo; Comi, Giacomo P; Corti, Stefania; Nizzardo, Monica
2018-01-06
Spinal muscular atrophy (SMA) is an autosomal-recessive childhood motor neuron disease and the main genetic cause of infant mortality. SMA is caused by deletions or mutations in the survival motor neuron 1 ( SMN1 ) gene, which results in SMN protein deficiency. Only one approved drug has recently become available and allows for the correction of aberrant splicing of the paralogous SMN2 gene by antisense oligonucleotides (ASOs), leading to production of full-length SMN protein. We have already demonstrated that a sequence of an ASO variant, Morpholino (MO), is particularly suitable because of its safety and efficacy profile and is both able to increase SMN levels and rescue the murine SMA phenotype. Here, we optimized this strategy by testing the efficacy of four new MO sequences targeting SMN2 . Two out of the four new MO sequences showed better efficacy in terms of SMN protein production both in SMA induced pluripotent stem cells (iPSCs) and SMAΔ7 mice. Further, the effect was enhanced when different MO sequences were administered in combination. Our data provide an important insight for MO-based treatment for SMA. Optimization of the target sequence and validation of a treatment based on a combination of different MO sequences could support further pre-clinical studies and the progression toward future clinical trials.
King, Benjamin L; Shi, Ling Fang; Kao, Peter; Clusin, William T
2016-03-01
Elasmobranchs detect small potentials using excitable cells of the ampulla of Lorenzini which have calcium-activated K(+) channels, first described in 1974. A distinctive feature of the outward current in voltage clamped ampullae is its apparent insensitivity to voltage. The sequence of a BK channel α isoform expressed in the ampulla of the skate was characterized. A signal peptide is present at the beginning of the gene. When compared to human isoform 1 (the canonical sequence), the largest difference was absence of a 59 amino acid region from the S8-S9 intra-cellular linker that contains the strex regulatory domain. The ampulla isoform was also compared with the isoform predicted in late skate embryos where strex was also absent. The BK voltage sensors were conserved in both skate isoforms. Differences between the skate and human BK channel included alternative splicing. Alternative splicing occurs at seven previously defined sites that are characteristic for BK channels in general and hair cells in particular. Skate BK sequences were highly similar to the Australian ghost shark and several other vertebrate species. Based on alignment of known BK sequences with the skate genome and transcriptome, there are at least two isoforms of Kcnma1α expressed in the skate. One of the β subunits (β4), which is known to decrease voltage sensitivity, was also identified in the skate genome and transcriptome and in the ampulla. These studies advance our knowledge of BK channels and suggest further studies in the ampulla and other excitable tissues. Copyright © 2015 Elsevier B.V. All rights reserved.
Large exon size does not limit splicing in vivo.
Chen, I T; Chasin, L A
1994-03-01
Exon sizes in vertebrate genes are, with a few exceptions, limited to less than 300 bases. It has been proposed that this limitation may derive from the exon definition model of splice site recognition. In this model, a downstream donor site enhances splicing at the upstream acceptor site of the same exon. This enhancement may require contact between factors bound to each end of the exon; an exon size limitation would promote such contact. To test the idea that proximity was required for exon definition, we inserted random DNA fragments from Escherichia coli into a central exon in a three-exon dihydrofolate reductase minigene and tested whether the expanded exons were efficiently spliced. DNA from a plasmid library of expanded minigenes was used to transfect a CHO cell deletion mutant lacking the dhfr locus. PCR analysis of DNA isolated from the pooled stable cotransfectant populations displayed a range of DNA insert sizes from 50 to 1,500 nucleotides. A parallel analysis of the RNA from this population by reverse transcription followed by PCR showed a similar size distribution. Central exons as large as 1,400 bases could be spliced into mRNA. We also tested individual plasmid clones containing exon inserts of defined sizes. The largest exon included in mRNA was 1,200 bases in length, well above the 300-base limit implied by the survey of naturally occurring exons. We conclude that a limitation in exon size is not part of the exon definition mechanism.
Kawaguchi, Risa; Kiryu, Hisanori
2016-05-06
RNA secondary structure around splice sites is known to assist normal splicing by promoting spliceosome recognition. However, analyzing the structural properties of entire intronic regions or pre-mRNA sequences has been difficult hitherto, owing to serious experimental and computational limitations, such as low read coverage and numerical problems. Our novel software, "ParasoR", is designed to run on a computer cluster and enables the exact computation of various structural features of long RNA sequences under the constraint of maximal base-pairing distance. ParasoR divides dynamic programming (DP) matrices into smaller pieces, such that each piece can be computed by a separate computer node without losing the connectivity information between the pieces. ParasoR directly computes the ratios of DP variables to avoid the reduction of numerical precision caused by the cancellation of a large number of Boltzmann factors. The structural preferences of mRNAs computed by ParasoR shows a high concordance with those determined by high-throughput sequencing analyses. Using ParasoR, we investigated the global structural preferences of transcribed regions in the human genome. A genome-wide folding simulation indicated that transcribed regions are significantly more structural than intergenic regions after removing repeat sequences and k-mer frequency bias. In particular, we observed a highly significant preference for base pairing over entire intronic regions as compared to their antisense sequences, as well as to intergenic regions. A comparison between pre-mRNAs and mRNAs showed that coding regions become more accessible after splicing, indicating constraints for translational efficiency. Such changes are correlated with gene expression levels, as well as GC content, and are enriched among genes associated with cytoskeleton and kinase functions. We have shown that ParasoR is very useful for analyzing the structural properties of long RNA sequences such as mRNAs, pre-mRNAs, and long non-coding RNAs whose lengths can be more than a million bases in the human genome. In our analyses, transcribed regions including introns are indicated to be subject to various types of structural constraints that cannot be explained from simple sequence composition biases. ParasoR is freely available at https://github.com/carushi/ParasoR .
The roles played by highly truncated splice variants of G protein-coupled receptors
2012-01-01
Alternative splicing of G protein-coupled receptor (GPCR) genes greatly increases the total number of receptor isoforms which may be expressed in a cell-dependent and time-dependent manner. This increased diversity of cell signaling options caused by the generation of splice variants is further enhanced by receptor dimerization. When alternative splicing generates highly truncated GPCRs with less than seven transmembrane (TM) domains, the predominant effect in vitro is that of a dominant-negative mutation associated with the retention of the wild-type receptor in the endoplasmic reticulum (ER). For constitutively active (agonist-independent) GPCRs, their attenuated expression on the cell surface, and consequent decreased basal activity due to the dominant-negative effect of truncated splice variants, has pathological consequences. Truncated splice variants may conversely offer protection from disease when expression of co-receptors for binding of infectious agents to cells is attenuated due to ER retention of the wild-type co-receptor. In this review, we will see that GPCRs retained in the ER can still be functionally active but also that highly truncated GPCRs may also be functionally active. Although rare, some truncated splice variants still bind ligand and activate cell signaling responses. More importantly, by forming heterodimers with full-length GPCRs, some truncated splice variants also provide opportunities to generate receptor complexes with unique pharmacological properties. So, instead of assuming that highly truncated GPCRs are associated with faulty transcription processes, it is time to reassess their potential benefit to the host organism. PMID:22938630
Rice, Michael; Gladstone, William; Weir, Michael
2004-01-01
We discuss how relational databases constitute an ideal framework for representing and analyzing large-scale genomic data sets in biology. As a case study, we describe a Drosophila splice-site database that we recently developed at Wesleyan University for use in research and teaching. The database stores data about splice sites computed by a custom algorithm using Drosophila cDNA transcripts and genomic DNA and supports a set of procedures for analyzing splice-site sequence space. A generic Web interface permits the execution of the procedures with a variety of parameter settings and also supports custom structured query language queries. Moreover, new analytical procedures can be added by updating special metatables in the database without altering the Web interface. The database provides a powerful setting for students to develop informatic thinking skills.
2004-01-01
We discuss how relational databases constitute an ideal framework for representing and analyzing large-scale genomic data sets in biology. As a case study, we describe a Drosophila splice-site database that we recently developed at Wesleyan University for use in research and teaching. The database stores data about splice sites computed by a custom algorithm using Drosophila cDNA transcripts and genomic DNA and supports a set of procedures for analyzing splice-site sequence space. A generic Web interface permits the execution of the procedures with a variety of parameter settings and also supports custom structured query language queries. Moreover, new analytical procedures can be added by updating special metatables in the database without altering the Web interface. The database provides a powerful setting for students to develop informatic thinking skills. PMID:15592597
Alternative Splicing of a Novel Inducible Exon Diversifies the CASK Guanylate Kinase Domain
Dembowski, Jill A.; An, Ping; Scoulos-Hanson, Maritsa; Yeo, Gene; Han, Joonhee; Fu, Xiang-Dong; Grabowski, Paula J.
2012-01-01
Alternative pre-mRNA splicing has a major impact on cellular functions and development with the potential to fine-tune cellular localization, posttranslational modification, interaction properties, and expression levels of cognate proteins. The plasticity of regulation sets the stage for cells to adjust the relative levels of spliced mRNA isoforms in response to stress or stimulation. As part of an exon profiling analysis of mouse cortical neurons stimulated with high KCl to induce membrane depolarization, we detected a previously unrecognized exon (E24a) of the CASK gene, which encodes for a conserved peptide insertion in the guanylate kinase interaction domain. Comparative sequence analysis shows that E24a appeared selectively in mammalian CASK genes as part of a >3,000 base pair intron insertion. We demonstrate that a combination of a naturally defective 5′ splice site and negative regulation by several splicing factors, including SC35 (SRSF2) and ASF/SF2 (SRSF1), drives E24a skipping in most cell types. However, this negative regulation is countered with an observed increase in E24a inclusion after neuronal stimulation and NMDA receptor signaling. Taken together, E24a is typically a skipped exon, which awakens during neuronal stimulation with the potential to diversify the protein interaction properties of the CASK polypeptide. PMID:23008758
Long noncoding RNA Saf and splicing factor 45 increase soluble Fas and resistance to apoptosis
Riberdy, Janice M.; Persons, Derek A.; Wilber, Andrew
2016-01-01
In multicellular organisms, cell growth and differentiation is controlled in part by programmed cell death or apoptosis. One major apoptotic pathway is triggered by Fas receptor (Fas)-Fas ligand (FasL) interaction. Neoplastic cells are frequently resistant to Fas-mediated apoptosis, evade Fas signals through down regulation of Fas and produce soluble Fas proteins that bind FasL thereby blocking apoptosis. Soluble Fas (sFas) is an alternative splice product of Fas pre-mRNA, commonly created by exclusion of transmembrane spanning sequences encoded within exon 6 (FasΔEx6). Long non-coding RNAs (lncRNAs) interact with other RNAs, DNA, and proteins to regulate gene expression. One lncRNA, Fas-antisense or Saf, was shown to participate in alternative splicing of Fas pre-mRNA through unknown mechanisms. We show that Saf is localized in the nucleus where it interacts with Fas receptor pre-mRNA and human splicing factor 45 (SPF45) to facilitate alternative splicing and exclusion of exon 6. The product is a soluble Fas protein that protects cells against FasL-induced apoptosis. Collectively, these studies reveal a novel mechanism to modulate this critical cell death program by an lncRNA and its protein partner. PMID:26885613
Succession of splicing regulatory elements determines cryptic 5΄ss functionality
Brillen, Anna-Lena; Schöneweis, Katrin; Walotka, Lara; Hartmann, Linda; Müller, Lisa; Ptok, Johannes; Kaisers, Wolfgang; Poschmann, Gereon; Stühler, Kai; Buratti, Emanuele
2017-01-01
Abstract A critical step in exon definition is the recognition of a proper splice donor (5΄ss) by the 5’ end of U1 snRNA. In the selection of appropriate 5΄ss, cis-acting splicing regulatory elements (SREs) are indispensable. As a model for 5΄ss recognition, we investigated cryptic 5΄ss selection within the human fibrinogen Bβ-chain gene (FGB) exon 7, where we identified several exonic SREs that simultaneously acted on up- and downstream cryptic 5΄ss. In the FGB exon 7 model system, 5΄ss selection iteratively proceeded along an alternating sequence of U1 snRNA binding sites and interleaved SREs which in principle supported different 3’ exon ends. Like in a relay race, SREs either suppressed a potential 5΄ss and passed the splicing baton on or splicing actually occurred. From RNA-Seq data, we systematically selected 19 genes containing exons with silent U1 snRNA binding sites competing with nearby highly used 5΄ss. Extensive SRE analysis by different algorithms found authentic 5΄ss significantly more supported by SREs than silent U1 snRNA binding sites, indicating that our concept may permit generalization to a model for 5΄ss selection and 3’ exon end definition. PMID:28039323
Spliced leader RNA trans-splicing discovered in copepods
NASA Astrophysics Data System (ADS)
Yang, Feifei; Xu, Donghui; Zhuang, Yunyun; Yi, Xiaoyan; Huang, Yousong; Chen, Hongju; Lin, Senjie; Campbell, David A.; Sturm, Nancy R.; Liu, Guangxing; Zhang, Huan
2015-12-01
Copepods are one of the most abundant metazoans in the marine ecosystem, constituting a critical link in aquatic food webs and contributing significantly to the global carbon budget, yet molecular mechanisms of their gene expression are not well understood. Here we report the detection of spliced leader (SL) trans-splicing in calanoid copepods. We have examined nine species of wild-caught copepods from Jiaozhou Bay, China that represent the major families of the calanoids. All these species contained a common 46-nt SL (CopepodSL). We further determined the size of CopepodSL precursor RNA (slRNA; 108-158 nt) through genomic analysis and 3‧-RACE technique, which was confirmed by RNA blot analysis. Structure modeling showed that the copepod slRNA folded into typical slRNA secondary structures. Using a CopepodSL-based primer set, we selectively enriched and sequenced copepod full-length cDNAs, which led to the characterization of copepod transcripts and the cataloging of the complete set of 79 eukaryotic cytoplasmic ribosomal proteins (cRPs) for a single copepod species. We uncovered the SL trans-splicing in copepod natural populations, and demonstrated that CopepodSL was a sensitive and specific tool for copepod transcriptomic studies at both the individual and population levels and that it would be useful for metatranscriptomic analysis of copepods.
Increased complexity of circRNA expression during species evolution.
Dong, Rui; Ma, Xu-Kai; Chen, Ling-Ling; Yang, Li
2017-08-03
Circular RNAs (circRNAs) are broadly identified from precursor mRNA (pre-mRNA) back-splicing across various species. Recent studies have suggested a cell-/tissue- specific manner of circRNA expression. However, the distinct expression pattern of circRNAs among species and its underlying mechanism still remain to be explored. Here, we systematically compared circRNA expression from human and mouse, and found that only a small portion of human circRNAs could be determined in parallel mouse samples. The conserved circRNA expression between human and mouse is correlated with the existence of orientation-opposite complementary sequences in introns that flank back-spliced exons in both species, but not the circRNA sequences themselves. Quantification of RNA pairing capacity of orientation-opposite complementary sequences across circRNA-flanking introns by Complementary Sequence Index (CSI) identifies that among all types of complementary sequences, SINEs, especially Alu elements in human, contribute the most for circRNA formation and that their diverse distribution across species leads to the increased complexity of circRNA expression during species evolution. Together, our integrated and comparative reference catalog of circRNAs in different species reveals a species-specific pattern of circRNA expression and suggests a previously under-appreciated impact of fast-evolved SINEs on the regulation of (circRNA) gene expression.
Discovery of a Mammalian Splice Variant of Myostatin That Stimulates Myogenesis
Jeanplong, Ferenc; Falconer, Shelley J.; Oldham, Jenny M.; Thomas, Mark; Gray, Tarra S.; Hennebry, Alex; Matthews, Kenneth G.; Kemp, Frederick C.; Patel, Ketan; Berry, Carole; Nicholas, Gina; McMahon, Christopher D.
2013-01-01
Myostatin plays a fundamental role in regulating the size of skeletal muscles. To date, only a single myostatin gene and no splice variants have been identified in mammals. Here we describe the splicing of a cryptic intron that removes the coding sequence for the receptor binding moiety of sheep myostatin. The deduced polypeptide sequence of the myostatin splice variant (MSV) contains a 256 amino acid N-terminal domain, which is common to myostatin, and a unique C-terminus of 65 amino acids. Western immunoblotting demonstrated that MSV mRNA is translated into protein, which is present in skeletal muscles. To determine the biological role of MSV, we developed an MSV over-expressing C2C12 myoblast line and showed that it proliferated faster than that of the control line in association with an increased abundance of the CDK2/Cyclin E complex in the nucleus. Recombinant protein made for the novel C-terminus of MSV also stimulated myoblast proliferation and bound to myostatin with high affinity as determined by surface plasmon resonance assay. Therefore, we postulated that MSV functions as a binding protein and antagonist of myostatin. Consistent with our postulate, myostatin protein was co-immunoprecipitated from skeletal muscle extracts with an MSV-specific antibody. MSV over-expression in C2C12 myoblasts blocked myostatin-induced Smad2/3-dependent signaling, thereby confirming that MSV antagonizes the canonical myostatin pathway. Furthermore, MSV over-expression increased the abundance of MyoD, Myogenin and MRF4 proteins (P<0.05), which indicates that MSV stimulates myogenesis through the induction of myogenic regulatory factors. To help elucidate a possible role in vivo, we observed that MSV protein was more abundant during early post-natal muscle development, while myostatin remained unchanged, which suggests that MSV may promote the growth of skeletal muscles. We conclude that MSV represents a unique example of intra-genic regulation in which a splice variant directly antagonizes the biological activity of the canonical gene product. PMID:24312578
Androgen Receptor Splice Variants and Resistance to Taxane Chemotherapy
2016-10-01
sequence (MTAS) on AR. Milestone: Identify the sequence of AR that is involved in microtubule-binding. Publish 1 peer-reviewed paper . Major Task 4...joined the project and worked on the validation of the PAXgene assay. 6. Products Publications, conference papers , and presentations...Journal publications. The following paper was published: Xichun Liu, Elisa Ledet, Dongying Li, Ary Dotiwala, Allie Steinberger, Jianzhuo
Legendre, Marine; Rodriguez-Ballesteros, Montserrat; Rossi, Massimiliano; Abadie, Véronique; Amiel, Jeanne; Revencu, Nicole; Blanchet, Patricia; Brioude, Frédéric; Delrue, Marie-Ange; Doubaj, Yassamine; Sefiani, Abdelaziz; Francannet, Christine; Holder-Espinasse, Muriel; Jouk, Pierre-Simon; Julia, Sophie; Melki, Judith; Mur, Sébastien; Naudion, Sophie; Fabre-Teste, Jennifer; Busa, Tiffany; Stamm, Stephen; Lyonnet, Stanislas; Attie-Bitach, Tania; Kitzis, Alain; Gilbert-Dussardier, Brigitte; Bilan, Frédéric
2018-02-01
CHARGE syndrome is a rare genetic disorder mainly due to de novo and private truncating mutations of CHD7 gene. Here we report an intriguing hot spot of intronic mutations (c.5405-7G > A, c.5405-13G > A, c.5405-17G > A and c.5405-18C > A) located in CHD7 IVS25. Combining computational in silico analysis, experimental branch-point determination and in vitro minigene assays, our study explains this mutation hot spot by a particular genomic context, including the weakness of the IVS25 natural acceptor-site and an unconventional lariat sequence localized outside the common 40 bp upstream the acceptor splice site. For each of the mutations reported here, bioinformatic tools indicated a newly created 3' splice site, of which the existence was confirmed using pSpliceExpress, an easy-to-use and reliable splicing reporter tool. Our study emphasizes the idea that combining these two complementary approaches could increase the efficiency of routine molecular diagnosis.
Barańska, Marta; Czerwińska-Rybak, Joanna; Gil, Lidia; Komarnicki, Mieczysław
2015-01-01
The myelodysplastic syndromes (MDS) constitute heterogeneous group of clonal disorders, characterized by ineffective hematopoiesis, peripheral cytopenia and increased risk of acute myeloid leukemia development. Molecular mechanisms behind MDS have not been fully explained, however recent studies based on new technologies confirmed that epigenetic abnormalities and somatic mutation in the spliceasome machinery are crucial in pathogenesis of these diseases. Abnormal mRNA splicing (excision of intronic sequences from mRNA) has been found in over half of all MDS patients and resulted in accumulation of cytogenetical and molecular changes. The biological impact of splicing factor genes mutations has been evaluated only in a limited extend and current studies concentrate on analysis of MDS transcriptome. Molecular characteristic of classical and alternative splicing is presented in the paper, according to current knowledge. We review the most prominent findings from recent years concerning mutation in the spliceasome machinery with respect to MDS phenotype and disease prognosis. Perspectives in applying of novel diagnostic and therapeutic possibilities for myelodysplasia, based on spliceosome mutations identification are also presented. © 2015 MEDPRESS.
MAJIQ-SPEL: Web-tool to interrogate classical and complex splicing variations from RNA-Seq data.
Green, Christopher J; Gazzara, Matthew R; Barash, Yoseph
2017-09-11
Analysis of RNA sequencing (RNA-Seq) data have highlighted the fact that most genes undergo alternative splicing (AS) and that these patterns are tightly regulated. Many of these events are complex, resulting in numerous possible isoforms that quickly become difficult to visualize, interpret, and experimentally validate. To address these challenges we developed MAJIQ-SPEL, a web-tool that takes as input local splicing variations (LSVs) quantified from RNA-Seq data and provides users with visualization and quantification of gene isoforms associated with those. Importantly, MAJIQ-SPEL is able to handle both classical (binary) and complex, non-binary, splicing variations. Using a matching primer design algorithm it also suggests users possible primers for experimental validation by RT-PCR and displays those, along with the matching protein domains affected by the LSV, on UCSC Genome Browser for further downstream analysis. Program and code will be available at http://majiq.biociphers.org/majiq-spel. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Generation of Chimeric RNAs by cis-splicing of adjacent genes (cis-SAGe) in mammals.
Zhuo, Jian-Shu; Jing, Xiao-Yan; Du, Xin; Yang, Xiu-Qin
2018-02-20
Chimeric RNA molecules, possessing exons from two or more independent genes, are traditionally believed to be produced by chromosome rearrangement. However, recent studies revealed that cis-splicing of adjacent genes (cis- SAGe) is one of the major mechanisms underlying the formation of chimeric RNAs. cis-SAGe refers to intergenic splicing of directly adjacent genes with the same transcriptional orientation, resulting in read-through transcripts, termed chimeric RNAs, which contain sequences from two or more parental genes. cis-SAGe was first identified in tumor cells, since then its potential in carcinogenesis has attracted extensive attention. More and more scientists are focusing on it. With the development of research, cis-SAGe was found to be ubiquitous in various normal tissues, and might make a crucial contribution to the formation of novel genes in the evolution of genomes. In this review, we summarize the splicing pattern, expression characteristics, possible mechanisms, and significance of cis-SAGe in mammals. This review will be helpful for general understanding of the current status and development tendency of cis-SAGe.
Co-expression networks reveal the tissue-specific regulation of transcription and splicing
Saha, Ashis; Kim, Yungil; Gewirtz, Ariel D.H.; Jo, Brian; Gao, Chuan; McDowell, Ian C.; Engelhardt, Barbara E.
2017-01-01
Gene co-expression networks capture biologically important patterns in gene expression data, enabling functional analyses of genes, discovery of biomarkers, and interpretation of genetic variants. Most network analyses to date have been limited to assessing correlation between total gene expression levels in a single tissue or small sets of tissues. Here, we built networks that additionally capture the regulation of relative isoform abundance and splicing, along with tissue-specific connections unique to each of a diverse set of tissues. We used the Genotype-Tissue Expression (GTEx) project v6 RNA sequencing data across 50 tissues and 449 individuals. First, we developed a framework called Transcriptome-Wide Networks (TWNs) for combining total expression and relative isoform levels into a single sparse network, capturing the interplay between the regulation of splicing and transcription. We built TWNs for 16 tissues and found that hubs in these networks were strongly enriched for splicing and RNA binding genes, demonstrating their utility in unraveling regulation of splicing in the human transcriptome. Next, we used a Bayesian biclustering model that identifies network edges unique to a single tissue to reconstruct Tissue-Specific Networks (TSNs) for 26 distinct tissues and 10 groups of related tissues. Finally, we found genetic variants associated with pairs of adjacent nodes in our networks, supporting the estimated network structures and identifying 20 genetic variants with distant regulatory impact on transcription and splicing. Our networks provide an improved understanding of the complex relationships of the human transcriptome across tissues. PMID:29021288
Alternative Splicing of NOX4 in the Failing Human Heart
Varga, Zoltán V.; Pipicz, Márton; Baán, Júlia A.; Baranyai, Tamás; Koncsos, Gábor; Leszek, Przemyslaw; Kuśmierczyk, Mariusz; Sánchez-Cabo, Fátima; García-Pavía, Pablo; Brenner, Gábor J.; Giricz, Zoltán; Csont, Tamás; Mendler, Luca; Lara-Pezzi, Enrique; Pacher, Pál; Ferdinandy, Péter
2017-01-01
Increased oxidative stress is a major contributor to the development and progression of heart failure, however, our knowledge on the role of the distinct NADPH oxidase (NOX) isoenzymes, especially on NOX4 is controversial. Therefore, we aimed to characterize NOX4 expression in human samples from healthy and failing hearts. Explanted human heart samples (left and right ventricular, and septal regions) were obtained from patients suffering from heart failure of ischemic or dilated origin. Control samples were obtained from donor hearts that were not used for transplantation. Deep RNA sequencing of the cardiac transcriptome indicated extensive alternative splicing of the NOX4 gene in heart failure as compared to samples from healthy donor hearts. Long distance PCR analysis with a universal 5′-3′ end primer pair, allowing amplification of different splice variants, confirmed the presence of the splice variants. To assess translation of the alternatively spliced transcripts we determined protein expression of NOX4 by using a specific antibody recognizing a conserved region in all variants. Western blot analysis showed up-regulation of the full-length NOX4 in ischemic cardiomyopathy samples and confirmed presence of shorter isoforms both in control and failing samples with disease-associated expression pattern. We describe here for the first time that NOX4 undergoes extensive alternative splicing in human hearts which gives rise to the expression of different enzyme isoforms. The full length NOX4 is significantly upregulated in ischemic cardiomyopathy suggesting a role for NOX4 in ROS production during heart failure. PMID:29204124
Diabetes-Induced Changes in the Alternative Splicing of the Slo Gene in Corporal Tissue
Davies, Kelvin P.; Zhao, Weixin; Tar, Moses; Figueroa, Johanna C.; Desai, Pratik; Verselis, Vytas K.; Kronengold, Jack; Wang, Hong-Zhan; Melman, Arnold; Christ, George J.
2007-01-01
Objectives Erectile dysfunction is a common diabetic complication. Preclinical studies have documented that the Slo gene (encoding the BK or Maxi-K channel α-subunit) plays a critical role in erectile function. Therefore, we determined whether diabetes induces changes in the splicing of the Slo gene relevant to erectile function. Methods Reverse transcriptase-polymerase chain reaction was used to compare Slo splice variant expression in corporal tissue excised from control and streptozotocin (STZ)-induced diabetic Fischer F-344 rats. Splice variants were sequenced, characterized by patch clamping, and fused to green fluorescent protein to determine cellular localization. The impact of altered Slo expression on erectile function was further evaluated in vivo. Results A novel Slo splice variant (SVcyt, with a cytoplasmic location) was predominantly expressed in corporal tissue from control rats. STZ-diabetes caused upregulation of a channel-forming transcript SV0. Preliminary results suggest that SV0 was also more prevalent in the corporal tissue of human diabetic compared with nondiabetic patients. The change in isoform expression in STZ-treated rats was partially reversed by insulin treatment. Intracorporal injection of a plasmid expressing the SV0 transcript, but not SVcyt, restored erectile function in STZ-diabetic rats. Conclusions Alternative splicing of the Slo transcript may represent an important compensatory mechanism to increase the ease with which relaxation of corporal tissue may be triggered as a result of a diabetes-related decline in erectile capacity. PMID:17150299
Joy, Nisha; Maimoonath Beevi, Y P; Soniya, E V
2018-05-09
The central tenet of 'genome content' has been that the 'non-coding' parts are highly enriched with 'microsatellites' or 'Simple Sequence Repeats' (SSRs). We presume that the presence and change in number of repeat unit (n) of SSRs in different genomic locations may or may not become beneficial, depending on the position of SSRs in a gene. Very few studies have looked into the existence of SSRs in the hair-pin precursors of miRNAs (pre-miRNAs). The interplay between SSRs and miRNAs is not yet clearly understood. Considering the potential significance of SSRs in pre-miRNAs, we analysed the miRNA hair-pin precursors of 171 organisms, which revealed a noticeable (29.8%) existence of SSRs in their pre-miRNAs. The maintenance of SSRs in pre-miRNAs even in the complex, highly evolved phyla like Chordata and Magnoliophyta shed light upon its diverse functions. Putative effects of SSRs in either regulating the biogenesis or function of miRNAs were more underlined based on computational and experimental analysis. A preliminary computational analysis to explore the relevance of such SSRs maintained in pre-miRNA sequences led to the detection of splicing regulatory elements (SREs) either in or near to the SSRs. The absence of SSRs correspondingly decreased the detection of SREs. The present study is the first implication for the possible involvement of SSRs in shaping the SREs to undergo Alternative Splicing events to produce miRNA isoforms in accordance with different stress environments. This part of work well demonstrates the importance of studying such consistently maintained SSRs residing in pre-miRNAs and can enhance more and more research towards deciphering the exact function of SSRs in the near future.
ACTG: novel peptide mapping onto gene models.
Choi, Seunghyuk; Kim, Hyunwoo; Paek, Eunok
2017-04-15
In many proteogenomic applications, mapping peptide sequences onto genome sequences can be very useful, because it allows us to understand origins of the gene products. Existing software tools either take the genomic position of a peptide start site as an input or assume that the peptide sequence exactly matches the coding sequence of a given gene model. In case of novel peptides resulting from genomic variations, especially structural variations such as alternative splicing, these existing tools cannot be directly applied unless users supply information about the variant, either its genomic position or its transcription model. Mapping potentially novel peptides to genome sequences, while allowing certain genomic variations, requires introducing novel gene models when aligning peptide sequences to gene structures. We have developed a new tool called ACTG (Amino aCids To Genome), which maps peptides to genome, assuming all possible single exon skipping, junction variation allowing three edit distances from the original splice sites, exon extension and frame shift. In addition, it can also consider SNVs (single nucleotide variations) during mapping phase if a user provides the VCF (variant call format) file as an input. Available at http://prix.hanyang.ac.kr/ACTG/search.jsp . eunokpaek@hanyang.ac.kr. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ruggles, Kelly V.; Tang, Zuojian; Wang, Xuya
Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations and splice variants identified in cancer cells are translated. Herein we therefore describe a proteogenomic data integration tool (QUILTS) and illustrate its application to whole genome, transcriptome and global MS peptide sequence datasets generated from a pair of luminal and basal-like breast cancer patient derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS process replicates. Despite over thirty sample replicates, only about 10% of all SNV (somatic andmore » germline) were detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNV without a detectable mRNA transcript were also observed demonstrating the transcriptome coverage was also incomplete (~80%). In contrast to germ-line variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than the luminal tumor raising the possibility of differential translation or protein degradation effects. In conclusion, the QUILTS program integrates DNA, RNA and peptide sequencing to assess the degree to which somatic mutations are translated and therefore biologically active. By identifying gaps in sequence coverage QUILTS benchmarks current technology and assesses progress towards whole cancer proteome and transcriptome analysis.« less
Hefti, Marco M; Farrell, Kurt; Kim, SoongHo; Bowles, Kathryn R; Fowkes, Mary E; Raj, Towfique; Crary, John F
2018-01-01
The microtubule associated protein tau plays a critical role in the pathogenesis of neurodegenerative disease. Recent studies suggest that tau also plays a role in disorders of neuronal connectivity, including epilepsy and post-traumatic stress disorder. Animal studies have shown that the MAPT gene, which codes for the tau protein, undergoes complex pre-mRNA alternative splicing to produce multiple isoforms during brain development. Human data, particularly on temporal and regional variation in tau splicing during development are however lacking. In this study, we present the first detailed examination of the temporal and regional sequence of MAPT alternative splicing in the developing human brain. We used a novel computational analysis of large transcriptomic datasets (total n = 502 patients), quantitative polymerase chain reaction (qPCR) and western blotting to examine tau expression and splicing in post-mortem human fetal, pediatric and adult brains. We found that MAPT exons 2 and 10 undergo abrupt shifts in expression during the perinatal period that are unique in the canonical human microtubule-associated protein family, while exon 3 showed small but significant temporal variation. Tau isoform expression may be a marker of neuronal maturation, temporally correlated with the onset of axonal growth. Immature brain regions such as the ganglionic eminence and rhombic lip had very low tau expression, but within more mature regions, there was little variation in tau expression or splicing. We thus demonstrate an abrupt, evolutionarily conserved shift in tau isoform expression during the human perinatal period that may be due to tau expression in maturing neurons. Alternative splicing of the MAPT pre-mRNA may play a vital role in normal brain development across multiple species and provides a basis for future investigations into the developmental and pathological functions of the tau protein.
Terenzi, Fulvia; Ladd, Andrea N
2010-01-01
Muscleblind-like (MBNL) proteins have been shown to regulate pre-mRNA alternative splicing, and MBNL1 has been implicated in regulating fetal-to-adult transitions in alternative splicing in the heart. MBNL1 is highly conserved, exhibiting more than 95% identity at the amino acid level between birds and mammals. To investigate MBNL1 expression during embryonic heart development, we examined MBNL1 transcript and protein expression in the embryonic chicken heart from the formation of the primitive heart tube through cardiac morphogenesis (embryonic days 1.5 through 8). MBNL1 transcript levels remained steady throughout these stages, whereas MBNL1 protein levels increased and exhibited a shift in isoforms. MBNL1 has several alternatively spliced exons. Using RT-PCR, we determined that the inclusion of one of these, exon 5, decreases dramatically during cardiac morphogenesis. This developmental transition is conserved in mice. Functional analyses of MBNL1 isoforms containing or lacking exon 5-encoded sequences revealed that exon 5 is important for the regulation of the subcellular localization, RNA binding affinity, and alternative splicing activity of MBNL1 proteins. A second MBNL protein, MBNL2, is also expressed in the embryonic heart. We found that MBNL2 exon 5, which is paralogous to MBNL1 exon 5, is similarly regulated during embryonic heart development. Analysis of MBNL1 and MBNL2 transcripts in several embryonic tissues in chicken and mouse indicate that exon 5 alternative splicing is highly conserved and tissue-specific. Thus, we propose that conserved developmental stage- and tissue-specific alternative splicing of MBNL transcripts is an important mechanism by which MBNL activity is regulated during embryonic development.
Branchpoint selection in the splicing of U12-dependent introns in vitro.
McConnell, Timothy S; Cho, Soo-Jin; Frilander, Mikko J; Steitz, Joan A
2002-05-01
In metazoans, splicing of introns from pre-mRNAs can occur by two pathways: the major U2-dependent or the minor U12-dependent pathways. Whereas the U2-dependent pathway has been well characterized, much about the U12-dependent pathway remains to be discovered. Most of the information regarding U12-type introns has come from in vitro studies of a very few known introns of this class. To expand our understanding of U12-type splicing, especially to test the hypothesis that the simple base-pairing mechanism between the intron and U12 snRNA defines the branchpoint of U12-dependent introns, additional in vitro splicing substrates were created from three putative U12-type introns: the third intron of the Xenopus RPL1 a gene (XRP), the sixth intron of the Xenopus TFIIS.oA gene (XTF), and the first intron of the human Sm E gene (SME). In vitro splicing in HeLa nuclear extract confirmed U12-dependent splicing of each of these introns. Surprisingly, branchpoint mapping of the XRP splicing intermediate shows use of the upstream rather than the downstream of two consecutive adenosines within the branchpoint sequence (BPS), contrary to the prediction based on alignment with the sixth intron of human P120, a U12-dependent intron whose branch site was previously determined. Also, in the SME intron, the position of the branchpoint A residue within the region base paired with U12 differs from that in P120 and XTF. Analysis of these three additional introns therefore rules out simple models for branchpoint selection by the U12-type spliceosome.
Branchpoint selection in the splicing of U12-dependent introns in vitro.
McConnell, Timothy S; Cho, Soo-Jin; Frilander, Mikko J; Steitz, Joan A
2002-01-01
In metazoans, splicing of introns from pre-mRNAs can occur by two pathways: the major U2-dependent or the minor U12-dependent pathways. Whereas the U2-dependent pathway has been well characterized, much about the U12-dependent pathway remains to be discovered. Most of the information regarding U12-type introns has come from in vitro studies of a very few known introns of this class. To expand our understanding of U12-type splicing, especially to test the hypothesis that the simple base-pairing mechanism between the intron and U12 snRNA defines the branchpoint of U12-dependent introns, additional in vitro splicing substrates were created from three putative U12-type introns: the third intron of the Xenopus RPL1 a gene (XRP), the sixth intron of the Xenopus TFIIS.oA gene (XTF), and the first intron of the human Sm E gene (SME). In vitro splicing in HeLa nuclear extract confirmed U12-dependent splicing of each of these introns. Surprisingly, branchpoint mapping of the XRP splicing intermediate shows use of the upstream rather than the downstream of two consecutive adenosines within the branchpoint sequence (BPS), contrary to the prediction based on alignment with the sixth intron of human P120, a U12-dependent intron whose branch site was previously determined. Also, in the SME intron, the position of the branchpoint A residue within the region base paired with U12 differs from that in P120 and XTF. Analysis of these three additional introns therefore rules out simple models for branchpoint selection by the U12-type spliceosome. PMID:12022225
Hammond, Suzan M; McClorey, Graham; Nordin, Joel Z; Godfrey, Caroline; Stenler, Sofia; Lennox, Kim A; Smith, C I Edvard; Jacobi, Ashley M; Varela, Miguel A; Lee, Yi; Behlke, Mark A; Wood, Matthew J A; Andaloussi, Samir E L
2014-11-25
Splice switching oligonucleotides (SSOs) induce alternative splicing of pre-mRNA and typically employ chemical modifications to increase nuclease resistance and binding affinity to target pre-mRNA. Here we describe a new SSO non-base modifier (a naphthyl-azo group, "ZEN™") to direct exon exclusion in mutant dystrophin pre-mRNA to generate functional dystrophin protein. The ZEN modifier is placed near the ends of a 2'-O-methyl (2'OMe) oligonucleotide, increasing melting temperature and potency over unmodified 2'OMe oligonucleotides. In cultured H2K cells, a ZEN-modified 2'OMe phosphorothioate (PS) oligonucleotide delivered by lipid transfection greatly enhanced dystrophin exon skipping over the same 2'OMePS SSO lacking ZEN. However, when tested using free gymnotic uptake in vitro and following systemic delivery in vivo in dystrophin deficient mdx mice, the same ZEN-modified SSO failed to enhance potency. Importantly, we show for the first time that in vivo activity of anionic SSOs is modelled in vitro only when using gymnotic delivery. ZEN is thus a novel modifier that enhances activity of SSOs in vitro but will require improved delivery methods before its in vivo clinical potential can be realized.
Li, Niu; Song, Aiyun; Ding, Lixia; Zhu, Hua; Li, Guoqiang; Miao, Yan; Wang, Jian; Li, Benshang; Chen, Jing
2018-07-01
Fanconi anemia (FA) is a rare autosomal recessive or X-linked disorder with highly variable clinical manifestations and an incidence of ∼1 to 5 in 1 million births. To date, 15 bona fide FA genes have been reported to be responsible for the known FA complementation groups and the FANCA gene accounts for almost 60%. In the present study, we report a special Chinese family, which has 2 children with classic FA characteristics. Via 2-step analysis of the whole-exome sequencing data and verification using multiplex ligation-dependent probe amplification test, one child was found to have a novel compound heterozygous mutation of a splicing variant (c.1471-1G>A) and a large intragenic deletion (exons 23-30 del) of the FANCA gene. The other child had the same splicing variant and another novel large deletion (exons 1-18 del) in the FANCA gene. Clone sequencing showed the c.1471-1G>A variant generate an altered transcript with 1 cryptic splice site in intron 15, resulting in a premature termination codon (p.Val490HisfsX6). This study not only shows the complexity of FA molecular diagnosis via comprehensively studying the FA pathogenic genes and the mutational spectrum, but also has significant reference value for the future molecular diagnosis of FA.
Stacey, Simon N.; Kehr, Birte; Gudmundsson, Julius; Zink, Florian; Jonasdottir, Aslaug; Gudjonsson, Sigurjon A.; Sigurdsson, Asgeir; Halldorsson, Bjarni V.; Agnarsson, Bjarni A.; Benediktsdottir, Kristrun R.; Aben, Katja K.H.; Vermeulen, Sita H.; Cremers, Ruben G.; Panadero, Angeles; Helfand, Brian T.; Cooper, Phillip R.; Donovan, Jenny L.; Hamdy, Freddie C.; Jinga, Viorel; Okamoto, Ichiro; Jonasson, Jon G.; Tryggvadottir, Laufey; Johannsdottir, Hrefna; Kristinsdottir, Anna M.; Masson, Gisli; Magnusson, Olafur T.; Iordache, Paul D.; Helgason, Agnar; Helgason, Hannes; Sulem, Patrick; Gudbjartsson, Daniel F.; Kong, Augustine; Jonsson, Eirikur; Barkardottir, Rosa B.; Einarsson, Gudmundur V.; Rafnar, Thorunn; Thorsteinsdottir, Unnur; Mates, Ioan N.; Neal, David E.; Catalona, William J.; Mayordomo, José I.; Kiemeney, Lambertus A.; Thorleifsson, Gudmar; Stefansson, Kari
2016-01-01
Transcriptional and splicing anomalies have been observed in intron 8 of the CASP8 gene (encoding procaspase-8) in association with cutaneous basal-cell carcinoma (BCC) and linked to a germline SNP rs700635. Here, we show that the rs700635[C] allele, which is associated with increased risk of BCC and breast cancer, is protective against prostate cancer [odds ratio (OR) = 0.91, P = 1.0 × 10−6]. rs700635[C] is also associated with failures to correctly splice out CASP8 intron 8 in breast and prostate tumours and in corresponding normal tissues. Investigation of rs700635[C] carriers revealed that they have a human-specific short interspersed element-variable number of tandem repeat-Alu (SINE-VNTR-Alu), subfamily-E retrotransposon (SVA-E) inserted into CASP8 intron 8. The SVA-E shows evidence of prior activity, because it has transduced some CASP8 sequences during subsequent retrotransposition events. Whole-genome sequence (WGS) data were used to tag the SVA-E with a surrogate SNP rs1035142[T] (r2 = 0.999), which showed associations with both the splicing anomalies (P = 6.5 × 10−32) and with protection against prostate cancer (OR = 0.91, P = 3.8 × 10−7). PMID:26740556
Unusual molecular findings in Kindler syndrome.
Arita, K; Wessagowit, V; Inamadar, A C; Palit, A; Fassihi, H; Lai-Cheong, J E; Pourreyron, C; South, A P; McGrath, J A
2007-12-01
Kindler syndrome (KS) is a rare inherited skin disorder with blistering and poikiloderma as its main clinical features. It is caused by loss-of-function mutations in the C20orf42 (KIND1) gene which encodes kindlin-1, an actin cytoskeleton-focal contact-associated protein which is predominantly expressed in keratinocytes. We investigated the molecular basis of KS in a 16-year-old Indian boy who had additional clinical findings, including scleroatrophic changes of the hands and feet, pseudoainhum and early onset of squamous cell carcinoma on his foot. Immunostaining for kindlin-1 in the patient's skin was completely absent and sequencing of C20orf42 (KIND1) genomic DNA showed a homozygous splice-site mutation at the -6 position, IVS9-6T-->A. Amplification and sequencing of cDNA from the skin revealed aberrant splicing with either deletion of exon 10 or deletion of exons 9, 10 and 11, both of which involve loss of the pleckstrin homology domain of kindlin-1 that is thought to play a role in cytoskeletal attachment and integrin-mediated cell signalling. Pathogenic splice-site mutations at the -6 position are unusual and have rarely been reported for any genetic disorder. Collectively, these findings extend the spectrum of clinical and molecular abnormalities in this rare genodermatosis.
Dollet, M; Sturm, N R; Campbell, D A
2001-03-01
The arbitrary genus Phytomonas includes a biologically diverse group of kinetoplastids that live in a wide variety of plant environments. To understand better the subdivisions within the phytomonads and the variability within groups, the exon, intron and non-transcribed spacer sequences of the spliced leader RNA gene were compared among isolates of the phloem-restricted members. A total of 29 isolates associated with disease in coconut, oil palm and red ginger (Alpinia purpurata, Zingibreaceae) were examined, all originating from plantations in South America and the Caribbean over a 12-year period. Analysis of non-transcribed spacer sequences revealed 2 main groups, I and II; group II could be further subdivided into 2 subgroups, IIa and Ilb. Three classes of spliced leader (SL) RNA gene were seen, with SLI corresponding to group I, SLIIa to group lIa, and SLIIb to group IIb. Two isolates showed some characteristics of both major groups. Group-specific oligonucleotide probes for hybridization studies were tested, and a multiplex amplification scheme was devised to allow direct differentiation between the 2 major groups of phloem-restricted Phytomonas. These results provide tools for diagnostic and molecular epidemiology of plant trypanosomes that are pathogenic for commercially important flowers and palms.
Splendore, Alessandra; Fanganiello, Roberto D; Masotti, Cibele; Morganti, Lucas S C; Passos-Bueno, M Rita
2005-05-01
Recently, a novel exon was described in TCOF1 that, although alternatively spliced, is included in the major protein isoform. In addition, most published mutations in this gene do not conform to current mutation nomenclature guidelines. Given these observations, we developed an online database of TCOF1 mutations in which all the reported mutations are renamed according to standard recommendations and in reference to the genomic and novel cDNA reference sequences (www.genoma.ib.usp.br/TCOF1_database). We also report in this work: 1) results of the first screening for large deletions in TCOF1 by Southern blot in patients without mutation detected by direct sequencing; 2) the identification of the first pathogenic mutation in the newly described exon 6A; and 3) statistical analysis of pathogenic mutations and polymorphism distribution throughout the gene.