Sample records for splice junction sequences

  1. PASTA: splice junction identification from RNA-Sequencing data

    PubMed Central

    2013-01-01

    Background Next generation transcriptome sequencing (RNA-Seq) is emerging as a powerful experimental tool for the study of alternative splicing and its regulation, but requires ad-hoc analysis methods and tools. PASTA (Patterned Alignments for Splicing and Transcriptome Analysis) is a splice junction detection algorithm specifically designed for RNA-Seq data, relying on a highly accurate alignment strategy and on a combination of heuristic and statistical methods to identify exon-intron junctions with high accuracy. Results Comparisons against TopHat and other splice junction prediction software on real and simulated datasets show that PASTA exhibits high specificity and sensitivity, especially at lower coverage levels. Moreover, PASTA is highly configurable and flexible, and can therefore be applied in a wide range of analysis scenarios: it is able to handle both single-end and paired-end reads, it does not rely on the presence of canonical splicing signals, and it uses organism-specific regression models to accurately identify junctions. Conclusions PASTA is a highly efficient and sensitive tool to identify splicing junctions from RNA-Seq data. Compared to similar programs, it has the ability to identify a higher number of real splicing junctions, and provides highly annotated output files containing detailed information about their location and characteristics. Accurate junction data in turn facilitates the reconstruction of the splicing isoforms and the analysis of their expression levels, which will be performed by the remaining modules of the PASTA pipeline, still under development. Use of PASTA can therefore enable the large-scale investigation of transcription and alternative splicing. PMID:23557086

  2. Design of RNA splicing analysis null models for post hoc filtering of Drosophila head RNA-Seq data with the splicing analysis kit (Spanki)

    PubMed Central

    2013-01-01

    Background The production of multiple transcript isoforms from one gene is a major source of transcriptome complexity. RNA-Seq experiments, in which transcripts are converted to cDNA and sequenced, allow the resolution and quantification of alternative transcript isoforms. However, methods to analyze splicing are underdeveloped and errors resulting in incorrect splicing calls occur in every experiment. Results We used RNA-Seq data to develop sequencing and aligner error models. By applying these error models to known input from simulations, we found that errors result from false alignment to minor splice motifs and antisense stands, shifted junction positions, paralog joining, and repeat induced gaps. By using a series of quantitative and qualitative filters, we eliminated diagnosed errors in the simulation, and applied this to RNA-Seq data from Drosophila melanogaster heads. We used high-confidence junction detections to specifically interrogate local splicing differences between transcripts. This method out-performed commonly used RNA-seq methods to identify known alternative splicing events in the Drosophila sex determination pathway. We describe a flexible software package to perform these tasks called Splicing Analysis Kit (Spanki), available at http://www.cbcb.umd.edu/software/spanki. Conclusions Splice-junction centric analysis of RNA-Seq data provides advantages in specificity for detection of alternative splicing. Our software provides tools to better understand error profiles in RNA-Seq data and improve inference from this new technology. The splice-junction centric approach that this software enables will provide more accurate estimates of differentially regulated splicing than current tools. PMID:24209455

  3. Design of RNA splicing analysis null models for post hoc filtering of Drosophila head RNA-Seq data with the splicing analysis kit (Spanki).

    PubMed

    Sturgill, David; Malone, John H; Sun, Xia; Smith, Harold E; Rabinow, Leonard; Samson, Marie-Laure; Oliver, Brian

    2013-11-09

    The production of multiple transcript isoforms from one gene is a major source of transcriptome complexity. RNA-Seq experiments, in which transcripts are converted to cDNA and sequenced, allow the resolution and quantification of alternative transcript isoforms. However, methods to analyze splicing are underdeveloped and errors resulting in incorrect splicing calls occur in every experiment. We used RNA-Seq data to develop sequencing and aligner error models. By applying these error models to known input from simulations, we found that errors result from false alignment to minor splice motifs and antisense stands, shifted junction positions, paralog joining, and repeat induced gaps. By using a series of quantitative and qualitative filters, we eliminated diagnosed errors in the simulation, and applied this to RNA-Seq data from Drosophila melanogaster heads. We used high-confidence junction detections to specifically interrogate local splicing differences between transcripts. This method out-performed commonly used RNA-seq methods to identify known alternative splicing events in the Drosophila sex determination pathway. We describe a flexible software package to perform these tasks called Splicing Analysis Kit (Spanki), available at http://www.cbcb.umd.edu/software/spanki. Splice-junction centric analysis of RNA-Seq data provides advantages in specificity for detection of alternative splicing. Our software provides tools to better understand error profiles in RNA-Seq data and improve inference from this new technology. The splice-junction centric approach that this software enables will provide more accurate estimates of differentially regulated splicing than current tools.

  4. Trans splicing in Leishmania enriettii and identification of ribonucleoprotein complexes containing the spliced leader and U2 equivalent RNAs

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Miller, S.I.; Wirth, D.F.

    1988-06-01

    The 5' ends of Leishmania mRNAs contain an identical 35-nucleotide sequence termed the spliced leader (SL) or 5' mini-exon. The SL sequence is at the 5' end of an 85-nucleotide primary transcript that contains a consensus eucaryotic 5' intron-exon splice junction immediately 3' to the SL. The SL is added to protein-coding genes immediately 3' to a consensus eucaryotic 3' intron-exon splice junction. The authors' previous work demonstrated possible intermediates in discontinuous mRNA processing that contain the 50 nucleotides of the SL primary transcript 3' to the SL, the SL intron sequence (SLIS). These RNAs have a 5' terminus atmore » the splice junction of the SL and the SLIS. The authors examined a Leishmania nuclear extract for these RNAs in ribonucleoprotein (RNP) particles. Density centrifugation analysis showed that the SL RNA is predominately in RNP complexes at 60S, while the SLIS-containing RNAs are in complexes at 40S. They also demonstrated that the SLIS can be released from polyadenylated RNA by incubation with a HeLa cell extract containing debranching enzymatic activity. These data suggested that Leishmania enriettii mRNAs are assembled by bimolecular or trans splicing as has been recently demonstrated for Trypanosoma brucei. Furthermore, they determined the partial sequence of the Leishmania U2 equivalent RNA and demonstrated that it cosediments with the SL RNA at 60S in a nuclear extract. These RNP particles may be analogous to so-called spliceosomes that have been demonstrated in other systems.« less

  5. Spliced RNA of woodchuck hepatitis virus.

    PubMed

    Ogston, C W; Razman, D G

    1992-07-01

    Polymerase chain reaction was used to investigate RNA splicing in liver of woodchucks infected with woodchuck hepatitis virus (WHV). Two spliced species were detected, and the splice junctions were sequenced. The larger spliced RNA has an intron of 1300 nucleotides, and the smaller spliced sequence shows an additional downstream intron of 1104 nucleotides. We did not detect singly spliced sequences from which the smaller intron alone was removed. Control experiments showed that spliced sequences are present in both RNA and DNA in infected liver, showing that the viral reverse transcriptase can use spliced RNA as template. Spliced sequences were detected also in virion DNA prepared from serum. The upstream intron produces a reading frame that fuses the core to the polymerase polypeptide, while the downstream intron causes an inframe deletion in the polymerase open reading frame. Whereas the splicing patterns in WHV are superficially similar to those reported recently in hepatitis B virus, we detected no obvious homology in the coding capacity of spliced RNAs from these two viruses.

  6. TopHat: discovering splice junctions with RNA-Seq

    PubMed Central

    Trapnell, Cole; Pachter, Lior; Salzberg, Steven L.

    2009-01-01

    Motivation: A new protocol for sequencing the messenger RNA in a cell, known as RNA-Seq, generates millions of short sequence fragments in a single run. These fragments, or ‘reads’, can be used to measure levels of gene expression and to identify novel splice variants of genes. However, current software for aligning RNA-Seq data to a genome relies on known splice junctions and cannot identify novel ones. TopHat is an efficient read-mapping algorithm designed to align reads from an RNA-Seq experiment to a reference genome without relying on known splice sites. Results: We mapped the RNA-Seq reads from a recent mammalian RNA-Seq experiment and recovered more than 72% of the splice junctions reported by the annotation-based software from that study, along with nearly 20 000 previously unreported junctions. The TopHat pipeline is much faster than previous systems, mapping nearly 2.2 million reads per CPU hour, which is sufficient to process an entire RNA-Seq experiment in less than a day on a standard desktop computer. We describe several challenges unique to ab initio splice site discovery from RNA-Seq reads that will require further algorithm development. Availability: TopHat is free, open-source software available from http://tophat.cbcb.umd.edu Contact: cole@cs.umd.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:19289445

  7. ABMapper: a suffix array-based tool for multi-location searching and splice-junction mapping.

    PubMed

    Lou, Shao-Ke; Ni, Bing; Lo, Leung-Yau; Tsui, Stephen Kwok-Wing; Chan, Ting-Fung; Leung, Kwong-Sak

    2011-02-01

    Sequencing reads generated by RNA-sequencing (RNA-seq) must first be mapped back to the genome through alignment before they can be further analyzed. Current fast and memory-saving short-read mappers could give us a quick view of the transcriptome. However, they are neither designed for reads that span across splice junctions nor for repetitive reads, which can be mapped to multiple locations in the genome (multi-reads). Here, we describe a new software package: ABMapper, which is specifically designed for exploring all putative locations of reads that are mapped to splice junctions or repetitive in nature. The software is freely available at: http://abmapper.sourceforge.net/. The software is written in C++ and PERL. It runs on all major platforms and operating systems including Windows, Mac OS X and LINUX.

  8. Using information content and base frequencies to distinguish mutations from genetic polymorphisms in splice junction recognition sites.

    PubMed

    Rogan, P K; Schneider, T D

    1995-01-01

    Predicting the effects of nucleotide substitutions in human splice sites has been based on analysis of consensus sequences. We used a graphic representation of sequence conservation and base frequency, the sequence logo, to demonstrate that a change in a splice acceptor of hMSH2 (a gene associated with familial nonpolyposis colon cancer) probably does not reduce splicing efficiency. This confirms a population genetic study that suggested that this substitution is a genetic polymorphism. The information theory-based sequence logo is quantitative and more sensitive than the corresponding splice acceptor consensus sequence for detection of true mutations. Information analysis may potentially be used to distinguish polymorphisms from mutations in other types of transcriptional, translational, or protein-coding motifs.

  9. When proteome meets genome: the alpha helix and the beta strand of proteins are eschewed by mRNA splice junctions and may define the minimal indivisible modules of protein architecture

    PubMed Central

    Barik, Sailen

    2008-01-01

    The significance of the intron-exon structure of genes is a mystery. As eukaryotic proteins are made up of modular functional domains, each exon was suspected to encode some form of module; however, the definition of a module remained vague. Comparison of pre-mRNA splice junctions with the three-dimensional architecture of its protein product from different eukaryotes revealed that the junctions were far less likely to occur inside the α-helices and β-strands of proteins than within the more flexible linker regions (‘turns’ and ‘loops’) connecting them. The splice junctions were equally distributed in the different types of linkers and throughout the linker sequence, although a slight preference for the central region of the linker was observed. The avoidance of the α-helix and the β-strand by splice junctions suggests the existence of a selection pressure against their disruption, perhaps underscoring the investment made by nature in building these intricate secondary structures. A corollary is that the helix and the strand are the smallest integral architectural units of a protein and represent the minimal modules in the evolution of protein structure. These results should find use in comparative genomics, designing of cloning strategies, and in the mutual verification of genome sequences with protein structures. PMID:15381847

  10. When proteome meets genome: the alpha helix and the beta strand of proteins are eschewed by mRNA splice junctions and may define the minimal indivisible modules of protein architecture.

    PubMed

    Barik, Sailen

    2004-09-01

    The significance of the intron-exon structure of genes is a mystery. As eukaryotic proteins are made up of modular functional domains, each exon was suspected to encode some form of module; however, the definition of a module remained vague. Comparison of pre-mRNA splice junctions with the three-dimensional architecture of its protein product from different eukaryotes revealed that the junctions were far less likely to occur inside the alpha-helices and beta-strands of proteins than within the more flexible linker regions ('turns' and 'loops') connecting them. The splice junctions were equally distributed in the different types of linkers and throughout the linker sequence, although a slight preference for the central region of the linker was observed. The avoidance of the alpha-helix and the beta-strand by splice junctions suggests the existence of a selection pressure against their disruption, perhaps underscoring the investment made by nature in building these intricate secondary structures. A corollary is that the helix and the strand are the smallest integral architectural units of a protein and represent the minimal modules in the evolution of protein structure. These results should find use in comparative genomics, designing of cloning strategies, and in the mutual verification of genome sequences with protein structures.

  11. PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data.

    PubMed

    Zhang, Yanju; Lameijer, Eric-Wubbo; 't Hoen, Peter A C; Ning, Zemin; Slagboom, P Eline; Ye, Kai

    2012-02-15

    RNA-seq is a powerful technology for the study of transcriptome profiles that uses deep-sequencing technologies. Moreover, it may be used for cellular phenotyping and help establishing the etiology of diseases characterized by abnormal splicing patterns. In RNA-Seq, the exact nature of splicing events is buried in the reads that span exon-exon boundaries. The accurate and efficient mapping of these reads to the reference genome is a major challenge. We developed PASSion, a pattern growth algorithm-based pipeline for splice site detection in paired-end RNA-Seq reads. Comparing the performance of PASSion to three existing RNA-Seq analysis pipelines, TopHat, MapSplice and HMMSplicer, revealed that PASSion is competitive with these packages. Moreover, the performance of PASSion is not affected by read length and coverage. It performs better than the other three approaches when detecting junctions in highly abundant transcripts. PASSion has the ability to detect junctions that do not have known splicing motifs, which cannot be found by the other tools. Of the two public RNA-Seq datasets, PASSion predicted ≈ 137,000 and 173,000 splicing events, of which on average 82 are known junctions annotated in the Ensembl transcript database and 18% are novel. In addition, our package can discover differential and shared splicing patterns among multiple samples. The code and utilities can be freely downloaded from https://trac.nbic.nl/passion and ftp://ftp.sanger.ac.uk/pub/zn1/passion.

  12. PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data

    PubMed Central

    Zhang, Yanju; Lameijer, Eric-Wubbo; 't Hoen, Peter A. C.; Ning, Zemin; Slagboom, P. Eline; Ye, Kai

    2012-01-01

    Motivation: RNA-seq is a powerful technology for the study of transcriptome profiles that uses deep-sequencing technologies. Moreover, it may be used for cellular phenotyping and help establishing the etiology of diseases characterized by abnormal splicing patterns. In RNA-Seq, the exact nature of splicing events is buried in the reads that span exon–exon boundaries. The accurate and efficient mapping of these reads to the reference genome is a major challenge. Results: We developed PASSion, a pattern growth algorithm-based pipeline for splice site detection in paired-end RNA-Seq reads. Comparing the performance of PASSion to three existing RNA-Seq analysis pipelines, TopHat, MapSplice and HMMSplicer, revealed that PASSion is competitive with these packages. Moreover, the performance of PASSion is not affected by read length and coverage. It performs better than the other three approaches when detecting junctions in highly abundant transcripts. PASSion has the ability to detect junctions that do not have known splicing motifs, which cannot be found by the other tools. Of the two public RNA-Seq datasets, PASSion predicted ∼ 137 000 and 173 000 splicing events, of which on average 82 are known junctions annotated in the Ensembl transcript database and 18% are novel. In addition, our package can discover differential and shared splicing patterns among multiple samples. Availability: The code and utilities can be freely downloaded from https://trac.nbic.nl/passion and ftp://ftp.sanger.ac.uk/pub/zn1/passion Contact: y.zhang@lumc.nl; k.ye@lumc.nl Supplementary information: Supplementary data are available at Bioinformatics online. PMID:22219203

  13. Unusual Intron Conservation near Tissue-Regulated Exons Found by Splicing Microarrays

    PubMed Central

    Sugnet, Charles W; Srinivasan, Karpagam; Clark, Tyson A; O'Brien, Georgeann; Cline, Melissa S; Wang, Hui; Williams, Alan; Kulp, David; Blume, John E; Haussler, David; Ares, Manuel

    2006-01-01

    Alternative splicing contributes to both gene regulation and protein diversity. To discover broad relationships between regulation of alternative splicing and sequence conservation, we applied a systems approach, using oligonucleotide microarrays designed to capture splicing information across the mouse genome. In a set of 22 adult tissues, we observe differential expression of RNA containing at least two alternative splice junctions for about 40% of the 6,216 alternative events we could detect. Statistical comparisons identify 171 cassette exons whose inclusion or skipping is different in brain relative to other tissues and another 28 exons whose splicing is different in muscle. A subset of these exons is associated with unusual blocks of intron sequence whose conservation in vertebrates rivals that of protein-coding exons. By focusing on sets of exons with similar regulatory patterns, we have identified new sequence motifs implicated in brain and muscle splicing regulation. Of note is a motif that is strikingly similar to the branchpoint consensus but is located downstream of the 5′ splice site of exons included in muscle. Analysis of three paralogous membrane-associated guanylate kinase genes reveals that each contains a paralogous tissue-regulated exon with a similar tissue inclusion pattern. While the intron sequences flanking these exons remain highly conserved among mammalian orthologs, the paralogous flanking intron sequences have diverged considerably, suggesting unusually complex evolution of the regulation of alternative splicing in multigene families. PMID:16424921

  14. Is an observed non-co-linear RNA product spliced in trans, in cis or just in vitro?

    PubMed Central

    Yu, Chun-Ying; Liu, Hsiao-Jung; Hung, Li-Yuan; Kuo, Hung-Chih; Chuang, Trees-Juen

    2014-01-01

    Global transcriptome investigations often result in the detection of an enormous number of transcripts composed of non-co-linear sequence fragments. Such ‘aberrant’ transcript products may arise from post-transcriptional events or genetic rearrangements, or may otherwise be false positives (sequencing/alignment errors or in vitro artifacts). Moreover, post-transcriptionally non-co-linear (‘PtNcl’) transcripts can arise from trans-splicing or back-splicing in cis (to generate so-called ‘circular RNA’). Here, we collected previously-predicted human non-co-linear RNA candidates, and designed a validation procedure integrating in silico filters with multiple experimental validation steps to examine their authenticity. We showed that >50% of the tested candidates were in vitro artifacts, even though some had been previously validated by RT-PCR. After excluding the possibility of genetic rearrangements, we distinguished between trans-spliced and circular RNAs, and confirmed that these two splicing forms can share the same non-co-linear junction. Importantly, the experimentally-confirmed PtNcl RNA events and their corresponding PtNcl splicing types (i.e. trans-splicing, circular RNA, or both sharing the same junction) were all expressed in rhesus macaque, and some were even expressed in mouse. Our study thus describes an essential procedure for confirming PtNcl transcripts, and provides further insight into the evolutionary role of PtNcl RNA events, opening up this important, but understudied, class of post-transcriptional events for comprehensive characterization. PMID:25053845

  15. Non-exomic and synonymous variants in ABCA4 are an important cause of Stargardt disease

    PubMed Central

    Braun, Terry A.; Mullins, Robert F.; Wagner, Alex H.; Andorf, Jeaneen L.; Johnston, Rebecca M.; Bakall, Benjamin B.; Deluca, Adam P.; Fishman, Gerald A.; Lam, Byron L.; Weleber, Richard G.; Cideciyan, Artur V.; Jacobson, Samuel G.; Sheffield, Val C.; Tucker, Budd A.; Stone, Edwin M.

    2013-01-01

    Mutations in ABCA4 cause Stargardt disease and other blinding autosomal recessive retinal disorders. However, sequencing of the complete coding sequence in patients with clinical features of Stargardt disease sometimes fails to detect one or both mutations. For example, among 208 individuals with clear clinical evidence of ABCA4 disease ascertained at a single institution, 28 had only one disease-causing allele identified in the exons and splice junctions of the primary retinal transcript of the gene. Haplotype analysis of these 28 probands revealed 3 haplotypes shared among ten families, suggesting that 18 of the 28 missing alleles were rare enough to be present only once in the cohort. We hypothesized that mutations near rare alternate splice junctions in ABCA4 might cause disease by increasing the probability of mis-splicing at these sites. Next-generation sequencing of RNA extracted from human donor eyes revealed more than a dozen alternate exons that are occasionally incorporated into the ABCA4 transcript in normal human retina. We sequenced the genomic DNA containing 15 of these minor exons in the 28 one-allele subjects and observed five instances of two different variations in the splice signals of exon 36.1 that were not present in normal individuals (P < 10−6). Analysis of RNA obtained from the keratinocytes of patients with these mutations revealed the predicted alternate transcript. This study illustrates the utility of RNA sequence analysis of human donor tissue and patient-derived cell lines to identify mutations that would be undetectable by exome sequencing. PMID:23918662

  16. Identification of novel point mutations in splicing sites integrating whole-exome and RNA-seq data in myeloproliferative diseases.

    PubMed

    Spinelli, Roberta; Pirola, Alessandra; Redaelli, Sara; Sharma, Nitesh; Raman, Hima; Valletta, Simona; Magistroni, Vera; Piazza, Rocco; Gambacorti-Passerini, Carlo

    2013-11-01

    Point mutations in intronic regions near mRNA splice junctions can affect the splicing process. To identify novel splicing variants from exome sequencing data, we developed a bioinformatics splice-site prediction procedure to analyze next-generation sequencing (NGS) data (SpliceFinder). SpliceFinder integrates two functional annotation tools for NGS, ANNOVAR and MutationTaster and two canonical splice site prediction programs for single mutation analysis, SSPNN and NetGene2. By SpliceFinder, we identified somatic mutations affecting RNA splicing in a colon cancer sample, in eight atypical chronic myeloid leukemia (aCML), and eight CML patients. A novel homozygous splicing mutation was found in APC (NM_000038.4:c.1312+5G>A) and six heterozygous in GNAQ (NM_002072.2:c.735+1C>T), ABCC 3 (NM_003786.3:c.1783-1G>A), KLHDC 1 (NM_172193.1:c.568-2A>G), HOOK 1 (NM_015888.4:c.1662-1G>A), SMAD 9 (NM_001127217.2:c.1004-1C>T), and DNAH 9 (NM_001372.3:c.10242+5G>A). Integrating whole-exome and RNA sequencing in aCML and CML, we assessed the phenotypic effect of mutations on mRNA splicing for GNAQ, ABCC 3, HOOK 1. In ABCC 3 and HOOK 1, RNA-Seq showed the presence of aberrant transcripts with activation of a cryptic splice site or intron retention, validated by the reverse transcription-polymerase chain reaction (RT-PCR) in the case of HOOK 1. In GNAQ, RNA-Seq showed 22% of wild-type transcript and 78% of mRNA skipping exon 5, resulting in a 4-6 frameshift fusion confirmed by RT-PCR. The pipeline can be useful to identify intronic variants affecting RNA sequence by complementing conventional exome analysis.

  17. FineSplice, enhanced splice junction detection and quantification: a novel pipeline based on the assessment of diverse RNA-Seq alignment solutions.

    PubMed

    Gatto, Alberto; Torroja-Fungairiño, Carlos; Mazzarotto, Francesco; Cook, Stuart A; Barton, Paul J R; Sánchez-Cabo, Fátima; Lara-Pezzi, Enrique

    2014-04-01

    Alternative splicing is the main mechanism governing protein diversity. The recent developments in RNA-Seq technology have enabled the study of the global impact and regulation of this biological process. However, the lack of standardized protocols constitutes a major bottleneck in the analysis of alternative splicing. This is particularly important for the identification of exon-exon junctions, which is a critical step in any analysis workflow. Here we performed a systematic benchmarking of alignment tools to dissect the impact of design and method on the mapping, detection and quantification of splice junctions from multi-exon reads. Accordingly, we devised a novel pipeline based on TopHat2 combined with a splice junction detection algorithm, which we have named FineSplice. FineSplice allows effective elimination of spurious junction hits arising from artefactual alignments, achieving up to 99% precision in both real and simulated data sets and yielding superior F1 scores under most tested conditions. The proposed strategy conjugates an efficient mapping solution with a semi-supervised anomaly detection scheme to filter out false positives and allows reliable estimation of expressed junctions from the alignment output. Ultimately this provides more accurate information to identify meaningful splicing patterns. FineSplice is freely available at https://sourceforge.net/p/finesplice/.

  18. Novel p53 tumour suppressor mutations in cases of spindle cell sarcoma, pleomorphic sarcoma and fibrosarcoma in cats.

    PubMed

    Mayr, B; Reifinger, M; Alton, K; Schaffner, G

    1998-06-01

    Twenty feline neoplasms were sequenced in the region from exons 5 to 8 for the presence of tumour suppressor gene p53 mutations. In a spindle cell sarcoma of the bladder, a missense mutation (codon 164 AAG-->GAG, lysine-->glutamic acid) in exon 5 was detected. In a pleomorphic sarcoma, a 23 bp deletion involving the splicing junction between intron 5 and exon 6 was observed. In a fibrosarcoma, a 6 bp deletion of p53 covering 2 bp of exon 7 and 4 bp of intron 7, including the splicing junction, was found. The study demonstrates three new p53 mutations in different types of sarcomas in cats.

  19. Identification of novel point mutations in splicing sites integrating whole-exome and RNA-seq data in myeloproliferative diseases

    PubMed Central

    Spinelli, Roberta; Pirola, Alessandra; Redaelli, Sara; Sharma, Nitesh; Raman, Hima; Valletta, Simona; Magistroni, Vera; Piazza, Rocco; Gambacorti-Passerini, Carlo

    2013-01-01

    Point mutations in intronic regions near mRNA splice junctions can affect the splicing process. To identify novel splicing variants from exome sequencing data, we developed a bioinformatics splice-site prediction procedure to analyze next-generation sequencing (NGS) data (SpliceFinder). SpliceFinder integrates two functional annotation tools for NGS, ANNOVAR and MutationTaster and two canonical splice site prediction programs for single mutation analysis, SSPNN and NetGene2. By SpliceFinder, we identified somatic mutations affecting RNA splicing in a colon cancer sample, in eight atypical chronic myeloid leukemia (aCML), and eight CML patients. A novel homozygous splicing mutation was found in APC (NM_000038.4:c.1312+5G>A) and six heterozygous in GNAQ (NM_002072.2:c.735+1C>T), ABCC3 (NM_003786.3:c.1783-1G>A), KLHDC1 (NM_172193.1:c.568-2A>G), HOOK1 (NM_015888.4:c.1662-1G>A), SMAD9 (NM_001127217.2:c.1004-1C>T), and DNAH9 (NM_001372.3:c.10242+5G>A). Integrating whole-exome and RNA sequencing in aCML and CML, we assessed the phenotypic effect of mutations on mRNA splicing for GNAQ, ABCC3, HOOK1. In ABCC3 and HOOK1, RNA-Seq showed the presence of aberrant transcripts with activation of a cryptic splice site or intron retention, validated by the reverse transcription-polymerase chain reaction (RT-PCR) in the case of HOOK1. In GNAQ, RNA-Seq showed 22% of wild-type transcript and 78% of mRNA skipping exon 5, resulting in a 4–6 frameshift fusion confirmed by RT-PCR. The pipeline can be useful to identify intronic variants affecting RNA sequence by complementing conventional exome analysis. PMID:24498620

  20. SplicingTypesAnno: annotating and quantifying alternative splicing events for RNA-Seq data.

    PubMed

    Sun, Xiaoyong; Zuo, Fenghua; Ru, Yuanbin; Guo, Jiqiang; Yan, Xiaoyan; Sablok, Gaurav

    2015-04-01

    Alternative splicing plays a key role in the regulation of the central dogma. Four major types of alternative splicing have been classified as intron retention, exon skipping, alternative 5 splice sites or alternative donor sites, and alternative 3 splice sites or alternative acceptor sites. A few algorithms have been developed to detect splice junctions from RNA-Seq reads. However, there are few tools targeting at the major alternative splicing types at the exon/intron level. This type of analysis may reveal subtle, yet important events of alternative splicing, and thus help gain deeper understanding of the mechanism of alternative splicing. This paper describes a user-friendly R package, extracting, annotating and analyzing alternative splicing types for sequence alignment files from RNA-Seq. SplicingTypesAnno can: (1) provide annotation for major alternative splicing at exon/intron level. By comparing the annotation from GTF/GFF file, it identifies the novel alternative splicing sites; (2) offer a convenient two-level analysis: genome-scale annotation for users with high performance computing environment, and gene-scale annotation for users with personal computers; (3) generate a user-friendly web report and additional BED files for IGV visualization. SplicingTypesAnno is a user-friendly R package for extracting, annotating and analyzing alternative splicing types at exon/intron level for sequence alignment files from RNA-Seq. It is publically available at https://sourceforge.net/projects/splicingtypes/files/ or http://genome.sdau.edu.cn/research/software/SplicingTypesAnno.html. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  1. Snaptron: querying splicing patterns across tens of thousands of RNA-seq samples

    PubMed Central

    Wilks, Christopher; Gaddipati, Phani; Nellore, Abhinav

    2018-01-01

    Abstract Motivation As more and larger genomics studies appear, there is a growing need for comprehensive and queryable cross-study summaries. These enable researchers to leverage vast datasets that would otherwise be difficult to obtain. Results Snaptron is a search engine for summarized RNA sequencing data with a query planner that leverages R-tree, B-tree and inverted indexing strategies to rapidly execute queries over 146 million exon-exon splice junctions from over 70 000 human RNA-seq samples. Queries can be tailored by constraining which junctions and samples to consider. Snaptron can score junctions according to tissue specificity or other criteria, and can score samples according to the relative frequency of different splicing patterns. We describe the software and outline biological questions that can be explored with Snaptron queries. Availability and implementation Documentation is at http://snaptron.cs.jhu.edu. Source code is at https://github.com/ChristopherWilks/snaptron and https://github.com/ChristopherWilks/snaptron-experiments with a CC BY-NC 4.0 license. Contact chris.wilks@jhu.edu or langmea@cs.jhu.edu Supplementary information Supplementary data are available at Bioinformatics online. PMID:28968689

  2. Snaptron: querying splicing patterns across tens of thousands of RNA-seq samples.

    PubMed

    Wilks, Christopher; Gaddipati, Phani; Nellore, Abhinav; Langmead, Ben

    2018-01-01

    As more and larger genomics studies appear, there is a growing need for comprehensive and queryable cross-study summaries. These enable researchers to leverage vast datasets that would otherwise be difficult to obtain. Snaptron is a search engine for summarized RNA sequencing data with a query planner that leverages R-tree, B-tree and inverted indexing strategies to rapidly execute queries over 146 million exon-exon splice junctions from over 70 000 human RNA-seq samples. Queries can be tailored by constraining which junctions and samples to consider. Snaptron can score junctions according to tissue specificity or other criteria, and can score samples according to the relative frequency of different splicing patterns. We describe the software and outline biological questions that can be explored with Snaptron queries. Documentation is at http://snaptron.cs.jhu.edu. Source code is at https://github.com/ChristopherWilks/snaptron and https://github.com/ChristopherWilks/snaptron-experiments with a CC BY-NC 4.0 license. chris.wilks@jhu.edu or langmea@cs.jhu.edu. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.

  3. Factor IX[sub Madrid 2]: A deletion/insertion in Facotr IX gene which abolishes the sequence of the donor junction at the exon IV-intron d splice site

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Solera, J.; Magallon, M.; Martin-Villar, J.

    1992-02-01

    DNA from a patient with severe hemophilia B was evaluated by RFLP analysis, producing results which suggested the existence of a partial deletion within the factor IX gene. The deletion was further localized and characterized by PCR amplification and sequencing. The altered allele has a 4,442-bp deletion which removes both the donor splice site located at the 5[prime] end of intron d and the two last coding nucleotides located at the 3[prime] end of exon IV in the normal factor IX gene; this fragment has been inserted in inverted orientation. Two homologous sequences have been discovered at the ends ofmore » the deleted DNA fragment.« less

  4. The Splicing History of an mRNA Affects Its Level of Translation and Sensitivity to Cleavage by the Virion Host Shutoff Endonuclease during Herpes Simplex Virus Infections

    PubMed Central

    Sadek, Jouliana

    2016-01-01

    ABSTRACT During lytic herpes simplex virus (HSV) infections, the virion host shutoff (Vhs) (UL41) endoribonuclease degrades many cellular and viral mRNAs. In uninfected cells, spliced mRNAs emerge into the cytoplasm bound by exon junction complexes (EJCs) and are translated several times more efficiently than unspliced mRNAs that have the same sequence but lack EJCs. Notably, most cellular mRNAs are spliced, whereas most HSV mRNAs are not. To examine the effect of splicing on gene expression during HSV infection, cells were transfected with plasmids harboring an unspliced renilla luciferase (RLuc) reporter mRNA or RLuc constructs with introns near the 5′ or 3′ end of the gene. After splicing of intron-containing transcripts, all three RLuc mRNAs had the same primary sequence. Upon infection in the presence of actinomycin D, spliced mRNAs were much less sensitive to degradation by copies of Vhs from infecting virions than were unspliced mRNAs. During productive infections (in the absence of drugs), RLuc was expressed at substantially higher levels from spliced than from unspliced mRNAs. Interestingly, the stimulatory effect of splicing on RLuc expression was significantly greater in infected than in uninfected cells. The translational stimulatory effect of an intron during HSV-1 infections could be replicated by artificially tethering various EJC components to an unspliced RLuc transcript. Thus, the splicing history of an mRNA, and the consequent presence or absence of EJCs, affects its level of translation and sensitivity to Vhs cleavage during lytic HSV infections. IMPORTANCE Most mammalian mRNAs are spliced. In contrast, of the more than 80 mRNAs harbored by herpes simplex virus 1 (HSV-1), only 5 are spliced. In addition, synthesis of the immediate early protein ICP27 causes partial inhibition of pre-mRNA splicing, with the resultant accumulation of both spliced and unspliced versions of some mRNAs in the cytoplasm. A common perception is that HSV-1 infection necessarily inhibits the expression of spliced mRNAs. In contrast, this study demonstrates two instances in which pre-mRNA splicing actually enhances the synthesis of proteins from mRNAs during HSV-1 infections. Specifically, splicing stabilized an mRNA against degradation by copies of the Vhs endoribonuclease from infecting virions and greatly enhanced the amount of protein synthesized from spliced mRNAs at late times after infection. The data suggest that splicing, and the resultant presence of exon junction complexes on an mRNA, may play an important role in gene expression during HSV-1 infections. PMID:27681125

  5. Alternative splicing regulated by butyrate in bovine epithelial cells.

    PubMed

    Wu, Sitao; Li, Congjun; Huang, Wen; Li, Weizhong; Li, Robert W

    2012-01-01

    As a signaling molecule and an inhibitor of histone deacetylases (HDACs), butyrate exerts its impact on a broad range of biological processes, such as apoptosis and cell proliferation, in addition to its critical role in energy metabolism in ruminants. This study examined the effect of butyrate on alternative splicing in bovine epithelial cells using RNA-seq technology. Junction reads account for 11.28 and 12.32% of total mapped reads between the butyrate-treated (BT) and control (CT) groups. 201,326 potential splicing junctions detected were supported by ≥ 3 junction reads. Approximately 94% of these junctions conformed to the consensus sequence (GT/AG) while ~3% were GC/AG junctions. No AT/AC junctions were observed. A total of 2,834 exon skipping events, supported by a minimum of 3 junction reads, were detected. At least 7 genes, their mRNA expression significantly affected by butyrate, also had exon skipping events differentially regulated by butyrate. Furthermore, COL5A3, which was induced 310-fold by butyrate (FDR <0.001) at the gene level, had a significantly higher number of junction reads mapped to Exon#8 (Donor) and Exon#11 (Acceptor) in BT. This event had the potential to result in the formation of a COL5A3 mRNA isoform with 2 of the 69 exons missing. In addition, 216 differentially expressed transcript isoforms regulated by butyrate were detected. For example, Isoform 1 of ORC1 was strongly repressed by butyrate while Isoform 2 remained unchanged. Butyrate physically binds to and inhibits all zinc-dependent HDACs except HDAC6 and HDAC10. Our results provided evidence that butyrate also regulated deacetylase activities of classical HDACs via its transcriptional control. Moreover, thirteen gene fusion events differentially affected by butyrate were identified. Our results provided a snapshot into complex transcriptome dynamics regulated by butyrate, which will facilitate our understanding of the biological effects of butyrate and other HDAC inhibitors.

  6. Mapping RNA-seq Reads with STAR

    PubMed Central

    Dobin, Alexander; Gingeras, Thomas R.

    2015-01-01

    Mapping of large sets of high-throughput sequencing reads to a reference genome is one of the foundational steps in RNA-seq data analysis. The STAR software package performs this task with high levels of accuracy and speed. In addition to detecting annotated and novel splice junctions, STAR is capable of discovering more complex RNA sequence arrangements, such as chimeric and circular RNA. STAR can align spliced sequences of any length with moderate error rates providing scalability for emerging sequencing technologies. STAR generates output files that can be used for many downstream analyses such as transcript/gene expression quantification, differential gene expression, novel isoform reconstruction, signal visualization, and so forth. In this unit we describe computational protocols that produce various output files, use different RNA-seq datatypes, and utilize different mapping strategies. STAR is Open Source software that can be run on Unix, Linux or Mac OS X systems. PMID:26334920

  7. Mapping RNA-seq Reads with STAR.

    PubMed

    Dobin, Alexander; Gingeras, Thomas R

    2015-09-03

    Mapping of large sets of high-throughput sequencing reads to a reference genome is one of the foundational steps in RNA-seq data analysis. The STAR software package performs this task with high levels of accuracy and speed. In addition to detecting annotated and novel splice junctions, STAR is capable of discovering more complex RNA sequence arrangements, such as chimeric and circular RNA. STAR can align spliced sequences of any length with moderate error rates, providing scalability for emerging sequencing technologies. STAR generates output files that can be used for many downstream analyses such as transcript/gene expression quantification, differential gene expression, novel isoform reconstruction, and signal visualization. In this unit, we describe computational protocols that produce various output files, use different RNA-seq datatypes, and utilize different mapping strategies. STAR is open source software that can be run on Unix, Linux, or Mac OS X systems. Copyright © 2015 John Wiley & Sons, Inc.

  8. PathwaySplice: An R package for unbiased pathway analysis of alternative splicing in RNA-Seq data.

    PubMed

    Yan, Aimin; Ban, Yuguang; Gao, Zhen; Chen, Xi; Wang, Lily

    2018-04-24

    Pathway analysis of alternative splicing would be biased without accounting for the different number of exons or junctions associated with each gene, because genes with higher number of exons or junctions are more likely to be included in the "significant" gene list in alternative splicing. We present PathwaySplice, an R package that (1) Performs pathway analysis that explicitly adjusts for the number of exons or junctions associated with each gene; (2) Visualizes selection bias due to different number of exons or junctions for each gene and formally tests for presence of bias using logistic regression; (3) Supports gene sets based on the Gene Ontology terms, as well as more broadly defined gene sets (e.g. MSigDB) or user defined gene sets; (4) Identifies the significant genes driving pathway significance and (5) Organizes significant pathways with an enrichment map, where pathways with large number of overlapping genes are grouped together in a network graph. https://bioconductor.org/packages/release/bioc/html/PathwaySplice.html. lily.wangg@gmail.com, xi.steven.chen@gmail.com.

  9. Genome-wide survey of human alternative pre-mRNA splicing with exon junction microarrays.

    PubMed

    Johnson, Jason M; Castle, John; Garrett-Engele, Philip; Kan, Zhengyan; Loerch, Patrick M; Armour, Christopher D; Santos, Ralph; Schadt, Eric E; Stoughton, Roland; Shoemaker, Daniel D

    2003-12-19

    Alternative pre-messenger RNA (pre-mRNA) splicing plays important roles in development, physiology, and disease, and more than half of human genes are alternatively spliced. To understand the biological roles and regulation of alternative splicing across different tissues and stages of development, systematic methods are needed. Here, we demonstrate the use of microarrays to monitor splicing at every exon-exon junction in more than 10,000 multi-exon human genes in 52 tissues and cell lines. These genome-wide data provide experimental evidence and tissue distributions for thousands of known and novel alternative splicing events. Adding to previous studies, the results indicate that at least 74% of human multi-exon genes are alternatively spliced.

  10. An RNAi-enhanced Logic Circuit for Cancer Specific Detection and Destruction

    DTIC Science & Technology

    2010-07-01

    Bcl-2 family: mBax (Mus musculus), hBax ( Homo sapiens ), and its mutant hBax-S184A [4]. A plasmid containing the tested gene was transfected into HEK...the far-red fluorescent protein mKate to express the Gata3 mStaple. Intron- feature sequences – donor site, branch point, poly- pyrimidine tract, and...intron-exon junction. Among the donor and acceptor sequences found in literature our intron features were chosen according SplicePort [5], an

  11. Application of hidden Markov models to biological data mining: a case study

    NASA Astrophysics Data System (ADS)

    Yin, Michael M.; Wang, Jason T.

    2000-04-01

    In this paper we present an example of biological data mining: the detection of splicing junction acceptors in eukaryotic genes. Identification or prediction of transcribed sequences from within genomic DNA has been a major rate-limiting step in the pursuit of genes. Programs currently available are far from being powerful enough to elucidate the gene structure completely. Here we develop a hidden Markov model (HMM) to represent the degeneracy features of splicing junction acceptor sites in eukaryotic genes. The HMM system is fully trained using an expectation maximization (EM) algorithm and the system performance is evaluated using the 10-way cross- validation method. Experimental results show that our HMM system can correctly classify more than 94% of the candidate sequences (including true and false acceptor sites) into right categories. About 90% of the true acceptor sites and 96% of the false acceptor sites in the test data are classified correctly. These results are very promising considering that only the local information in DNA is used. The proposed model will be a very important component of an effective and accurate gene structure detection system currently being developed in our lab.

  12. A Predictive Model of Intein Insertion Site for Use in the Engineering of Molecular Switches

    PubMed Central

    Apgar, James; Ross, Mary; Zuo, Xiao; Dohle, Sarah; Sturtevant, Derek; Shen, Binzhang; de la Vega, Humberto; Lessard, Philip; Lazar, Gabor; Raab, R. Michael

    2012-01-01

    Inteins are intervening protein domains with self-splicing ability that can be used as molecular switches to control activity of their host protein. Successfully engineering an intein into a host protein requires identifying an insertion site that permits intein insertion and splicing while allowing for proper folding of the mature protein post-splicing. By analyzing sequence and structure based properties of native intein insertion sites we have identified four features that showed significant correlation with the location of the intein insertion sites, and therefore may be useful in predicting insertion sites in other proteins that provide native-like intein function. Three of these properties, the distance to the active site and dimer interface site, the SVM score of the splice site cassette, and the sequence conservation of the site showed statistically significant correlation and strong predictive power, with area under the curve (AUC) values of 0.79, 0.76, and 0.73 respectively, while the distance to secondary structure/loop junction showed significance but with less predictive power (AUC of 0.54). In a case study of 20 insertion sites in the XynB xylanase, two features of native insertion sites showed correlation with the splice sites and demonstrated predictive value in selecting non-native splice sites. Structural modeling of intein insertions at two sites highlighted the role that the insertion site location could play on the ability of the intein to modulate activity of the host protein. These findings can be used to enrich the selection of insertion sites capable of supporting intein splicing and hosting an intein switch. PMID:22649521

  13. The Exon Junction Complex and Srp54 Contribute to Hedgehog Signaling via ci RNA Splicing in Drosophila melanogaster.

    PubMed

    Garcia-Garcia, Elisa; Little, Jamie C; Kalderon, Daniel

    2017-08-01

    Hedgehog (Hh) regulates the Cubitus interruptus (Ci) transcription factor in Drosophila melanogaster by activating full-length Ci-155 and blocking processing to the Ci-75 repressor. However, the interplay between the regulation of Ci-155 levels and activity, as well as processing-independent mechanisms that affect Ci-155 levels, have not been explored extensively. Here, we identified Mago Nashi (Mago) and Y14 core Exon Junction Complex (EJC) proteins, as well as the Srp54 splicing factor, as modifiers of Hh pathway activity under sensitized conditions. Mago inhibition reduced Hh pathway activity by altering the splicing pattern of ci to reduce Ci-155 levels. Srp54 inhibition also affected pathway activity by reducing ci RNA levels but additionally altered Ci-155 levels and activity independently of ci splicing. Further tests using ci transgenes and ci mutations confirmed evidence from studying the effects of Mago and Srp54 that relatively small changes in the level of Ci-155 primary translation product alter Hh pathway activity under a variety of sensitized conditions. We additionally used ci transgenes lacking intron sequences or the presumed translation initiation codon for an alternatively spliced ci RNA to provide further evidence that Mago acts principally by modulating the levels of the major ci RNA encoding Ci-155, and to show that ci introns are necessary to support the production of sufficient Ci-155 for robust Hh signaling and may also be important mediators of regulatory inputs. Copyright © 2017 by the Genetics Society of America.

  14. Four novel cystic fibrosis mutations in splice junction sequences affecting the CFTR nucleotide binding folds

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Doerk, T.; Wulbrand, U.; Tuemmler, B.

    1993-03-01

    Single cases of the four novel splice site mutations 1525[minus]1 G [r arrow] A (intron 9), 3601[minus]2 A [r arrow] G (intron 18), 3850[minus]3 T [r arrow] G (intron 19), and 4374+1 G [r arrow] T (intron 23) were detected in the CFTR gene of cystic fibrosis patients of Indo-Iranian, Turkish, Polish, and Germany descent. The nucleotide substitutions at the +1, [minus]1, and [minus]2 positions all destroy splice sites and lead to severe disease alleles associated with features typical of gastrointestinal and pulmonary cystic fibrosis disease. The 3850[minus]3 T-to-G change was discovered in a very mildly affected 33-year-old [Delta]F508 compoundmore » heterozygote, suggesting that the T-to-G transversion at the less conserved [minus]3 position of the acceptor splice site may retain some wildtype function. 13 refs., 1 fig., 2 tabs.« less

  15. Evaluation of Bioinformatic Programmes for the Analysis of Variants within Splice Site Consensus Regions

    PubMed Central

    Tang, Rongying; Prosser, Debra O.; Love, Donald R.

    2016-01-01

    The increasing diagnostic use of gene sequencing has led to an expanding dataset of novel variants that lie within consensus splice junctions. The challenge for diagnostic laboratories is the evaluation of these variants in order to determine if they affect splicing or are merely benign. A common evaluation strategy is to use in silico analysis, and it is here that a number of programmes are available online; however, currently, there are no consensus guidelines on the selection of programmes or protocols to interpret the prediction results. Using a collection of 222 pathogenic mutations and 50 benign polymorphisms, we evaluated the sensitivity and specificity of four in silico programmes in predicting the effect of each variant on splicing. The programmes comprised Human Splice Finder (HSF), Max Entropy Scan (MES), NNSplice, and ASSP. The MES and ASSP programmes gave the highest performance based on Receiver Operator Curve analysis, with an optimal cut-off of score reduction of 10%. The study also showed that the sensitivity of prediction is affected by the level of conservation of individual positions, with in silico predictions for variants at positions −4 and +7 within consensus splice sites being largely uninformative. PMID:27313609

  16. A Homozygous Mutation in the Tight-Junction Protein JAM3 Causes Hemorrhagic Destruction of the Brain, Subependymal Calcification, and Congenital Cataracts

    PubMed Central

    Mochida, Ganeshwaran H.; Ganesh, Vijay S.; Felie, Jillian M.; Gleason, Danielle; Hill, R. Sean; Clapham, Katie Rose; Rakiec, Daniel; Tan, Wen-Hann; Akawi, Nadia; Al-Saffar, Muna; Partlow, Jennifer N.; Tinschert, Sigrid; Barkovich, A. James; Ali, Bassam; Al-Gazali, Lihadh; Walsh, Christopher A.

    2010-01-01

    The tight junction, or zonula occludens, is a specialized cell-cell junction that regulates epithelial and endothelial permeability, and it is an essential component of the blood-brain barrier in the cerebrovascular endothelium. In addition to functioning as a diffusion barrier, tight junctions are also involved in signal transduction. In this study, we identified a homozygous mutation in the tight-junction protein gene JAM3 in a large consanguineous family from the United Arab Emirates. Some members of this family had a rare autosomal-recessive syndrome characterized by severe hemorrhagic destruction of the brain, subependymal calcification, and congenital cataracts. Their clinical presentation overlaps with some reported cases of pseudo-TORCH syndrome as well as with cases involving mutations in occludin, another component of the tight-junction complex. However, massive intracranial hemorrhage distinguishes these patients from others. Homozygosity mapping identified the disease locus in this family on chromosome 11q25 with a maximum multipoint LOD score of 6.15. Sequence analysis of genes in the candidate interval uncovered a mutation in the canonical splice-donor site of intron 5 of JAM3. RT-PCR analysis of a patient lymphoblast cell line confirmed abnormal splicing, leading to a frameshift mutation with early termination. JAM3 is known to be present in vascular endothelium, although its roles in cerebral vasculature have not been implicated. Our results suggest that JAM3 is essential for maintaining the integrity of the cerebrovascular endothelium as well as for normal lens development in humans. PMID:21109224

  17. Transcriptome analysis reveals the complexity of alternative splicing regulation in the fungus Verticillium dahliae.

    PubMed

    Jin, Lirong; Li, Guanglin; Yu, Dazhao; Huang, Wei; Cheng, Chao; Liao, Shengjie; Wu, Qijia; Zhang, Yi

    2017-02-06

    Alternative splicing (AS) regulation is extensive and shapes the functional complexity of higher organisms. However, the contribution of alternative splicing to fungal biology is not well studied. This study provides sequences of the transcriptomes of the plant wilt pathogen Verticillium dahliae, using two different strains and multiple methods for cDNA library preparations. We identified alternatively spliced mRNA isoforms in over a half of the multi-exonic fungal genes. Over one-thousand isoforms involve TopHat novel splice junction; multiple types of combinatory alternative splicing patterns were identified. We showed that one Verticillium gene could use four different 5' splice sites and two different 3' donor sites to produce up to five mature mRNAs, representing one of the most sophisticated alternative splicing model in eukaryotes other than animals. Hundreds of novel intron types involving a pair of new splice sites were identified in the V. dahliae genome. All the types of AS events were validated by using RT-PCR. Functional enrichment analysis showed that AS genes are involved in most known biological functions and enriched in ATP biosynthesis, sexual/asexual reproduction, morphogenesis, signal transduction etc., predicting that the AS regulation modulates mRNA isoform output and shapes the V. dahliae proteome plasticity of the pathogen in response to the environmental and developmental changes. These findings demonstrate the comprehensive alternative splicing mechanisms in a fungal plant pathogen, which argues the importance of this fungus in developing complicate genome regulation strategies in eukaryotes.

  18. Structure of the human gene encoding the protein repair L-isoaspartyl (D-aspartyl) O-methyltransferase.

    PubMed

    DeVry, C G; Tsai, W; Clarke, S

    1996-11-15

    The protein L-isoaspartyl/D-aspartyl O-methyltransferase (EC 2.1.1.77) catalyzes the first step in the repair of proteins damaged in the aging process by isomerization or racemization reactions at aspartyl and asparaginyl residues. A single gene has been localized to human chromosome 6 and multiple transcripts arising through alternative splicing have been identified. Restriction enzyme mapping, subcloning, and DNA sequence analysis of three overlapping clones from a human genomic library in bacteriophage P1 indicate that the gene spans approximately 60 kb and is composed of 8 exons interrupted by 7 introns. Analysis of intron/exon splice junctions reveals that all of the donor and acceptor splice sites are in agreement with the mammalian consensus splicing sequence. Determination of transcription initiation sites by primer extension analysis of poly(A)+ mRNA from human brain identifies multiple start sites, with a major site 159 nucleotides upstream from the ATG start codon. Sequence analysis of the 5'-untranslated region demonstrates several potential cis-acting DNA elements including SP1, ETF, AP1, AP2, ARE, XRE, CREB, MED-1, and half-palindromic ERE motifs. The promoter of this methyltransferase gene lacks an identifiable TATA box but is characterized by a CpG island which begins approximately 723 nucleotides upstream of the major transcriptional start site and extends through exon 1 and into the first intron. These features are characteristic of housekeeping genes and are consistent with the wide tissue distribution observed for this methyltransferase activity.

  19. New Splice Site Acceptor Mutation in AIRE Gene in Autoimmune Polyendocrine Syndrome Type 1

    PubMed Central

    Mora, Mireia; Hanzu, Felicia A.; Pradas-Juni, Marta; Aranda, Gloria B.; Halperin, Irene; Puig-Domingo, Manuel; Aguiló, Sira; Fernández-Rebollo, Eduardo

    2014-01-01

    Autoimmune polyglandular syndrome type 1 (APS-1, OMIM 240300) is a rare autosomal recessive disorder, characterized by the presence of at least two of three major diseases: hypoparathyroidism, Addison’s disease, and chronic mucocutaneous candidiasis. We aim to identify the molecular defects and investigate the clinical and mutational characteristics in an index case and other members of a consanguineous family. We identified a novel homozygous mutation in the splice site acceptor (SSA) of intron 5 (c.653-1G>A) in two siblings with different clinical outcomes of APS-1. Coding DNA sequencing revealed that this AIRE mutation potentially compromised the recognition of the constitutive SSA of intron 5, splicing upstream onto a nearby cryptic SSA in intron 5. Surprisingly, the use of an alternative SSA entails the uncovering of a cryptic donor splice site in exon 5. This new transcript generates a truncated protein (p.A214fs67X) containing the first 213 amino acids and followed by 68 aberrant amino acids. The mutation affects the proper splicing, not only at the acceptor but also at the donor splice site, highlighting the complexity of recognizing suitable splicing sites and the importance of sequencing the intron-exon junctions for a more precise molecular diagnosis and correct genetic counseling. As both siblings were carrying the same mutation but exhibited a different APS-1 onset, and one of the brothers was not clinically diagnosed, our finding highlights the possibility to suspect mutations in the AIRE gene in cases of childhood chronic candidiasis and/or hypoparathyroidism otherwise unexplained, especially when the phenotype is associated with other autoimmune diseases. PMID:24988226

  20. Hereditary vitamin D resistant rickets: identification of a novel splice site mutation in the vitamin D receptor gene and successful treatment with oral calcium therapy.

    PubMed

    Ma, Nina S; Malloy, Peter J; Pitukcheewanont, Pisit; Dreimane, Daina; Geffner, Mitchell E; Feldman, David

    2009-10-01

    To study the vitamin D receptor (VDR) gene in a young girl with severe rickets and clinical features of hereditary vitamin D resistant rickets, including hypocalcemia, hypophosphatemia, partial alopecia, and elevated serum levels of 1,25-dihydroxyvitamin D. We amplified and sequenced DNA samples from blood from the patient, her mother, and the patient's two siblings. We also amplified and sequenced the VDR cDNA from RNA isolated from the patient's blood. DNA sequence analyses of the VDR gene showed that the patient was homozygous for a novel guanine to thymine substitution in the 5'-splice site in the exon 8-intron J junction. Analysis of the VDR cDNA using reverse transcriptase-polymerase chain reaction showed that exons 7 and 9 were fused, and that exon 8 was skipped. The mother was heterozygous for the mutation and the two siblings were unaffected. A novel splice site mutation was identified in the VDR gene that caused exon 8 to be skipped. The mutation deleted amino acids 303-341 in the VDR ligand-binding domain, which is expected to render the VDR non-functional. Nevertheless, successful outpatient treatment was achieved with frequent high doses of oral calcium.

  1. Deep RNA sequencing analysis of readthrough gene fusions in human prostate adenocarcinoma and reference samples

    PubMed Central

    2011-01-01

    Background Readthrough fusions across adjacent genes in the genome, or transcription-induced chimeras (TICs), have been estimated using expressed sequence tag (EST) libraries to involve 4-6% of all genes. Deep transcriptional sequencing (RNA-Seq) now makes it possible to study the occurrence and expression levels of TICs in individual samples across the genome. Methods We performed single-end RNA-Seq on three human prostate adenocarcinoma samples and their corresponding normal tissues, as well as brain and universal reference samples. We developed two bioinformatics methods to specifically identify TIC events: a targeted alignment method using artificial exon-exon junctions within 200,000 bp from adjacent genes, and genomic alignment allowing splicing within individual reads. We performed further experimental verification and characterization of selected TIC and fusion events using quantitative RT-PCR and comparative genomic hybridization microarrays. Results Targeted alignment against artificial exon-exon junctions yielded 339 distinct TIC events, including 32 gene pairs with multiple isoforms. The false discovery rate was estimated to be 1.5%. Spliced alignment to the genome was less sensitive, finding only 18% of those found by targeted alignment in 33-nt reads and 59% of those in 50-nt reads. However, spliced alignment revealed 30 cases of TICs with intervening exons, in addition to distant inversions, scrambled genes, and translocations. Our findings increase the catalog of observed TIC gene pairs by 66%. We verified 6 of 6 predicted TICs in all prostate samples, and 2 of 5 predicted novel distant gene fusions, both private events among 54 prostate tumor samples tested. Expression of TICs correlates with that of the upstream gene, which can explain the prostate-specific pattern of some TIC events and the restriction of the SLC45A3-ELK4 e4-e2 TIC to ERG-negative prostate samples, as confirmed in 20 matched prostate tumor and normal samples and 9 lung cancer cell lines. Conclusions Deep transcriptional sequencing and analysis with targeted and spliced alignment methods can effectively identify TIC events across the genome in individual tissues. Prostate and reference samples exhibit a wide range of TIC events, involving more genes than estimated previously using ESTs. Tissue specificity of TIC events is correlated with expression patterns of the upstream gene. Some TIC events, such as MSMB-NCOA4, may play functional roles in cancer. PMID:21261984

  2. Skipping of Exons by Premature Termination of Transcription and Alternative Splicing within Intron-5 of the Sheep SCF Gene: A Novel Splice Variant

    PubMed Central

    Saravanaperumal, Siva Arumugam; Pediconi, Dario; Renieri, Carlo; La Terza, Antonietta

    2012-01-01

    Stem cell factor (SCF) is a growth factor, essential for haemopoiesis, mast cell development and melanogenesis. In the hematopoietic microenvironment (HM), SCF is produced either as a membrane-bound (−) or soluble (+) forms. Skin expression of SCF stimulates melanocyte migration, proliferation, differentiation, and survival. We report for the first time, a novel mRNA splice variant of SCF from the skin of white merino sheep via cloning and sequencing. Reverse transcriptase (RT)-PCR and molecular prediction revealed two different cDNA products of SCF. Full-length cDNA libraries were enriched by the method of rapid amplification of cDNA ends (RACE-PCR). Nucleotide sequencing and molecular prediction revealed that the primary 1519 base pair (bp) cDNA encodes a precursor protein of 274 amino acids (aa), commonly known as ‘soluble’ isoform. In contrast, the shorter (835 and/or 725 bp) cDNA was found to be a ‘novel’ mRNA splice variant. It contains an open reading frame (ORF) corresponding to a truncated protein of 181 aa (vs 245 aa) with an unique C-terminus lacking the primary proteolytic segment (28 aa) right after the D175G site which is necessary to produce ‘soluble’ form of SCF. This alternative splice (AS) variant was explained by the complete nucleotide sequencing of splice junction covering exon 5-intron (5)-exon 6 (948 bp) with a premature termination codon (PTC) whereby exons 6 to 9/10 are skipped (Cassette Exon, CE 6–9/10). We also demonstrated that the Northern blot analysis at transcript level is mediated via an intron-5 splicing event. Our data refine the structure of SCF gene; clarify the presence (+) and/or absence (−) of primary proteolytic-cleavage site specific SCF splice variants. This work provides a basis for understanding the functional role and regulation of SCF in hair follicle melanogenesis in sheep beyond what was known in mice, humans and other mammals. PMID:22719917

  3. A compatible exon-exon junction database for the identification of exon skipping events using tandem mass spectrum data.

    PubMed

    Mo, Fan; Hong, Xu; Gao, Feng; Du, Lin; Wang, Jun; Omenn, Gilbert S; Lin, Biaoyang

    2008-12-16

    Alternative splicing is an important gene regulation mechanism. It is estimated that about 74% of multi-exon human genes have alternative splicing. High throughput tandem (MS/MS) mass spectrometry provides valuable information for rapidly identifying potentially novel alternatively-spliced protein products from experimental datasets. However, the ability to identify alternative splicing events through tandem mass spectrometry depends on the database against which the spectra are searched. We wrote scripts in perl, Bioperl, mysql and Ensembl API and built a theoretical exon-exon junction protein database to account for all possible combinations of exons for a gene while keeping the frame of translation (i.e., keeping only in-phase exon-exon combinations) from the Ensembl Core Database. Using our liver cancer MS/MS dataset, we identified a total of 488 non-redundant peptides that represent putative exon skipping events. Our exon-exon junction database provides the scientific community with an efficient means to identify novel alternatively spliced (exon skipping) protein isoforms using mass spectrometry data. This database will be useful in annotating genome structures using rapidly accumulating proteomics data.

  4. A mechanism for exon skipping caused by nonsense or missense mutations in BRCA1 and other genes.

    PubMed

    Liu, H X; Cartegni, L; Zhang, M Q; Krainer, A R

    2001-01-01

    Point mutations can generate defective and sometimes harmful proteins. The nonsense-mediated mRNA decay (NMD) pathway minimizes the potential damage caused by nonsense mutations. In-frame nonsense codons located at a minimum distance upstream of the last exon-exon junction are recognized as premature termination codons (PTCs), targeting the mRNA for degradation. Some nonsense mutations cause skipping of one or more exons, presumably during pre-mRNA splicing in the nucleus; this phenomenon is termed nonsense-mediated altered splicing (NAS), and its underlying mechanism is unclear. By analyzing NAS in BRCA1, we show here that inappropriate exon skipping can be reproduced in vitro, and results from disruption of a splicing enhancer in the coding sequence. Enhancers can be disrupted by single nonsense, missense and translationally silent point mutations, without recognition of an open reading frame as such. These results argue against a nuclear reading-frame scanning mechanism for NAS. Coding-region single-nucleotide polymorphisms (cSNPs) within exonic splicing enhancers or silencers may affect the patterns or efficiency of mRNA splicing, which may in turn cause phenotypic variability and variable penetrance of mutations elsewhere in a gene.

  5. Splicing regulation and dysregulation of cholinergic genes expressed at the neuromuscular junction.

    PubMed

    Ohno, Kinji; Rahman, Mohammad Alinoor; Nazim, Mohammad; Nasrin, Farhana; Lin, Yingni; Takeda, Jun-Ichi; Masuda, Akio

    2017-08-01

    We humans have evolved by acquiring diversity of alternative RNA metabolisms including alternative means of splicing and transcribing non-coding genes, and not by acquiring new coding genes. Tissue-specific and developmental stage-specific alternative RNA splicing is achieved by tightly regulated spatiotemporal regulation of expressions and activations of RNA-binding proteins that recognize their cognate splicing cis-elements on nascent RNA transcripts. Genes expressed at the neuromuscular junction are also alternatively spliced. In addition, germline mutations provoke aberrant splicing by compromising binding of RNA-binding proteins, and cause congenital myasthenic syndromes (CMS). We present physiological splicing mechanisms of genes for agrin (AGRN), acetylcholinesterase (ACHE), MuSK (MUSK), acetylcholine receptor (AChR) α1 subunit (CHRNA1), and collagen Q (COLQ) in human, and their aberration in diseases. Splicing isoforms of AChE T , AChE H , and AChE R are generated by hnRNP H/F. Skipping of MUSK exon 10 makes a Wnt-insensitive MuSK isoform, which is unique to human. Skipping of exon 10 is achieved by coordinated binding of hnRNP C, YB-1, and hnRNP L to exon 10. Exon P3A of CHRNA1 is alternatively included to generate a non-functional AChR α1 subunit in human. Molecular dissection of splicing mutations in patients with CMS reveals that exon P3A is alternatively skipped by hnRNP H, polypyrimidine tract-binding protein 1, and hnRNP L. Similarly, analysis of an exonic mutation in COLQ exon 16 in a CMS patient discloses that constitutive splicing of exon 16 requires binding of serine arginine-rich splicing factor 1. Intronic and exonic splicing mutations in CMS enable us to dissect molecular mechanisms underlying alternative and constitutive splicing of genes expressed at the neuromuscular junction. This is an article for the special issue XVth International Symposium on Cholinergic Mechanisms. © 2017 International Society for Neurochemistry.

  6. RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application

    PubMed Central

    2015-01-01

    Background The study of RNA has been dramatically improved by the introduction of Next Generation Sequencing platforms allowing massive and cheap sequencing of selected RNA fractions, also providing information on strand orientation (RNA-Seq). The complexity of transcriptomes and of their regulative pathways make RNA-Seq one of most complex field of NGS applications, addressing several aspects of the expression process (e.g. identification and quantification of expressed genes and transcripts, alternative splicing and polyadenylation, fusion genes and trans-splicing, post-transcriptional events, etc.). Moreover, the huge volume of data generated by NGS platforms introduces unprecedented computational and technological challenges to efficiently analyze and store sequence data and results. Methods In order to provide researchers with an effective and friendly resource for analyzing RNA-Seq data, we present here RAP (RNA-Seq Analysis Pipeline), a cloud computing web application implementing a complete but modular analysis workflow. This pipeline integrates both state-of-the-art bioinformatics tools for RNA-Seq analysis and in-house developed scripts to offer to the user a comprehensive strategy for data analysis. RAP is able to perform quality checks (adopting FastQC and NGS QC Toolkit), identify and quantify expressed genes and transcripts (with Tophat, Cufflinks and HTSeq), detect alternative splicing events (using SpliceTrap) and chimeric transcripts (with ChimeraScan). This pipeline is also able to identify splicing junctions and constitutive or alternative polyadenylation sites (implementing custom analysis modules) and call for statistically significant differences in genes and transcripts expression, splicing pattern and polyadenylation site usage (using Cuffdiff2 and DESeq). Results Through a user friendly web interface, the RAP workflow can be suitably customized by the user and it is automatically executed on our cloud computing environment. This strategy allows to access to bioinformatics tools and computational resources without specific bioinformatics and IT skills. RAP provides a set of tabular and graphical results that can be helpful to browse, filter and export analyzed data, according to the user needs. PMID:26046471

  7. Ovarian Tumors related to Intronic Mutations in DICER1: A Report from the International Ovarian and Testicular Stromal Tumor Registry

    PubMed Central

    Schultz, Kris Ann; Harris, Anne; Messinger, Yoav; Sencer, Susan; Baldinger, Shari; Dehner, Louis P.; Hill, D. Ashley

    2015-01-01

    Germline DICER1 mutations have been described in individuals with pleuropulmonary blastoma (PPB), ovarian Sertoli-Leydig cell tumor (SLCT), sarcomas, multinodular goiter, thyroid carcinoma, cystic nephroma and other neoplastic conditions. Early results from the International Ovarian and Testicular Stromal Tumor Registry show germline DICER1 mutations in 48% of girls and women with SLCT. In this report, a young woman presented with ovarian undifferentiated sarcoma. Four years later, she presented with SLCT. She was successfully treated for both malignancies. Sequence results showed a germline intronic mutation in DICER1. This mutation results in an exact duplication of the six bases at the splice site at the intron 23 and exon 24 junction. Predicted improper splicing leads to inclusion of 10 bases of intronic sequence, frameshift and premature truncation of the protein disrupting the RNase IIIb domain. A second individual with SLCT was found to have an identical germline mutation. In each of the ovarian tumors, an additional somatic mutation in the RNase IIIb domain of DICER1 was found. In rare patients, germline intronic mutations in DICER1 that are predicted to cause incorrect splicing can also contribute to the pathogenesis of SLCT. PMID:26289771

  8. Global regulation of alternative RNA splicing by the SR-rich protein RBM39.

    PubMed

    Mai, Sanyue; Qu, Xiuhua; Li, Ping; Ma, Qingjun; Cao, Cheng; Liu, Xuan

    2016-08-01

    RBM39 is a serine/arginine-rich RNA-binding protein that is highly homologous to the splicing factor U2AF65. However, the role of RBM39 in alternative splicing is poorly understood. In this study, RBM39-mediated global alternative splicing was investigated using RNA-Seq and genome-wide RBM39-RNA interactions were mapped via cross-linking and immunoprecipitation coupled with deep sequencing (CLIP-Seq) in wild-type and RBM39-knockdown MCF-7 cells. RBM39 was involved in the up- or down-regulation of the transcript levels of various genes. Hundreds of alternative splicing events regulated by endogenous RBM39 were identified. The majority of these events were cassette exons. Genes containing RBM39-regulated alternative exons were found to be linked to G2/M transition, cellular response to DNA damage, adherens junctions and endocytosis. CLIP-Seq analysis showed that the binding site of RBM39 was mainly in proximity to 5' and 3' splicing sites. Considerable RBM39 binding to mRNAs encoding proteins involved in translation was observed. Of particular importance, ~20% of the alternative splicing events that were significantly regulated by RBM39 were similarly regulated by U2AF65. RBM39 is extensively involved in alternative splicing of RNA and helps regulate transcript levels. RBM39 may modulate alternative splicing similarly to U2AF65 by either directly binding to RNA or recruiting other splicing factors, such as U2AF65. The current study offers a genome-wide view of RBM39's regulatory function in alternative splicing. RBM39 may play important roles in multiple cellular processes by regulating both alternative splicing of RNA molecules and transcript levels. Copyright © 2016 Elsevier B.V. All rights reserved.

  9. Expressed sequence tag analysis of human RPE/choroid for the NEIBank Project: over 6000 non-redundant transcripts, novel genes and splice variants.

    PubMed

    Wistow, Graeme; Bernstein, Steven L; Wyatt, M Keith; Fariss, Robert N; Behal, Amita; Touchman, Jeffrey W; Bouffard, Gerald; Smith, Don; Peterson, Katherine

    2002-06-15

    The retinal pigment epithelium (RPE) and choroid comprise a functional unit of the eye that is essential to normal retinal health and function. Here we describe expressed sequence tag (EST) analysis of human RPE/choroid as part of a project for ocular bioinformatics. A cDNA library (cs) was made from human RPE/choroid and sequenced. Data were analyzed and assembled using the program GRIST (GRouping and Identification of Sequence Tags). Complete sequencing, Northern and Western blots, RH mapping, peptide antibody synthesis and immunofluorescence (IF) have been used to examine expression patterns and genome location for selected transcripts and proteins. Ten thousand individual sequence reads yield over 6300 unique gene clusters of which almost half have no matches with named genes. One of the most abundant transcripts is from a gene (named "alpha") that maps to the BBS1 region of chromosome 11. A number of tissue preferred transcripts are common to both RPE/choroid and iris. These include oculoglycan/opticin, for which an alternative splice form is detected in RPE/choroid, and "oculospanin" (Ocsp), a novel tetraspanin that maps to chromosome 17q. Antiserum to Ocsp detects expression in RPE, iris, ciliary body, and retinal ganglion cells by IF. A newly identified gene for a zinc-finger protein (TIRC) maps to 19q13.4. Variant transcripts of several genes were also detected. Most notably, the predominant form of Bestrophin represented in cs contains a longer open reading frame as a result of splice junction skipping. The unamplified cs library gives a view of the transcriptional repertoire of the adult RPE/choroid. A large number of potentially novel genes and splice forms and candidates for genetic diseases are revealed. Clones from this collection are being included in a large, nonredundant set for cDNA microarray construction.

  10. RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application.

    PubMed

    D'Antonio, Mattia; D'Onorio De Meo, Paolo; Pallocca, Matteo; Picardi, Ernesto; D'Erchia, Anna Maria; Calogero, Raffaele A; Castrignanò, Tiziana; Pesole, Graziano

    2015-01-01

    The study of RNA has been dramatically improved by the introduction of Next Generation Sequencing platforms allowing massive and cheap sequencing of selected RNA fractions, also providing information on strand orientation (RNA-Seq). The complexity of transcriptomes and of their regulative pathways make RNA-Seq one of most complex field of NGS applications, addressing several aspects of the expression process (e.g. identification and quantification of expressed genes and transcripts, alternative splicing and polyadenylation, fusion genes and trans-splicing, post-transcriptional events, etc.). In order to provide researchers with an effective and friendly resource for analyzing RNA-Seq data, we present here RAP (RNA-Seq Analysis Pipeline), a cloud computing web application implementing a complete but modular analysis workflow. This pipeline integrates both state-of-the-art bioinformatics tools for RNA-Seq analysis and in-house developed scripts to offer to the user a comprehensive strategy for data analysis. RAP is able to perform quality checks (adopting FastQC and NGS QC Toolkit), identify and quantify expressed genes and transcripts (with Tophat, Cufflinks and HTSeq), detect alternative splicing events (using SpliceTrap) and chimeric transcripts (with ChimeraScan). This pipeline is also able to identify splicing junctions and constitutive or alternative polyadenylation sites (implementing custom analysis modules) and call for statistically significant differences in genes and transcripts expression, splicing pattern and polyadenylation site usage (using Cuffdiff2 and DESeq). Through a user friendly web interface, the RAP workflow can be suitably customized by the user and it is automatically executed on our cloud computing environment. This strategy allows to access to bioinformatics tools and computational resources without specific bioinformatics and IT skills. RAP provides a set of tabular and graphical results that can be helpful to browse, filter and export analyzed data, according to the user needs.

  11. A serine–arginine-rich (SR) splicing factor modulates alternative splicing of over a thousand genes in Toxoplasma gondii

    PubMed Central

    Yeoh, Lee M.; Goodman, Christopher D.; Hall, Nathan E.; van Dooren, Giel G.; McFadden, Geoffrey I.; Ralph, Stuart A.

    2015-01-01

    Single genes are often subject to alternative splicing, which generates alternative mature mRNAs. This phenomenon is widespread in animals, and observed in over 90% of human genes. Recent data suggest it may also be common in Apicomplexa. These parasites have small genomes, and economy of DNA is evolutionarily favoured in this phylum. We investigated the mechanism of alternative splicing in Toxoplasma gondii, and have identified and localized TgSR3, a homologue of ASF/SF2 (alternative-splicing factor/splicing factor 2, a serine-arginine–rich, or SR protein) to a subnuclear compartment. In addition, we conditionally overexpressed this protein, which was deleterious to growth. qRT-PCR was used to confirm perturbation of splicing in a known alternatively-spliced gene. We performed high-throughput RNA-seq to determine the extent of splicing modulated by this protein. Current RNA-seq algorithms are poorly suited to compact parasite genomes, and hence we complemented existing tools by writing a new program, GeneGuillotine, that addresses this deficiency by segregating overlapping reads into distinct genes. In order to identify the extent of alternative splicing, we released another program, JunctionJuror, that detects changes in intron junctions. Using this program, we identified about 2000 genes that were constitutively alternatively spliced in T. gondii. Overexpressing the splice regulator TgSR3 perturbed alternative splicing in over 1000 genes. PMID:25870410

  12. Human heavy chain disease protein WIS: implications for the organization of immunoglobulin genes.

    PubMed Central

    Franklin, E C; Prelli, F; Frangione, B

    1979-01-01

    Protein WIS is a human gamma3 heavy (H) chain disease immunoglobulin variant whose amino acid sequence is most readily interpreted by postulating that three residues of the amino terminus are followed by a deletion of most of the variable (VH) domain, which ends at the variable-constant (VC) joining region. Then there is a stretch of eight residues, three of which are unusual, while the other five have striking homology to the VC junction sequence. This is followed by a second deletion, which ends at the beginning of the quadruplicated hinge region. These findings are consistent with mutations resulting in deletions of most of the gene coding for the V region and CH1 domain followed by splicing at the VC joining region and at the hinge. These structural features fit well the notion of genetic discontinuity between V and C genes and also suggest similar mechanisms of excision and splicing in the interdomain regions of the C gene of the heavy chain. PMID:106391

  13. An Analysis of the Sensitivity of Proteogenomic Mapping of Somatic Mutations and Novel Splicing Events in Cancer.

    PubMed

    Ruggles, Kelly V; Tang, Zuojian; Wang, Xuya; Grover, Himanshu; Askenazi, Manor; Teubl, Jennifer; Cao, Song; McLellan, Michael D; Clauser, Karl R; Tabb, David L; Mertins, Philipp; Slebos, Robbert; Erdmann-Gilmore, Petra; Li, Shunqiang; Gunawardena, Harsha P; Xie, Ling; Liu, Tao; Zhou, Jian-Ying; Sun, Shisheng; Hoadley, Katherine A; Perou, Charles M; Chen, Xian; Davies, Sherri R; Maher, Christopher A; Kinsinger, Christopher R; Rodland, Karen D; Zhang, Hui; Zhang, Zhen; Ding, Li; Townsend, R Reid; Rodriguez, Henry; Chan, Daniel; Smith, Richard D; Liebler, Daniel C; Carr, Steven A; Payne, Samuel; Ellis, Matthew J; Fenyő, David

    2016-03-01

    Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations, and splice variants identified in cancer cells are translated. Herein, we apply a proteogenomic data integration tool (QUILTS) to illustrate protein variant discovery using whole genome, whole transcriptome, and global proteome datasets generated from a pair of luminal and basal-like breast-cancer-patient-derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS sample process replicates defined here as an independent tandem MS experiment using identical sample material. Despite analysis of over 30 sample process replicates, only about 10% of SNVs (somatic and germline) detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNVs without a detectable mRNA transcript were also observed, suggesting that transcriptome coverage was incomplete (∼80%). In contrast to germline variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than in the luminal tumor, raising the possibility of differential translation or protein degradation effects. In conclusion, this large-scale proteogenomic integration allowed us to determine the degree to which mutations are translated and identify gaps in sequence coverage, thereby benchmarking current technology and progress toward whole cancer proteome and transcriptome analysis. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  14. Congenital amegakaryocytic thrombocytopenia in three siblings: molecular analysis of atypical clinical presentation.

    PubMed

    Gandhi, Manish J; Pendergrass, Thomas W; Cummings, Carrie C; Ihara, Kenji; Blau, C Anthony; Drachman, Jonathan G

    2005-10-01

    An 11-year-old girl, presenting with fatigue and bruising, was found to be profoundly pancytopenic. Bone marrow exam and clinical evaluation were consistent with aplastic anemia. Family members were studied as potential stem cell donors, revealing that both younger siblings displayed significant thrombocytopenia, whereas both parents had normal blood counts. We evaluated this pedigree to understand the unusually late presentation of congenital amegakaryocytic thrombocytopenia (CAMT). The coding region and the intron/exon junctions of MPL were sequenced from each family member. Vectors representing each of the mutations were constructed and tested for the ability to support growth of Baf3/Mpl(mutant) cells. All three siblings had elevated thrombopoietin levels. Analysis of genomic DNA demonstrated that each parent had mutations/polymorphisms in a single MPL allele and that each child was a compound heterozygote, having inherited both abnormal alleles. The maternal allele encoded a mutation of the donor splice-junction at the exon-3/intron-3 boundary. A mini-gene construct encoding normal vs mutant versions of the intron-3 donor-site demonstrated that physiologic splicing was significantly reduced in the mutant construct. Mutations that incompletely eliminate Mpl expression/function may result in delayed diagnosis of CAMT and confusion with aplastic anemia.

  15. ACTG: novel peptide mapping onto gene models.

    PubMed

    Choi, Seunghyuk; Kim, Hyunwoo; Paek, Eunok

    2017-04-15

    In many proteogenomic applications, mapping peptide sequences onto genome sequences can be very useful, because it allows us to understand origins of the gene products. Existing software tools either take the genomic position of a peptide start site as an input or assume that the peptide sequence exactly matches the coding sequence of a given gene model. In case of novel peptides resulting from genomic variations, especially structural variations such as alternative splicing, these existing tools cannot be directly applied unless users supply information about the variant, either its genomic position or its transcription model. Mapping potentially novel peptides to genome sequences, while allowing certain genomic variations, requires introducing novel gene models when aligning peptide sequences to gene structures. We have developed a new tool called ACTG (Amino aCids To Genome), which maps peptides to genome, assuming all possible single exon skipping, junction variation allowing three edit distances from the original splice sites, exon extension and frame shift. In addition, it can also consider SNVs (single nucleotide variations) during mapping phase if a user provides the VCF (variant call format) file as an input. Available at http://prix.hanyang.ac.kr/ACTG/search.jsp . eunokpaek@hanyang.ac.kr. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  16. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ruggles, Kelly V.; Tang, Zuojian; Wang, Xuya

    Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations and splice variants identified in cancer cells are translated. Herein we therefore describe a proteogenomic data integration tool (QUILTS) and illustrate its application to whole genome, transcriptome and global MS peptide sequence datasets generated from a pair of luminal and basal-like breast cancer patient derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS process replicates. Despite over thirty sample replicates, only about 10% of all SNV (somatic andmore » germline) were detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNV without a detectable mRNA transcript were also observed demonstrating the transcriptome coverage was also incomplete (~80%). In contrast to germ-line variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than the luminal tumor raising the possibility of differential translation or protein degradation effects. In conclusion, the QUILTS program integrates DNA, RNA and peptide sequencing to assess the degree to which somatic mutations are translated and therefore biologically active. By identifying gaps in sequence coverage QUILTS benchmarks current technology and assesses progress towards whole cancer proteome and transcriptome analysis.« less

  17. Single Molecule Spectroscopy of Amino Acids and Peptides by Recognition Tunneling

    PubMed Central

    Zhao, Yanan; Ashcroft, Brian; Zhang, Peiming; Liu, Hao; Sen, Suman; Song, Weisi; Im, JongOne; Gyarfas, Brett; Manna, Saikat; Biswas, Sovan; Borges, Chad; Lindsay, Stuart

    2014-01-01

    The human proteome has millions of protein variants due to alternative RNA splicing and post-translational modifications, and variants that are related to diseases are frequently present in minute concentrations. For DNA and RNA, low concentrations can be amplified using the polymerase chain reaction, but there is no such reaction for proteins. Therefore, the development of single molecule protein sequencing is a critical step in the search for protein biomarkers. Here we show that single amino acids can be identified by trapping the molecules between two electrodes that are coated with a layer of recognition molecules and measuring the electron tunneling current across the junction. A given molecule can bind in more than one way in the junction, and we therefore use a machine-learning algorithm to distinguish between the sets of electronic ‘fingerprints’ associated with each binding motif. With this recognition tunneling technique, we are able to identify D, L enantiomers, a methylated amino acid, isobaric isomers, and short peptides. The results suggest that direct electronic sequencing of single proteins could be possible by sequentially measuring the products of processive exopeptidase digestion, or by using a molecular motor to pull proteins through a tunnel junction integrated with a nanopore. PMID:24705512

  18. Improving the efficiency of a user-driven learning system with reconfigurable hardware. Application to DNA splicing.

    PubMed

    Lemoine, E; Merceron, D; Sallantin, J; Nguifo, E M

    1999-01-01

    This paper describes a new approach to problem solving by splitting up problem component parts between software and hardware. Our main idea arises from the combination of two previously published works. The first one proposed a conceptual environment of concept modelling in which the machine and the human expert interact. The second one reported an algorithm based on reconfigurable hardware system which outperforms any kind of previously published genetic data base scanning hardware or algorithms. Here we show how efficient the interaction between the machine and the expert is when the concept modelling is based on reconfigurable hardware system. Their cooperation is thus achieved with an real time interaction speed. The designed system has been partially applied to the recognition of primate splice junctions sites in genetic sequences.

  19. The low information content of Neurospora splicing signals: implications for RNA splicing and intron origin.

    PubMed

    Collins, Richard A; Stajich, Jason E; Field, Deborah J; Olive, Joan E; DeAbreu, Diane M

    2015-05-01

    When we expressed a small (0.9 kb) nonprotein-coding transcript derived from the mitochondrial VS plasmid in the nucleus of Neurospora we found that it was efficiently spliced at one or more of eight 5' splice sites and ten 3' splice sites, which are present apparently by chance in the sequence. Further experimental and bioinformatic analyses of other mitochondrial plasmids, random sequences, and natural nuclear genes in Neurospora and other fungi indicate that fungal spliceosomes recognize a wide range of 5' splice site and branchpoint sequences and predict introns to be present at high frequency in random sequence. In contrast, analysis of intronless fungal nuclear genes indicates that branchpoint, 5' splice site and 3' splice site consensus sequences are underrepresented compared with random sequences. This underrepresentation of splicing signals is sufficient to deplete the nuclear genome of splice sites at locations that do not comprise biologically relevant introns. Thus, the splicing machinery can recognize a wide range of splicing signal sequences, but splicing still occurs with great accuracy, not because the splicing machinery distinguishes correct from incorrect introns, but because incorrect introns are substantially depleted from the genome. © 2015 Collins et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  20. ViennaNGS: A toolbox for building efficient next- generation sequencing analysis pipelines

    PubMed Central

    Wolfinger, Michael T.; Fallmann, Jörg; Eggenhofer, Florian; Amman, Fabian

    2015-01-01

    Recent achievements in next-generation sequencing (NGS) technologies lead to a high demand for reuseable software components to easily compile customized analysis workflows for big genomics data. We present ViennaNGS, an integrated collection of Perl modules focused on building efficient pipelines for NGS data processing. It comes with functionality for extracting and converting features from common NGS file formats, computation and evaluation of read mapping statistics, as well as normalization of RNA abundance. Moreover, ViennaNGS provides software components for identification and characterization of splice junctions from RNA-seq data, parsing and condensing sequence motif data, automated construction of Assembly and Track Hubs for the UCSC genome browser, as well as wrapper routines for a set of commonly used NGS command line tools. PMID:26236465

  1. Multiple splicing defects in an intronic false exon.

    PubMed

    Sun, H; Chasin, L A

    2000-09-01

    Splice site consensus sequences alone are insufficient to dictate the recognition of real constitutive splice sites within the typically large transcripts of higher eukaryotes, and large numbers of pseudoexons flanked by pseudosplice sites with good matches to the consensus sequences can be easily designated. In an attempt to identify elements that prevent pseudoexon splicing, we have systematically altered known splicing signals, as well as immediately adjacent flanking sequences, of an arbitrarily chosen pseudoexon from intron 1 of the human hprt gene. The substitution of a 5' splice site that perfectly matches the 5' consensus combined with mutation to match the CAG/G sequence of the 3' consensus failed to get this model pseudoexon included as the central exon in a dhfr minigene context. Provision of a real 3' splice site and a consensus 5' splice site and removal of an upstream inhibitory sequence were necessary and sufficient to confer splicing on the pseudoexon. This activated context also supported the splicing of a second pseudoexon sequence containing no apparent enhancer. Thus, both the 5' splice site sequence and the polypyrimidine tract of the pseudoexon are defective despite their good agreement with the consensus. On the other hand, the pseudoexon body did not exert a negative influence on splicing. The introduction into the pseudoexon of a sequence selected for binding to ASF/SF2 or its replacement with beta-globin exon 2 only partially reversed the effect of the upstream negative element and the defective polypyrimidine tract. These results support the idea that exon-bridging enhancers are not a prerequisite for constitutive exon definition and suggest that intrinsically defective splice sites and negative elements play important roles in distinguishing the real splicing signal from the vast number of false splicing signals.

  2. Transcriptional diversity during lineage commitment of human blood progenitors.

    PubMed

    Chen, Lu; Kostadima, Myrto; Martens, Joost H A; Canu, Giovanni; Garcia, Sara P; Turro, Ernest; Downes, Kate; Macaulay, Iain C; Bielczyk-Maczynska, Ewa; Coe, Sophia; Farrow, Samantha; Poudel, Pawan; Burden, Frances; Jansen, Sjoert B G; Astle, William J; Attwood, Antony; Bariana, Tadbir; de Bono, Bernard; Breschi, Alessandra; Chambers, John C; Consortium, Bridge; Choudry, Fizzah A; Clarke, Laura; Coupland, Paul; van der Ent, Martijn; Erber, Wendy N; Jansen, Joop H; Favier, Rémi; Fenech, Matthew E; Foad, Nicola; Freson, Kathleen; van Geet, Chris; Gomez, Keith; Guigo, Roderic; Hampshire, Daniel; Kelly, Anne M; Kerstens, Hindrik H D; Kooner, Jaspal S; Laffan, Michael; Lentaigne, Claire; Labalette, Charlotte; Martin, Tiphaine; Meacham, Stuart; Mumford, Andrew; Nürnberg, Sylvia; Palumbo, Emilio; van der Reijden, Bert A; Richardson, David; Sammut, Stephen J; Slodkowicz, Greg; Tamuri, Asif U; Vasquez, Louella; Voss, Katrin; Watt, Stephen; Westbury, Sarah; Flicek, Paul; Loos, Remco; Goldman, Nick; Bertone, Paul; Read, Randy J; Richardson, Sylvia; Cvejic, Ana; Soranzo, Nicole; Ouwehand, Willem H; Stunnenberg, Hendrik G; Frontini, Mattia; Rendon, Augusto

    2014-09-26

    Blood cells derive from hematopoietic stem cells through stepwise fating events. To characterize gene expression programs driving lineage choice, we sequenced RNA from eight primary human hematopoietic progenitor populations representing the major myeloid commitment stages and the main lymphoid stage. We identified extensive cell type-specific expression changes: 6711 genes and 10,724 transcripts, enriched in non-protein-coding elements at early stages of differentiation. In addition, we found 7881 novel splice junctions and 2301 differentially used alternative splicing events, enriched in genes involved in regulatory processes. We demonstrated experimentally cell-specific isoform usage, identifying nuclear factor I/B (NFIB) as a regulator of megakaryocyte maturation-the platelet precursor. Our data highlight the complexity of fating events in closely related progenitor populations, the understanding of which is essential for the advancement of transplantation and regenerative medicine. Copyright © 2014, American Association for the Advancement of Science.

  3. Revealing the transcriptomic complexity of switchgrass by PacBio long-read sequencing.

    PubMed

    Zuo, Chunman; Blow, Matthew; Sreedasyam, Avinash; Kuo, Rita C; Ramamoorthy, Govindarajan Kunde; Torres-Jerez, Ivone; Li, Guifen; Wang, Mei; Dilworth, David; Barry, Kerrie; Udvardi, Michael; Schmutz, Jeremy; Tang, Yuhong; Xu, Ying

    2018-01-01

    Switchgrass ( Panicum virgatum L.) is an important bioenergy crop widely used for lignocellulosic research. While extensive transcriptomic analyses have been conducted on this species using short read-based sequencing techniques, very little has been reliably derived regarding alternatively spliced (AS) transcripts. We present an analysis of transcriptomes of six switchgrass tissue types pooled together, sequenced using Pacific Biosciences (PacBio) single-molecular long-read technology. Our analysis identified 105,419 unique transcripts covering 43,570 known genes and 8795 previously unknown genes. 45,168 are novel transcripts of known genes. A total of 60,096 AS transcripts are identified, 45,628 being novel. We have also predicted 1549 transcripts of genes involved in cell wall construction and remodeling, 639 being novel transcripts of known cell wall genes. Most of the predicted transcripts are validated against Illumina-based short reads. Specifically, 96% of the splice junction sites in all the unique transcripts are validated by at least five Illumina reads. Comparisons between genes derived from our identified transcripts and the current genome annotation revealed that among the gene set predicted by both analyses, 16,640 have different exon-intron structures. Overall, substantial amount of new information is derived from the PacBio RNA data regarding both the transcriptome and the genome of switchgrass.

  4. Mutation Analysis of SLC26A4 for Pendred Syndrome and Nonsyndromic Hearing Loss by High-Resolution Melting

    PubMed Central

    Chen, Neng; Tranebjærg, Lisbeth; Rendtorff, Nanna Dahl; Schrijver, Iris

    2011-01-01

    Pendred syndrome and DFNB4 (autosomal recessive nonsyndromic congenital deafness, locus 4) are associated with autosomal recessive congenital sensorineural hearing loss and mutations in the SLC26A4 gene. Extensive allelic heterogeneity, however, necessitates analysis of all exons and splice sites to identify mutations for individual patients. Although Sanger sequencing is the gold standard for mutation detection, screening methods supplemented with targeted sequencing can provide a cost-effective alternative. One such method, denaturing high-performance liquid chromatography, was developed for clinical mutation detection in SLC26A4. However, this method inherently cannot distinguish homozygous changes from wild-type sequences. High-resolution melting (HRM), on the other hand, can detect heterozygous and homozygous changes cost-effectively, without any post-PCR modifications. We developed a closed-tube HRM mutation detection method specific for SLC26A4 that can be used in the clinical diagnostic setting. Twenty-eight primer pairs were designed to cover all 21 SLC26A4 exons and splice junction sequences. Using the resulting amplicons, initial HRM analysis detected all 45 variants previously identified by sequencing. Subsequently, a 384-well plate format was designed for up to three patient samples per run. Blinded HRM testing on these plates of patient samples collected over 1 year in a clinical diagnostic laboratory accurately detected all variants identified by sequencing. In conclusion, HRM with targeted sequencing is a reliable, simple, and cost-effective method for SLC26A4 mutation screening and detection. PMID:21704276

  5. Splice junction mutations at the Menkes locus that maintain some proper splicing are associated with milder clinical phenotypes, including typical occipital horn syndrome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kaler, S.G.; Gahl, W.A.

    1994-09-01

    Menkes disease is an X linked recessive disorder of copper metabolism produced by abnormalities in a gene that encodes a copper transporting ATPase. The clinical spectrum of Menkes disease includes a range of neurological severity from the classical type to the occipital horn syndrome (OHS) in which slightly subnormal intelligence or signs of autonomic dysfunction are the only neurologic abnormalities. We previously documented a distinctive, less severe Menkes phenotype associated with a +3 intronic splice donor mutation at the 3{prime} end of the gene in which exon skipping occurred but some normally spliced message was also detectable. We now reportmore » a similar splicing mutation in a patient with a typical OHS phenotype an A to G transition at the 2 exonic position of a splice donor site in the middle of the Menkes coding sequence. Some normally sized transcripts are evident by RT-PCR of lymphoblast mRNA from this individual, as well as 2 truncated fragments generated by exon skipping and activation of a cryptic splice acceptor site, respectively. The predicted effect of the mutation on the gene product involves a serine to glycine substitution in a noncritical region of the Menkes ATPase from the patient`s normally sized message, and premature termination due to translational frameshift in both truncated transcripts. The mutation eliminates a Dde 1 restriction site in the gene which provided a method to rapidly screen other family members, and revealed that the patient`s mother is a non-carrier. The mutational base change was not present in 25 normal X chromosomes studied. Preliminary analysis of the Menkes locus in 5 other Menkes disease families indicates aberrant mRNA splicing in 2. Our findings confirm allelism at the Menkes locus, indicate that splice mutations are relatively common mutational event in Menkes disease, and suggest that splice mutations in which some normal splicing is preserved may underlie milder Menkes disease variants, including OHS.« less

  6. [Deregulation of pre-messenger RNA splicing and rare diseases].

    PubMed

    de la Grange, Pierre

    2016-12-01

    Most of protein-coding human genes are subjected to alternative pre-mRNA splicing. This mechanism is highly regulated to precisely modulate detection of specific splice sites. This regulation is under control of the spliceosome and several splicing factors are also required to modulate the alternative usage of splice sites. Splicing factors and spliceosome components recognize splicing signals and regulatory sequences of the pre-mRNAs. These splicing sequences make splicing susceptible to polymorphisms and mutations. Examples of associations between human rare diseases and defects in pre-messenger RNA splicing are accumulating. Although many alterations are caused by mutations in splicing sequence (i.e., cis acting mutations), recent studies described the disruptive impact of mutations within spliceosome components or splicing factors (i.e., trans acting mutations). Following growing of knowledge regarding splicing regulation, several approaches have been developed to compensate for the effect of deleterious mutations and to restore sufficient amounts of functional protein. © 2016 médecine/sciences – Inserm.

  7. Self-Organizing Hidden Markov Model Map (SOHMMM).

    PubMed

    Ferles, Christos; Stafylopatis, Andreas

    2013-12-01

    A hybrid approach combining the Self-Organizing Map (SOM) and the Hidden Markov Model (HMM) is presented. The Self-Organizing Hidden Markov Model Map (SOHMMM) establishes a cross-section between the theoretic foundations and algorithmic realizations of its constituents. The respective architectures and learning methodologies are fused in an attempt to meet the increasing requirements imposed by the properties of deoxyribonucleic acid (DNA), ribonucleic acid (RNA), and protein chain molecules. The fusion and synergy of the SOM unsupervised training and the HMM dynamic programming algorithms bring forth a novel on-line gradient descent unsupervised learning algorithm, which is fully integrated into the SOHMMM. Since the SOHMMM carries out probabilistic sequence analysis with little or no prior knowledge, it can have a variety of applications in clustering, dimensionality reduction and visualization of large-scale sequence spaces, and also, in sequence discrimination, search and classification. Two series of experiments based on artificial sequence data and splice junction gene sequences demonstrate the SOHMMM's characteristics and capabilities. Copyright © 2013 Elsevier Ltd. All rights reserved.

  8. Concurrent and Accurate Short Read Mapping on Multicore Processors.

    PubMed

    Martínez, Héctor; Tárraga, Joaquín; Medina, Ignacio; Barrachina, Sergio; Castillo, Maribel; Dopazo, Joaquín; Quintana-Ortí, Enrique S

    2015-01-01

    We introduce a parallel aligner with a work-flow organization for fast and accurate mapping of RNA sequences on servers equipped with multicore processors. Our software, HPG Aligner SA (HPG Aligner SA is an open-source application. The software is available at http://www.opencb.org, exploits a suffix array to rapidly map a large fraction of the RNA fragments (reads), as well as leverages the accuracy of the Smith-Waterman algorithm to deal with conflictive reads. The aligner is enhanced with a careful strategy to detect splice junctions based on an adaptive division of RNA reads into small segments (or seeds), which are then mapped onto a number of candidate alignment locations, providing crucial information for the successful alignment of the complete reads. The experimental results on a platform with Intel multicore technology report the parallel performance of HPG Aligner SA, on RNA reads of 100-400 nucleotides, which excels in execution time/sensitivity to state-of-the-art aligners such as TopHat 2+Bowtie 2, MapSplice, and STAR.

  9. Language study on Spliced Semigraph using Folding techniques

    NASA Astrophysics Data System (ADS)

    Thiagarajan, K.; Padmashree, J.

    2018-04-01

    In this paper, we proposed algorithm to identify cut vertices and cut edges for n-Cut Spliced Semigraph and splicing the n-Cut Spliced Semigraph using cut vertices else cut edges or combination of cut vertex and cut edge and applying sequence of folding to the spliced semigraph to obtain the semigraph quadruple η(S)=(2, 1, 1, 1). We observed that the splicing and folding using both cut vertices and cut edges is applicable only for n-Cut Spliced Semigraph where n > 2. Also, we transformed the spliced semigraph into tree structure and studied the language for the semigraph with n+2 vertices and n+1 semivertices using Depth First Edge Sequence algorithm and obtain the language structure with sequence of alphabet ‘a’ and ‘b’.

  10. A novel recessive mutation in the gene ELOVL4 causes a neuro-ichthyotic disorder with variable expressivity

    PubMed Central

    2014-01-01

    Background A rare neuro-ichthyotic disorder characterized by ichthyosis, spastic quadriplegia and intellectual disability and caused by recessive mutations in ELOVL4, encoding elongase-4 protein has recently been described. The objective of the study was to search for sequence variants in the gene ELOVL4 in three affected individuals of a consanguineous Pakistani family exhibiting features of neuro-ichthyotic disorder. Methods Linkage in the family was searched by genotyping microsatellite markers linked to the gene ELOVL4, mapped at chromosome 6p14.1. Exons and splice junction sites of the gene ELOVL4 were polymerase chain reaction amplified and sequenced in an automated DNA sequencer. Results DNA sequence analysis revealed a novel homozygous nonsense mutation (c.78C > G; p.Tyr26*). Conclusions Our report further confirms the recently described ELOVL4-related neuro-ichthyosis and shows that the neurological phenotype can be absent in some individuals. PMID:24571530

  11. Identification and analysis of pig chimeric mRNAs using RNA sequencing data

    PubMed Central

    2012-01-01

    Background Gene fusion is ubiquitous over the course of evolution. It is expected to increase the diversity and complexity of transcriptomes and proteomes through chimeric sequence segments or altered regulation. However, chimeric mRNAs in pigs remain unclear. Here we identified some chimeric mRNAs in pigs and analyzed the expression of them across individuals and breeds using RNA-sequencing data. Results The present study identified 669 putative chimeric mRNAs in pigs, of which 251 chimeric candidates were detected in a set of RNA-sequencing data. The 618 candidates had clear trans-splicing sites, 537 of which obeyed the canonical GU-AG splice rule. Only two putative pig chimera variants whose fusion junction was overlapped with that of a known human chimeric mRNA were found. A set of unique chimeric events were considered middle variances in the expression across individuals and breeds, and revealed non-significant variance between sexes. Furthermore, the genomic region of the 5′ partner gene shares a similar DNA sequence with that of the 3′ partner gene for 458 putative chimeric mRNAs. The 81 of those shared DNA sequences significantly matched the known DNA-binding motifs in the JASPAR CORE database. Four DNA motifs shared in parental genomic regions had significant similarity with known human CTCF binding sites. Conclusions The present study provided detailed information on some pig chimeric mRNAs. We proposed a model that trans-acting factors, such as CTCF, induced the spatial organisation of parental genes to the same transcriptional factory so that parental genes were coordinatively transcribed to give birth to chimeric mRNAs. PMID:22925561

  12. Deep Surveying of the Transcriptional and Alternative Splicing Signatures for Decidual CD8+ T Cells at the First Trimester of Human Healthy Pregnancy.

    PubMed

    Zeng, Weihong; Liu, Xinmei; Liu, Zhicui; Zheng, Ying; Yu, Tiantian; Fu, Shaliu; Li, Xiao; Zhang, Jing; Zhang, Siming; Ma, Xiaoling; Liu, Xiao-Rui; Qin, Xiaoli; Khanniche, Asma; Zhang, Yan; Tian, Fuju; Lin, Yi

    2018-01-01

    Decidual CD8 + (dCD8) T cells have been proposed to play important roles in immune protection against the invading pathogens and in tolerance toward the growing semi-allogeneic fetus during early pregnancy. However, their phenotypic and functional characteristics remain poorly defined. Here, we performed the first analysis of the transcriptional and alternative splicing (AS) signatures for human first-trimester dCD8 T cells using high-throughput mRNA sequencing. Our data revealed that dCD8 T cells have distinct transcriptional and AS landscapes when compared with their autologous peripheral blood CD8 + (pCD8) T counterparts. Furthermore, human dCD8 T cells were observed to contain CD8-Treg and effector-memory T-cell subsets, and display enhanced functionality in terms of degranulation and cytokine production on a per-cell basis. Additionally, we have identified the novel splice junctions that use a high ratio of the non-canonical splicing motif GC-AG and found that AS is not a major contributor to the gene expression-level changes between paired pCD8 and dCD8 T cells. Together, our findings not only provide a comprehensive framework of the transcriptional and AS landscapes but also reveal the functional feature of human dCD8 T cells, which are of great importance in understanding the biology of these cells and the physiology of human healthy pregnancy.

  13. Investigating DNA-, RNA-, and protein-based features as a means to discriminate pathogenic synonymous variants.

    PubMed

    Livingstone, Mark; Folkman, Lukas; Yang, Yuedong; Zhang, Ping; Mort, Matthew; Cooper, David N; Liu, Yunlong; Stantic, Bela; Zhou, Yaoqi

    2017-10-01

    Synonymous single-nucleotide variants (SNVs), although they do not alter the encoded protein sequences, have been implicated in many genetic diseases. Experimental studies indicate that synonymous SNVs can lead to changes in the secondary and tertiary structures of DNA and RNA, thereby affecting translational efficiency, cotranslational protein folding as well as the binding of DNA-/RNA-binding proteins. However, the importance of these various features in disease phenotypes is not clearly understood. Here, we have built a support vector machine (SVM) model (termed DDIG-SN) as a means to discriminate disease-causing synonymous variants. The model was trained and evaluated on nearly 900 disease-causing variants. The method achieves robust performance with the area under the receiver operating characteristic curve of 0.84 and 0.85 for protein-stratified 10-fold cross-validation and independent testing, respectively. We were able to show that the disease-causing effects in the immediate proximity to exon-intron junctions (1-3 bp) are driven by the loss of splicing motif strength, whereas the gain of splicing motif strength is the primary cause in regions further away from the splice site (4-69 bp). The method is available as a part of the DDIG server at http://sparks-lab.org/ddig. © 2017 Wiley Periodicals, Inc.

  14. A role for exon sequences in alternative splicing of the human fibronectin gene.

    PubMed Central

    Mardon, H J; Sebastio, G; Baralle, F E

    1987-01-01

    Exon EDIIIA of the fibronectin (Fn) gene is alternatively spliced via pathways which either skip or include the whole exon in the messenger RNA (mRNA). We have investigated the role of EDIIIA exon sequences in the human Fn gene in determining alternative splicing of this exon during transient expression of alpha globin/Fn minigene hybrids in HeLa cells. We demonstrate that a DNA sequence of 81bp within the central region of exon EDIIIA is required for alternative splicing during processing of the primary transcript to generate both EDIIIA+ and EDIIIA- mRNA's. Furthermore, alternative splicing of EDIIIA only occurs when this sequence is present in the correct orientation since when it is in antisense orientation splicing always occurs via exon-skipping generating EDIIIA- mRNA. Images PMID:3671064

  15. Survey of gene splicing algorithms based on reads.

    PubMed

    Si, Xiuhua; Wang, Qian; Zhang, Lei; Wu, Ruo; Ma, Jiquan

    2017-11-02

    Gene splicing is the process of assembling a large number of unordered short sequence fragments to the original genome sequence as accurately as possible. Several popular splicing algorithms based on reads are reviewed in this article, including reference genome algorithms and de novo splicing algorithms (Greedy-extension, Overlap-Layout-Consensus graph, De Bruijn graph). We also discuss a new splicing method based on the MapReduce strategy and Hadoop. By comparing these algorithms, some conclusions are drawn and some suggestions on gene splicing research are made.

  16. Molecular Characterization of Voltage-Gated Sodium Channels and Their Relations with Paralytic Shellfish Toxin Bioaccumulation in the Pacific Oyster Crassostrea gigas

    PubMed Central

    Boullot, Floriane; Castrec, Justine; Bidault, Adeline; Dantas, Natanael; Payton, Laura; Perrigault, Mickael; Tran, Damien; Amzil, Zouher; Boudry, Pierre; Soudant, Philippe; Hégaret, Hélène; Fabioux, Caroline

    2017-01-01

    Paralytic shellfish toxins (PST) bind to voltage-gated sodium channels (Nav) and block conduction of action potential in excitable cells. This study aimed to (i) characterize Nav sequences in Crassostrea gigas and (ii) investigate a putative relation between Nav and PST-bioaccumulation in oysters. The phylogenetic analysis highlighted two types of Nav in C. gigas: a Nav1 (CgNav1) and a Nav2 (CgNav2) with sequence properties of sodium-selective and sodium/calcium-selective channels, respectively. Three alternative splice transcripts of CgNav1 named A, B and C, were characterized. The expression of CgNav1, analyzed by in situ hybridization, is specific to nervous cells and to structures corresponding to neuromuscular junctions. Real-time PCR analyses showed a strong expression of CgNav1A in the striated muscle while CgNav1B is mainly expressed in visceral ganglia. CgNav1C expression is ubiquitous. The PST binding site (domain II) of CgNav1 variants possess an amino acid Q that could potentially confer a partial saxitoxin (STX)-resistance to the channel. The CgNav1 genotype or alternative splicing would not be the key point determining PST bioaccumulation level in oysters. PMID:28106838

  17. Phosphothreonine 218 is required for the function of SR45.1 in regulating flower petal development in Arabidopsis

    USDA-ARS?s Scientific Manuscript database

    RNA splicing is crucial to the production of mature messenger RNAs (mRNA). The protein Arginine/Serine-rich 45 (SR45) acts as an RNA splicing activator and initiates the spliceosome assembly. It is also a peripheral component of the exon-exon junction complex, which assures the quality and availabil...

  18. Optimization of oligonucleotide arrays and RNA amplification protocols for analysis of transcript structure and alternative splicing.

    PubMed

    Castle, John; Garrett-Engele, Phil; Armour, Christopher D; Duenwald, Sven J; Loerch, Patrick M; Meyer, Michael R; Schadt, Eric E; Stoughton, Roland; Parrish, Mark L; Shoemaker, Daniel D; Johnson, Jason M

    2003-01-01

    Microarrays offer a high-resolution means for monitoring pre-mRNA splicing on a genomic scale. We have developed a novel, unbiased amplification protocol that permits labeling of entire transcripts. Also, hybridization conditions, probe characteristics, and analysis algorithms were optimized for detection of exons, exon-intron edges, and exon junctions. These optimized protocols can be used to detect small variations and isoform mixtures, map the tissue specificity of known human alternative isoforms, and provide a robust, scalable platform for high-throughput discovery of alternative splicing.

  19. Optimization of oligonucleotide arrays and RNA amplification protocols for analysis of transcript structure and alternative splicing

    PubMed Central

    Castle, John; Garrett-Engele, Phil; Armour, Christopher D; Duenwald, Sven J; Loerch, Patrick M; Meyer, Michael R; Schadt, Eric E; Stoughton, Roland; Parrish, Mark L; Shoemaker, Daniel D; Johnson, Jason M

    2003-01-01

    Microarrays offer a high-resolution means for monitoring pre-mRNA splicing on a genomic scale. We have developed a novel, unbiased amplification protocol that permits labeling of entire transcripts. Also, hybridization conditions, probe characteristics, and analysis algorithms were optimized for detection of exons, exon-intron edges, and exon junctions. These optimized protocols can be used to detect small variations and isoform mixtures, map the tissue specificity of known human alternative isoforms, and provide a robust, scalable platform for high-throughput discovery of alternative splicing. PMID:14519201

  20. Abiotic Stresses Modulate Landscape of Poplar Transcriptome via Alternative Splicing, Differential Intron Retention, and Isoform Ratio Switching

    PubMed Central

    Filichkin, Sergei A.; Hamilton, Michael; Dharmawardhana, Palitha D.; Singh, Sunil K.; Sullivan, Christopher; Ben-Hur, Asa; Reddy, Anireddy S. N.; Jaiswal, Pankaj

    2018-01-01

    Abiotic stresses affect plant physiology, development, growth, and alter pre-mRNA splicing. Western poplar is a model woody tree and a potential bioenergy feedstock. To investigate the extent of stress-regulated alternative splicing (AS), we conducted an in-depth survey of leaf, root, and stem xylem transcriptomes under drought, salt, or temperature stress. Analysis of approximately one billion of genome-aligned RNA-Seq reads from tissue- or stress-specific libraries revealed over fifteen millions of novel splice junctions. Transcript models supported by both RNA-Seq and single molecule isoform sequencing (Iso-Seq) data revealed a broad array of novel stress- and/or tissue-specific isoforms. Analysis of Iso-Seq data also resulted in the discovery of 15,087 novel transcribed regions of which 164 show AS. Our findings demonstrate that abiotic stresses profoundly perturb transcript isoform profiles and trigger widespread intron retention (IR) events. Stress treatments often increased or decreased retention of specific introns – a phenomenon described here as differential intron retention (DIR). Many differentially retained introns were regulated in a stress- and/or tissue-specific manner. A subset of transcripts harboring super stress-responsive DIR events showed persisting fluctuations in the degree of IR across all treatments and tissue types. To investigate coordinated dynamics of intron-containing transcripts in the study we quantified absolute copy number of isoforms of two conserved transcription factors (TFs) using Droplet Digital PCR. This case study suggests that stress treatments can be associated with coordinated switches in relative ratios between fully spliced and intron-retaining isoforms and may play a role in adjusting transcriptome to abiotic stresses. PMID:29483921

  1. Perispeckles are major assembly sites for the exon junction core complex

    PubMed Central

    Daguenet, Elisabeth; Baguet, Aurélie; Degot, Sébastien; Schmidt, Ute; Alpy, Fabien; Wendling, Corinne; Spiegelhalter, Coralie; Kessler, Pascal; Rio, Marie-Christine; Le Hir, Hervé; Bertrand, Edouard; Tomasetto, Catherine

    2012-01-01

    The exon junction complex (EJC) is loaded onto mRNAs as a consequence of splicing and regulates multiple posttranscriptional events. MLN51, Magoh, Y14, and eIF4A3 form a highly stable EJC core, but where this tetrameric complex is assembled in the cell remains unclear. Here we show that EJC factors are enriched in domains that we term perispeckles and are visible as doughnuts around nuclear speckles. Fluorescence resonance energy transfer analyses and EJC assembly mutants show that perispeckles do not store free subunits, but instead are enriched for assembled cores. At the ultrastructural level, perispeckles are distinct from interchromatin granule clusters that may function as storage sites for splicing factors and intermingle with perichromatin fibrils, where nascent RNAs and active RNA Pol II are present. These results support a model in which perispeckles are major assembly sites for the tetrameric EJC core. This subnuclear territory thus represents an intermediate region important for mRNA maturation, between transcription sites and splicing factor reservoirs and assembly sites. PMID:22419818

  2. Alternative Splicing as a Target for Cancer Treatment.

    PubMed

    Martinez-Montiel, Nancy; Rosas-Murrieta, Nora Hilda; Anaya Ruiz, Maricruz; Monjaraz-Guzman, Eduardo; Martinez-Contreras, Rebeca

    2018-02-11

    Alternative splicing is a key mechanism determinant for gene expression in metazoan. During alternative splicing, non-coding sequences are removed to generate different mature messenger RNAs due to a combination of sequence elements and cellular factors that contribute to splicing regulation. A different combination of splicing sites, exonic or intronic sequences, mutually exclusive exons or retained introns could be selected during alternative splicing to generate different mature mRNAs that could in turn produce distinct protein products. Alternative splicing is the main source of protein diversity responsible for 90% of human gene expression, and it has recently become a hallmark for cancer with a full potential as a prognostic and therapeutic tool. Currently, more than 15,000 alternative splicing events have been associated to different aspects of cancer biology, including cell proliferation and invasion, apoptosis resistance and susceptibility to different chemotherapeutic drugs. Here, we present well established and newly discovered splicing events that occur in different cancer-related genes, their modification by several approaches and the current status of key tools developed to target alternative splicing with diagnostic and therapeutic purposes.

  3. Recurrent chimeric RNAs enriched in human prostate cancer identified by deep sequencing

    PubMed Central

    Kannan, Kalpana; Wang, Liguo; Wang, Jianghua; Ittmann, Michael M.; Li, Wei; Yen, Laising

    2011-01-01

    Transcription-induced chimeric RNAs, possessing sequences from different genes, are expected to increase the proteomic diversity through chimeric proteins or altered regulation. Despite their importance, few studies have focused on chimeric RNAs especially regarding their presence/roles in human cancers. By deep sequencing the transcriptome of 20 human prostate cancer and 10 matched benign prostate tissues, we obtained 1.3 billion sequence reads, which led to the identification of 2,369 chimeric RNA candidates. Chimeric RNAs occurred in significantly higher frequency in cancer than in matched benign samples. Experimental investigation of a selected 46 set led to the confirmation of 32 chimeric RNAs, of which 27 were highly recurrent and previously undescribed in prostate cancer. Importantly, a subset of these chimeras was present in prostate cancer cell lines, but not detectable in primary human prostate epithelium cells, implying their associations with cancer. These chimeras contain discernable 5′ and 3′ splice sites at the RNA junction, indicating that their formation is mediated by splicing. Their presence is also largely independent of the expression of parental genes, suggesting that other factors are involved in their production and regulation. One chimera, TMEM79-SMG5, is highly differentially expressed in human cancer samples and therefore a potential biomarker. The prevalence of chimeric RNAs may allow the limited number of human genes to encode a substantially larger number of RNAs and proteins, forming an additional layer of cellular complexity. Together, our results suggest that chimeric RNAs are widespread, and increased chimeric RNA events could represent a unique class of molecular alteration in cancer. PMID:21571633

  4. Identification of a splicing enhancer in MLH1 using COMPARE a new assay for determination of relative RNA splicing efficiencies

    PubMed Central

    Xu, Dong-Qing; Mattox, William

    2006-01-01

    Exonic splicing enhancers (ESEs) are sequences that facilitate recognition of splice sites and prevent exon-skipping. Because ESEs are often embedded within proteincoding sequences, alterations in them can also often be interpreted as nonsense, missense or silent mutations. To correctly interpret exonic mutations and their roles in disease, it is important to develop strategies that identify ESE mutations. Potential ESEs can be found computationally in many exons but it has proven difficult to predict if a given mutation will have effects on splicing based on sequence alone. Here we describe a flexible in vitro method that can be used to functionally compare the effects of multiple sequence variants on ESE activity in a single in vitro splicing reaction. We have applied this method in parallel with conventional splicing assays to test for a splicing enhancer in exon 17 of the human MLH1 gene. Point mutations associated with hereditary nonpolyposis colorectal cancer (HNPCC) have previously been found to correlate with exon-skipping in both lymphocytes and tumors from patients. We show that sequences from this exon can replace an ESE from the mouse IgM gene to support RNA splicing in HeLa nuclear extracts. ESE activity was reduced by HNPCC point mutations in codon 659 indicating that their primary effect is on splicing. Surprisingly the strongest enhancer function mapped to a different region of the exon upstream of this codon. Together our results indicate that HNPCC point mutations in codon 659 affect an auxillary element that augments the enhancer function to ensure exon inclusion. PMID:16357104

  5. Large-scale identification and characterization of alternative splicing variants of human gene transcripts using 56 419 completely sequenced and manually annotated full-length cDNAs

    PubMed Central

    Takeda, Jun-ichi; Suzuki, Yutaka; Nakao, Mitsuteru; Barrero, Roberto A.; Koyanagi, Kanako O.; Jin, Lihua; Motono, Chie; Hata, Hiroko; Isogai, Takao; Nagai, Keiichi; Otsuki, Tetsuji; Kuryshev, Vladimir; Shionyu, Masafumi; Yura, Kei; Go, Mitiko; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Wiemann, Stefan; Nomura, Nobuo; Sugano, Sumio; Gojobori, Takashi; Imanishi, Tadashi

    2006-01-01

    We report the first genome-wide identification and characterization of alternative splicing in human gene transcripts based on analysis of the full-length cDNAs. Applying both manual and computational analyses for 56 419 completely sequenced and precisely annotated full-length cDNAs selected for the H-Invitational human transcriptome annotation meetings, we identified 6877 alternative splicing genes with 18 297 different alternative splicing variants. A total of 37 670 exons were involved in these alternative splicing events. The encoded protein sequences were affected in 6005 of the 6877 genes. Notably, alternative splicing affected protein motifs in 3015 genes, subcellular localizations in 2982 genes and transmembrane domains in 1348 genes. We also identified interesting patterns of alternative splicing, in which two distinct genes seemed to be bridged, nested or having overlapping protein coding sequences (CDSs) of different reading frames (multiple CDS). In these cases, completely unrelated proteins are encoded by a single locus. Genome-wide annotations of alternative splicing, relying on full-length cDNAs, should lay firm groundwork for exploring in detail the diversification of protein function, which is mediated by the fast expanding universe of alternative splicing variants. PMID:16914452

  6. Designing oligo libraries taking alternative splicing into account

    NASA Astrophysics Data System (ADS)

    Shoshan, Avi; Grebinskiy, Vladimir; Magen, Avner; Scolnicov, Ariel; Fink, Eyal; Lehavi, David; Wasserman, Alon

    2001-06-01

    We have designed sequences for DNA microarrays and oligo libraries, taking alternative splicing into account. Alternative splicing is a common phenomenon, occurring in more than 25% of the human genes. In many cases, different splice variants have different functions, are expressed in different tissues or may indicate different stages of disease. When designing sequences for DNA microarrays or oligo libraries, it is very important to take into account the sequence information of all the mRNA transcripts. Therefore, when a gene has more than one transcript (as a result of alternative splicing, alternative promoter sites or alternative poly-adenylation sites), it is very important to take all of them into account in the design. We have used the LEADS transcriptome prediction system to cluster and assemble the human sequences in GenBank and design optimal oligonucleotides for all the human genes with a known mRNA sequence based on the LEADS predictions.

  7. A Bioinformatics-Based Alternative mRNA Splicing Code that May Explain Some Disease Mutations Is Conserved in Animals.

    PubMed

    Qu, Wen; Cingolani, Pablo; Zeeberg, Barry R; Ruden, Douglas M

    2017-01-01

    Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5' or 3' splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an "alternative-splicing code" in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs.

  8. Conservation of CD44 exon v3 functional elements in mammals

    PubMed Central

    Vela, Elena; Hilari, Josep M; Delclaux, María; Fernández-Bellon, Hugo; Isamat, Marcos

    2008-01-01

    Background The human CD44 gene contains 10 variable exons (v1 to v10) that can be alternatively spliced to generate hundreds of different CD44 protein isoforms. Human CD44 variable exon v3 inclusion in the final mRNA depends on a multisite bipartite splicing enhancer located within the exon itself, which we have recently described, and provides the protein domain responsible for growth factor binding to CD44. Findings We have analyzed the sequence of CD44v3 in 95 mammalian species to report high conservation levels for both its splicing regulatory elements (the 3' splice site and the exonic splicing enhancer), and the functional glycosaminglycan binding site coded by v3. We also report the functional expression of CD44v3 isoforms in peripheral blood cells of different mammalian taxa with both consensus and variant v3 sequences. Conclusion CD44v3 mammalian sequences maintain all functional splicing regulatory elements as well as the GAG binding site with the same relative positions and sequence identity previously described during alternative splicing of human CD44. The sequence within the GAG attachment site, which in turn contains the Y motif of the exonic splicing enhancer, is more conserved relative to the rest of exon. Amplification of CD44v3 sequence from mammalian species but not from birds, fish or reptiles, may lead to classify CD44v3 as an exclusive mammalian gene trait. PMID:18710510

  9. Widespread alternative and aberrant splicing revealed by lariat sequencing

    PubMed Central

    Stepankiw, Nicholas; Raghavan, Madhura; Fogarty, Elizabeth A.; Grimson, Andrew; Pleiss, Jeffrey A.

    2015-01-01

    Alternative splicing is an important and ancient feature of eukaryotic gene structure, the existence of which has likely facilitated eukaryotic proteome expansions. Here, we have used intron lariat sequencing to generate a comprehensive profile of splicing events in Schizosaccharomyces pombe, amongst the simplest organisms that possess mammalian-like splice site degeneracy. We reveal an unprecedented level of alternative splicing, including alternative splice site selection for over half of all annotated introns, hundreds of novel exon-skipping events, and thousands of novel introns. Moreover, the frequency of these events is far higher than previous estimates, with alternative splice sites on average activated at ∼3% the rate of canonical sites. Although a subset of alternative sites are conserved in related species, implying functional potential, the majority are not detectably conserved. Interestingly, the rate of aberrant splicing is inversely related to expression level, with lowly expressed genes more prone to erroneous splicing. Although we validate many events with RNAseq, the proportion of alternative splicing discovered with lariat sequencing is far greater, a difference we attribute to preferential decay of aberrantly spliced transcripts. Together, these data suggest the spliceosome possesses far lower fidelity than previously appreciated, highlighting the potential contributions of alternative splicing in generating novel gene structures. PMID:26261211

  10. Conservation and Sex-Specific Splicing of the transformer Gene in the Calliphorids Cochliomyia hominivorax, Cochliomyia macellaria and Lucilia sericata

    PubMed Central

    Li, Fang; Vensko, Steven P.; Belikoff, Esther J.; Scott, Maxwell J.

    2013-01-01

    Transformer (TRA) promotes female development in several dipteran species including the Australian sheep blowfly Lucilia cuprina, the Mediterranean fruit fly, housefly and Drosophila melanogaster. tra transcripts are sex-specifically spliced such that only the female form encodes full length functional protein. The presence of six predicted TRA/TRA2 binding sites in the sex-specific female intron of the L. cuprina gene suggested that tra splicing is auto-regulated as in medfly and housefly. With the aim of identifying conserved motifs that may play a role in tra sex-specific splicing, here we have isolated and characterized the tra gene from three additional blowfly species, L. sericata, Cochliomyia hominivorax and C. macellaria. The blowfly adult male and female transcripts differ in the choice of splice donor site in the first intron, with males using a site downstream of the site used in females. The tra genes all contain a single TRA/TRA2 site in the male exon and a cluster of four to five sites in the male intron. However, overall the sex-specific intron sequences are poorly conserved in closely related blowflies. The most conserved regions are around the exon/intron junctions, the 3′ end of the intron and near the cluster of TRA/TRA2 sites. We propose a model for sex specific regulation of tra splicing that incorporates the conserved features identified in this study. In L. sericata embryos, the male tra transcript was first detected at around the time of cellular blastoderm formation. RNAi experiments showed that tra is required for female development in L. sericata and C. macellaria. The isolation of the tra gene from the New World screwworm fly C. hominivorax, a major livestock pest, will facilitate the development of a “male-only” strain for genetic control programs. PMID:23409170

  11. Human Splicing Finder: an online bioinformatics tool to predict splicing signals.

    PubMed

    Desmet, François-Olivier; Hamroun, Dalil; Lalande, Marine; Collod-Béroud, Gwenaëlle; Claustres, Mireille; Béroud, Christophe

    2009-05-01

    Thousands of mutations are identified yearly. Although many directly affect protein expression, an increasing proportion of mutations is now believed to influence mRNA splicing. They mostly affect existing splice sites, but synonymous, non-synonymous or nonsense mutations can also create or disrupt splice sites or auxiliary cis-splicing sequences. To facilitate the analysis of the different mutations, we designed Human Splicing Finder (HSF), a tool to predict the effects of mutations on splicing signals or to identify splicing motifs in any human sequence. It contains all available matrices for auxiliary sequence prediction as well as new ones for binding sites of the 9G8 and Tra2-beta Serine-Arginine proteins and the hnRNP A1 ribonucleoprotein. We also developed new Position Weight Matrices to assess the strength of 5' and 3' splice sites and branch points. We evaluated HSF efficiency using a set of 83 intronic and 35 exonic mutations known to result in splicing defects. We showed that the mutation effect was correctly predicted in almost all cases. HSF could thus represent a valuable resource for research, diagnostic and therapeutic (e.g. therapeutic exon skipping) purposes as well as for global studies, such as the GEN2PHEN European Project or the Human Variome Project.

  12. Human Splicing Finder: an online bioinformatics tool to predict splicing signals

    PubMed Central

    Desmet, François-Olivier; Hamroun, Dalil; Lalande, Marine; Collod-Béroud, Gwenaëlle; Claustres, Mireille; Béroud, Christophe

    2009-01-01

    Thousands of mutations are identified yearly. Although many directly affect protein expression, an increasing proportion of mutations is now believed to influence mRNA splicing. They mostly affect existing splice sites, but synonymous, non-synonymous or nonsense mutations can also create or disrupt splice sites or auxiliary cis-splicing sequences. To facilitate the analysis of the different mutations, we designed Human Splicing Finder (HSF), a tool to predict the effects of mutations on splicing signals or to identify splicing motifs in any human sequence. It contains all available matrices for auxiliary sequence prediction as well as new ones for binding sites of the 9G8 and Tra2-β Serine-Arginine proteins and the hnRNP A1 ribonucleoprotein. We also developed new Position Weight Matrices to assess the strength of 5′ and 3′ splice sites and branch points. We evaluated HSF efficiency using a set of 83 intronic and 35 exonic mutations known to result in splicing defects. We showed that the mutation effect was correctly predicted in almost all cases. HSF could thus represent a valuable resource for research, diagnostic and therapeutic (e.g. therapeutic exon skipping) purposes as well as for global studies, such as the GEN2PHEN European Project or the Human Variome Project. PMID:19339519

  13. Can the HIV-1 splicing machinery be targeted for drug discovery?

    PubMed Central

    Dlamini, Zodwa; Hull, Rodney

    2017-01-01

    HIV-1 is able to express multiple protein types and isoforms from a single 9 kb mRNA transcript. These proteins are also expressed at particular stages of viral development, and this is achieved through the control of alternative splicing and the export of these transcripts from the nucleus. The nuclear export is controlled by the HIV protein Rev being required to transport incompletely spliced and partially spliced mRNA from the nucleus where they are normally retained. This implies a close relationship between the control of alternate splicing and the nuclear export of mRNA in the control of HIV-1 viral proliferation. This review discusses both the processes. The specificity and regulation of splicing in HIV-1 is controlled by the use of specific splice sites as well as exonic splicing enhancer and exonic splicing silencer sequences. The use of these silencer and enhancer sequences is dependent on the serine arginine family of proteins as well as the heterogeneous nuclear ribonucleoprotein family of proteins that bind to these sequences and increase or decrease splicing. Since alternative splicing is such a critical factor in viral development, it presents itself as a promising drug target. This review aims to discuss the inhibition of splicing, which would stall viral development, as an anti-HIV therapeutic strategy. In this review, the most recent knowledge of splicing in human immunodeficiency viral development and the latest therapeutic strategies targeting human immunodeficiency viral splicing are discussed. PMID:28331370

  14. Medical Sequencing at the extremes of Human Body Mass

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ahituv, Nadav; Kavaslar, Nihan; Schackwitz, Wendy

    2006-09-01

    Body weight is a quantitative trait with significantheritability in humans. To identify potential genetic contributors tothis phenotype, we resequenced the coding exons and splice junctions of58 genes in 379 obese and 378 lean individuals. Our 96Mb survey included21 genes associated with monogenic forms of obesity in humans or mice, aswell as 37 genes that function in body weight-related pathways. We foundthat the monogenic obesity-associated gene group was enriched for rarenonsynonymous variants unique to the obese (n=46) versus lean (n=26)populations. Computational analysis further predicted a significantlygreater fraction of deleterious variants within the obese cohort.Consistent with the complex inheritance of body weight,more » we did notobserve obvious familial segregation in the majority of the 28 availablekindreds. Taken together, these data suggest that multiple rare alleleswith variable penetrance contribute to obesity in the population andprovide a deep medical sequencing based approach to detectthem.« less

  15. SplicePlot: a utility for visualizing splicing quantitative trait loci.

    PubMed

    Wu, Eric; Nance, Tracy; Montgomery, Stephen B

    2014-04-01

    RNA sequencing has provided unprecedented resolution of alternative splicing and splicing quantitative trait loci (sQTL). However, there are few tools available for visualizing the genotype-dependent effects of splicing at a population level. SplicePlot is a simple command line utility that produces intuitive visualization of sQTLs and their effects. SplicePlot takes mapped RNA sequencing reads in BAM format and genotype data in VCF format as input and outputs publication-quality Sashimi plots, hive plots and structure plots, enabling better investigation and understanding of the role of genetics on alternative splicing and transcript structure. Source code and detailed documentation are available at http://montgomerylab.stanford.edu/spliceplot/index.html under Resources and at Github. SplicePlot is implemented in Python and is supported on Linux and Mac OS. A VirtualBox virtual machine running Ubuntu with SplicePlot already installed is also available.

  16. iCLIP Predicts the Dual Splicing Effects of TIA-RNA Interactions

    PubMed Central

    Briese, Michael; Zarnack, Kathi; Luscombe, Nicholas M.; Rot, Gregor; Zupan, Blaž; Curk, Tomaž; Ule, Jernej

    2010-01-01

    The regulation of alternative splicing involves interactions between RNA-binding proteins and pre-mRNA positions close to the splice sites. T-cell intracellular antigen 1 (TIA1) and TIA1-like 1 (TIAL1) locally enhance exon inclusion by recruiting U1 snRNP to 5′ splice sites. However, effects of TIA proteins on splicing of distal exons have not yet been explored. We used UV-crosslinking and immunoprecipitation (iCLIP) to find that TIA1 and TIAL1 bind at the same positions on human RNAs. Binding downstream of 5′ splice sites was used to predict the effects of TIA proteins in enhancing inclusion of proximal exons and silencing inclusion of distal exons. The predictions were validated in an unbiased manner using splice-junction microarrays, RT-PCR, and minigene constructs, which showed that TIA proteins maintain splicing fidelity and regulate alternative splicing by binding exclusively downstream of 5′ splice sites. Surprisingly, TIA binding at 5′ splice sites silenced distal cassette and variable-length exons without binding in proximity to the regulated alternative 3′ splice sites. Using transcriptome-wide high-resolution mapping of TIA-RNA interactions we evaluated the distal splicing effects of TIA proteins. These data are consistent with a model where TIA proteins shorten the time available for definition of an alternative exon by enhancing recognition of the preceding 5′ splice site. Thus, our findings indicate that changes in splicing kinetics could mediate the distal regulation of alternative splicing. PMID:21048981

  17. Nucleotide sequence of the gag gene and gag-pol junction of feline leukemia virus.

    PubMed Central

    Laprevotte, I; Hampe, A; Sherr, C J; Galibert, F

    1984-01-01

    The nucleotide sequence of the gag gene of feline leukemia virus and its flanking sequences were determined and compared with the corresponding sequences of two strains of feline sarcoma virus and with that of the Moloney strain of murine leukemia virus. A high degree of nucleotide sequence homology between the feline leukemia virus and murine leukemia virus gag genes was observed, suggesting that retroviruses of domestic cats and laboratory mice have a common, proximal evolutionary progenitor. The predicted structure of the complete feline leukemia virus gag gene precursor suggests that the translation of nonglycosylated and glycosylated gag gene polypeptides is initiated at two different AUG codons. These initiator codons fall in the same reading frame and are separated by a 222-base-pair segment which encodes an amino terminal signal peptide. The nucleotide sequence predicts the order of amino acids in each of the individual gag-coded proteins (p15, p12, p30, p10), all of which derive from the gag gene precursor. Stable stem-and-loop secondary structures are proposed for two regions of viral RNA. The first falls within sequences at the 5' end of the viral genome, together with adjacent palindromic sequences which may play a role in dimer linkage of RNA subunits. The second includes coding sequences at the gag-pol junction and is proposed to be involved in translation of the pol gene product. Sequence analysis of the latter region shows that the gag and pol genes are translated in different reading frames. Classical consensus splice donor and acceptor sequences could not be localized to regions which would permit synthesis of the expected gag-pol precursor protein. Alternatively, we suggest that the pol gene product (RNA-dependent DNA polymerase) could be translated by a frameshift suppressing mechanism which could involve cleavage modification of stems and loops in a manner similar to that observed in tRNA processing. PMID:6328019

  18. Lariat sequencing in a unicellular yeast identifies regulated alternative splicing of exons that are evolutionarily conserved with humans.

    PubMed

    Awan, Ali R; Manfredo, Amanda; Pleiss, Jeffrey A

    2013-07-30

    Alternative splicing is a potent regulator of gene expression that vastly increases proteomic diversity in multicellular eukaryotes and is associated with organismal complexity. Although alternative splicing is widespread in vertebrates, little is known about the evolutionary origins of this process, in part because of the absence of phylogenetically conserved events that cross major eukaryotic clades. Here we describe a lariat-sequencing approach, which offers high sensitivity for detecting splicing events, and its application to the unicellular fungus, Schizosaccharomyces pombe, an organism that shares many of the hallmarks of alternative splicing in mammalian systems but for which no previous examples of exon-skipping had been demonstrated. Over 200 previously unannotated splicing events were identified, including examples of regulated alternative splicing. Remarkably, an evolutionary analysis of four of the exons identified here as subject to skipping in S. pombe reveals high sequence conservation and perfect length conservation with their homologs in scores of plants, animals, and fungi. Moreover, alternative splicing of two of these exons have been documented in multiple vertebrate organisms, making these the first demonstrations of identical alternative-splicing patterns in species that are separated by over 1 billion y of evolution.

  19. Base pairing between the 3' exon and an internal guide sequence increases 3' splice site specificity in the Tetrahymena self-splicing rRNA intron.

    PubMed Central

    Suh, E R; Waring, R B

    1990-01-01

    It has been proposed that recognition of the 3' splice site in many group I introns involves base pairing between the start of the 3' exon and a region of the intron known as the internal guide sequence (R. W. Davies, R. B. Waring, J. Ray, T. A. Brown, and C. Scazzocchio, Nature [London] 300:719-724, 1982). We have examined this hypothesis, using the self-splicing rRNA intron from Tetrahymena thermophila. Mutations in the 3' exon that weaken this proposed pairing increased use of a downstream cryptic 3' splice site. Compensatory mutations in the guide sequence that restore this pairing resulted in even stronger selection of the normal 3' splice site. These changes in 3' splice site usage were more pronounced in the background of a mutation (414A) which resulted in an adenine instead of a guanine being the last base of the intron. These results show that the proposed pairing (P10) plays an important role in ensuring that cryptic 3' splice sites are selected against. Surprisingly, the 414A mutation alone did not result in activation of the cryptic 3' splice site. Images PMID:2342465

  20. Splicing predictions reliably classify different types of alternative splicing

    PubMed Central

    Busch, Anke; Hertel, Klemens J.

    2015-01-01

    Alternative splicing is a key player in the creation of complex mammalian transcriptomes and its misregulation is associated with many human diseases. Multiple mRNA isoforms are generated from most human genes, a process mediated by the interplay of various RNA signature elements and trans-acting factors that guide spliceosomal assembly and intron removal. Here, we introduce a splicing predictor that evaluates hundreds of RNA features simultaneously to successfully differentiate between exons that are constitutively spliced, exons that undergo alternative 5′ or 3′ splice-site selection, and alternative cassette-type exons. Surprisingly, the splicing predictor did not feature strong discriminatory contributions from binding sites for known splicing regulators. Rather, the ability of an exon to be involved in one or multiple types of alternative splicing is dictated by its immediate sequence context, mainly driven by the identity of the exon's splice sites, the conservation around them, and its exon/intron architecture. Thus, the splicing behavior of human exons can be reliably predicted based on basic RNA sequence elements. PMID:25805853

  1. The power of fission: yeast as a tool for understanding complex splicing.

    PubMed

    Fair, Benjamin Jung; Pleiss, Jeffrey A

    2017-06-01

    Pre-mRNA splicing is an essential component of eukaryotic gene expression. Many metazoans, including humans, regulate alternative splicing patterns to generate expansions of their proteome from a limited number of genes. Importantly, a considerable fraction of human disease causing mutations manifest themselves through altering the sequences that shape the splicing patterns of genes. Thus, understanding the mechanistic bases of this complex pathway will be an essential component of combating these diseases. Dating almost to the initial discovery of splicing, researchers have taken advantage of the genetic tractability of budding yeast to identify the components and decipher the mechanisms of splicing. However, budding yeast lacks the complex splicing machinery and alternative splicing patterns most relevant to humans. More recently, many researchers have turned their efforts to study the fission yeast, Schizosaccharomyces pombe, which has retained many features of complex splicing, including degenerate splice site sequences, the usage of exonic splicing enhancers, and SR proteins. Here, we review recent work using fission yeast genetics to examine pre-mRNA splicing, highlighting its promise for modeling the complex splicing seen in higher eukaryotes.

  2. Effects of mutations in the human uncoupling protein 3 gene on the respiratory quotient and fat oxidation in severe obesity and type 2 diabetes.

    PubMed Central

    Argyropoulos, G; Brown, A M; Willi, S M; Zhu, J; He, Y; Reitman, M; Gevao, S M; Spruill, I; Garvey, W T

    1998-01-01

    Human uncoupling protein 3 (UCP3) is a mitochondrial transmembrane carrier that uncouples oxidative ATP phosphorylation. With the capacity to participate in thermogenesis and energy balance, UCP3 is an important obesity candidate gene. A missense polymorphism in exon 3 (V102I) was identified in an obese and diabetic proband. A mutation introducing a stop codon in exon 4 (R143X) and a terminal polymorphism in the splice donor junction of exon 6 were also identified in a compound heterozygote that was morbidly obese and diabetic. Allele frequencies of the exon 3 and exon 6 splice junction polymorphisms were determined and found to be similar in Gullah-speaking African Americans and the Mende tribe of Sierra Leone, but absent in Caucasians. Moreover, in exon 6-splice donor heterozygotes, basal fat oxidation rates were reduced by 50%, and the respiratory quotient was markedly increased compared with wild-type individuals, implicating a role for UCP3 in metabolic fuel partitioning. PMID:9769326

  3. A 5′ Splice Site-Proximal Enhancer Binds SF1 and Activates Exon Bridging of a Microexon

    PubMed Central

    Carlo, Troy; Sierra, Rebecca; Berget, Susan M.

    2000-01-01

    Internal exon size in vertebrates occurs over a narrow size range. Experimentally, exons shorter than 50 nucleotides are poorly included in mRNA unless accompanied by strengthened splice sites or accessory sequences that act as splicing enhancers, suggesting steric interference between snRNPs and other splicing factors binding simultaneously to the 3′ and 5′ splice sites of microexons. Despite these problems, very small naturally occurring exons exist. Here we studied the factors and mechanism involved in recognizing a constitutively included six-nucleotide exon from the cardiac troponin T gene. Inclusion of this exon is dependent on an enhancer located downstream of the 5′ splice site. This enhancer contains six copies of the simple sequence GGGGCUG. The enhancer activates heterologous microexons and will work when located either upstream or downstream of the target exon, suggesting an ability to bind factors that bridge splicing units. A single copy of this sequence is sufficient for in vivo exon inclusion and is the binding site for the known bridging mammalian splicing factor 1 (SF1). The enhancer and its bound SF1 act to increase recognition of the upstream exon during exon definition, such that competition of in vitro reactions with RNAs containing the GGGGCUG repeated sequence depress splicing of the upstream intron, assembly of the spliceosome on the 3′ splice site of the exon, and cross-linking of SF1. These results suggest a model in which SF1 bridges the small exon during initial assembly, thereby effectively extending the domain of the exon. PMID:10805741

  4. Systematic Analysis of Splice-Site-Creating Mutations in Cancer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jayasinghe, Reyka G.; Cao, Song; Gao, Qingsong

    For the past decade, cancer genomic studies have focused on mutations leading to splice-site disruption, overlooking those having splice-creating potential. Here, we applied a bioinformatic tool, MiSplice, for the large-scale discovery of splice-site-creating mutations (SCMs) across 8,656 TCGA tumors. We report 1,964 originally mis-annotated mutations having clear evidence of creating alternative splice junctions. TP53 and GATA3 have 26 and 18 SCMs, respectively, and ATRX has 5 from lower-grade gliomas. Mutations in 11 genes, including PARP1, BRCA1, and BAP1, were experimentally validated for splice-site-creating function. Notably, we found that neoantigens induced by SCMs are likely several folds more immunogenic compared tomore » missense mutations, exemplified by the recurrent GATA3 SCM. Further, high expression of PD-1 and PD-L1 was observed in tumors with SCMs, suggesting candidates for immune blockade therapy. Finally, our work highlights the importance of integrating DNA and RNA data for understanding the functional and the clinical implications of mutations in human diseases.« less

  5. Systematic Analysis of Splice-Site-Creating Mutations in Cancer.

    PubMed

    Jayasinghe, Reyka G; Cao, Song; Gao, Qingsong; Wendl, Michael C; Vo, Nam Sy; Reynolds, Sheila M; Zhao, Yanyan; Climente-González, Héctor; Chai, Shengjie; Wang, Fang; Varghese, Rajees; Huang, Mo; Liang, Wen-Wei; Wyczalkowski, Matthew A; Sengupta, Sohini; Li, Zhi; Payne, Samuel H; Fenyö, David; Miner, Jeffrey H; Walter, Matthew J; Vincent, Benjamin; Eyras, Eduardo; Chen, Ken; Shmulevich, Ilya; Chen, Feng; Ding, Li

    2018-04-03

    For the past decade, cancer genomic studies have focused on mutations leading to splice-site disruption, overlooking those having splice-creating potential. Here, we applied a bioinformatic tool, MiSplice, for the large-scale discovery of splice-site-creating mutations (SCMs) across 8,656 TCGA tumors. We report 1,964 originally mis-annotated mutations having clear evidence of creating alternative splice junctions. TP53 and GATA3 have 26 and 18 SCMs, respectively, and ATRX has 5 from lower-grade gliomas. Mutations in 11 genes, including PARP1, BRCA1, and BAP1, were experimentally validated for splice-site-creating function. Notably, we found that neoantigens induced by SCMs are likely several folds more immunogenic compared to missense mutations, exemplified by the recurrent GATA3 SCM. Further, high expression of PD-1 and PD-L1 was observed in tumors with SCMs, suggesting candidates for immune blockade therapy. Our work highlights the importance of integrating DNA and RNA data for understanding the functional and the clinical implications of mutations in human diseases. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  6. Systematic Analysis of Splice-Site-Creating Mutations in Cancer

    DOE PAGES

    Jayasinghe, Reyka G.; Cao, Song; Gao, Qingsong; ...

    2018-04-05

    For the past decade, cancer genomic studies have focused on mutations leading to splice-site disruption, overlooking those having splice-creating potential. Here, we applied a bioinformatic tool, MiSplice, for the large-scale discovery of splice-site-creating mutations (SCMs) across 8,656 TCGA tumors. We report 1,964 originally mis-annotated mutations having clear evidence of creating alternative splice junctions. TP53 and GATA3 have 26 and 18 SCMs, respectively, and ATRX has 5 from lower-grade gliomas. Mutations in 11 genes, including PARP1, BRCA1, and BAP1, were experimentally validated for splice-site-creating function. Notably, we found that neoantigens induced by SCMs are likely several folds more immunogenic compared tomore » missense mutations, exemplified by the recurrent GATA3 SCM. Further, high expression of PD-1 and PD-L1 was observed in tumors with SCMs, suggesting candidates for immune blockade therapy. Finally, our work highlights the importance of integrating DNA and RNA data for understanding the functional and the clinical implications of mutations in human diseases.« less

  7. Diverse growth hormone receptor gene mutations in Laron syndrome.

    PubMed Central

    Berg, M A; Argente, J; Chernausek, S; Gracia, R; Guevara-Aguirre, J; Hopp, M; Pérez-Jurado, L; Rosenbloom, A; Toledo, S P; Francke, U

    1993-01-01

    To better understand the molecular genetic basis and genetic epidemiology of Laron syndrome (growth-hormone insensitivity syndrome), we analyzed the growth-hormone receptor (GHR) genes of seven unrelated affected individuals from the United States, South America, Europe, and Africa. We amplified all nine GHR gene exons and splice junctions from these individuals by PCR and screened the products for mutations by using denaturing gradient gel electrophoresis (DGGE). We identified a single GHR gene fragment with abnormal DGGE results for each affected individual, sequenced this fragment, and, in each case, identified a mutation likely to cause Laron syndrome, including two nonsense mutations (R43X and R217X), two splice-junction mutations, (189-1 G to T and 71 + 1 G to A), and two frameshift mutations (46 del TT and 230 del TA or AT). Only one of these mutations, R43X, has been previously reported. Using haplotype analysis, we determined that this mutation, which involves a CpG dinucleotide hot spot, likely arose as a separate event in this case, relative to the two prior reports of R43X. Aside from R43X, the mutations we identified are unique to patients from particular geographic regions. Ten GHR gene mutations have now been described in this disorder. We conclude that Laron syndrome is caused by diverse GHR gene mutations, including deletions, RNA processing defects, translational stop codons, and missense codons. All the identified mutations involve the extracellular domain of the receptor, and most are unique to particular families or geographic areas. Images Figure 1 Figure 2 PMID:8488849

  8. Genomic organization of the human heparan sulfate-N-deacetylase/N-sulfotransferase gene: Exclusion from a causative role in the pathogenesis of Treacher Collins syndrome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gladwin, A.J.; Dixon, J.; Loftus, S.K.

    Heparan sulfate-N-deacetylase/N-sulfotransferase (HSST) catalyzes both the N-deacetylation and the N-sulfation of heparan sulfate. Previous studies have resulted in the isolation of the human HSST gene from within the Treacher Collins syndrome locus (TCOF1) critical region on 5q. In the present study, the genomic organization of the HSST gene has been elucidated, and the 14 exons identified have been tested for TCOF1-specific mutations. As a result of these studies, mutations within the coding sequence and adjacent splice junctions of HSST can be excluded from a causative role in the pathogenesis of Treacher Collins syndrome. 13 refs., 1 fig., 2 tabs.

  9. Human Splice-Site Prediction with Deep Neural Networks.

    PubMed

    Naito, Tatsuhiko

    2018-04-18

    Accurate splice-site prediction is essential to delineate gene structures from sequence data. Several computational techniques have been applied to create a system to predict canonical splice sites. For classification tasks, deep neural networks (DNNs) have achieved record-breaking results and often outperformed other supervised learning techniques. In this study, a new method of splice-site prediction using DNNs was proposed. The proposed system receives an input sequence data and returns an answer as to whether it is splice site. The length of input is 140 nucleotides, with the consensus sequence (i.e., "GT" and "AG" for the donor and acceptor sites, respectively) in the middle. Each input sequence model is applied to the pretrained DNN model that determines the probability that an input is a splice site. The model consists of convolutional layers and bidirectional long short-term memory network layers. The pretraining and validation were conducted using the data set tested in previously reported methods. The performance evaluation results showed that the proposed method can outperform the previous methods. In addition, the pattern learned by the DNNs was visualized as position frequency matrices (PFMs). Some of PFMs were very similar to the consensus sequence. The trained DNN model and the brief source code for the prediction system are uploaded. Further improvement will be achieved following the further development of DNNs.

  10. Tools to covisualize and coanalyze proteomic data with genomes and transcriptomes: validation of genes and alternative mRNA splicing.

    PubMed

    Pang, Chi Nam Ignatius; Tay, Aidan P; Aya, Carlos; Twine, Natalie A; Harkness, Linda; Hart-Smith, Gene; Chia, Samantha Z; Chen, Zhiliang; Deshpande, Nandan P; Kaakoush, Nadeem O; Mitchell, Hazel M; Kassem, Moustapha; Wilkins, Marc R

    2014-01-03

    Direct links between proteomic and genomic/transcriptomic data are not frequently made, partly because of lack of appropriate bioinformatics tools. To help address this, we have developed the PG Nexus pipeline. The PG Nexus allows users to covisualize peptides in the context of genomes or genomic contigs, along with RNA-seq reads. This is done in the Integrated Genome Viewer (IGV). A Results Analyzer reports the precise base position where LC-MS/MS-derived peptides cover genes or gene isoforms, on the chromosomes or contigs where this occurs. In prokaryotes, the PG Nexus pipeline facilitates the validation of genes, where annotation or gene prediction is available, or the discovery of genes using a "virtual protein"-based unbiased approach. We illustrate this with a comprehensive proteogenomics analysis of two strains of Campylobacter concisus . For higher eukaryotes, the PG Nexus facilitates gene validation and supports the identification of mRNA splice junction boundaries and splice variants that are protein-coding. This is illustrated with an analysis of splice junctions covered by human phosphopeptides, and other examples of relevance to the Chromosome-Centric Human Proteome Project. The PG Nexus is open-source and available from https://github.com/IntersectAustralia/ap11_Samifier. It has been integrated into Galaxy and made available in the Galaxy tool shed.

  11. Genome-wide mapping of alternative splicing in Arabidopsis thaliana

    PubMed Central

    Filichkin, Sergei A.; Priest, Henry D.; Givan, Scott A.; Shen, Rongkun; Bryant, Douglas W.; Fox, Samuel E.; Wong, Weng-Keen; Mockler, Todd C.

    2010-01-01

    Alternative splicing can enhance transcriptome plasticity and proteome diversity. In plants, alternative splicing can be manifested at different developmental stages, and is frequently associated with specific tissue types or environmental conditions such as abiotic stress. We mapped the Arabidopsis transcriptome at single-base resolution using the Illumina platform for ultrahigh-throughput RNA sequencing (RNA-seq). Deep transcriptome sequencing confirmed a majority of annotated introns and identified thousands of novel alternatively spliced mRNA isoforms. Our analysis suggests that at least ∼42% of intron-containing genes in Arabidopsis are alternatively spliced; this is significantly higher than previous estimates based on cDNA/expressed sequence tag sequencing. Random validation confirmed that novel splice isoforms empirically predicted by RNA-seq can be detected in vivo. Novel introns detected by RNA-seq were substantially enriched in nonconsensus terminal dinucleotide splice signals. Alternative isoforms with premature termination codons (PTCs) comprised the majority of alternatively spliced transcripts. Using an example of an essential circadian clock gene, we show that intron retention can generate relatively abundant PTC+ isoforms and that this specific event is highly conserved among diverse plant species. Alternatively spliced PTC+ isoforms can be potentially targeted for degradation by the nonsense mediated mRNA decay (NMD) surveillance machinery or regulate the level of functional transcripts by the mechanism of regulated unproductive splicing and translation (RUST). We demonstrate that the relative ratios of the PTC+ and reference isoforms for several key regulatory genes can be considerably shifted under abiotic stress treatments. Taken together, our results suggest that like in animals, NMD and RUST may be widespread in plants and may play important roles in regulating gene expression. PMID:19858364

  12. HSA: a heuristic splice alignment tool.

    PubMed

    Bu, Jingde; Chi, Xuebin; Jin, Zhong

    2013-01-01

    RNA-Seq methodology is a revolutionary transcriptomics sequencing technology, which is the representative of Next generation Sequencing (NGS). With the high throughput sequencing of RNA-Seq, we can acquire much more information like differential expression and novel splice variants from deep sequence analysis and data mining. But the short read length brings a great challenge to alignment, especially when the reads span two or more exons. A two steps heuristic splice alignment tool is generated in this investigation. First, map raw reads to reference with unspliced aligner--BWA; second, split initial unmapped reads into three equal short reads (seeds), align each seed to the reference, filter hits, search possible split position of read and extend hits to a complete match. Compare with other splice alignment tools like SOAPsplice and Tophat2, HSA has a better performance in call rate and efficiency, but its results do not as accurate as the other software to some extent. HSA is an effective spliced aligner of RNA-Seq reads mapping, which is available at https://github.com/vlcc/HSA.

  13. Spliced leader RNA of trypanosomes: in vivo mutational analysis reveals extensive and distinct requirements for trans splicing and cap4 formation.

    PubMed Central

    Lücke, S; Xu, G L; Palfi, Z; Cross, M; Bellofatto, V; Bindereif, A

    1996-01-01

    In trypanosomes mRNAs are generated through trans splicing. The spliced leader (SL) RNA, which donates the 5'-terminal mini-exon to each of the protein coding exons, plays a central role in the trans splicing process. We have established in vivo assays to study in detail trans splicing, cap4 modification, and RNP assembly of the SL RNA in the trypanosomatid species Leptomonas seymouri. First, we found that extensive sequences within the mini-exon are required for SL RNA function in vivo, although a conserved length of 39 nt is not essential. In contrast, the intron sequence appears to be surprisingly tolerant to mutation; only the stem-loop II structure is indispensable. The asymmetry of the sequence requirements in the stem I region suggests that this domain may exist in different functional conformations. Second, distinct mini-exon sequences outside the modification site are important for efficient cap4 formation. Third, all SL RNA mutations tested allowed core RNP assembly, suggesting flexible requirements for core protein binding. In sum, the results of our mutational analysis provide evidence for a discrete domain structure of the SL RNA and help to explain the strong phylogenetic conservation of the mini-exon sequence and of the overall SL RNA secondary structure; they also suggest that there may be certain differences between trans splicing in nematodes and trypanosomes. This approach provides a basis for studying RNA-RNA interactions in the trans spliceosome. Images PMID:8861965

  14. PRP5: a helicase-like protein required for mRNA splicing in yeast.

    PubMed Central

    Dalbadie-McFarland, G; Abelson, J

    1990-01-01

    A 96-kDa protein predicted by the DNA sequence of the Saccharomyces cerevisiae PRP5 gene contains a domain that bears a striking resemblance to a family of RNA helicases characterized by the conserved amino acid sequence Asp-Glu-Ala-Asp (D-E-A-D). Previous work indicated that the product of the PRP5 gene is required for splicing and that spliceosome assembly does not occur in its absence. However, its precise role in splicing and the nature of its biochemical activity remained unknown. To examine the role of PRP5 in splicing, we cloned the gene by complementation of a temperature-sensitive mutation and determined its DNA sequence. We discuss here the possible roles for an RNA helicase in splicing and for the activity of the PRP5 protein. Images PMID:2349233

  15. Rail-RNA: scalable analysis of RNA-seq splicing and coverage.

    PubMed

    Nellore, Abhinav; Collado-Torres, Leonardo; Jaffe, Andrew E; Alquicira-Hernández, José; Wilks, Christopher; Pritt, Jacob; Morton, James; Leek, Jeffrey T; Langmead, Ben

    2017-12-15

    RNA sequencing (RNA-seq) experiments now span hundreds to thousands of samples. Current spliced alignment software is designed to analyze each sample separately. Consequently, no information is gained from analyzing multiple samples together, and it requires extra work to obtain analysis products that incorporate data from across samples. We describe Rail-RNA, a cloud-enabled spliced aligner that analyzes many samples at once. Rail-RNA eliminates redundant work across samples, making it more efficient as samples are added. For many samples, Rail-RNA is more accurate than annotation-assisted aligners. We use Rail-RNA to align 667 RNA-seq samples from the GEUVADIS project on Amazon Web Services in under 16 h for US$0.91 per sample. Rail-RNA outputs alignments in SAM/BAM format; but it also outputs (i) base-level coverage bigWigs for each sample; (ii) coverage bigWigs encoding normalized mean and median coverages at each base across samples analyzed; and (iii) exon-exon splice junctions and indels (features) in columnar formats that juxtapose coverages in samples in which a given feature is found. Supplementary outputs are ready for use with downstream packages for reproducible statistical analysis. We use Rail-RNA to identify expressed regions in the GEUVADIS samples and show that both annotated and unannotated (novel) expressed regions exhibit consistent patterns of variation across populations and with respect to known confounding variables. Rail-RNA is open-source software available at http://rail.bio. anellore@gmail.com or langmea@cs.jhu.edu. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  16. SpliceRover: Interpretable Convolutional Neural: Networks for Improved Splice Site Prediction.

    PubMed

    Zuallaert, Jasper; Godin, Fréderic; Kim, Mijung; Soete, Arne; Saeys, Yvan; De Neve, Wesley

    2018-06-21

    During the last decade, improvements in high-throughput sequencing have generated a wealth of genomic data. Functionally interpreting these sequences and finding the biological signals that are hallmarks of gene function and regulation is currently mostly done using automated genome annotation platforms, which mainly rely on integrated machine learning frameworks to identify different functional sites of interest, including splice sites. Splicing is an essential step in the gene regulation process, and the correct identification of splice sites is a major cornerstone in a genome annotation system. In this paper, we present SpliceRover, a predictive deep learning approach that outperforms the state-of-the-art in splice site prediction. SpliceRover uses convolutional neural networks (CNNs), which have been shown to obtain cutting edge performance on a wide variety of prediction tasks. We adapted this approach to deal with genomic sequence inputs, and show it consistently outperforms already existing approaches, with relative improvements in prediction effectiveness of up to 80.9% when measured in terms of false discovery rate. However, a major criticism of CNNs concerns their "black box" nature, as mechanisms to obtain insight into their reasoning processes are limited. To facilitate interpretability of the SpliceRover models, we introduce an approach to visualize the biologically relevant information learnt. We show that our visualization approach is able to recover features known to be important for splice site prediction (binding motifs around the splice site, presence of polypyrimidine tracts and branch points), as well as reveal new features (e.g., several types of exclusion patterns near splice sites). SpliceRover is available as a web service. The prediction tool and instructions can be found at http://bioit2.irc.ugent.be/splicerover/. Supplementary materials are available at Bioinformatics online.

  17. Context-dependent control of alternative splicing by RNA-binding proteins

    PubMed Central

    Fu, Xiang-Dong; Ares, Manuel

    2015-01-01

    Sequence-specific RNA-binding proteins (RBPs) bind to pre-mRNA to control alternative splicing, but it is not yet possible to read the ‘splicing code’ that dictates splicing regulation on the basis of genome sequence. Each alternative splicing event is controlled by multiple RBPs, the combined action of which creates a distribution of alternatively spliced products in a given cell type. As each cell type expresses a distinct array of RBPs, the interpretation of regulatory information on a given RNA target is exceedingly dependent on the cell type. RBPs also control each other’s functions at many levels, including by mutual modulation of their binding activities on specific regulatory RNA elements. In this Review, we describe some of the emerging rules that govern the highly context-dependent and combinatorial nature of alternative splicing regulation. PMID:25112293

  18. Long Non-Coding RNA and Alternative Splicing Modulations in Parkinson's Leukocytes Identified by RNA Sequencing

    PubMed Central

    Soreq, Lilach; Guffanti, Alessandro; Salomonis, Nathan; Simchovitz, Alon; Israel, Zvi; Bergman, Hagai; Soreq, Hermona

    2014-01-01

    The continuously prolonged human lifespan is accompanied by increase in neurodegenerative diseases incidence, calling for the development of inexpensive blood-based diagnostics. Analyzing blood cell transcripts by RNA-Seq is a robust means to identify novel biomarkers that rapidly becomes a commonplace. However, there is lack of tools to discover novel exons, junctions and splicing events and to precisely and sensitively assess differential splicing through RNA-Seq data analysis and across RNA-Seq platforms. Here, we present a new and comprehensive computational workflow for whole-transcriptome RNA-Seq analysis, using an updated version of the software AltAnalyze, to identify both known and novel high-confidence alternative splicing events, and to integrate them with both protein-domains and microRNA binding annotations. We applied the novel workflow on RNA-Seq data from Parkinson's disease (PD) patients' leukocytes pre- and post- Deep Brain Stimulation (DBS) treatment and compared to healthy controls. Disease-mediated changes included decreased usage of alternative promoters and N-termini, 5′-end variations and mutually-exclusive exons. The PD regulated FUS and HNRNP A/B included prion-like domains regulated regions. We also present here a workflow to identify and analyze long non-coding RNAs (lncRNAs) via RNA-Seq data. We identified reduced lncRNA expression and selective PD-induced changes in 13 of over 6,000 detected leukocyte lncRNAs, four of which were inversely altered post-DBS. These included the U1 spliceosomal lncRNA and RP11-462G22.1, each entailing sequence complementarity to numerous microRNAs. Analysis of RNA-Seq from PD and unaffected controls brains revealed over 7,000 brain-expressed lncRNAs, of which 3,495 were co-expressed in the leukocytes including U1, which showed both leukocyte and brain increases. Furthermore, qRT-PCR validations confirmed these co-increases in PD leukocytes and two brain regions, the amygdala and substantia-nigra, compared to controls. This novel workflow allows deep multi-level inspection of RNA-Seq datasets and provides a comprehensive new resource for understanding disease transcriptome modifications in PD and other neurodegenerative diseases. PMID:24651478

  19. Identification of Alternative Splicing and Fusion Transcripts in Non-Small Cell Lung Cancer by RNA Sequencing.

    PubMed

    Hong, Yoonki; Kim, Woo Jin; Bang, Chi Young; Lee, Jae Cheol; Oh, Yeon-Mok

    2016-04-01

    Lung cancer is the most common cause of cancer related death. Alterations in gene sequence, structure, and expression have an important role in the pathogenesis of lung cancer. Fusion genes and alternative splicing of cancer-related genes have the potential to be oncogenic. In the current study, we performed RNA-sequencing (RNA-seq) to investigate potential fusion genes and alternative splicing in non-small cell lung cancer. RNA was isolated from lung tissues obtained from 86 subjects with lung cancer. The RNA samples from lung cancer and normal tissues were processed with RNA-seq using the HiSeq 2000 system. Fusion genes were evaluated using Defuse and ChimeraScan. Candidate fusion transcripts were validated by Sanger sequencing. Alternative splicing was analyzed using multivariate analysis of transcript sequencing and validated using quantitative real time polymerase chain reaction. RNA-seq data identified oncogenic fusion genes EML4-ALK and SLC34A2-ROS1 in three of 86 normal-cancer paired samples. Nine distinct fusion transcripts were selected using DeFuse and ChimeraScan; of which, four fusion transcripts were validated by Sanger sequencing. In 33 squamous cell carcinoma, 29 tumor specific skipped exon events and six mutually exclusive exon events were identified. ITGB4 and PYCR1 were top genes that showed significant tumor specific splice variants. In conclusion, RNA-seq data identified novel potential fusion transcripts and splice variants. Further evaluation of their functional significance in the pathogenesis of lung cancer is required.

  20. Two novel mutations in the homogentisate-1,2-dioxygenase gene identified in Chinese Han Child with Alkaptonuria.

    PubMed

    Li, Hongying; Zhang, Kaihui; Xu, Qun; Ma, Lixia; Lv, Xin; Sun, Ruopeng

    2015-03-01

    Alkaptonuria (AKU) is an autosomal recessive disorder of tyrosine metabolism, which is caused by a defect in the enzyme homogentisate 1,2-dioxygenase (HGD) with subsequent accumulation of homogentisic acid. Presently, more than 100 HGD mutations have been identified as the cause of the inborn error of metabolism across different populations worldwide. However, the HGD mutation is very rarely reported in Asia, especially China. In this study, we present mutational analyses of HGD gene in one Chinese Han child with AKU, which had been identified by gas chromatography-mass spectrometry detection of organic acids in urine samples. PCR and DNA sequencing of the entire coding region as well as exon-intron boundaries of HGD have been performed. Two novel mutations were identified in the HGD gene in this AKU case, a frameshift mutation of c.115delG in exon 3 and the splicing mutation of IVS5+3 A>C, a donor splice site of the exon 5 and exon-intron junction. The identification of these mutations in this study further expands the spectrum of known HGD gene mutations and contributes to prenatal molecular diagnosis of AKU.

  1. The developmental transcriptome of Drosophila melanogaster

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    University of Connecticut; Graveley, Brenton R.; Brooks, Angela N.

    Drosophila melanogaster is one of the most well studied genetic model organisms; nonetheless, its genome still contains unannotated coding and non-coding genes, transcripts, exons and RNA editing sites. Full discovery and annotation are pre-requisites for understanding how the regulation of transcription, splicing and RNA editing directs the development of this complex organism. Here we used RNA-Seq, tiling microarrays and cDNA sequencing to explore the transcriptome in 30 distinct developmental stages. We identified 111,195 new elements, including thousands of genes, coding and non-coding transcripts, exons, splicing and editing events, and inferred protein isoforms that previously eluded discovery using established experimental, predictionmore » and conservation-based approaches. These data substantially expand the number of known transcribed elements in the Drosophila genome and provide a high-resolution view of transcriptome dynamics throughout development. Drosophila melanogaster is an important non-mammalian model system that has had a critical role in basic biological discoveries, such as identifying chromosomes as the carriers of genetic information and uncovering the role of genes in development. Because it shares a substantial genic content with humans, Drosophila is increasingly used as a translational model for human development, homeostasis and disease. High-quality maps are needed for all functional genomic elements. Previous studies demonstrated that a rich collection of genes is deployed during the life cycle of the fly. Although expression profiling using microarrays has revealed the expression of, 13,000 annotated genes, it is difficult to map splice junctions and individual base modifications generated by RNA editing using such approaches. Single-base resolution is essential to define precisely the elements that comprise the Drosophila transcriptome. Estimates of the number of transcript isoforms are less accurate than estimates of the number of genes. Whereas, 20% of Drosophila genes are annotated as encoding alternatively spliced premRNAs, splice-junction microarray experiments indicate that this number is at least 40% (ref. 7). Determining the diversity of mRNAs generated by alternative promoters, alternative splicing and RNA editing will substantially increase the inferred protein repertoire. Non-coding RNA genes (ncRNAs) including short interfering RNAs (siRNAs) and microRNAS (miRNAs) (reviewed in ref. 10), and longer ncRNAs such as bxd (ref. 11) and rox (ref. 12), have important roles in gene regulation, whereas others such as small nucleolar RNAs (snoRNAs)and small nuclear RNAs (snRNAs) are important components of macromolecular machines such as the ribosome and spliceosome. The transcription and processing of these ncRNAs must also be fully documented and mapped. As part of the modENCODE project to annotate the functional elements of the D. melanogaster and Caenorhabditis elegans genomes, we used RNA-Seq and tiling microarrays to sample the Drosophila transcriptome at unprecedented depth throughout development from early embryo to ageing male and female adults. We report on a high-resolution view of the discovery, structure and dynamic expression of the D. melanogaster transcriptome.« less

  2. Definition of Proteasomal Peptide Splicing Rules for High-Efficiency Spliced Peptide Presentation by MHC Class I Molecules

    PubMed Central

    Berkers, Celia R.; de Jong, Annemieke; Schuurman, Karianne G.; Linnemann, Carsten; Meiring, Hugo D.; Janssen, Lennert; Neefjes, Jacques J.; Schumacher, Ton N. M.; Rodenko, Boris

    2015-01-01

    Peptide splicing, in which two distant parts of a protein are excised and then ligated to form a novel peptide, can generate unique MHC class I–restricted responses. Because these peptides are not genetically encoded and the rules behind proteasomal splicing are unknown, it is difficult to predict these spliced Ags. In the current study, small libraries of short peptides were used to identify amino acid sequences that affect the efficiency of this transpeptidation process. We observed that splicing does not occur at random, neither in terms of the amino acid sequences nor through random splicing of peptides from different sources. In contrast, splicing followed distinct rules that we deduced and validated both in vitro and in cells. Peptide ligation was quantified using a model peptide and demonstrated to occur with up to 30% ligation efficiency in vitro, provided that optimal structural requirements for ligation were met by both ligating partners. In addition, many splicing products could be formed from a single protein. Our splicing rules will facilitate prediction and detection of new spliced Ags to expand the peptidome presented by MHC class I Ags. PMID:26401003

  3. ASPIC: a novel method to predict the exon-intron structure of a gene that is optimally compatible to a set of transcript sequences.

    PubMed

    Bonizzoni, Paola; Rizzi, Raffaella; Pesole, Graziano

    2005-10-05

    Currently available methods to predict splice sites are mainly based on the independent and progressive alignment of transcript data (mostly ESTs) to the genomic sequence. Apart from often being computationally expensive, this approach is vulnerable to several problems--hence the need to develop novel strategies. We propose a method, based on a novel multiple genome-EST alignment algorithm, for the detection of splice sites. To avoid limitations of splice sites prediction (mainly, over-predictions) due to independent single EST alignments to the genomic sequence our approach performs a multiple alignment of transcript data to the genomic sequence based on the combined analysis of all available data. We recast the problem of predicting constitutive and alternative splicing as an optimization problem, where the optimal multiple transcript alignment minimizes the number of exons and hence of splice site observations. We have implemented a splice site predictor based on this algorithm in the software tool ASPIC (Alternative Splicing PredICtion). It is distinguished from other methods based on BLAST-like tools by the incorporation of entirely new ad hoc procedures for accurate and computationally efficient transcript alignment and adopts dynamic programming for the refinement of intron boundaries. ASPIC also provides the minimal set of non-mergeable transcript isoforms compatible with the detected splicing events. The ASPIC web resource is dynamically interconnected with the Ensembl and Unigene databases and also implements an upload facility. Extensive bench marking shows that ASPIC outperforms other existing methods in the detection of novel splicing isoforms and in the minimization of over-predictions. ASPIC also requires a lower computation time for processing a single gene and an EST cluster. The ASPIC web resource is available at http://aspic.algo.disco.unimib.it/aspic-devel/.

  4. Spliced DNA Sequences in the Paramecium Germline: Their Properties and Evolutionary Potential

    PubMed Central

    Catania, Francesco; McGrath, Casey L.; Doak, Thomas G.; Lynch, Michael

    2013-01-01

    Despite playing a crucial role in germline-soma differentiation, the evolutionary significance of developmentally regulated genome rearrangements (DRGRs) has received scant attention. An example of DRGR is DNA splicing, a process that removes segments of DNA interrupting genic and/or intergenic sequences. Perhaps, best known for shaping immune-system genes in vertebrates, DNA splicing plays a central role in the life of ciliated protozoa, where thousands of germline DNA segments are eliminated after sexual reproduction to regenerate a functional somatic genome. Here, we identify and chronicle the properties of 5,286 sequences that putatively undergo DNA splicing (i.e., internal eliminated sequences [IESs]) across the genomes of three closely related species of the ciliate Paramecium (P. tetraurelia, P. biaurelia, and P. sexaurelia). The study reveals that these putative IESs share several physical characteristics. Although our results are consistent with excision events being largely conserved between species, episodes of differential IES retention/excision occur, may have a recent origin, and frequently involve coding regions. Our findings indicate interconversion between somatic—often coding—DNA sequences and noncoding IESs, and provide insights into the role of DNA splicing in creating potentially functional genetic innovation. PMID:23737328

  5. The intron 1 of HPV 16 has a suboptimal branch point at a guanosine.

    PubMed

    De la Rosa-Rios, Marco Antonio; Martínez-Salazar, Martha; Martínez-Garcia, Martha; González-Bonilla, César; Villegas-Sepúlveda, Nicolás

    2006-06-01

    The branch point sequence (BPS) of intron 1 of the HPV-16 was determined via RT-PCR in a cell free system, using lariat intermediates obtained by in vitro splicing reactions. We used synthetic E6/E7 transcripts and HeLa nuclear protein extracts to obtain the splicing intermediates. Then, a divergent oligonucleotide primer set, pairing on the lariat RNA that encompassed the 2'-5' phosphodiester bond formed between the 5' end of the intron and the BPS, was used for cDNA synthesis and PCR amplification. Subsequent RT-PCR assays revealed four splicing intermediates, made up of a major intermediary corresponding to the BPS and four cryptic branched sequences. Only intermediates bound at the 5' end of the intron are probably the authentic branch point sequence, and all of them branch at guanosine 328 instead of the typical adenosine. Unusually, the BPS of intron 1 of HPV-16 is a suboptimal sequence (AGUGAGU) that differs from the eukaryotic consensus BPS, which correlates with the splicing profile observed for early transcripts of HPV-16 in tumors and tumor derived cell lines. The implications of this unusual branch point sequence for splicing of the HPV-16 pre-mRNA are discussed.

  6. TSVdb: a web-tool for TCGA splicing variants analysis.

    PubMed

    Sun, Wenjie; Duan, Ting; Ye, Panmeng; Chen, Kelie; Zhang, Guanling; Lai, Maode; Zhang, Honghe

    2018-05-29

    Collaborative projects such as The Cancer Genome Atlas (TCGA) have generated various -omics and clinical data on cancer. Many computational tools have been developed to facilitate the study of the molecular characterization of tumors using data from the TCGA. Alternative splicing of a gene produces splicing variants, and accumulating evidence has revealed its essential role in cancer-related processes, implying the urgent need to discover tumor-specific isoforms and uncover their potential functions in tumorigenesis. We developed TSVdb, a web-based tool, to explore alternative splicing based on TCGA samples with 30 clinical variables from 33 tumors. TSVdb has an integrated and well-proportioned interface for visualization of the clinical data, gene expression, usage of exons/junctions and splicing patterns. Researchers can interpret the isoform expression variations between or across clinical subgroups and estimate the relationships between isoforms and patient prognosis. TSVdb is available at http://www.tsvdb.com , and the source code is available at https://github.com/wenjie1991/TSVdb . TSVdb will inspire oncologists and accelerate isoform-level advances in cancer research.

  7. Two splice variants of the bovine lactoferrin gene identified in Staphylococcus aureus isolated from mastitis in dairy cattle.

    PubMed

    Huang, J M; Wang, Z Y; Ju, Z H; Wang, C F; Li, Q L; Sun, T; Hou, Q L; Hang, S Q; Hou, M H; Zhong, J F

    2011-12-21

    Bovine lactoferrin (bLF) is a member of the transferrin family; it plays an important role in the innate immune response. We identified novel splice variants of the bLF gene in mastitis-infected and healthy cows. Reverse transcription-polymerase chain reaction (RT-PCR) and clone sequencing analysis were used to screen the splice variants of the bLF gene in the mammary gland, spleen and liver tissues. One main transcript corresponding to the bLF reference sequence was found in three tissues in both healthy and mastitis-infected cows. Quantitative real-time PCR analysis showed that the expression levels of the LF gene's main transcript were not significantly different in tissues from healthy versus mastitis-infected cows. However, the new splice variant, LF-AS2, which has the exon-skipping alternative splicing pattern, was only identified in mammary glands infected with Staphylococcus aureus. Sequencing analysis showed that the new splice variant was 251 bp in length, including exon 1, part of exon 2, part of exon 16, and exon 17. We conclude that bLF may play a role in resistance to mastitis through alternative splicing mechanisms.

  8. Identification of Alternative Splice Variants Using Unique Tryptic Peptide Sequences for Database Searches.

    PubMed

    Tran, Trung T; Bollineni, Ravi C; Strozynski, Margarita; Koehler, Christian J; Thiede, Bernd

    2017-07-07

    Alternative splicing is a mechanism in eukaryotes by which different forms of mRNAs are generated from the same gene. Identification of alternative splice variants requires the identification of peptides specific for alternative splice forms. For this purpose, we generated a human database that contains only unique tryptic peptides specific for alternative splice forms from Swiss-Prot entries. Using this database allows an easy access to splice variant-specific peptide sequences that match to MS data. Furthermore, we combined this database without alternative splice variant-1-specific peptides with human Swiss-Prot. This combined database can be used as a general database for searching of LC-MS data. LC-MS data derived from in-solution digests of two different cell lines (LNCaP, HeLa) and phosphoproteomics studies were analyzed using these two databases. Several nonalternative splice variant-1-specific peptides were found in both cell lines, and some of them seemed to be cell-line-specific. Control and apoptotic phosphoproteomes from Jurkat T cells revealed several nonalternative splice variant-1-specific peptides, and some of them showed clear quantitative differences between the two states.

  9. A Comprehensive Analysis of Alternative Splicing in Paleopolyploid Maize.

    PubMed

    Mei, Wenbin; Liu, Sanzhen; Schnable, James C; Yeh, Cheng-Ting; Springer, Nathan M; Schnable, Patrick S; Barbazuk, William B

    2017-01-01

    Identifying and characterizing alternative splicing (AS) enables our understanding of the biological role of transcript isoform diversity. This study describes the use of publicly available RNA-Seq data to identify and characterize the global diversity of AS isoforms in maize using the inbred lines B73 and Mo17, and a related species, sorghum. Identification and characterization of AS within maize tissues revealed that genes expressed in seed exhibit the largest differential AS relative to other tissues examined. Additionally, differences in AS between the two genotypes B73 and Mo17 are greatest within genes expressed in seed. We demonstrate that changes in the level of alternatively spliced transcripts (intron retention and exon skipping) do not solely reflect differences in total transcript abundance, and we present evidence that intron retention may act to fine-tune gene expression across seed development stages. Furthermore, we have identified temperature sensitive AS in maize and demonstrate that drought-induced changes in AS involve distinct sets of genes in reproductive and vegetative tissues. Examining our identified AS isoforms within B73 × Mo17 recombinant inbred lines (RILs) identified splicing QTL (sQTL). The 43.3% of cis- sQTL regulated junctions are actually identified as alternatively spliced junctions in our analysis, while 10 Mb windows on each side of 48.2% of trans -sQTLs overlap with splicing related genes. Using sorghum as an out-group enabled direct examination of loss or conservation of AS between homeologous genes representing the two subgenomes of maize. We identify several instances where AS isoforms that are conserved between one maize homeolog and its sorghum ortholog are absent from the second maize homeolog, suggesting that these AS isoforms may have been lost after the maize whole genome duplication event. This comprehensive analysis provides new insights into the complexity of AS in maize.

  10. Novel compound heterozygous Thyroglobulin mutations c.745+1G>A/c.7036+2T>A associated with congenital goiter and hypothyroidism in a Vietnamese family. Identification of a new cryptic 5' splice site in the exon 6.

    PubMed

    Citterio, Cintia E; Morales, Cecilia M; Bouhours-Nouet, Natacha; Machiavelli, Gloria A; Bueno, Elena; Gatelais, Frédérique; Coutant, Regis; González-Sarmiento, Rogelio; Rivolta, Carina M; Targovnik, Héctor M

    2015-03-15

    Several patients were identified with dyshormonogenesis caused by mutations in the thyroglobulin (TG) gene. These defects are inherited in an autosomal recessive manner and affected individuals are either homozygous or compound heterozygous for the mutations. The aim of the present study was to identify new TG mutations in a patient of Vietnamese origin affected by congenital hypothyroidism, goiter and low levels of serum TG. DNA sequencing identified the presence of compound heterozygous mutations in the TG gene: the maternal mutation consists of a novel c.745+1G>A (g.IVS6 + 1G>A), whereas the hypothetical paternal mutation consists of a novel c.7036+2T>A (g.IVS40 + 2T>A). The father was not available for segregation analysis. Ex-vivo splicing assays and subsequent RT-PCR analyses were performed on mRNA isolated from the eukaryotic-cells transfected with normal and mutant expression vectors. Minigene analysis of the c.745+1G>A mutant showed that the exon 6 is skipped during pre-mRNA splicing or partially included by use of a cryptic 5' splice site located to 55 nucleotides upstream of the authentic exon 6/intron 6 junction site. The functional analysis of c.7036+2T>A mutation showed a complete skipping of exon 40. The theoretical consequences of splice site mutations, predicted with the bioinformatics tool NNSplice, Fsplice, SPL, SPLM and MaxEntScan programs were investigated and evaluated in relation with the experimental evidence. These analyses predicted that both mutant alleles would result in the abolition of the authentic splice donor sites. The c.745+1G>A mutation originates two putative truncated proteins of 200 and 1142 amino acids, whereas c.7036+2T>A mutation results in a putative truncated protein of 2277 amino acids. In conclusion, we show that the c.745+1G>A mutation promotes the activation of a new cryptic donor splice site in the exon 6 of the TG gene. The functional consequences of these mutations could be structural changes in the protein molecule that alter the biosynthesis of thyroid hormones. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  11. A Targeted Oligonucleotide Enhancer of SMN2 Exon 7 Splicing Forms Competing Quadruplex and Protein Complexes in Functional Conditions

    PubMed Central

    Smith, Lindsay D.; Dickinson, Rachel L.; Lucas, Christian M.; Cousins, Alex; Malygin, Alexey A.; Weldon, Carika; Perrett, Andrew J.; Bottrill, Andrew R.; Searle, Mark S.; Burley, Glenn A.; Eperon, Ian C.

    2014-01-01

    Summary The use of oligonucleotides to activate the splicing of selected exons is limited by a poor understanding of the mechanisms affected. A targeted bifunctional oligonucleotide enhancer of splicing (TOES) anneals to SMN2 exon 7 and carries an exonic splicing enhancer (ESE) sequence. We show that it stimulates splicing specifically of intron 6 in the presence of repressing sequences in intron 7. Complementarity to the 5′ end of exon 7 increases U2AF65 binding, but the ESE sequence is required for efficient recruitment of U2 snRNP. The ESE forms at least three coexisting discrete states: a quadruplex, a complex containing only hnRNP F/H, and a complex enriched in the activator SRSF1. Neither hnRNP H nor quadruplex formation contributes to ESE activity. The results suggest that splicing limited by weak signals can be rescued by rapid exchange of TOES oligonucleotides in various complexes and raise the possibility that SR proteins associate transiently with ESEs. PMID:25263560

  12. SpliceDisease database: linking RNA splicing and disease.

    PubMed

    Wang, Juan; Zhang, Jie; Li, Kaibo; Zhao, Wei; Cui, Qinghua

    2012-01-01

    RNA splicing is an important aspect of gene regulation in many organisms. Splicing of RNA is regulated by complicated mechanisms involving numerous RNA-binding proteins and the intricate network of interactions among them. Mutations in cis-acting splicing elements or its regulatory proteins have been shown to be involved in human diseases. Defects in pre-mRNA splicing process have emerged as a common disease-causing mechanism. Therefore, a database integrating RNA splicing and disease associations would be helpful for understanding not only the RNA splicing but also its contribution to disease. In SpliceDisease database, we manually curated 2337 splicing mutation disease entries involving 303 genes and 370 diseases, which have been supported experimentally in 898 publications. The SpliceDisease database provides information including the change of the nucleotide in the sequence, the location of the mutation on the gene, the reference Pubmed ID and detailed description for the relationship among gene mutations, splicing defects and diseases. We standardized the names of the diseases and genes and provided links for these genes to NCBI and UCSC genome browser for further annotation and genomic sequences. For the location of the mutation, we give direct links of the entry to the respective position/region in the genome browser. The users can freely browse, search and download the data in SpliceDisease at http://cmbi.bjmu.edu.cn/sdisease.

  13. cis-acting intron mutations that affect the efficiency of avian retroviral RNA splicing: implication for mechanisms of control.

    PubMed Central

    Katz, R A; Kotler, M; Skalka, A M

    1988-01-01

    The full-length retroviral RNA transcript serves as (i) mRNA for the gag and pol gene products, (ii) genomic RNA that is assembled into progeny virions, and (iii) a pre-mRNA for spliced subgenomic mRNAs. Therefore, a balance of spliced and unspliced RNA is required to generate the appropriate levels of protein and RNA products for virion production. We have introduced an insertion mutation near the avian sarcoma virus env splice acceptor site that results in a significant increase in splicing to form functional env mRNA. The mutant virus is replication defective, but phenotypic revertant viruses that have acquired second-site mutations near the splice acceptor site can be isolated readily. Detailed analysis of one of these viruses revealed that a single nucleotide change at -20 from the splice acceptor site, within the original mutagenic insert, was sufficient to restore viral growth and significantly decrease splicing efficiency compared with the original mutant and wild-type viruses. Thus, minor sequence alterations near the env splice acceptor site can produce major changes in the balance of spliced and unspliced RNAs. Our results suggest a mechanism of control in which splicing is modulated by cis-acting sequences at the env splice acceptor site. Furthermore, this retroviral system provides a powerful genetic method for selection and analysis of mutations that affect splicing control. Images PMID:2839694

  14. A SMN-Dependent U12 Splicing Event Essential for Motor Circuit Function

    PubMed Central

    Lotti, Francesco; Imlach, Wendy L.; Saieva, Luciano; Beck, Erin S.; Hao, Le T.; Li, Darrick K.; Jiao, Wei; Mentis, George Z.; Beattie, Christine E.; McCabe, Brian D.; Pellizzoni, Livio

    2012-01-01

    SUMMARY Spinal muscular atrophy (SMA) is a motor neuron disease caused by deficiency of the ubiquitous survival motor neuron (SMN) protein. To define the mechanisms of selective neuronal dysfunction in SMA, we investigated the role of SMN-dependent U12 splicing events in the regulation of motor circuit activity. We show that SMN deficiency perturbs splicing and decreases the expression of a subset of U12 intron-containing genes in mammalian cells and Drosophila larvae. Analysis of these SMN target genes identifies Stasimon as a novel protein required for motor circuit function. Restoration of Stasimon expression in the motor circuit corrects defects in neuromuscular junction transmission and muscle growth in Drosophila SMN mutants and aberrant motor neuron development in SMN-deficient zebrafish. These findings directly link defective splicing of critical neuronal genes induced by SMN deficiency to motor circuit dysfunction, establishing a molecular framework for the selective pathology of SMA. PMID:23063131

  15. Phenotypic and genotypic characterization of four factor VII deficiency patients from central China.

    PubMed

    Liu, Hui; Wang, Hua-Fang; Cheng, Zhi-peng; Wang, Qing-yun; Hu, Bei; Zeng, Wei; Wu, Ying-ying; Guo, Tao; Tang, Liang; Hu, Yu

    2015-06-01

    Hereditary coagulation factor VII deficiency (FVIID) is a rare autosomal, recessive inherited hemorrhagic disorder related to a variety of mutations or polymorphisms throughout the factor VII (FVII) gene (F7). The aims of this study were to characterize the molecular defect of the F7 gene in four unrelated patients with FVIID and to find the genotype-phenotype correlation. All nine exons, exon-intron boundaries, and 5' and 3'-untranslated regions of the F7 gene were amplified by PCR and the purified PCR products were sequenced directly. Suspected mutations were confirmed by another PCR and sequencing of the opposite strand. Family studies were also performed. A total of five unique lesions were identified, including three missense mutations (c.384A>G, c.839A>C, c.1163T>G, predicting p.Tyr128Cys, p.Glu280Ala and p.Phe388Cys substitution, respectively) and two splice junction mutations (c.572-1G>A, c.681+1G>T), among which two (p.Glu280Ala, p.Phe388Cys) were novel. A previously reported mutation p.Tyr128Cys was seen in the homozygous state in two unrelated patients. The other two cases were both compound heterozygotes of a missense mutation and a splicing site mutation. Multiple sequence alignment using DNAMAN analysis showed that all the missense mutations were found in residues that highly conserved across species and vitamin K-dependent serine proteases. Online software Polyphen and SIFT were used to confirm the pathogenic of the missense mutation. p.Tyr128Cys seems to be a hotspot of the F7 gene in ethnic Han Chinese population.

  16. Investigation of Experimental Factors That Underlie BRCA1/2 mRNA Isoform Expression Variation: Recommendations for Utilizing Targeted RNA Sequencing to Evaluate Potential Spliceogenic Variants

    PubMed Central

    Lattimore, Vanessa L.; Pearson, John F.; Currie, Margaret J.; Spurdle, Amanda B.; Robinson, Bridget A.; Walker, Logan C.

    2018-01-01

    PCR-based RNA splicing assays are commonly used in diagnostic and research settings to assess the potential effects of variants of uncertain clinical significance in BRCA1 and BRCA2. The Evidence-based Network for the Interpretation of Germline Mutant Alleles (ENIGMA) consortium completed a multicentre investigation to evaluate differences in assay design and the integrity of published data, raising a number of methodological questions associated with cell culture conditions and PCR-based protocols. We utilized targeted RNA-seq to re-assess BRCA1 and BRCA2 mRNA isoform expression patterns in lymphoblastoid cell lines (LCLs) previously used in the multicentre ENIGMA study. Capture of the targeted cDNA sequences was carried out using 34 BRCA1 and 28 BRCA2 oligonucleotides from the Illumina Truseq Targeted RNA Expression platform. Our results show that targeted RNA-seq analysis of LCLs overcomes many of the methodology limitations associated with PCR-based assays leading us to make the following observations and recommendations: (1) technical replicates (n > 2) of variant carriers to capture methodology induced variability associated with RNA-seq assays, (2) LCLs can undergo multiple freeze/thaw cycles and can be cultured up to 2 weeks without noticeably influencing isoform expression levels, (3) nonsense-mediated decay inhibitors are essential prior to splicing assays for comprehensive mRNA isoform detection, (4) quantitative assessment of exon:exon junction levels across BRCA1 and BRCA2 can help distinguish between normal and aberrant isoform expression patterns. Experimentally derived recommendations from this study will facilitate the application of targeted RNA-seq platforms for the quantitation of BRCA1 and BRCA2 mRNA aberrations associated with sequence variants of uncertain clinical significance. PMID:29774201

  17. Investigation of Experimental Factors That Underlie BRCA1/2 mRNA Isoform Expression Variation: Recommendations for Utilizing Targeted RNA Sequencing to Evaluate Potential Spliceogenic Variants.

    PubMed

    Lattimore, Vanessa L; Pearson, John F; Currie, Margaret J; Spurdle, Amanda B; Robinson, Bridget A; Walker, Logan C

    2018-01-01

    PCR-based RNA splicing assays are commonly used in diagnostic and research settings to assess the potential effects of variants of uncertain clinical significance in BRCA1 and BRCA2 . The Evidence-based Network for the Interpretation of Germline Mutant Alleles (ENIGMA) consortium completed a multicentre investigation to evaluate differences in assay design and the integrity of published data, raising a number of methodological questions associated with cell culture conditions and PCR-based protocols. We utilized targeted RNA-seq to re-assess BRCA1 and BRCA2 mRNA isoform expression patterns in lymphoblastoid cell lines (LCLs) previously used in the multicentre ENIGMA study. Capture of the targeted cDNA sequences was carried out using 34 BRCA1 and 28 BRCA2 oligonucleotides from the Illumina Truseq Targeted RNA Expression platform. Our results show that targeted RNA-seq analysis of LCLs overcomes many of the methodology limitations associated with PCR-based assays leading us to make the following observations and recommendations: (1) technical replicates ( n  > 2) of variant carriers to capture methodology induced variability associated with RNA-seq assays, (2) LCLs can undergo multiple freeze/thaw cycles and can be cultured up to 2 weeks without noticeably influencing isoform expression levels, (3) nonsense-mediated decay inhibitors are essential prior to splicing assays for comprehensive mRNA isoform detection, (4) quantitative assessment of exon:exon junction levels across BRCA1 and BRCA2 can help distinguish between normal and aberrant isoform expression patterns. Experimentally derived recommendations from this study will facilitate the application of targeted RNA-seq platforms for the quantitation of BRCA1 and BRCA2 mRNA aberrations associated with sequence variants of uncertain clinical significance.

  18. Parameter optimization of fusion splicing of photonic crystal fibers and conventional fibers to increase strength

    NASA Astrophysics Data System (ADS)

    Zhang, Chunxi; Zhang, Zuchen; Song, Jingming; Wu, Chunxiao; Song, Ningfang

    2015-03-01

    A splicing parameter optimization method to increase the tensile strength of splicing joint between photonic crystal fiber (PCF) and conventional fiber is demonstrated. Based on the splicing recipes provided by splicer or fiber manufacturers, the optimal values of some major splicing parameters are obtained in sequence, and a conspicuous improvement in the mechanical strength of splicing joints between PCFs and conventional fibers is validated through experiments.

  19. Evolutionary conservation and regulation of particular alternative splicing events in plant SR proteins

    PubMed Central

    Kalyna, Maria; Lopato, Sergiy; Voronin, Viktor; Barta, Andrea

    2006-01-01

    Alternative splicing is an important mechanism for fine tuning of gene expression at the post-transcriptional level. SR proteins govern splice site selection and spliceosome assembly. The Arabidopsis genome encodes 19 SR proteins, several of which have no orthologues in metazoan. Three of the plant specific subfamilies are characterized by the presence of a relatively long alternatively spliced intron located in their first RNA recognition motif, which potentially results in an extremely truncated protein. In atRSZ33, a member of the RS2Z subfamily, this alternative splicing event was shown to be autoregulated. Here we show that atRSp31, a member of the RS subfamily, does not autoregulate alternative splicing of its similarily positioned intron. Interestingly, this alternative splicing event is regulated by atRSZ33. We demonstrate that the positions of these long introns and their capability for alternative splicing are conserved from green algae to flowering plants. Moreover, in particular alternative splicing events the splicing signals are embedded into highly conserved sequences. In different taxa, these conserved sequences occur in at least one gene within a subfamily. The evolutionary preservation of alternative splice forms together with highly conserved intron features argues for additional functions hidden in the genes of these plant-specific SR proteins. PMID:16936312

  20. An EMT–Driven Alternative Splicing Program Occurs in Human Breast Cancer and Modulates Cellular Phenotype

    PubMed Central

    Flytzanis, Nicholas C.; Balsamo, Michele; Condeelis, John S.; Oktay, Maja H.; Burge, Christopher B.; Gertler, Frank B.

    2011-01-01

    Epithelial-mesenchymal transition (EMT), a mechanism important for embryonic development, plays a critical role during malignant transformation. While much is known about transcriptional regulation of EMT, alternative splicing of several genes has also been correlated with EMT progression, but the extent of splicing changes and their contributions to the morphological conversion accompanying EMT have not been investigated comprehensively. Using an established cell culture model and RNA–Seq analyses, we determined an alternative splicing signature for EMT. Genes encoding key drivers of EMT–dependent changes in cell phenotype, such as actin cytoskeleton remodeling, regulation of cell–cell junction formation, and regulation of cell migration, were enriched among EMT–associated alternatively splicing events. Our analysis suggested that most EMT–associated alternative splicing events are regulated by one or more members of the RBFOX, MBNL, CELF, hnRNP, or ESRP classes of splicing factors. The EMT alternative splicing signature was confirmed in human breast cancer cell lines, which could be classified into basal and luminal subtypes based exclusively on their EMT–associated splicing pattern. Expression of EMT–associated alternative mRNA transcripts was also observed in primary breast cancer samples, indicating that EMT–dependent splicing changes occur commonly in human tumors. The functional significance of EMT–associated alternative splicing was tested by expression of the epithelial-specific splicing factor ESRP1 or by depletion of RBFOX2 in mesenchymal cells, both of which elicited significant changes in cell morphology and motility towards an epithelial phenotype, suggesting that splicing regulation alone can drive critical aspects of EMT–associated phenotypic changes. The molecular description obtained here may aid in the development of new diagnostic and prognostic markers for analysis of breast cancer progression. PMID:21876675

  1. Traceless splicing enabled by substrate-induced activation of the Nostoc punctiforme Npu DnaE intein after mutation of a catalytic cysteine to serine.

    PubMed

    Cheriyan, Manoj; Chan, Siu-Hong; Perler, Francine

    2014-12-12

    Inteins self-catalytically cleave out of precursor proteins while ligating the surrounding extein fragments with a native peptide bond. Much attention has been lavished on these molecular marvels with the hope of understanding and harnessing their chemistry for novel biochemical transformations including coupling peptides from synthetic or biological origins and controlling protein function. Despite an abundance of powerful applications, the use of inteins is still hampered by limitations in our understanding of their specificity (defined as flanking sequences that permit splicing) and the challenge of inserting inteins into target proteins. We examined the frequently used Nostoc punctiforme Npu DnaE intein after the C-extein cysteine nucleophile (Cys+1) was mutated to serine or threonine. Previous studies demonstrated reduced rates and/or splicing yields with the Npu DnaE intein after mutation of Cys+1 to Ser+1. In this study, genetic selection identified extein sequences with Ser+1 that enabled the Npu DnaE intein to splice with only a 5-fold reduction in rate compared to the wild-type Cys+1 intein and without mutation of the intein itself to activate Ser+1 as a nucleophile. Three different proteins spliced efficiently after insertion of the intein flanked by the selected sequences. We then used this selected specificity to achieve traceless splicing in a targeted enzyme at a location predicted by primary sequence similarity to only the selected C-extein sequence. This study highlights the latent catalytic potential of the Npu DnaE intein to splice with an alternative nucleophile and enables broader intein utility by increasing insertion site choices. Copyright © 2014. Published by Elsevier Ltd.

  2. SPLICEFINDER – A Fast and Easy Screening Method for Active Protein Trans-Splicing Positions

    PubMed Central

    Eppmann, Simone; Busche, Alena; Dikovskaya, Dina; Dötsch, Volker; Mootz, Henning D.

    2013-01-01

    Split intein enabled protein trans-splicing (PTS) is a powerful method for the ligation of two protein fragments, thereby paving the way for various protein modification or protein function control applications. PTS activity is strongly influenced by the amino acids directly flanking the splice junctions. However, to date no reliable prediction can be made whether or not a split intein is active in a particular foreign extein context. Here we describe SPLICEFINDER, a PCR-based method, allowing fast and easy screening for active split intein insertions in any target protein. Furthermore we demonstrate the applicability of SPLICEFINDER for segmental isotopic labeling as well as for the generation of multi-domain and enzymatically active proteins. PMID:24023792

  3. G to A substitution in 5{prime} donor splice site of introns 18 and 48 of COL1A1 gene of type I collagen results in different splicing alternatives in osteogenesis imperfecta type I cell strains

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Willing, M.; Deschenes, S.

    We have identified a G to A substitution in the 5{prime} donor splice site of intron 18 of one COL1A1 allele in two unrelated families with osteogenesis imperfecta (OI) type I. A third OI type I family has a G to A substitution at the identical position in intron 48 of one COL1A1 allele. Both mutations abolish normal splicing and lead to reduced steady-state levels of mRNA from the mutant COL1A1 allele. The intron 18 mutation leads to both exon 18 skipping in the mRNA and to utilization of a single alternative splice site near the 3{prime} end of exonmore » 18. The latter results in deletion of the last 8 nucleotides of exon 18 from the mRNA, a shift in the translational reading-frame, and the creation of a premature termination codon in exon 19. Of the potential alternative 5{prime} splice sites in exon 18 and intron 18, the one utilized has a surrounding nucleotide sequence which most closely resembles that of the natural splice site. Although a G to A mutation was detected at the identical position in intron 48 of one COL1A1 allele in another OI type I family, nine complex alternative splicing patterns were identified by sequence analysis of cDNA clones derived from fibroblast mRNA from this cell strain. All result in partial or complete skipping of exon 48, with in-frame deletions of portions of exons 47 and/or 49. The different patterns of RNA splicing were not explained by their sequence homology with naturally occuring 5{prime} splice sites, but rather by recombination between highly homologous exon sequences, suggesting that we may not have identified the major splicing alternative(s) in this cell strain. Both G to A mutations result in decreased production of type I collagen, the common biochemical correlate of OI type I.« less

  4. Defective control of pre–messenger RNA splicing in human disease

    PubMed Central

    Shkreta, Lulzim

    2016-01-01

    Examples of associations between human disease and defects in pre–messenger RNA splicing/alternative splicing are accumulating. Although many alterations are caused by mutations in splicing signals or regulatory sequence elements, recent studies have noted the disruptive impact of mutated generic spliceosome components and splicing regulatory proteins. This review highlights recent progress in our understanding of how the altered splicing function of RNA-binding proteins contributes to myelodysplastic syndromes, cancer, and neuropathologies. PMID:26728853

  5. Manananggal - a novel viewer for alternative splicing events.

    PubMed

    Barann, Matthias; Zimmer, Ralf; Birzele, Fabian

    2017-02-21

    Alternative splicing is an important cellular mechanism that can be analyzed by RNA sequencing. However, identification of splicing events in an automated fashion is error-prone. Thus, further validation is required to select reliable instances of alternative splicing events (ASEs). There are only few tools specifically designed for interactive inspection of ASEs and available visualization approaches can be significantly improved. Here, we present Manananggal, an application specifically designed for the identification of splicing events in next generation sequencing data. Manananggal includes a web application for visual inspection and a command line tool that allows for ASE detection. We compare the sashimi plots available in the IGV Viewer, the DEXSeq splicing plots and SpliceSeq to the Manananggal interface and discuss the advantages and drawbacks of these tools. We show that sashimi plots (such as those used by the IGV Viewer and SpliceSeq) offer a practical solution for simple ASEs, but also indicate short-comings for highly complex genes. Manananggal is an interactive web application that offers functions specifically tailored to the identification of alternative splicing events that other tools are lacking. The ability to select a subset of isoforms allows an easier interpretation of complex alternative splicing events. In contrast to SpliceSeq and the DEXSeq splicing plot, Manananggal does not obscure the gene structure by showing full transcript models that makes it easier to determine which isoforms are expressed and which are not.

  6. Dynamic ASXL1 Exon Skipping and Alternative Circular Splicing in Single Human Cells

    PubMed Central

    Natarajan, Sivaraman; Carter, Robert; Brown, Patrick O.

    2016-01-01

    Circular RNAs comprise a poorly understood new class of noncoding RNA. In this study, we used a combination of targeted deletion, high-resolution splicing detection, and single-cell sequencing to deeply probe ASXL1 circular splicing. We found that efficient circular splicing required the canonical transcriptional start site and inverted AluSx elements. Sequencing-based interrogation of isoforms after ASXL1 overexpression identified promiscuous linear splicing between all exons, with the two most abundant non-canonical linear products skipping the exons that produced the circular isoforms. Single-cell sequencing revealed a strong preference for either the linear or circular ASXL1 isoforms in each cell, and found the predominant exon skipping product is frequently co-expressed with its reciprocal circular isoform. Finally, absolute quantification of ASXL1 isoforms confirmed our findings and suggests that standard methods overestimate circRNA abundance. Taken together, these data reveal a dynamic new view of circRNA genesis, providing additional framework for studying their roles in cellular biology. PMID:27736885

  7. Characterization of cis-Acting RNA Elements of Zika Virus by Using a Self-Splicing Ribozyme-Dependent Infectious Clone.

    PubMed

    Liu, Zhong-Yu; Yu, Jiu-Yang; Huang, Xing-Yao; Fan, Hang; Li, Xiao-Feng; Deng, Yong-Qiang; Ji, Xue; Cheng, Meng-Li; Ye, Qing; Zhao, Hui; Han, Jian-Feng; An, Xiao-Ping; Jiang, Tao; Zhang, Bo; Tong, Yi-Gang; Qin, Cheng-Feng

    2017-11-01

    Zika virus (ZIKV) has caused significant outbreaks and epidemics in the Americas recently, raising global concern due to its ability to cause microcephaly and other neurological complications. A stable and efficient infectious clone of ZIKV is urgently needed. However, the instability and toxicity of flavivirus cDNA clones in Escherichia coli hosts has hindered the development of ZIKV infectious clones. Here, using a novel self-splicing ribozyme-based strategy, we generated a stable infectious cDNA clone of a contemporary ZIKV strain imported from Venezuela to China in 2016. The constructed clone contained a modified version of the group II self-splicing intron P.li.LSUI2 near the junction between the E and NS1 genes, which were removed from the RNA transcripts by an easy-to-establish in vitro splicing reaction. Transfection of the spliced RNAs into BHK-21 cells led to the production of infectious progeny virus that resembled the parental virus. Finally, potential cis -acting RNA elements in ZIKV genomic RNA were identified based on this novel reverse genetics system, and the critical role of 5'-SLA promoter and 5'-3' cyclization sequences were characterized by a combination of different assays. Our results provide another stable and reliable reverse genetics system for ZIKV that will help study ZIKV infection and pathogenesis, and the novel self-splicing intron-based strategy could be further expanded for the construction of infectious clones from other emerging and reemerging flaviviruses. IMPORTANCE The ongoing Zika virus (ZIKV) outbreaks have drawn global concern due to the unexpected causal link to fetus microcephaly and other severe neurological complications. The infectious cDNA clones of ZIKV are critical for the research community to study the virus, understand the disease, and inform vaccine design and antiviral screening. A panel of existing technologies have been utilized to develop ZIKV infectious clones. Here, we successfully generated a stable infectious clone of a 2016 ZIKV strain using a novel self-splicing ribozyme-based technology that abolished the potential toxicity of ZIKV cDNA clones to the E. coli host. Moreover, two crucial cis -acting replication elements (5'-SLA and 5'-CS) of ZIKV were first identified using this novel reverse genetics system. This novel self-splicing ribozyme-based reverse genetics platform will be widely utilized in future ZIKV studies and provide insight for the development of infectious clones of other emerging viruses. Copyright © 2017 American Society for Microbiology.

  8. Characterization of cis-Acting RNA Elements of Zika Virus by Using a Self-Splicing Ribozyme-Dependent Infectious Clone

    PubMed Central

    Liu, Zhong-Yu; Yu, Jiu-Yang; Huang, Xing-Yao; Fan, Hang; Li, Xiao-Feng; Deng, Yong-Qiang; Ji, Xue; Cheng, Meng-Li; Ye, Qing; Zhao, Hui; Han, Jian-Feng; An, Xiao-Ping; Jiang, Tao; Zhang, Bo; Tong, Yi-Gang

    2017-01-01

    ABSTRACT Zika virus (ZIKV) has caused significant outbreaks and epidemics in the Americas recently, raising global concern due to its ability to cause microcephaly and other neurological complications. A stable and efficient infectious clone of ZIKV is urgently needed. However, the instability and toxicity of flavivirus cDNA clones in Escherichia coli hosts has hindered the development of ZIKV infectious clones. Here, using a novel self-splicing ribozyme-based strategy, we generated a stable infectious cDNA clone of a contemporary ZIKV strain imported from Venezuela to China in 2016. The constructed clone contained a modified version of the group II self-splicing intron P.li.LSUI2 near the junction between the E and NS1 genes, which were removed from the RNA transcripts by an easy-to-establish in vitro splicing reaction. Transfection of the spliced RNAs into BHK-21 cells led to the production of infectious progeny virus that resembled the parental virus. Finally, potential cis-acting RNA elements in ZIKV genomic RNA were identified based on this novel reverse genetics system, and the critical role of 5′-SLA promoter and 5′-3′ cyclization sequences were characterized by a combination of different assays. Our results provide another stable and reliable reverse genetics system for ZIKV that will help study ZIKV infection and pathogenesis, and the novel self-splicing intron-based strategy could be further expanded for the construction of infectious clones from other emerging and reemerging flaviviruses. IMPORTANCE The ongoing Zika virus (ZIKV) outbreaks have drawn global concern due to the unexpected causal link to fetus microcephaly and other severe neurological complications. The infectious cDNA clones of ZIKV are critical for the research community to study the virus, understand the disease, and inform vaccine design and antiviral screening. A panel of existing technologies have been utilized to develop ZIKV infectious clones. Here, we successfully generated a stable infectious clone of a 2016 ZIKV strain using a novel self-splicing ribozyme-based technology that abolished the potential toxicity of ZIKV cDNA clones to the E. coli host. Moreover, two crucial cis-acting replication elements (5′-SLA and 5′-CS) of ZIKV were first identified using this novel reverse genetics system. This novel self-splicing ribozyme-based reverse genetics platform will be widely utilized in future ZIKV studies and provide insight for the development of infectious clones of other emerging viruses. PMID:28814522

  9. Interconnections Between RNA-Processing Pathways Revealed by a Sequencing-Based Genetic Screen for Pre-mRNA Splicing Mutants in Fission Yeast.

    PubMed

    Larson, Amy; Fair, Benjamin Jung; Pleiss, Jeffrey A

    2016-06-01

    Pre-mRNA splicing is an essential component of eukaryotic gene expression and is highly conserved from unicellular yeasts to humans. Here, we present the development and implementation of a sequencing-based reverse genetic screen designed to identify nonessential genes that impact pre-mRNA splicing in the fission yeast Schizosaccharomyces pombe, an organism that shares many of the complex features of splicing in higher eukaryotes. Using a custom-designed barcoding scheme, we simultaneously queried ∼3000 mutant strains for their impact on the splicing efficiency of two endogenous pre-mRNAs. A total of 61 nonessential genes were identified whose deletions resulted in defects in pre-mRNA splicing; enriched among these were factors encoding known or predicted components of the spliceosome. Included among the candidates identified here are genes with well-characterized roles in other RNA-processing pathways, including heterochromatic silencing and 3' end processing. Splicing-sensitive microarrays confirm broad splicing defects for many of these factors, revealing novel functional connections between these pathways. Copyright © 2016 Larson et al.

  10. Interconnections Between RNA-Processing Pathways Revealed by a Sequencing-Based Genetic Screen for Pre-mRNA Splicing Mutants in Fission Yeast

    PubMed Central

    Larson, Amy; Fair, Benjamin Jung; Pleiss, Jeffrey A.

    2016-01-01

    Pre-mRNA splicing is an essential component of eukaryotic gene expression and is highly conserved from unicellular yeasts to humans. Here, we present the development and implementation of a sequencing-based reverse genetic screen designed to identify nonessential genes that impact pre-mRNA splicing in the fission yeast Schizosaccharomyces pombe, an organism that shares many of the complex features of splicing in higher eukaryotes. Using a custom-designed barcoding scheme, we simultaneously queried ∼3000 mutant strains for their impact on the splicing efficiency of two endogenous pre-mRNAs. A total of 61 nonessential genes were identified whose deletions resulted in defects in pre-mRNA splicing; enriched among these were factors encoding known or predicted components of the spliceosome. Included among the candidates identified here are genes with well-characterized roles in other RNA-processing pathways, including heterochromatic silencing and 3ʹ end processing. Splicing-sensitive microarrays confirm broad splicing defects for many of these factors, revealing novel functional connections between these pathways. PMID:27172183

  11. A conserved intronic U1 snRNP-binding sequence promotes trans-splicing in Drosophila

    PubMed Central

    Gao, Jun-Li; Fan, Yu-Jie; Wang, Xiu-Ye; Zhang, Yu; Pu, Jia; Li, Liang; Shao, Wei; Zhan, Shuai; Hao, Jianjiang

    2015-01-01

    Unlike typical cis-splicing, trans-splicing joins exons from two separate transcripts to produce chimeric mRNA and has been detected in most eukaryotes. Trans-splicing in trypanosomes and nematodes has been characterized as a spliced leader RNA-facilitated reaction; in contrast, its mechanism in higher eukaryotes remains unclear. Here we investigate mod(mdg4), a classic trans-spliced gene in Drosophila, and report that two critical RNA sequences in the middle of the last 5′ intron, TSA and TSB, promote trans-splicing of mod(mdg4). In TSA, a 13-nucleotide (nt) core motif is conserved across Drosophila species and is essential and sufficient for trans-splicing, which binds U1 small nuclear RNP (snRNP) through strong base-pairing with U1 snRNA. In TSB, a conserved secondary structure acts as an enhancer. Deletions of TSA and TSB using the CRISPR/Cas9 system result in developmental defects in flies. Although it is not clear how the 5′ intron finds the 3′ introns, compensatory changes in U1 snRNA rescue trans-splicing of TSA mutants, demonstrating that U1 recruitment is critical to promote trans-splicing in vivo. Furthermore, TSA core-like motifs are found in many other trans-spliced Drosophila genes, including lola. These findings represent a novel mechanism of trans-splicing, in which RNA motifs in the 5′ intron are sufficient to bring separate transcripts into close proximity to promote trans-splicing. PMID:25838544

  12. Late-onset spastic paraplegia: Aberrant SPG11 transcripts generated by a novel splice site donor mutation.

    PubMed

    Kawarai, Toshitaka; Miyamoto, Ryosuke; Mori, Atsuko; Oki, Ryosuke; Tsukamoto-Miyashiro, Ai; Matsui, Naoko; Miyazaki, Yoshimichi; Orlacchio, Antonio; Izumi, Yuishin; Nishida, Yoshihiko; Kaji, Ryuji

    2015-12-15

    We identified a novel homozygous mutation in the splice site donor (SSD) of intron 30 (c.5866+1G>A) in consanguineous Japanese SPG11 siblings showing late-onset spastic paraplegia using the whole-exome sequencing. Phenotypic variability was observed, including age-at-onset, dysarthria and pes cavus. Coding DNA sequencing revealed that the mutation affected the recognition of the constitutive SSD of intron 30, splicing upstream onto a nearby cryptic SSD in exon 30. The use of constitutive splice sites of intron 29 was confirmed by sequencing. The mutant transcripts are mostly subject to degradation by the nonsense-mediated mRNA decay system. SPG11 transcripts, escaping from the nonsense-mediated mRNA decay pathway, would generate a truncated protein (p.Tyr1900Phefs5X) containing the first 1899 amino acids and followed by 4 aberrant amino acids. This study showed a successful clinical application of whole-exome sequencing in spastic paraplegia and demonstrated a further evidence of allelic heterogeneity in SPG11. The confirmation of aberrant transcript by splice site mutation is a prerequisite for a more precise molecular diagnosis. Copyright © 2015 Elsevier B.V. All rights reserved.

  13. Reading the tea leaves: Dead transposon copies reveal novel host and transposon biology.

    PubMed

    McLaughlin, Richard N

    2018-03-01

    Transposable elements comprise a huge portion of most animal genomes. Unlike many pathogens, these elements leave a mark of their impact via their insertion into host genomes. With proper teasing, these sequences can relay information about the evolutionary history of transposons and their hosts. In a new publication, Larson and colleagues describe a previously unappreciated density of long interspersed element-1 (LINE-1) sequences that have been spliced (LINE-1 and other reverse transcribing elements are necessarily intronless). They provide data to suggest that the retention of these potentially deleterious splice sites in LINE-1 results from the sites' overlap with an important transcription factor binding site. These spliced LINE-1s (i.e., spliced integrated retrotransposed elements [SpiREs]) lose their ability to replicate, suggesting they are evolutionary dead ends. However, the lethality of this splicing could be an efficient means of blocking continued replication of LINE-1. In this way, the record of inactive LINE-1 sequences in the human genome revealed a new, though infrequent, event in the LINE-1 replication cycle and motivates future studies to test whether splicing might be another weapon in the anti-LINE-1 arsenal of host genomes.

  14. A KCNH2 branch point mutation causing aberrant splicing contributes to an explanation of genotype-negative long QT syndrome.

    PubMed

    Crotti, Lia; Lewandowska, Marzena A; Schwartz, Peter J; Insolia, Roberto; Pedrazzini, Matteo; Bussani, Erica; Dagradi, Federica; George, Alfred L; Pagani, Franco

    2009-02-01

    Genetic screening of long QT syndrome (LQTS) fails to identify disease-causing mutations in about 30% of patients. So far, molecular screening has focused mainly on coding sequence mutations or on substitutions at canonical splice sites. The purpose of this study was to explore the possibility that intronic variants not at canonical splice sites might affect splicing regulatory elements, lead to aberrant transcripts, and cause LQTS. Molecular screening was performed through DHPLC and sequence analysis. The role of the intronic mutation identified was assessed with a hybrid minigene splicing assay. A three-generation LQTS family was investigated. Molecular screening failed to identify an obvious disease-causing mutation in the coding sequences of the major LQTS genes but revealed an intronic A-to-G substitution in KCNH2 (IVS9-28A/G) cosegregating with the clinical phenotype in family members. In vitro analysis proved that the mutation disrupts the acceptor splice site definition by affecting the branch point (BP) sequence and promoting intron retention. We further demonstrated a tight functional relationship between the BP and the polypyrimidine tract, whose weakness is responsible for the pathological effect of the IVS9-28A/G mutation. We identified a novel BP mutation in KCNH2 that disrupts the intron 9 acceptor splice site definition and causes LQT2. The present finding demonstrates that intronic mutations affecting pre-mRNA processing may contribute to the failure of traditional molecular screening in identifying disease-causing mutations in LQTS subjects and offers a rationale strategy for the reduction of genotype-negative cases.

  15. A survey of the sorghum transcriptome using single-molecule long reads

    DOE PAGES

    Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; ...

    2016-06-24

    Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novelmore » splice isoforms. Additionally, we uncover APA ofB11,000 expressed genes and more than 2,100 novel genes. Lastly, these results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism.« less

  16. A survey of the sorghum transcriptome using single-molecule long reads

    PubMed Central

    Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; Ngam, Peter; Devitt, Nicholas; Schilkey, Faye; Ben-Hur, Asa; Reddy, Anireddy S. N.

    2016-01-01

    Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novel splice isoforms. Additionally, we uncover APA of ∼11,000 expressed genes and more than 2,100 novel genes. These results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism. PMID:27339290

  17. Multi-species sequence comparison reveals conservation of ghrelin gene-derived splice variants encoding a truncated ghrelin peptide.

    PubMed

    Seim, Inge; Jeffery, Penny L; Thomas, Patrick B; Walpole, Carina M; Maugham, Michelle; Fung, Jenny N T; Yap, Pei-Yi; O'Keeffe, Angela J; Lai, John; Whiteside, Eliza J; Herington, Adrian C; Chopin, Lisa K

    2016-06-01

    The peptide hormone ghrelin is a potent orexigen produced predominantly in the stomach. It has a number of other biological actions, including roles in appetite stimulation, energy balance, the stimulation of growth hormone release and the regulation of cell proliferation. Recently, several ghrelin gene splice variants have been described. Here, we attempted to identify conserved alternative splicing of the ghrelin gene by cross-species sequence comparisons. We identified a novel human exon 2-deleted variant and provide preliminary evidence that this splice variant and in1-ghrelin encode a C-terminally truncated form of the ghrelin peptide, termed minighrelin. These variants are expressed in humans and mice, demonstrating conservation of alternative splicing spanning 90 million years. Minighrelin appears to have similar actions to full-length ghrelin, as treatment with exogenous minighrelin peptide stimulates appetite and feeding in mice. Forced expression of the exon 2-deleted preproghrelin variant mirrors the effect of the canonical preproghrelin, stimulating cell proliferation and migration in the PC3 prostate cancer cell line. This is the first study to characterise an exon 2-deleted preproghrelin variant and to demonstrate sequence conservation of ghrelin gene-derived splice variants that encode a truncated ghrelin peptide. This adds further impetus for studies into the alternative splicing of the ghrelin gene and the function of novel ghrelin peptides in vertebrates.

  18. Cloning and characterization of an alternative splicing transcript of the gene coding for human cytidine deaminase.

    PubMed

    Lisboa, Bianca Cristina Garcia; Machado, Tamara da Rocha; Pimenta, Daniel Carvalho; Han, Sang Won

    2007-02-01

    Human cytidine deaminase (HCD) catalyzes the deamination of cytidine or deoxycytidine to uridine or deoxyuridine, respectively. The genomic sequence of HCD is formed by 31 kb with 4 exons and several alternative splicing signals, but an alternative form of HCD has yet to be reported. Here we describe the cloning and characterization of a small form of HCD, HSCD, and it is likely to be a product of alternative splicing of HCD. The alignment of DNA sequences shows that the HSCD matches HCD in 2 parts, except for a deletion of 170 bp. Based on the HCD genome organization, exons 1 and 4 should be joined and all sequences of introns and exons 2 and 3 should be deleted by splicing. This alternative splicing shifted the translation of the reading frame from the point of splicing. The estimated molecular mass is 9.8 kDa, and this value was confirmed by Western blot and mass spectroscopy after expressing the gene fused with glutathionine-S-transferase in the pGEX vector. The deletion and shift of the reading frame caused a loss of HCD activity, which was confirmed by enzyme assay and also with NIH3T3 cells modified to express HSCD and challenged against cytosine arabinoside. In this work we describe the identification and characterization of HSCD, which is the product of alternative splicing of the HCD gene.

  19. Precise Maps of RNA Polymerase Reveal How Promoters Direct Initiation and Pausing

    PubMed Central

    Kwak, Hojoong; Fuda, Nicholas J.; Core, Leighton J.; Lis, John T.

    2014-01-01

    Transcription regulation occurs frequently through promoter-associated pausing of RNA polymerase II (Pol II). We developed a Precision nuclear Run-On and sequencing assay (PRO-seq) to map the genome-wide distribution of transcriptionally-engaged Pol II at base-pair resolution. Pol II accumulates immediately downstream of promoters, at intron-exon junctions that are efficiently used for splicing, and over 3' poly-adenylation sites. Focused analyses of promoters reveal that pausing is not fixed relative to initiation sites nor is it specified directly by the position of a particular core promoter element or the first nucleosome. Core promoter elements function beyond initiation, and when optimally positioned they act collectively to dictate the position and strength of pausing. We test this ‘Complex Interaction’ model with insertional mutagenesis of the Drosophila Hsp70 core promoter. PMID:23430654

  20. SEQassembly: A Practical Tools Program for Coding Sequences Splicing

    NASA Astrophysics Data System (ADS)

    Lee, Hongbin; Yang, Hang; Fu, Lei; Qin, Long; Li, Huili; He, Feng; Wang, Bo; Wu, Xiaoming

    CDS (Coding Sequences) is a portion of mRNA sequences, which are composed by a number of exon sequence segments. The construction of CDS sequence is important for profound genetic analysis such as genotyping. A program in MATLAB environment is presented, which can process batch of samples sequences into code segments under the guide of reference exon models, and splice these code segments of same sample source into CDS according to the exon order in queue file. This program is useful in transcriptional polymorphism detection and gene function study.

  1. THE GRK4 SUBFAMILY OF G PROTEIN-COUPLED RECEPTOR KINASES: ALTERNATIVE SPLICING, GENE ORGANIZATION, AND SEQUENCE CONSERVATION

    EPA Science Inventory

    The GRK4 subfamily of G protein-coupled receptor kinases. Alternative splicing, gene organization, and sequence conservation.

    Premont RT, Macrae AD, Aparicio SA, Kendall HE, Welch JE, Lefkowitz RJ.

    Department of Medicine, Howard Hughes Medical Institute, Duke Univer...

  2. iSS-PseDNC: identifying splicing sites using pseudo dinucleotide composition.

    PubMed

    Chen, Wei; Feng, Peng-Mian; Lin, Hao; Chou, Kuo-Chen

    2014-01-01

    In eukaryotic genes, exons are generally interrupted by introns. Accurately removing introns and joining exons together are essential processes in eukaryotic gene expression. With the avalanche of genome sequences generated in the postgenomic age, it is highly desired to develop automated methods for rapid and effective detection of splice sites that play important roles in gene structure annotation and even in RNA splicing. Although a series of computational methods were proposed for splice site identification, most of them neglected the intrinsic local structural properties. In the present study, a predictor called "iSS-PseDNC" was developed for identifying splice sites. In the new predictor, the sequences were formulated by a novel feature-vector called "pseudo dinucleotide composition" (PseDNC) into which six DNA local structural properties were incorporated. It was observed by the rigorous cross-validation tests on two benchmark datasets that the overall success rates achieved by iSS-PseDNC in identifying splice donor site and splice acceptor site were 85.45% and 87.73%, respectively. It is anticipated that iSS-PseDNC may become a useful tool for identifying splice sites and that the six DNA local structural properties described in this paper may provide novel insights for in-depth investigations into the mechanism of RNA splicing.

  3. Control of calcitonin/calcitonin gene-related peptide pre-mRNA processing by constitutive intron and exon elements.

    PubMed Central

    Yeakley, J M; Hedjran, F; Morfin, J P; Merillat, N; Rosenfeld, M G; Emeson, R B

    1993-01-01

    The calcitonin/calcitonin gene-related peptide (CGRP) primary transcript is alternatively spliced in thyroid C cells and neurons, resulting in the tissue-specific production of calcitonin and CGRP mRNAs. Analyses of mutated calcitonin/CGRP transcription units in permanently transfected cell lines have indicated that alternative splicing is regulated by a differential capacity to utilize the calcitonin-specific splice acceptor. The analysis of an extensive series of mutations suggests that tissue-specific regulation of calcitonin mRNA production does not depend on the presence of a single, unique cis-active element but instead appears to be a consequence of suboptimal constitutive splicing signals. While only those mutations that altered constitutive splicing signals affected splice choices, the action of multiple regulatory sequences cannot be formally excluded. Further, we have identified a 13-nucleotide purine-rich element from a constitutive exon that, when placed in exon 4, entirely switches splice site usage in CGRP-producing cells. These data suggest that specific exon recruitment sequences, in combination with other constitutive elements, serve an important function in exon recognition. These results are consistent with the hypothesis that tissue-specific alternative splicing of the calcitonin/CGRP primary transcript is mediated by cell-specific differences in components of the constitutive splicing machinery. Images PMID:8413203

  4. Dwarfism with joint laxity in Friesian horses is associated with a splice site mutation in B4GALT7.

    PubMed

    Leegwater, Peter A; Vos-Loohuis, Manon; Ducro, Bart J; Boegheim, Iris J; van Steenbeek, Frank G; Nijman, Isaac J; Monroe, Glen R; Bastiaansen, John W M; Dibbits, Bert W; van de Goor, Leanne H; Hellinga, Ids; Back, Willem; Schurink, Anouk

    2016-10-28

    Inbreeding and population bottlenecks in the ancestry of Friesian horses has led to health issues such as dwarfism. The limbs of dwarfs are short and the ribs are protruding inwards at the costochondral junction, while the head and back appear normal. A striking feature of the condition is the flexor tendon laxity that leads to hyperextension of the fetlock joints. The growth plates of dwarfs display disorganized and thickened chondrocyte columns. The aim of this study was to identify the gene defect that causes the recessively inherited trait in Friesian horses to understand the disease process at the molecular level. We have localized the genetic cause of the dwarfism phenotype by a genome wide approach to a 3 Mb region on the p-arm of equine chromosome 14. The DNA of two dwarfs and one control Friesian horse was sequenced completely and we identified the missense mutation ECA14:g.4535550C > T that cosegregated with the phenotype in all Friesians analyzed. The mutation leads to the amino acid substitution p.(Arg17Lys) of xylosylprotein beta 1,4-galactosyltransferase 7 encoded by B4GALT7. The protein is one of the enzymes that synthesize the tetrasaccharide linker between protein and glycosaminoglycan moieties of proteoglycans of the extracellular matrix. The mutation not only affects a conserved arginine codon but also the last nucleotide of the first exon of the gene and we show that it impedes splicing of the primary transcript in cultured fibroblasts from a heterozygous horse. As a result, the level of B4GALT7 mRNA in fibroblasts from a dwarf is only 2 % compared to normal levels. Mutations in B4GALT7 in humans are associated with Ehlers-Danlos syndrome progeroid type 1 and Larsen of Reunion Island syndrome. Growth retardation and ligamentous laxity are common manifestations of these syndromes. We suggest that the identified mutation of equine B4GALT7 leads to the typical dwarfism phenotype in Friesian horses due to deficient splicing of transcripts of the gene. The mutated gene implicates the extracellular matrix in the regular organization of chrondrocyte columns of the growth plate. Conservation of individual amino acids may not be necessary at the protein level but instead may reflect underlying conservation of nucleotide sequence that are required for efficient splicing.

  5. Analysis of the neuroligin 4Y gene in patients with autism.

    PubMed

    Yan, Jin; Feng, Jinong; Schroer, Richard; Li, Wenyan; Skinner, Cindy; Schwartz, Charles E; Cook, Edwin H; Sommer, Steve S

    2008-08-01

    Frameshift and missense mutations in the X-linked neuroligin 4 (NLGN4, MIM# 300427) and neuroligin 3 (NLGN3, MIM# 300336) genes have been identified in patients with autism, Asperger syndrome and mental retardation. We hypothesize that sequence variants in NLGN4Y are associated with autism or mental retardation. The coding sequences and splice junctions of the NLGN4Y gene were analyzed in 335 male samples (290 with autism and 45 with mental retardation). A total of 1.1 Mb of genomic DNA was sequenced. One missense variant, p.I679V, was identified in a patient with autism, as well as his father with learning disabilities. The I679 residue is highly conserved in three members of the neuroligin family. The absence of p.I679V in 2986 control Y chromosomes and the high similarity of NLGN4 and NLGN4Y are consistent with the hypothesis that p.I679V contributes to the etiology of autism. The presence of only one structural variant in our population of 335 males with autism/mental retardation, the unavailability of significant family cosegregation and an absence of functional assays are, however, important limitations of this study.

  6. Quantitation of normal CFTR mRNA in CF patients with splice-site mutations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhou, Z.; Olsen, J.C.; Silverman, L.M.

    Previously we identified two mutations in introns of the CFTR gene associated with partially active splice sites and unusual clinical phenotypes. One mutation in intron 19 (3849+10 kb C to T) is common in CF patients with normal sweat chloride values; an 84 bp sequence from intron 19, which contains a stop codon, is inserted between exon 19 and exon 20 in most nasal CFTR transcripts. The other mutation in intron 14B (2789+5 G to A) is associated with elevated sweat chloride levels, but mild pulmonary disease; exon 14B (38 bp) is spliced out of most nasal CFTR transcipts. Themore » remaining CFTR cDNA sequences, other than the 84 bp insertion of exon 14B deletion, are identical to the published sequence. To correlate genotype and phenotype, we used quantitative RT-PCR to determine the levels of normally-spliced CFTR mRNA in nasal epithelia from these patients. CFTR cDNA was amplified (25 cycles) by using primers specific for normally-spliced species, {gamma}-actin cDNA was amplified as a standard.« less

  7. CSReport: A New Computational Tool Designed for Automatic Analysis of Class Switch Recombination Junctions Sequenced by High-Throughput Sequencing.

    PubMed

    Boyer, François; Boutouil, Hend; Dalloul, Iman; Dalloul, Zeinab; Cook-Moreau, Jeanne; Aldigier, Jean-Claude; Carrion, Claire; Herve, Bastien; Scaon, Erwan; Cogné, Michel; Péron, Sophie

    2017-05-15

    B cells ensure humoral immune responses due to the production of Ag-specific memory B cells and Ab-secreting plasma cells. In secondary lymphoid organs, Ag-driven B cell activation induces terminal maturation and Ig isotype class switch (class switch recombination [CSR]). CSR creates a virtually unique IgH locus in every B cell clone by intrachromosomal recombination between two switch (S) regions upstream of each C region gene. Amount and structural features of CSR junctions reveal valuable information about the CSR mechanism, and analysis of CSR junctions is useful in basic and clinical research studies of B cell functions. To provide an automated tool able to analyze large data sets of CSR junction sequences produced by high-throughput sequencing (HTS), we designed CSReport, a software program dedicated to support analysis of CSR recombination junctions sequenced with a HTS-based protocol (Ion Torrent technology). CSReport was assessed using simulated data sets of CSR junctions and then used for analysis of Sμ-Sα and Sμ-Sγ1 junctions from CH12F3 cells and primary murine B cells, respectively. CSReport identifies junction segment breakpoints on reference sequences and junction structure (blunt-ended junctions or junctions with insertions or microhomology). Besides the ability to analyze unprecedentedly large libraries of junction sequences, CSReport will provide a unified framework for CSR junction studies. Our results show that CSReport is an accurate tool for analysis of sequences from our HTS-based protocol for CSR junctions, thereby facilitating and accelerating their study. Copyright © 2017 by The American Association of Immunologists, Inc.

  8. Women with steroid 5 alpha-reductase 2 deficiency have normal concentrations of plasma 5 alpha-dihydroprogesterone during the luteal phase.

    PubMed

    Milewich, L; Mendonca, B B; Arnhold, I; Wallace, A M; Donaldson, M D; Wilson, J D; Russell, D W

    1995-11-01

    Steroid 5 alpha-reductase 2 deficiency has been identified in two adult women from unrelated families, one a homozygote and the other a compound heterozygote. The homozygote carries the G183S mutation and is the sister of an affected male; the compound heterozygote (R246W/splice junction abnormality) is married to a heterozygote (splice junction abnormality) and is the mother of two compound heterozygotes and two homozygotes. The fact that these two women are the mothers of seven children and appear to be endocrinologically normal confirms the previous deduction that this disorder is not manifest in women. Concentrations of plasma 5 alpha-dihydroprogesterone were normal in these two women during the luteal phase; this finding implies that circulating 5 alpha-dihydroprogesterone in women is derived principally from the steroid 5 alpha-reductase 1 isoenzyme and leaves unresolved the question of whether 5 alpha-dihydroprogesterone plays a physiological role in women.

  9. Long-read sequencing of nascent RNA reveals coupling among RNA processing events.

    PubMed

    Herzel, Lydia; Straube, Korinna; Neugebauer, Karla M

    2018-06-14

    Pre-mRNA splicing is accomplished by the spliceosome, a megadalton complex that assembles de novo on each intron. Because spliceosome assembly and catalysis occur cotranscriptionally, we hypothesized that introns are removed in the order of their transcription in genomes dominated by constitutive splicing. Remarkably little is known about splicing order and the regulatory potential of nascent transcript remodeling by splicing, due to the limitations of existing methods that focus on analysis of mature splicing products (mRNAs) rather than substrates and intermediates. Here, we overcome this obstacle through long-read RNA sequencing of nascent, multi-intron transcripts in the fission yeast Schizosaccharomyces pombe Most multi-intron transcripts were fully spliced, consistent with rapid cotranscriptional splicing. However, an unexpectedly high proportion of transcripts were either fully spliced or fully unspliced, suggesting that splicing of any given intron is dependent on the splicing status of other introns in the transcript. Supporting this, mild inhibition of splicing by a temperature-sensitive mutation in prp2 , the homolog of vertebrate U2AF65, increased the frequency of fully unspliced transcripts. Importantly, fully unspliced transcripts displayed transcriptional read-through at the polyA site and were degraded cotranscriptionally by the nuclear exosome. Finally, we show that cellular mRNA levels were reduced in genes with a high number of unspliced nascent transcripts during caffeine treatment, showing regulatory significance of cotranscriptional splicing. Therefore, overall splicing of individual nascent transcripts, 3' end formation, and mRNA half-life depend on the splicing status of neighboring introns, suggesting crosstalk among spliceosomes and the polyA cleavage machinery during transcription elongation. © 2018 Herzel et al.; Published by Cold Spring Harbor Laboratory Press.

  10. Insertion of part of an intron into the 5[prime] untranslated region of a Caenorhabditis elegans gene converts it into a trans-spliced gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Conrad, R.; Thomas, J.; Spieth, J.

    In nematodes, the RNA products of some genes are trans-spliced to a 22-nucleotide spliced leader (SL), while the RNA products of other genes are not. In Caenorhabditis elegans, there are two SLs, Sl1 and SL2, donated by two distinct small nuclear ribonucleoprotein particles in a process functionally quite similar to nuclear intron removal. The authors demonstrate here that it is possible to convert a non-trans-spliced gene into a trans-spliced gene by placement of an intron missing only the 5[prime] splice site into the 5[prime] untranslated region. Stable transgenic strains were isolated expressing a gene in which 69 nucleotides of amore » vit-5 intron, including the 3[prime] splice site, were inserted into the 5[prime] untranslated region of a vit-2/vit-6 fusion gene. The RNA product of this gene was examined by primer extension and PCR amplification. Although the vit-2/vit-6 transgene product is not normally trans-spliced, the majority of transcripts from this altered gene were trans-spliced to SL1. They termed the region of a trans-spliced mRNA precursor between the 5[prime] end and the first 3[prime] splice site an 'outrun'. The results suggest that if a transcript begins with intronlike sequence followed by a 3[prime] splice site, this alone may constitute an outrun and be sufficient to demarcate a transcript as a trans-splice acceptor. These findings leave open the possibility that specific sequences are required to increase the efficiency of trans-splicing.« less

  11. The Acheta domesticus Densovirus, Isolated from the European House Cricket, Has Evolved an Expression Strategy Unique among Parvoviruses▿†

    PubMed Central

    Liu, Kaiyu; Li, Yi; Jousset, Françoise-Xavière; Zadori, Zoltan; Szelei, Jozsef; Yu, Qian; Pham, Hanh Thi; Lépine, François; Bergoin, Max; Tijssen, Peter

    2011-01-01

    The Acheta domesticus densovirus (AdDNV), isolated from crickets, has been endemic in Europe for at least 35 years. Severe epizootics have also been observed in American commercial rearings since 2009 and 2010. The AdDNV genome was cloned and sequenced for this study. The transcription map showed that splicing occurred in both the nonstructural (NS) and capsid protein (VP) multicistronic RNAs. The splicing pattern of NS mRNA predicted 3 nonstructural proteins (NS1 [576 codons], NS2 [286 codons], and NS3 [213 codons]). The VP gene cassette contained two VP open reading frames (ORFs), of 597 (ORF-A) and 268 (ORF-B) codons. The VP2 sequence was shown by N-terminal Edman degradation and mass spectrometry to correspond with ORF-A. Mass spectrometry, sequencing, and Western blotting of baculovirus-expressed VPs versus native structural proteins demonstrated that the VP1 structural protein was generated by joining ORF-A and -B via splicing (splice II), eliminating the N terminus of VP2. This splice resulted in a nested set of VP1 (816 codons), VP3 (467 codons), and VP4 (429 codons) structural proteins. In contrast, the two splices within ORF-B (Ia and Ib) removed the donor site of intron II and resulted in VP2, VP3, and VP4 expression. ORF-B may also code for several nonstructural proteins, of 268, 233, and 158 codons. The small ORF-B contains the coding sequence for a phospholipase A2 motif found in VP1, which was shown previously to be critical for cellular uptake of the virus. These splicing features are unique among parvoviruses and define a new genus of ambisense densoviruses. PMID:21775445

  12. Evolutionary pattern of mutation in the factor IX genes of great apes: How does it compare to the pattern of recent germline mutation in patients with hemophilia B?

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Grouse, L.H.; Ketterling, R.P.; Sommer, S.S.

    Most mutations causing hemophilia B have arisen within the past 150 years. By correcting for multiple biases, the underlying rates of spontaneous germline mutation have been estimated in the factor IX gene. From these rates, an underlying pattern of mutation has emerged. To determine if this pattern compares to a underlying pattern found in the great apes, sequence changes were determined in intronic regions of the factor IX gene. The following species were studied: Gorilla gorilla, Pan troglodytes (chimpanzee), Pongo pygmacus (orangutan) and Homo sapiens. Intronic sequences at least 200 bp from a splice junction were randomly chosen, amplified bymore » cross-species PCR, and sequenced. These regions are expected to be subject to little if any selective pressure. Early diverged species of Old World monkeys were also studied to help determine the direction of mutational changes. A total of 62 sequence changes were observed. Initial data suggest that the average pattern since evolution of the great apes has a paucity of transitions at CpG dinucleotides and an excess of microinsertions to microdeletions when compared to the pattern observed in humans during the past 150 years (p<.05). A larger study is in progress to confirm these results.« less

  13. Rapid and efficient cDNA library screening by self-ligation of inverse PCR products (SLIP).

    PubMed

    Hoskins, Roger A; Stapleton, Mark; George, Reed A; Yu, Charles; Wan, Kenneth H; Carlson, Joseph W; Celniker, Susan E

    2005-12-02

    cDNA cloning is a central technology in molecular biology. cDNA sequences are used to determine mRNA transcript structures, including splice junctions, open reading frames (ORFs) and 5'- and 3'-untranslated regions (UTRs). cDNA clones are valuable reagents for functional studies of genes and proteins. Expressed Sequence Tag (EST) sequencing is the method of choice for recovering cDNAs representing many of the transcripts encoded in a eukaryotic genome. However, EST sequencing samples a cDNA library at random, and it recovers transcripts with low expression levels inefficiently. We describe a PCR-based method for directed screening of plasmid cDNA libraries. We demonstrate its utility in a screen of libraries used in our Drosophila EST projects for 153 transcription factor genes that were not represented by full-length cDNA clones in our Drosophila Gene Collection. We recovered high-quality, full-length cDNAs for 72 genes and variously compromised clones for an additional 32 genes. The method can be used at any scale, from the isolation of cDNA clones for a particular gene of interest, to the improvement of large gene collections in model organisms and the human. Finally, we discuss the relative merits of directed cDNA library screening and RT-PCR approaches.

  14. On the path to genetic novelties: insights from programmed DNA elimination and RNA splicing.

    PubMed

    Catania, Francesco; Schmitz, Jürgen

    2015-01-01

    Understanding how genetic novelties arise is a central goal of evolutionary biology. To this end, programmed DNA elimination and RNA splicing deserve special consideration. While programmed DNA elimination reshapes genomes by eliminating chromatin during organismal development, RNA splicing rearranges genetic messages by removing intronic regions during transcription. Small RNAs help to mediate this class of sequence reorganization, which is not error-free. It is this imperfection that makes programmed DNA elimination and RNA splicing excellent candidates for generating evolutionary novelties. Leveraging a number of these two processes' mechanistic and evolutionary properties, which have been uncovered over the past years, we present recently proposed models and empirical evidence for how splicing can shape the structure of protein-coding genes in eukaryotes. We also chronicle a number of intriguing similarities between the processes of programmed DNA elimination and RNA splicing, and highlight the role that the variation in the population-genetic environment may play in shaping their target sequences. © 2015 Wiley Periodicals, Inc.

  15. A Dentin Sialophosphoprotein Mutation That Partially Disrupts a Splice Acceptor Site Causes Type II Dentin Dysplasia

    PubMed Central

    Lee, Sook-Kyung; Hu, Jan C.-C.; Lee, Kyung-Eun; Simmer, James P.; Kim, Jung-Wook

    2009-01-01

    The dentin sialophosphoprotein (DSPP) gene on chromosome 4q21.3 encodes the major noncollagenous protein in tooth dentin. DSPP mutations are the principal cause of dentin dysplasia type II, dentinogenesis imperfecta type II, and dentinogenesis imperfecta type III. We have identified a DSPP splice junction mutation (IVS2-6T>G) in a family with dentin dysplasia type II. The primary dentition is discolored brown with severe attrition. The mildly discolored permanent dentition has thistle-shaped pulp chambers, pulp stones, and eventual pulp obliteration. The mutation is in the sixth nucleotide from the end of intron 2, perfectly segregates with the disease phenotype, and is absent in 200 normal control chromosomes. An in vitro splicing assay shows that pre-mRNA splicing of the mutant allele generates wild-type mRNA and mRNA lacking exon 3 in approximately equal amounts. Skipping exon 3 might interfere with signal peptide cleavage, causing endoplasmic reticulum stress, and also reduce DSPP secretion, leading to haploinsufficiency. PMID:19026876

  16. Multiple cis-acting sequence elements are required for efficient splicing of simian virus 40 small-t antigen pre-mRNA.

    PubMed Central

    Fu, X Y; Colgan, J D; Manley, J L

    1988-01-01

    We have determined the effects of a number of mutations in the small-t antigen mRNA intron on the alternative splicing pattern of the simian virus 40 early transcript. Expansion of the distance separating the small-t pre-mRNA lariat branch point and the shared large T-small t 3' splice site from 18 to 29 nucleotides (nt) resulted in a relative enhancement of small-t splicing in vivo. This finding, coupled with the observation that large-T pre-RNA splicing in vitro was not affected by this expansion, suggests that small-t splicing is specifically constrained by a short branch point-3' splice site distance. Similarly, the distance separating the 5' splice site and branch point (48 nt) was found to be at or near a minimum for small-t splicing, because deletions in this region as small as 2 nt dramatically reduced the ratio of small-t to large-T mRNA that accumulated in transfected cells. Finally, a specific sequence within the small-t intron, encompassing the upstream branch sites used in large-T splicing, was found to be an important element in the cell-specific pattern of early alternative splicing. Substitutions within this region reduced the ratio of small-t to large-T mRNA produced in HeLa cells but had only minor effects in human 293 cells. Images PMID:2851720

  17. QUANTIFYING ALTERNATIVE SPLICING FROM PAIRED-END RNA-SEQUENCING DATA.

    PubMed

    Rossell, David; Stephan-Otto Attolini, Camille; Kroiss, Manuel; Stöcker, Almond

    2014-03-01

    RNA-sequencing has revolutionized biomedical research and, in particular, our ability to study gene alternative splicing. The problem has important implications for human health, as alternative splicing may be involved in malfunctions at the cellular level and multiple diseases. However, the high-dimensional nature of the data and the existence of experimental biases pose serious data analysis challenges. We find that the standard data summaries used to study alternative splicing are severely limited, as they ignore a substantial amount of valuable information. Current data analysis methods are based on such summaries and are hence sub-optimal. Further, they have limited flexibility in accounting for technical biases. We propose novel data summaries and a Bayesian modeling framework that overcome these limitations and determine biases in a non-parametric, highly flexible manner. These summaries adapt naturally to the rapid improvements in sequencing technology. We provide efficient point estimates and uncertainty assessments. The approach allows to study alternative splicing patterns for individual samples and can also be the basis for downstream analyses. We found a several fold improvement in estimation mean square error compared popular approaches in simulations, and substantially higher consistency between replicates in experimental data. Our findings indicate the need for adjusting the routine summarization and analysis of alternative splicing RNA-seq studies. We provide a software implementation in the R package casper.

  18. An UPF3-based nonsense-mediated decay in Paramecium.

    PubMed

    Contreras, Julia; Begley, Victoria; Macias, Sandra; Villalobo, Eduardo

    2014-12-01

    Nonsense-mediated decay recognises mRNAs containing premature termination codons. One of its components, UPF3, is a molecular link bridging through its binding to the exon junction complex nonsense-mediated decay and splicing. In protists UPF3 has not been identified yet. We report that Paramecium tetraurelia bears an UPF3 gene and that it has a role in nonsense-mediated decay. Interestingly, the identified UPF3 has not conserved the essential amino acids required to bind the exon junction complex. Though, our data indicates that this ciliate bears genes coding for core proteins of the exon junction complex. Copyright © 2014 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

  19. HS3D, A Dataset of Homo Sapiens Splice Regions, and its Extraction Procedure from a Major Public Database

    NASA Astrophysics Data System (ADS)

    Pollastro, Pasquale; Rampone, Salvatore

    The aim of this work is to describe a cleaning procedure of GenBank data, producing material to train and to assess the prediction accuracy of computational approaches for gene characterization. A procedure (GenBank2HS3D) has been defined, producing a dataset (HS3D - Homo Sapiens Splice Sites Dataset) of Homo Sapiens Splice regions extracted from GenBank (Rel.123 at this time). It selects, from the complete GenBank Primate Division, entries of Human Nuclear DNA according with several assessed criteria; then it extracts exons and introns from these entries (actually 4523 + 3802). Donor and acceptor sites are then extracted as windows of 140 nucleotides around each splice site (3799 + 3799). After discarding windows not including canonical GT-AG junctions (65 + 74), including insufficient data (not enough material for a 140 nucleotide window) (686 + 589), including not AGCT bases (29 + 30), and redundant (218 + 226), the remaining windows (2796 + 2880) are reported in the dataset. Finally, windows of false splice sites are selected by searching canonical GT-AG pairs in not splicing positions (271 937 + 332 296). The false sites in a range +/- 60 from a true splice site are marked as proximal. HS3D, release 1.2 at this time, is available at the Web server of the University of Sannio: http://www.sci.unisannio.it/docenti/rampone/.

  20. TSUNAMI: an antisense method to phenocopy splicing-associated diseases in animals

    PubMed Central

    Sahashi, Kentaro; Hua, Yimin; Ling, Karen K.Y.; Hung, Gene; Rigo, Frank; Horev, Guy; Katsuno, Masahisa; Sobue, Gen; Ko, Chien-Ping; Bennett, C. Frank; Krainer, Adrian R.

    2012-01-01

    Antisense oligonucleotides (ASOs) are versatile molecules that can be designed to specifically alter splicing patterns of target pre-mRNAs. Here we exploit this feature to phenocopy a genetic disease. Spinal muscular atrophy (SMA) is a motor neuron disease caused by loss-of-function mutations in the SMN1 gene. The related SMN2 gene expresses suboptimal levels of functional SMN protein due to alternative splicing that skips exon 7; correcting this defect—e.g., with ASOs—is a promising therapeutic approach. We describe the use of ASOs that exacerbate SMN2 missplicing and phenocopy SMA in a dose-dependent manner when administered to transgenic Smn−/− mice. Intracerebroventricular ASO injection in neonatal mice recapitulates SMA-like progressive motor dysfunction, growth impairment, and shortened life span, with α-motor neuron loss and abnormal neuromuscular junctions. These SMA-like phenotypes are prevented by a therapeutic ASO that restores correct SMN2 splicing. We uncovered starvation-induced splicing changes, particularly in SMN2, which likely accelerate disease progression. These results constitute proof of principle that ASOs designed to cause sustained splicing defects can be used to induce pathogenesis and rapidly and accurately model splicing-associated diseases in animals. This approach allows the dissection of pathogenesis mechanisms, including spatial and temporal features of disease onset and progression, as well as testing of candidate therapeutics. PMID:22895255

  1. Novel Junction-specific and Quantifiable In Situ Detection of AR-V7 and its Clinical Correlates in Metastatic Castration-resistant Prostate Cancer.

    PubMed

    Zhu, Yezi; Sharp, Adam; Anderson, Courtney M; Silberstein, John L; Taylor, Maritza; Lu, Changxue; Zhao, Pei; De Marzo, Angelo M; Antonarakis, Emmanuel S; Wang, Mindy; Wu, Xingyong; Luo, Yuling; Su, Nan; Nava Rodrigues, Daniel; Figueiredo, Ines; Welti, Jonathan; Park, Emily; Ma, Xiao-Jun; Coleman, Ilsa; Morrissey, Colm; Plymate, Stephen R; Nelson, Peter S; de Bono, Johann S; Luo, Jun

    2018-05-01

    Androgen receptor splice variant 7 (AR-V7) has been implicated in resistance to abiraterone and enzalutamide treatment in men with metastatic castration-resistant prostate cancer (mCRPC). Tissue- or cell-based in situ detection of AR-V7, however, has been limited by lack of specificity. To address current limitations in precision measurement of AR-V7 by developing a novel junction-specific AR-V7 RNA in situ hybridization (RISH) assay compatible with automated quantification. We designed a RISH method to visualize single splice junctions in cells and tissue. Using the validated assay for junction-specific detection of the full-length AR (AR-FL) and AR-V7, we generated quantitative data, blinded to clinical data, for 63 prostate tumor biopsies. We evaluated clinical correlates of AR-FL/AR-V7 measurements, including association with prostate-specific antigen progression-free survival (PSA-PFS) and clinical and radiographic progression-free survival (PFS), in a subset of patients starting treatment with abiraterone or enzalutamide following biopsy. Quantitative AR-FL/AR-V7 data were generated from 56 of the 63 (88.9%) biopsy specimens examined, of which 44 were mCRPC biopsies. Positive AR-V7 signals were detected in 34.1% (15/44) mCRPC specimens, all of which also co-expressed AR-FL. The median AR-V7/AR-FL ratio was 11.9% (range 2.7-30.3%). Positive detection of AR-V7 was correlated with indicators of high disease burden at baseline. Among the 25 CRPC biopsies collected before treatment with abiraterone or enzalutamide, positive AR-V7 detection, but not higher AR-FL, was significantly associated with shorter PSA-PFS (hazard ratio 2.789, 95% confidence interval 1.12-6.95; p=0.0081). We report for the first time a RISH method for highly specific and quantifiable detection of splice junctions, allowing further characterization of AR-V7 and its clinical significance. Higher AR-V7 levels detected and quantified using a novel method were associated with poorer response to abiraterone or enzalutamide in prostate cancer. Copyright © 2017 European Association of Urology. Published by Elsevier B.V. All rights reserved.

  2. Possibility of cytoplasmic pre-tRNA splicing: the yeast tRNA splicing endonuclease mainly localizes on the mitochondria.

    PubMed

    Yoshihisa, Tohru; Yunoki-Esaki, Kaori; Ohshima, Chie; Tanaka, Nobuyuki; Endo, Toshiya

    2003-08-01

    Pre-tRNA splicing has been believed to occur in the nucleus. In yeast, the tRNA splicing endonuclease that cleaves the exon-intron junctions of pre-tRNAs consists of Sen54p, Sen2p, Sen34p, and Sen15p and was thought to be an integral membrane protein of the inner nuclear envelope. Here we show that the majority of Sen2p, Sen54p, and the endonuclease activity are not localized in the nucleus, but on the mitochondrial surface. The endonuclease is peripherally associated with the cytosolic surface of the outer mitochondrial membrane. A Sen54p derivative artificially fixed on the mitochondria as an integral membrane protein can functionally replace the authentic Sen54p, whereas mutant proteins defective in mitochondrial localization are not fully active. sen2 mutant cells accumulate unspliced pre-tRNAs in the cytosol under the restrictive conditions, and this export of the pre-tRNAs partly depends on Los1p, yeast exportin-t. It is difficult to explain these results from the view of tRNA splicing in the nucleus. We rather propose a new possibility that tRNA splicing occurs on the mitochondrial surface in yeast.

  3. Detection of Splice Sites Using Support Vector Machine

    NASA Astrophysics Data System (ADS)

    Varadwaj, Pritish; Purohit, Neetesh; Arora, Bhumika

    Automatic identification and annotation of exon and intron region of gene, from DNA sequences has been an important research area in field of computational biology. Several approaches viz. Hidden Markov Model (HMM), Artificial Intelligence (AI) based machine learning and Digital Signal Processing (DSP) techniques have extensively and independently been used by various researchers to cater this challenging task. In this work, we propose a Support Vector Machine based kernel learning approach for detection of splice sites (the exon-intron boundary) in a gene. Electron-Ion Interaction Potential (EIIP) values of nucleotides have been used for mapping character sequences to corresponding numeric sequences. Radial Basis Function (RBF) SVM kernel is trained using EIIP numeric sequences. Furthermore this was tested on test gene dataset for detection of splice site by window (of 12 residues) shifting. Optimum values of window size, various important parameters of SVM kernel have been optimized for a better accuracy. Receiver Operating Characteristic (ROC) curves have been utilized for displaying the sensitivity rate of the classifier and results showed 94.82% accuracy for splice site detection on test dataset.

  4. Modelling reveals kinetic advantages of co-transcriptional splicing.

    PubMed

    Aitken, Stuart; Alexander, Ross D; Beggs, Jean D

    2011-10-01

    Messenger RNA splicing is an essential and complex process for the removal of intron sequences. Whereas the composition of the splicing machinery is mostly known, the kinetics of splicing, the catalytic activity of splicing factors and the interdependency of transcription, splicing and mRNA 3' end formation are less well understood. We propose a stochastic model of splicing kinetics that explains data obtained from high-resolution kinetic analyses of transcription, splicing and 3' end formation during induction of an intron-containing reporter gene in budding yeast. Modelling reveals co-transcriptional splicing to be the most probable and most efficient splicing pathway for the reporter transcripts, due in part to a positive feedback mechanism for co-transcriptional second step splicing. Model comparison is used to assess the alternative representations of reactions. Modelling also indicates the functional coupling of transcription and splicing, because both the rate of initiation of transcription and the probability that step one of splicing occurs co-transcriptionally are reduced, when the second step of splicing is abolished in a mutant reporter.

  5. Jannovar: a java library for exome annotation.

    PubMed

    Jäger, Marten; Wang, Kai; Bauer, Sebastian; Smedley, Damian; Krawitz, Peter; Robinson, Peter N

    2014-05-01

    Transcript-based annotation and pedigree analysis are two basic steps in the computational analysis of whole-exome sequencing experiments in genetic diagnostics and disease-gene discovery projects. Here, we present Jannovar, a stand-alone Java application as well as a Java library designed to be used in larger software frameworks for exome and genome analysis. Jannovar uses an interval tree to identify all transcripts affected by a given variant, and provides Human Genome Variation Society-compliant annotations both for variants affecting coding sequences and splice junctions as well as untranslated regions and noncoding RNA transcripts. Jannovar can also perform family-based pedigree analysis with Variant Call Format (VCF) files with data from members of a family segregating a Mendelian disorder. Using a desktop computer, Jannovar requires a few seconds to annotate a typical VCF file with exome data. Jannovar is freely available under the BSD2 license. Source code as well as the Java application and library file can be downloaded from http://compbio.charite.de (with tutorial) and https://github.com/charite/jannovar. © 2014 WILEY PERIODICALS, INC.

  6. SL2-like spliced leader RNAs in the basal nematode Prionchulus punctatus: New insight into the evolution of nematode SL2 RNAs.

    PubMed

    Harrison, Neale; Kalbfleisch, Andreas; Connolly, Bernadette; Pettitt, Jonathan; Müller, Berndt

    2010-08-01

    Spliced-leader (SL) trans-splicing has been found in all molecularly characterized nematode species to date, and it is likely to be a nematode synapomorphy. Most information regarding SL trans-splicing has come from the study of nematodes from a single monophyletic group, the Rhabditida, all of which employ SL RNAs that are identical to, or variants of, the SL1 RNA first characterized in Caenorhabditis elegans. In contrast, the more distantly related Trichinella spiralis, belonging to the subclass Dorylaimia, utilizes a distinct set of SL RNAs that display considerable sequence diversity. To investigate whether this is true of other members of the Dorylaimia, we have characterized SL RNAs from Prionchulus punctatus. Surprisingly, this revealed the presence of a set of SLs that show clear sequence similarity to the SL2 family of spliced leaders, which have previously only been found within the rhabditine group (which includes C. elegans). Expression of one of the P. punctatus SL RNAs in C. elegans reveals that it can compete specifically with the endogenous C. elegans SL2 spliced leaders, being spliced to the pre-mRNAs derived from downstream genes in operons, but does not compete with the SL1 spliced leaders. This discovery raises the possibility that SL2-like spliced leaders were present in the last common ancestor of the nematode phylum.

  7. Experimental Assessment of Splicing Variants Using Expression Minigenes and Comparison with In Silico Predictions

    PubMed Central

    Sharma, Neeraj; Sosnay, Patrick R.; Ramalho, Anabela S.; Douville, Christopher; Franca, Arianna; Gottschalk, Laura B.; Park, Jeenah; Lee, Melissa; Vecchio-Pagan, Briana; Raraigh, Karen S.; Amaral, Margarida D.; Karchin, Rachel; Cutting, Garry R.

    2015-01-01

    Assessment of the functional consequences of variants near splice sites is a major challenge in the diagnostic laboratory. To address this issue, we created expression minigenes (EMGs) to determine the RNA and protein products generated by splice site variants (n = 10) implicated in cystic fibrosis (CF). Experimental results were compared with the splicing predictions of eight in silico tools. EMGs containing the full-length Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) coding sequence and flanking intron sequences generated wild-type transcript and fully processed protein in Human Embryonic Kidney (HEK293) and CF bronchial epithelial (CFBE41o-) cells. Quantification of variant induced aberrant mRNA isoforms was concordant using fragment analysis and pyrosequencing. The splicing patterns of c.1585−1G>A and c.2657+5G>A were comparable to those reported in primary cells from individuals bearing these variants. Bioinformatics predictions were consistent with experimental results for 9/10 variants (MES), 8/10 variants (NNSplice), and 7/10 variants (SSAT and Sroogle). Programs that estimate the consequences of mis-splicing predicted 11/16 (HSF and ASSEDA) and 10/16 (Fsplice and SplicePort) experimentally observed mRNA isoforms. EMGs provide a robust experimental approach for clinical interpretation of splice site variants and refinement of in silico tools. PMID:25066652

  8. Quaking and PTB control overlapping splicing regulatory networks during muscle cell differentiation

    PubMed Central

    Hall, Megan P.; Nagel, Roland J.; Fagg, W. Samuel; Shiue, Lily; Cline, Melissa S.; Perriman, Rhonda J.; Donohue, John Paul; Ares, Manuel

    2013-01-01

    Alternative splicing contributes to muscle development, but a complete set of muscle-splicing factors and their combinatorial interactions are unknown. Previous work identified ACUAA (“STAR” motif) as an enriched intron sequence near muscle-specific alternative exons such as Capzb exon 9. Mass spectrometry of myoblast proteins selected by the Capzb exon 9 intron via RNA affinity chromatography identifies Quaking (QK), a protein known to regulate mRNA function through ACUAA motifs in 3′ UTRs. We find that QK promotes inclusion of Capzb exon 9 in opposition to repression by polypyrimidine tract-binding protein (PTB). QK depletion alters inclusion of 406 cassette exons whose adjacent intron sequences are also enriched in ACUAA motifs. During differentiation of myoblasts to myotubes, QK levels increase two- to threefold, suggesting a mechanism for QK-responsive exon regulation. Combined analysis of the PTB- and QK-splicing regulatory networks during myogenesis suggests that 39% of regulated exons are under the control of one or both of these splicing factors. This work provides the first evidence that QK is a global regulator of splicing during muscle development in vertebrates and shows how overlapping splicing regulatory networks contribute to gene expression programs during differentiation. PMID:23525800

  9. Integrative analysis of Arabidopsis thaliana transcriptomics reveals intuitive splicing mechanism for circular RNA.

    PubMed

    Sun, Xiaoyong; Wang, Lin; Ding, Jiechao; Wang, Yanru; Wang, Jiansheng; Zhang, Xiaoyang; Che, Yulei; Liu, Ziwei; Zhang, Xinran; Ye, Jiazhen; Wang, Jie; Sablok, Gaurav; Deng, Zhiping; Zhao, Hongwei

    2016-10-01

    A new regulatory class of small endogenous RNAs called circular RNAs (circRNAs) has been described as miRNA sponges in animals. Using 16 Arabidopsis thaliana RNA-Seq data sets, we identified 803 circRNAs in RNase R-/non-RNase R-treated samples. The results revealed the following features: Canonical and noncanonical splicing can generate circRNAs; chloroplasts are a hotspot for circRNA generation; furthermore, limited complementary sequences exist not only in introns, but also in the sequences flanking splice sites. The latter finding suggests that multiple combinations between complementary sequences may facilitate the formation of the circular structure. Our results contribute to a better understanding of this novel class of plant circRNAs. © 2016 Federation of European Biochemical Societies.

  10. Splicing-related genes are alternatively spliced upon changes in ambient temperatures in plants

    PubMed Central

    Bucher, Johan; Lammers, Michiel; Busscher-Lange, Jacqueline; Bonnema, Guusje; Rodenburg, Nicole; Proveniers, Marcel C. G.; Angenent, Gerco C.

    2017-01-01

    Plants adjust their development and architecture to small variations in ambient temperature. In a time in which temperatures are rising world-wide, the mechanism by which plants are able to sense temperature fluctuations and adapt to it, is becoming of special interest. By performing RNA-sequencing on two Arabidopsis accession and one Brassica species exposed to temperature alterations, we showed that alternative splicing is an important mechanism in ambient temperature sensing and adaptation. We found that amongst the differentially alternatively spliced genes, splicing related genes are enriched, suggesting that the splicing machinery itself is targeted for alternative splicing when temperature changes. Moreover, we showed that many different components of the splicing machinery are targeted for ambient temperature regulated alternative splicing. Mutant analysis of a splicing related gene that was differentially spliced in two of the genotypes showed an altered flowering time response to different temperatures. We propose a two-step mechanism where temperature directly influences alternative splicing of the splicing machinery genes, followed by a second step where the altered splicing machinery affects splicing of downstream genes involved in the adaptation to altered temperatures. PMID:28257507

  11. Mutation testing in Treacher Collins Syndrome.

    PubMed

    Ellis, P E; Dawson, M; Dixon, M J

    2002-12-01

    To report on a study where 97 subjects were screened for mutations in the Treacher Collins syndrome (TCS) gene TCOF1. Ninety-seven subjects with a clinical diagnosis of TCS were screened for potential mutations in TCOF1, by means of single strand conformation polymorphism (SSCP) analysis. In those subjects where potential mutations were detected, sequence analysis was performed to determine the site and type of mutation present. Thirty-six TCS-specific mutations are reported including 27 deletions, six point mutations, two splice junction mutations, and one insertion/deletion. This brings the total number of mutations reported to date to 105. The importance of detection of these mutations is mainly in postnatal diagnosis and genetic counselling. Knowledge of the family specific mutation may also be used in prenatal diagnosis to confirm whether the foetus is affected or not, and give the parents the choice of whether to continue with the pregnancy.

  12. Genetic therapies for RNA mis-splicing diseases.

    PubMed

    Hammond, Suzan M; Wood, Matthew J A

    2011-05-01

    RNA mis-splicing diseases account for up to 15% of all inherited diseases, ranging from neurological to myogenic and metabolic disorders. With greatly increased genomic sequencing being performed for individual patients, the number of known mutations affecting splicing has risen to 50-60% of all disease-causing mutations. During the past 10years, genetic therapy directed toward correction of RNA mis-splicing in disease has progressed from theoretical work in cultured cells to promising clinical trials. In this review, we discuss the use of antisense oligonucleotides to modify splicing as well as the principles and latest work in bifunctional RNA, trans-splicing and modification of U1 and U7 snRNA to target splice sites. The success of clinical trials for modifying splicing to treat Duchenne muscular dystrophy opens the door for the use of splicing modification for most of the mis-splicing diseases. Copyright © 2011 Elsevier Ltd. All rights reserved.

  13. Mapping neurofibromatosis 1 homologous loci by fluorescence in situ hybridization

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Viskochil, D.; Breidenbach, H.H.; Cawthon, R.

    Neurofibromatosis 1 maps to chromosome band 17q11.2 and the NF1 gene is comprised of 59 exons that span approximately 335 kb of genomic DNA. In order to further analyze the structure of NF1 from exons 2 through 27b, we isolated a number of cosmid and bacteriophage P-1 genomic clones using NF1-exon probes under high-stringency hybridization conditions. Using tagged, intron-based primers and DNA from various clones as a template, we PCR-amplified and sequenced individual NF1 exons. The exon sequences in PCR products from several genomic clones differed from the exon sequence derived from cloned NF1 cDNAs. Clones with variant sequences weremore » mapped by fluorescence in situ hybridization under high-stringency conditions. Three clones mapped to chromosome band 15q11.2, one mapped to 14q11.2, one mapped to both 2q14.1-14.3 and 14q11.2, one mapped to 2q33-34, and one mapped to both 18q11.2 and 21q21. Even though some PCR-product sequences retained proper splice junctions and open reading frames, we have yet to identify cDNAs that correspond to the variant exon sequences. We are now sequencing clones that map to NF1-homologous loci in order to develop discriminating primer pairs for the exclusive amplification of NF1-specific sequences in our efforts to develop a comprehensive NF1 mutation screen using genomic DNA as template. The role of NF1-homologous sequences may play in neurofibromatosis 1 is not clear.« less

  14. The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture.

    PubMed

    Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen; Burge, Christopher B

    2017-12-27

    Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning ('intron definition') or exon-spanning ('exon definition') pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila , using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60-70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.

  15. Intragenic motifs regulate the transcriptional complexity of Pkhd1/PKHD1

    PubMed Central

    Boddu, Ravindra; Yang, Chaozhe; O’Connor, Amber K.; Hendrickson, Robert Curtis; Boone, Braden; Cui, Xiangqin; Garcia-Gonzalez, Miguel; Igarashi, Peter; Onuchic, Luiz F.; Germino, Gregory G.

    2014-01-01

    Autosomal recessive polycystic kidney disease (ARPKD) results from mutations in the human PKHD1 gene. Both this gene, and its mouse ortholog, Pkhd1, are primarily expressed in renal and biliary ductal structures. The mouse protein product, fibrocystin/polyductin complex (FPC), is a 445-kDa protein encoded by a 67-exon transcript that spans >500 kb of genomic DNA. In the current study, we observed multiple alternatively spliced Pkhd1 transcripts that varied in size and exon composition in embryonic mouse kidney, liver, and placenta samples, as well as among adult mouse pancreas, brain, heart, lung, testes, liver, and kidney. Using reverse transcription PCR and RNASeq, we identified 22 novel Pkhd1 kidney transcripts with unique exon junctions. Various mechanisms of alternative splicing were observed, including exon skipping, use of alternate acceptor/donor splice sites, and inclusion of novel exons. Bioinformatic analyses identified, and exon-trapping minigene experiments validated, consensus binding sites for serine/arginine-rich proteins that modulate alternative splicing. Using site-directed mutagenesis, we examined the functional importance of selected splice enhancers. In addition, we demonstrated that many of the novel transcripts were polysome bound, thus likely translated. Finally, we determined that the human PKHD1 R760H missense variant alters a splice enhancer motif that disrupts exon splicing in vitro and is predicted to truncate the protein. Taken together, these data provide evidence of the complex transcriptional regulation of Pkhd1/PKHD1 and identified motifs that regulate its splicing. Our studies indicate that Pkhd1/PKHD1 transcription is modulated, in part by intragenic factors, suggesting that aberrant PKHD1 splicing represents an unappreciated pathogenic mechanism in ARPKD. PMID:24984783

  16. Carcinoembryonic antigen promotes colorectal cancer progression by targeting adherens junction complexes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bajenova, Olga, E-mail: o.bazhenova@spbu.ru; Department of Genetics and Biotechnology, St. Petersburg State University, St. Petersburg 199034; Department of Surgery and Biomedical Sciences, Creighton University, Omaha, NE 68178

    2014-06-10

    Oncomarkers play important roles in the detection and management of human malignancies. Carcinoembryonic antigen (CEA, CEACAM5) and epithelial cadherin (E-cadherin) are considered as independent tumor markers in monitoring metastatic colorectal cancer. They are both expressed by cancer cells and can be detected in the blood serum. We investigated the effect of CEA production by MIP101 colorectal carcinoma cell lines on E-cadherin adherens junction (AJ) protein complexes. No direct interaction between E-cadherin and CEA was detected; however, the functional relationships between E-cadherin and its AJ partners: α-, β- and p120 catenins were impaired. We discovered a novel interaction between CEA andmore » beta-catenin protein in the CEA producing cells. It is shown in the current study that CEA overexpression alters the splicing of p120 catenin and triggers the release of soluble E-cadherin. The influence of CEA production by colorectal cancer cells on the function of E-cadherin junction complexes may explain the link between the elevated levels of CEA and the increase in soluble E-cadherin during the progression of colorectal cancer. - Highlights: • Elevated level of CEA increases the release of soluble E-cadherin during the progression of colorectal cancer. • CEA over-expression alters the binding preferences between E-cadherin and its partners: α-, β- and p120 catenins in adherens junction complexes. • CEA produced by colorectal cancer cells interacts with beta-catenin protein. • CEA over-expression triggers the increase in nuclear beta-catenin. • CEA over-expression alters the splicing of p120 catenin protein.« less

  17. U2 small nuclear ribonucleoprotein particle (snRNP) auxiliary factor of 65 kDa, U2AF65, can promote U1 snRNP recruitment to 5' splice sites.

    PubMed Central

    Förch, Patrik; Merendino, Livia; Martínez, Concepción; Valcárcel, Juan

    2003-01-01

    The splicing factor U2AF(65), U2 small nuclear ribonucleoprotein particle (snRNP) auxillary factor of 65 kDa, binds to pyrimidine-rich sequences at 3' splice sites to recruit U2 snRNP to pre-mRNAs. We report that U2AF(65) can also promote the recruitment of U1 snRNP to weak 5' splice sites that are followed by uridine-rich sequences. The arginine- and serine-rich domain of U2AF(65) is critical for U1 recruitment, and we discuss the role of its RNA-RNA annealing activity in this novel function of U2AF(65). PMID:12558503

  18. A few nucleotide polymorphisms are sufficient to recruit nuclear factors differentially to the intron 1 of HPV-16 intratypic variants.

    PubMed

    López-Urrutia, Eduardo; Valdés, Jesús; Bonilla-Moreno, Raúl; Martínez-Salazar, Martha; Martínez-Garcia, Martha; Berumen, Jaime; Villegas-Sepúlveda, Nicolás

    2012-06-01

    The HPV-16 E6/E7 genes, which contain intron 1, are processed by alternative splicing and its transcripts are detected with a heterogeneous profile in tumours cells. Frequently, the HPV-16 positive carcinoma cells bear viral variants that contain single nucleotide polymorphisms into its DNA sequence. We were interested in analysing the contribution of this polymorphism to the heterogeneity in the pattern of the E6/E7 spliced transcripts. Using the E6/E7 sequences from three closely related HPV-16 variants, we have shown that a few nucleotide changes are sufficient to produce heterogeneity in the splicing profile. Furthermore, using mutants that contained a single SNP, we also showed that one nucleotide change was sufficient to reproduce the heterogeneous splicing profile. Additionally, a difference of two or three SNPs among these viral sequences was sufficient to recruit differentially several splicing factors to the polymorphic E6/E7 transcripts. Moreover, only one SNP was sufficient to alter the binding site of at least one splicing factor, changing the ability of splicing factors to bind the transcript. Finally, the factors that were differentially bound to the short form of intron 1 of one of these E6/E7 variants were identified as TIA1 and/or TIAR and U1-70k, while U2AF65, U5-52k and PTB were preferentially bound to the transcript of the other variants. Copyright © 2012 Elsevier B.V. All rights reserved.

  19. Detection of alternative splice variants at the proteome level in Aspergillus flavus.

    PubMed

    Chang, Kung-Yen; Georgianna, D Ryan; Heber, Steffen; Payne, Gary A; Muddiman, David C

    2010-03-05

    Identification of proteins from proteolytic peptides or intact proteins plays an essential role in proteomics. Researchers use search engines to match the acquired peptide sequences to the target proteins. However, search engines depend on protein databases to provide candidates for consideration. Alternative splicing (AS), the mechanism where the exon of pre-mRNAs can be spliced and rearranged to generate distinct mRNA and therefore protein variants, enable higher eukaryotic organisms, with only a limited number of genes, to have the requisite complexity and diversity at the proteome level. Multiple alternative isoforms from one gene often share common segments of sequences. However, many protein databases only include a limited number of isoforms to keep minimal redundancy. As a result, the database search might not identify a target protein even with high quality tandem MS data and accurate intact precursor ion mass. We computationally predicted an exhaustive list of putative isoforms of Aspergillus flavus proteins from 20 371 expressed sequence tags to investigate whether an alternative splicing protein database can assign a greater proportion of mass spectrometry data. The newly constructed AS database provided 9807 new alternatively spliced variants in addition to 12 832 previously annotated proteins. The searches of the existing tandem MS spectra data set using the AS database identified 29 new proteins encoded by 26 genes. Nine fungal genes appeared to have multiple protein isoforms. In addition to the discovery of splice variants, AS database also showed potential to improve genome annotation. In summary, the introduction of an alternative splicing database helps identify more proteins and unveils more information about a proteome.

  20. Exploration of Molecular Factors Impairing Superoxide Dismutase Isoforms Activity in Human Senile Cataractous Lenses

    PubMed Central

    Rajkumar, Sankaranarayanan; Vasavada, Abhay R.; Praveen, Mamidipudi R.; Ananthan, Rajendran; Reddy, Geereddy B.; Tripathi, Harsha; Ganatra, Darshini A.; Arora, Anshul I.; Patel, Alpesh R.

    2013-01-01

    Purpose. To explore different molecular factors impairing the activities of superoxide dismutase (SOD) isoforms in senile cataractous lenses. Methods. Enzyme activity of SOD isoforms, levels of their corresponding cofactors copper (Cu), manganese (Mn), zinc (Zn), and expression of mRNA transcripts and proteins were determined in the lenses of human subjects with and without cataract. DNA from lens epithelium (LE) and peripheral blood was isolated. Polymerase chain reaction–single strand conformation polymorphism (PCR-SSCP) followed by sequencing was carried out to screen somatic mutations. The impact of intronic insertion/deletion (INDEL) variations on the splicing process and on the resultant transcript was evaluated. Genotyping of IVS4+42delG polymorphism of SOD1 gene was done by PCR–restriction fragment length polymorphism (RFLP). Results. A significant decrease in Cu/Zn- and Mn-SOD activity (P < 0.001) and in Cu/Zn-SOD transcript (P < 0.001) and its protein (P < 0.05) were found in cataractous lenses. No significant change in the level of copper (P = 0.36) and an increase in the level of manganese (P = 0.01) and zinc (P = 0.02) were observed in cataractous lenses. A significant positive correlation between the level of Cu/Zn-SOD activity and the levels of Cu (P = 0.003) and Zn (P = 0.005) was found in the cataractous lenses. DNA sequencing revealed three intronic INDEL variations in exon4 of SOD1 gene. Splice-junction analysis showed the potential of IVS4+42delG in creating a new cryptic acceptor site. If it is involved in alternate splicing, it could result in generation of SOD1 mRNA transcripts lacking exon4 region. Transcript analysis revealed the presence of complete SOD1 mRNA transcripts. Genotyping revealed the presence of IVS4+42delG polymorphism in all subjects. Conclusions. The decrease in the activity of SOD1 isoform in cataractous lenses was associated with the decreased level of mRNA transcripts and their protein expression and was not associated with either modulation in the level of enzyme cofactors or with INDEL variations. PMID:23970468

  1. Spliced synthetic genes as internal controls in RNA sequencing experiments.

    PubMed

    Hardwick, Simon A; Chen, Wendy Y; Wong, Ted; Deveson, Ira W; Blackburn, James; Andersen, Stacey B; Nielsen, Lars K; Mattick, John S; Mercer, Tim R

    2016-09-01

    RNA sequencing (RNA-seq) can be used to assemble spliced isoforms, quantify expressed genes and provide a global profile of the transcriptome. However, the size and diversity of the transcriptome, the wide dynamic range in gene expression and inherent technical biases confound RNA-seq analysis. We have developed a set of spike-in RNA standards, termed 'sequins' (sequencing spike-ins), that represent full-length spliced mRNA isoforms. Sequins have an entirely artificial sequence with no homology to natural reference genomes, but they align to gene loci encoded on an artificial in silico chromosome. The combination of multiple sequins across a range of concentrations emulates alternative splicing and differential gene expression, and it provides scaling factors for normalization between samples. We demonstrate the use of sequins in RNA-seq experiments to measure sample-specific biases and determine the limits of reliable transcript assembly and quantification in accompanying human RNA samples. In addition, we have designed a complementary set of sequins that represent fusion genes arising from rearrangements of the in silico chromosome to aid in cancer diagnosis. RNA sequins provide a qualitative and quantitative reference with which to navigate the complexity of the human transcriptome.

  2. Fox-2 Splicing Factor Binds to a Conserved Intron Motif to PromoteInclusion of Protein 4.1R Alternative Exon 16

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ponthier, Julie L.; Schluepen, Christina; Chen, Weiguo

    Activation of protein 4.1R exon 16 (E16) inclusion during erythropoiesis represents a physiologically important splicing switch that increases 4.1R affinity for spectrin and actin. Previous studies showed that negative regulation of E16 splicing is mediated by the binding of hnRNP A/B proteins to silencer elements in the exon and that downregulation of hnRNP A/B proteins in erythroblasts leads to activation of E16 inclusion. This paper demonstrates that positive regulation of E16 splicing can be mediated by Fox-2 or Fox-1, two closely related splicing factors that possess identical RNA recognition motifs. SELEX experiments with human Fox-1 revealed highly selective binding tomore » the hexamer UGCAUG. Both Fox-1 and Fox-2 were able to bind the conserved UGCAUG elements in the proximal intron downstream of E16, and both could activate E16 splicing in HeLa cell co-transfection assays in a UGCAUG-dependent manner. Conversely, knockdown of Fox-2 expression, achieved with two different siRNA sequences resulted in decreased E16 splicing. Moreover, immunoblot experiments demonstrate mouse erythroblasts express Fox-2, but not Fox-1. These findings suggest that Fox-2 is a physiological activator of E16 splicing in differentiating erythroid cells in vivo. Recent experiments show that UGCAUG is present in the proximal intron sequence of many tissue-specific alternative exons, and we propose that the Fox family of splicing enhancers plays an important role in alternative splicing switches during differentiation in metazoan organisms.« less

  3. Isolation and characterization of alternatively spliced variants of the mouse sigma1 receptor gene, Sigmar1.

    PubMed

    Pan, Ling; Pasternak, David A; Xu, Jin; Xu, Mingming; Lu, Zhigang; Pasternak, Gavril W; Pan, Ying-Xian

    2017-01-01

    The sigma1 receptor acts as a chaperone at the endoplasmic reticulum, associates with multiple proteins in various cellular systems, and involves in a number of diseases, such as addiction, pain, cancer and psychiatric disorders. The sigma1 receptor is encoded by the single copy SIGMAR1 gene. The current study identifies five alternatively spliced variants of the mouse sigma1 receptor gene using a polymerase chain reaction cloning approach. All the splice variants are generated by exon skipping or alternative 3' or 5' splicing, producing the truncated sigma1 receptor. Similar alternative splicing has been observed in the human SIGMAR1 gene based on the molecular cloning or genome sequence prediction, suggesting conservation of alternative splicing of SIGMAR1 gene. Using quantitative polymerase chain reactions, we demonstrate differential expression of several splice variants in mouse tissues and brain regions. When expressed in HEK293 cells, all the splice variants fail to bind sigma ligands, implicating that each truncated region in these splice variants is important for ligand binding. However, co-immunoprecipitation (Co-IP) study in HEK293 cells co-transfected with tagged constructs reveals that all the splice variants maintain their ability to physically associate with a mu opioid receptor (mMOR-1), providing useful information to correlate the motifs/sequences necessary for their physical association. Furthermore, a competition Co-IP study showed that all the variants can disrupt in a dose-dependent manner the dimerization of the original sigma1 receptor with mMOR-1, suggesting a potential dominant negative function and providing significant insights into their function.

  4. Mechanisms and Regulation of Alternative Pre-mRNA Splicing

    PubMed Central

    Lee, Yeon

    2015-01-01

    Precursor messenger RNA (pre-mRNA) splicing is a critical step in the posttranscriptional regulation of gene expression, providing significant expansion of the functional proteome of eukaryotic organisms with limited gene numbers. Split eukaryotic genes contain intervening sequences or introns disrupting protein-coding exons, and intron removal occurs by repeated assembly of a large and highly dynamic ribonucleoprotein complex termed the spliceosome, which is composed of five small nuclear ribonucleoprotein particles, U1, U2, U4/U6, and U5. Biochemical studies over the past 10 years have allowed the isolation as well as compositional, functional, and structural analysis of splicing complexes at distinct stages along the spliceosome cycle. The average human gene contains eight exons and seven introns, producing an average of three or more alternatively spliced mRNA isoforms. Recent high-throughput sequencing studies indicate that 100% of human genes produce at least two alternative mRNA isoforms. Mechanisms of alternative splicing include RNA–protein interactions of splicing factors with regulatory sites termed silencers or enhancers, RNA–RNA base-pairing interactions, or chromatin-based effects that can change or determine splicing patterns. Disease-causing mutations can often occur in splice sites near intron borders or in exonic or intronic RNA regulatory silencer or enhancer elements, as well as in genes that encode splicing factors. Together, these studies provide mechanistic insights into how spliceosome assembly, dynamics, and catalysis occur; how alternative splicing is regulated and evolves; and how splicing can be disrupted by cis- and trans-acting mutations leading to disease states. These findings make the spliceosome an attractive new target for small-molecule, antisense, and genome-editing therapeutic interventions. PMID:25784052

  5. The prediction of human exons by oligonucleotide composition and discriminant analysis of spliceable open reading frames

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Solovyev, V.V.; Salamov, A.A.; Lawrence, C.B.

    1994-12-31

    Discriminant analysis is applied to the problem of recognition 5`-, internal and 3`-exons in human DNA sequences. Specific recognition functions were developed for revealing exons of particular types. The method based on a splice site prediction algorithm that uses the linear Fisher discriminant to combine the information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotide in protein coding and nation regions. The accuracy of our splice site recognition function is about 97%. A discriminant function for 5`-exon prediction includes hexanucleotide composition of upstream region, triplet composition around the ATG codon, ORF codingmore » potential, donor splice site potential and composition of downstream introit region. For internal exon prediction, we combine in a discriminant function the characteristics describing the 5`- intron region, donor splice site, coding region, acceptor splice site and Y-intron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences is 77% with a specificity of 79% and a level of pseudoexon ORF prediction of 99.96%. The recognition quality computed at the level of individual nucleotides is 89%, for exon sequences and 98% for intron sequences. A discriminant function for 3`-exon prediction includes octanucleolide composition of upstream nation region, triplet composition around the stop codon, ORF coding potential, acceptor splice site potential and hexanucleotide composition of downstream region. We unite these three discriminant functions in exon predicting program FEX (find exons). FEX exactly predicts 70% of 1016 exons from the test of 181 complete genes with specificity 73%, and 89% exons are exactly or partially predicted. On the average, 85% of nucleotides were predicted accurately with specificity 91%.« less

  6. Resolution of model Holliday junctions by yeast endonuclease: effect of DNA structure and sequence.

    PubMed Central

    Parsons, C A; Murchie, A I; Lilley, D M; West, S C

    1989-01-01

    The resolution of Holliday junctions in DNA involves specific cleavage at or close to the site of the junction. A nuclease from Saccharomyces cerevisiae cleaves model Holliday junctions in vitro by the introduction of nicks in regions of duplex DNA adjacent to the crossover point. In previous studies [Parsons and West (1988) Cell, 52, 621-629] it was shown that cleavage occurred within homologous arm sequences with precise symmetry across the junction. In contrast, junctions with heterologous arm sequences were cleaved asymmetrically. In this work, we have studied the effect of sequence changes and base modification upon the site of cleavage. It is shown that the specificity of cleavage is unchanged providing that perfect homology is maintained between opposing arm sequences. However, in the absence of homology, cleavage depends upon sequence context and is affected by minor changes such as base modification. These data support the proposed mechanism for cleavage of a Holliday junction, which requires homologous alignment of arm sequences in an enzyme--DNA complex as a prerequisite for symmetrical cleavage by the yeast endonuclease. Images PMID:2653810

  7. The splicing of tiny introns of Paramecium is controlled by MAGO.

    PubMed

    Contreras, Julia; Begley, Victoria; Marsella, Laura; Villalobo, Eduardo

    2018-07-15

    The exon junction complex (EJC) is a key element of the splicing machinery. The EJC core is composed of eIF4A3, MAGO, Y14 and MLN51. Few accessory proteins, such as CWC22 or UPF3, bind transiently to the EJC. The EJC has been implicated in the control of the splicing of long introns. To ascertain whether the EJC controls the splicing of short introns, we used Paramecium tetraurelia as a model organism, since it has thousands of very tiny introns. To elucidate whether EJC affects intron splicing in P. tetraurelia, we searched for EJC protein-coding genes, and silenced those genes coding for eIF4A3, MAGO and CWC22. We found that P. tetraurelia likely assembles an active EJC with only three of the core proteins, since MLN51 is lacking. Silencing of eIF4A3 or CWC22 genes, but not that of MAGO, caused lethality. Silencing of the MAGO gene caused either an increase, decrease, or no change in intron retention levels of some intron-containing mRNAs used as reporters. We suggest that a fine-tuning expression of EJC genes is required for steady intron removal in P. tetraurelia. Taking into consideration our results and those published by others, we conclude that the EJC controls splicing independently of the intron size. Copyright © 2018 Elsevier B.V. All rights reserved.

  8. Computational identification and validation of alternative splicing in ZSF1 rat RNA-seq data, a preclinical model for type 2 diabetic nephropathy.

    PubMed

    Zhang, Chi; Dower, Ken; Zhang, Baohong; Martinez, Robert V; Lin, Lih-Ling; Zhao, Shanrong

    2018-05-16

    Obese ZSF1 rats exhibit spontaneous time-dependent diabetic nephropathy and are considered to be a highly relevant animal model of progressive human diabetic kidney disease. We previously identified gene expression changes between disease and control animals across six time points from 12 to 41 weeks. In this study, the same data were analysed at the isoform and exon levels to reveal additional disease mechanisms that may be governed by alternative splicing. Our analyses identified alternative splicing patterns in genes that may be implicated in disease pathogenesis (such as Shc1, Serpinc1, Epb4.1l5, and Il-33), which would have been overlooked in standard gene-level analysis. The alternatively spliced genes were enriched in pathways related to cell adhesion, cell-cell interactions/junctions, and cytoskeleton signalling, whereas the differentially expressed genes were enriched in pathways related to immune response, G protein-coupled receptor, and cAMP signalling. Our findings indicate that additional mechanistic insights can be gained from exon- and isoform-level data analyses over standard gene-level analysis. Considering alternative splicing is poorly conserved between rodents and humans, it is noted that this work is not translational, but the point holds true that additional insights can be gained from alternative splicing analysis of RNA-seq data.

  9. Regulation of alternative mRNA splicing: old players and new perspectives.

    PubMed

    Dvinge, Heidi

    2018-06-01

    Nearly all human multi-exon genes are subject to alternative splicing in one or more cell types. The splicing machinery, therefore, has to select between multiple splice sites in a context-dependent manner, relying on sequence features in cis and trans-acting splicing regulators that either promote or repress splice site recognition and spliceosome assembly. However, the functional coupling between multiple gene regulatory layers signifies that splicing can also be modulated by transcriptional or epigenetic characteristics. Other, less obvious, aspects of alternative splicing have come to light in recent years, often involving core components of the spliceosome previously thought to perform a basal rather than a regulatory role in splicing. Together this paints a highly dynamic picture of splicing regulation, where the final splice site choice is governed by the entire transcriptional environment of a gene and its cellular context. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  10. RNA sequencing enables systematic identification of platelet transcriptomic alterations in NSCLC patients.

    PubMed

    Zhang, Qun; Hu, Huan; Liu, Hongda; Jin, Jiajia; Zhu, Peiyuan; Wang, Shujun; Shen, Kaikai; Hu, Yangbo; Li, Zhou; Zhan, Ping; Zhu, Suhua; Fan, Hang; Zhang, Jianya; Lv, Tangfeng; Song, Yong

    2018-05-29

    Platelets are implicated as key players in the metastatic dissemination of tumor cells. Previous evidence demonstrated platelets retained cytoplasmic RNAs with physiologically activity, splicing pre-mRNA to mRNA and translating into functional proteins in response to external stimulation. Recently, platelets gene profile of healthy or diseased individuals were characterized with the help of RNA sequencing (RNA-Seq) in some studies, leading to new insights into the mechanisms underlying disease pathogenesis. In this study, we performed RNA-seq in platelets from 7 healthy individuals and 15 non-small cell lung cancer (NSCLC) patients. Our data revealed a subset of near universal differently expressed gene (DEG) profiles in platelets of metastatic NSCLC compared to healthy individuals, including 626 up-regulated RNAs (mRNAs and ncRNAs) and 1497 down-regulated genes. The significant over-expressed genes showed enrichment in focal adhesion, platelets activation, gap junction and adherens junction pathways. The DEGs also included previously reported tumor-related genes such as PDGFR, VEGF, EGF, etc., verifying the consistence and significance of platelet RNA-Seq in oncology study. We also validated several up-regulated DEGs involved in tumor cell-induced platelet aggregation (TCIPA) and tumorigenesis. Additionally, transcriptomic comparison analyses of NSCLC subgroups were conducted. Between non-metastatic and metastatic NSCLC patients, 526 platelet DEGs were identified with the most altered expression. The outcomes from subgroup analysis between lung adenocarcinoma and lung squamous cell carcinoma demonstrated the diagnostic potential of platelet RNA-Seq on distinguishing tumor histological types. Copyright © 2018 Elsevier Masson SAS. All rights reserved.

  11. NMR studies of two spliced leader RNAs using isotope labeling

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lapham, J.; Crothers, D.M.

    1994-12-01

    Spliced leader RNAs are a class of RNA molecules (<200 nts) involved in the trans splicing of messenger RNA found in trypanosomes, nematodes, and other lower eukaryotes. The spliced leader RNA from the trypanosome Leptomonas Collosoma exists in two alternate structural forms with similar thermal stabilities. The 54 nucleotides on the 5{prime} end of the SL molecule is structurally independent from the 3{prime} half of the RNA, and displays the two structural forms. Furthermore, the favored of the two structures was shown to contain anomalous nuclease sensitivity and thermal stability features, which suggests that there may be tertiary interactions betweenmore » the splice site and other nucleotides in the 5{prime} end. Multidimensional NMR studies are underway to elucidate the structural elements present in the SL RNAs that give rise to their physical properties. Two spliced leader sequences have been studied. The first, the 54 nucleotides on the 5{prime} end of the L. Collosoma sequence, was selected because of earlier studies in our laboratory. The second sequence is the 5{prime} end of the trypanosome Crithidia Fasciculata, which was chosen because of its greater sequence homology to other SL sequences. Given the complexity of the NMR spectra for RNA molecules of this size, we have incorporated {sup 15}N/{sup 13}C-labeled nucleotides into the RNA. One of the techniques we have developed to simplify the spectra of these RNA molecules is isotope labeling of specific regions of the RNA. This has been especially helpful in assigning the secondary structure of molecules that may be able to adopt multiple conformations. Using this technique one can examine a part of the molecule without spectral interference from the unlabeled portion. We hope this approach will promote an avenue for studying the structure of larger RNAs in their native surroundings.« less

  12. Regulation of alternative splicing in Drosophila by 56 RNA binding proteins

    DOE PAGES

    Brooks, Angela N.; Duff, Michael O.; May, Gemma; ...

    2015-08-20

    Alternative splicing is regulated by RNA binding proteins (RBPs) that recognize pre-mRNA sequence elements and activate or repress adjacent exons. Here, we used RNA interference and RNA-seq to identify splicing events regulated by 56 Drosophila proteins, some previously unknown to regulate splicing. Nearly all proteins affected alternative first exons, suggesting that RBPs play important roles in first exon choice. Half of the splicing events were regulated by multiple proteins, demonstrating extensive combinatorial regulation. We observed that SR and hnRNP proteins tend to act coordinately with each other, not antagonistically. We also identified a cross-regulatory network where splicing regulators affected themore » splicing of pre-mRNAs encoding other splicing regulators. In conclusion, this large-scale study substantially enhances our understanding of recent models of splicing regulation and provides a resource of thousands of exons that are regulated by 56 diverse RBPs.« less

  13. The Human Splicing Factor ASF/SF2 can Specifically Recognize Pre-mRNA 5' Splice Sites

    NASA Astrophysics Data System (ADS)

    Zuo, Ping; Manley, James L.

    1994-04-01

    ASF/SF2 is a human protein previously shown to function in in vitro pre-mRNA splicing as an essential factor necessary for all splices and also as an alternative splicing factor, capable of switching selection of 5' splice sites. To begin to study the protein's mechanism of action, we have investigated the RNA binding properties of purified recombinant ASF/SF2. Using UV crosslinking and gel shift assays, we demonstrate that the RNA binding region of ASF/SF2 can interact with RNA in a sequence-specific manner, recognizing the 5' splice site in each of two different pre-mRNAs. Point mutations in the 5' splice site consensus can reduce binding by as much as a factor of 100, with the largest effects observed in competition assays. These findings support a model in which ASF/SF2 aids in the recognition of pre-mRNA 5' splice sites.

  14. In silico prediction of splice-altering single nucleotide variants in the human genome.

    PubMed

    Jian, Xueqiu; Boerwinkle, Eric; Liu, Xiaoming

    2014-12-16

    In silico tools have been developed to predict variants that may have an impact on pre-mRNA splicing. The major limitation of the application of these tools to basic research and clinical practice is the difficulty in interpreting the output. Most tools only predict potential splice sites given a DNA sequence without measuring splicing signal changes caused by a variant. Another limitation is the lack of large-scale evaluation studies of these tools. We compared eight in silico tools on 2959 single nucleotide variants within splicing consensus regions (scSNVs) using receiver operating characteristic analysis. The Position Weight Matrix model and MaxEntScan outperformed other methods. Two ensemble learning methods, adaptive boosting and random forests, were used to construct models that take advantage of individual methods. Both models further improved prediction, with outputs of directly interpretable prediction scores. We applied our ensemble scores to scSNVs from the Catalogue of Somatic Mutations in Cancer database. Analysis showed that predicted splice-altering scSNVs are enriched in recurrent scSNVs and known cancer genes. We pre-computed our ensemble scores for all potential scSNVs across the human genome, providing a whole genome level resource for identifying splice-altering scSNVs discovered from large-scale sequencing studies.

  15. Evolution of Rubisco activase gene in plants.

    PubMed

    Nagarajan, Ragupathi; Gill, Kulvinder S

    2018-01-01

    Rubisco activase of plants evolved in a stepwise manner without losing its function to adapt to the major evolutionary events including endosymbiosis and land colonization. Rubisco activase is an essential enzyme for photosynthesis, which removes inhibitory sugar phosphates from the active sites of Rubisco, a process necessary for Rubisco activation and carbon fixation. The gene probably evolved in cyanobacteria as different species differ for its presence. However, the gene is present in all other plant species. At least a single gene copy was maintained throughout plant evolution; but various genome and gene duplication events, which occurred during plant evolution, increased its copy number in some species. The exons and exon-intron junctions of present day higher plant's Rca, which is conserved in most species seem to have evolved in charophytes. A unique tandem duplication of Rca gene occurred in a common grass ancestor, and the two genes evolved differently for gene structure, sequence, and expression pattern. At the protein level, starting with a primitive form in cyanobacteria, RCA of chlorophytes evolved by integrating chloroplast transit peptide (cTP), and N-terminal domains to the ATPase, Rubisco recognition and C-terminal domains. The redox regulated C-terminal extension (CTE) and the associated alternate splicing mechanism, which splices the RCA-α and RCA-β isoforms were probably gained from another gene in charophytes, conserved in most species except the members of Solanaceae family.

  16. Another heritage from the RNA world: self-excision of intron sequence from nuclear pre-tRNAs.

    PubMed

    Weber, U; Beier, H; Gross, H J

    1996-06-15

    The intervening sequences of nuclear tRNA precursors are known to be excised by tRNA splicing endonuclease. We show here that a T7 transcript corresponding to a pre-tRNA(Tyr) from Arabidopsis thaliana has a highly specific activity for autolytic intron excision. Self-cleavage occurs precisely at the authentic 3'-splice site and at the phosphodiester bond one nucleotide downstream of the authentic 5'-splice site. The reaction results in fragments with 2',3'-cyclic phosphate and 5'-OH termini. It is resistant to proteinase K and/or SDS treatment and is not inhibited by added tRNA. The self-cleavage depends on Mg2+ and is stimulated by spermine and Triton X-100. A set of sequence variants at the cleavage sites has been analysed for autolytic intron excision and, in parallel, for enzymatic in vitro splicing in wheat germ S23 extract. Single-stranded loops are a prerequisite for both reactions. Self-cleavage not only occurs at pyrimidine-A but also at U-U bonds. Since intron self-excision is only about five times slower than the enzymatic intron excision in a wheat germ S23 extract, we propose that the splicing endonuclease may function by improving the preciseness and efficiency of an inherent pre-tRNA self-cleavage activity.

  17. Phylogenetic Analysis of Nuclear-Encoded RNA Maturases

    PubMed Central

    Malik, Sunita; Upadhyaya, KC; Khurana, SM Paul

    2017-01-01

    Posttranscriptional processes, such as splicing, play a crucial role in gene expression and are prevalent not only in nuclear genes but also in plant mitochondria where splicing of group II introns is catalyzed by a class of proteins termed maturases. In plant mitochondria, there are 22 mitochondrial group II introns. matR, nMAT1, nMAT2, nMAT3, and nMAT4 proteins have been shown to be required for efficient splicing of several group II introns in Arabidopsis thaliana. Nuclear maturases (nMATs) are necessary for splicing of mitochondrial genes, leading to normal oxidative phosphorylation. Sequence analysis through phylogenetic tree (including bootstrapping) revealed high homology with maturase sequences of A thaliana and other plants. This study shows the phylogenetic relationship of nMAT proteins between A thaliana and other nonredundant plant species taken from BLASTP analysis. PMID:28607538

  18. A study of alternative splicing in the pig

    PubMed Central

    2010-01-01

    Background Since at least half of the genes in mammalian genomes are subjected to alternative splicing, alternative pre-mRNA splicing plays an important contribution to the complexity of the mammalian proteome. Expressed sequence tags (ESTs) provide evidence of a great number of possible alternative isoforms. With the EST resource for the domestic pig now containing more than one million porcine ESTs, it is possible to identify alternative splice forms of the individual transcripts in this species from the EST data with some confidence. Results The pig EST data generated by the Sino-Danish Pig Genome project has been assembled with publicly available ESTs and made available in the PigEST database. Using the Distiller package 2,515 EST clusters with candidate alternative isoforms were identified in the EST data with high confidence. In agreement with general observations in human and mouse, we find putative splice variants in about 30% of the contigs with more than 50 ESTs. Based on the criteria that a minimum of two EST sequences confirmed each splice event, a list of 100 genes with the most distinct tissue-specific alternative splice events was generated from the list of candidates. To confirm the tissue specificity of the splice events, 10 genes with functional annotation were randomly selected from which 16 individual splice events were chosen for experimental verification by quantitative PCR (qPCR). Six genes were shown to have tissue specific alternatively spliced transcripts with expression patterns matching those of the EST data. The remaining four genes had tissue-restricted expression of alternative spliced transcripts. Five out of the 16 splice events that were experimentally verified were found to be putative pig specific. Conclusions In accordance with human and rodent studies we estimate that approximately 30% of the porcine genes undergo alternative splicing. We found a good correlation between EST predicted tissue-specificity and experimentally validated splice events in different porcine tissue. This study indicates that a cluster size of around 50 ESTs is optimal for in silico detection of alternative splicing. Although based on a limited number of splice events, the study supports the notion that alternative splicing could have an important impact on species differentiation since 31% of the splice events studied appears to be species specific. PMID:20444244

  19. The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture

    PubMed Central

    Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen

    2017-01-01

    Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing. PMID:29280736

  20. The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture

    DOE PAGES

    Pai, Athma A.; Henriques, Telmo; McCue, Kayla; ...

    2017-12-27

    Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less

  1. The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pai, Athma A.; Henriques, Telmo; McCue, Kayla

    Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less

  2. Microprocessor-dependent processing of Splice site Overlapping microRNA exons does not result in changes in alternative splicing.

    PubMed

    Pianigiani, Giulia; Licastro, Danilo; Fortugno, Paola; Castiglia, Daniele; Petrovic, Ivana; Pagani, Franco

    2018-06-12

    MicroRNAs are found throughout the genome and are processed by the microprocessor complex (MPC) from longer precursors. Some precursor miRNAs overlap intron:exon junctions. These Splice site Overlapping microRNAs (SO-miRNAs) are mostly located in coding genes. It has been intimated, in the rarer examples of SO-miRNAs in non-coding RNAs, that the competition between the spliceosome and the MPC modulates alternative splicing. However, the effect of this overlap on coding transcripts is unknown. Unexpectedly, we show that neither Drosha silencing nor SF3b1 silencing changed the inclusion ratio of SO-miRNA exons. Two SO-miRNAs, located in genes that code for basal membrane proteins, are known to inhibit proliferation in primary keratinocytes. These SO-miRNAs were upregulated during differentiation and the host mRNAs were downregulated, but again there was no change in inclusion ratio of the SO-miRNA exons. Interestingly, Drosha silencing increased nascent RNA density, on chromatin, downstream of SO-miRNA exons. Overall our data suggest a novel mechanism for regulating gene expression in which MPC-dependent cleavage of SO-miRNA exons could cause premature transcriptional termination of coding genes rather than affecting alternative splicing. Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  3. Spliced integrated retrotransposed element (SpIRE) formation in the human genome.

    PubMed

    Larson, Peter A; Moldovan, John B; Jasti, Naveen; Kidd, Jeffrey M; Beck, Christine R; Moran, John V

    2018-03-01

    Human Long interspersed element-1 (L1) retrotransposons contain an internal RNA polymerase II promoter within their 5' untranslated region (UTR) and encode two proteins, (ORF1p and ORF2p) required for their mobilization (i.e., retrotransposition). The evolutionary success of L1 relies on the continuous retrotransposition of full-length L1 mRNAs. Previous studies identified functional splice donor (SD), splice acceptor (SA), and polyadenylation sequences in L1 mRNA and provided evidence that a small number of spliced L1 mRNAs retrotransposed in the human genome. Here, we demonstrate that the retrotransposition of intra-5'UTR or 5'UTR/ORF1 spliced L1 mRNAs leads to the generation of spliced integrated retrotransposed elements (SpIREs). We identified a new intra-5'UTR SpIRE that is ten times more abundant than previously identified SpIREs. Functional analyses demonstrated that both intra-5'UTR and 5'UTR/ORF1 SpIREs lack Cis-acting transcription factor binding sites and exhibit reduced promoter activity. The 5'UTR/ORF1 SpIREs also produce nonfunctional ORF1p variants. Finally, we demonstrate that sequence changes within the L1 5'UTR over evolutionary time, which permitted L1 to evade the repressive effects of a host protein, can lead to the generation of new L1 splicing events, which, upon retrotransposition, generates a new SpIRE subfamily. We conclude that splicing inhibits L1 retrotransposition, SpIREs generally represent evolutionary "dead-ends" in the L1 retrotransposition process, mutations within the L1 5'UTR alter L1 splicing dynamics, and that retrotransposition of the resultant spliced transcripts can generate interindividual genomic variation.

  4. Spliced integrated retrotransposed element (SpIRE) formation in the human genome

    PubMed Central

    Larson, Peter A.; Moldovan, John B.; Jasti, Naveen; Kidd, Jeffrey M.; Beck, Christine R.; Moran, John V.

    2018-01-01

    Human Long interspersed element-1 (L1) retrotransposons contain an internal RNA polymerase II promoter within their 5′ untranslated region (UTR) and encode two proteins, (ORF1p and ORF2p) required for their mobilization (i.e., retrotransposition). The evolutionary success of L1 relies on the continuous retrotransposition of full-length L1 mRNAs. Previous studies identified functional splice donor (SD), splice acceptor (SA), and polyadenylation sequences in L1 mRNA and provided evidence that a small number of spliced L1 mRNAs retrotransposed in the human genome. Here, we demonstrate that the retrotransposition of intra-5′UTR or 5′UTR/ORF1 spliced L1 mRNAs leads to the generation of spliced integrated retrotransposed elements (SpIREs). We identified a new intra-5′UTR SpIRE that is ten times more abundant than previously identified SpIREs. Functional analyses demonstrated that both intra-5′UTR and 5′UTR/ORF1 SpIREs lack Cis-acting transcription factor binding sites and exhibit reduced promoter activity. The 5′UTR/ORF1 SpIREs also produce nonfunctional ORF1p variants. Finally, we demonstrate that sequence changes within the L1 5′UTR over evolutionary time, which permitted L1 to evade the repressive effects of a host protein, can lead to the generation of new L1 splicing events, which, upon retrotransposition, generates a new SpIRE subfamily. We conclude that splicing inhibits L1 retrotransposition, SpIREs generally represent evolutionary “dead-ends” in the L1 retrotransposition process, mutations within the L1 5′UTR alter L1 splicing dynamics, and that retrotransposition of the resultant spliced transcripts can generate interindividual genomic variation. PMID:29505568

  5. Isolation and characterization of alternatively spliced variants of the mouse sigma1 receptor gene, Sigmar1

    PubMed Central

    Pan, Ling; Pasternak, David A.; Xu, Jin; Xu, Mingming; Lu, Zhigang; Pasternak, Gavril W.

    2017-01-01

    The sigma1 receptor acts as a chaperone at the endoplasmic reticulum, associates with multiple proteins in various cellular systems, and involves in a number of diseases, such as addiction, pain, cancer and psychiatric disorders. The sigma1 receptor is encoded by the single copy SIGMAR1 gene. The current study identifies five alternatively spliced variants of the mouse sigma1 receptor gene using a polymerase chain reaction cloning approach. All the splice variants are generated by exon skipping or alternative 3’ or 5’ splicing, producing the truncated sigma1 receptor. Similar alternative splicing has been observed in the human SIGMAR1 gene based on the molecular cloning or genome sequence prediction, suggesting conservation of alternative splicing of SIGMAR1 gene. Using quantitative polymerase chain reactions, we demonstrate differential expression of several splice variants in mouse tissues and brain regions. When expressed in HEK293 cells, all the splice variants fail to bind sigma ligands, implicating that each truncated region in these splice variants is important for ligand binding. However, co-immunoprecipitation (Co-IP) study in HEK293 cells co-transfected with tagged constructs reveals that all the splice variants maintain their ability to physically associate with a mu opioid receptor (mMOR-1), providing useful information to correlate the motifs/sequences necessary for their physical association. Furthermore, a competition Co-IP study showed that all the variants can disrupt in a dose-dependent manner the dimerization of the original sigma1 receptor with mMOR-1, suggesting a potential dominant negative function and providing significant insights into their function. PMID:28350844

  6. Novel down-regulatory mechanism of the surface expression of the vasopressin V2 receptor by an alternative splice receptor variant.

    PubMed

    Sarmiento, José M; Añazco, Carolina C; Campos, Danae M; Prado, Gregory N; Navarro, Javier; González, Carlos B

    2004-11-05

    In rat kidney, two alternatively spliced transcripts are generated from the V2 vasopressin receptor gene. The large transcript (1.2 kb) encodes the canonical V2 receptor, whereas the small transcript encodes a splice variant displaying a distinct sequence corresponding to the putative seventh transmembrane domain and the intracellular C terminus of the V2 receptor. This work showed that the small spliced transcript is translated in the rat kidney collecting tubules. However, the protein encoded by the small transcript (here called the V2b splice variant) is retained inside the cell, in contrast to the preferential surface distribution of the V2 receptor (here called the V2a receptor). Cells expressing the V2b splice variant do not exhibit binding to 3H-labeled vasopressin. Interestingly, we found that expression of the splice variant V2b down-regulates the surface expression of the V2a receptor, most likely via the formation of V2a.V2b heterodimers as demonstrated by co-immunoprecipitation and fluorescence resonance energy transfer experiments between the V2a receptor and the V2b splice variant. The V2b splice variant would then be acting as a dominant negative. The effect of the V2b splice variant is specific, as it does not affect the surface expression of the G protein-coupled interleukin-8 receptor (CXCR1). Furthermore, the sequence encompassing residues 242-339, corresponding to the C-terminal domain of the V2b splice variant, also down-regulates the surface expression of the V2a receptor. We suggest that some forms of nephrogenic diabetes insipidus are due to overexpression of the splice variant V2b, which could retain the wild-type V2a receptor inside the cell via the formation of V2a.V2b heterodimers.

  7. Novel splice mutation in microthalmia-associated transcription factor in Waardenburg Syndrome.

    PubMed

    Brenner, Laura; Burke, Kelly; Leduc, Charles A; Guha, Saurav; Guo, Jiancheng; Chung, Wendy K

    2011-01-01

    Waardenburg Syndrome (WS) is a syndromic form of hearing loss associated with mutations in six different genes. We identified a large family with WS that had previously undergone clinical testing, with no reported pathogenic mutation. Using linkage analysis, a region on 3p14.1 with an LOD score of 6.6 was identified. Microthalmia-Associated Transcription Factor, a gene known to cause WS, is located within this region of linkage. Sequencing of Microthalmia-Associated Transcription Factor demonstrated a c.1212 G>A synonymous variant that segregated with the WS in the family and was predicted to cause a novel splicing site that was confirmed with expression analysis of the mRNA. This case illustrates the need to computationally analyze novel synonymous sequence variants for possible effects on splicing to maximize the clinical sensitivity of sequence-based genetic testing.

  8. Mutation analysis of pre-mRNA splicing genes in Chinese families with retinitis pigmentosa

    PubMed Central

    Pan, Xinyuan; Chen, Xue; Liu, Xiaoxing; Gao, Xiang; Kang, Xiaoli; Xu, Qihua; Chen, Xuejuan; Zhao, Kanxing; Zhang, Xiumei; Chu, Qiaomei; Wang, Xiuying

    2014-01-01

    Purpose Seven genes involved in precursor mRNA (pre-mRNA) splicing have been implicated in autosomal dominant retinitis pigmentosa (adRP). We sought to detect mutations in all seven genes in Chinese families with RP, to characterize the relevant phenotypes, and to evaluate the prevalence of mutations in splicing genes in patients with adRP. Methods Six unrelated families from our adRP cohort (42 families) and two additional families with RP with uncertain inheritance mode were clinically characterized in the present study. Targeted sequence capture with next-generation massively parallel sequencing (NGS) was performed to screen mutations in 189 genes including all seven pre-mRNA splicing genes associated with adRP. Variants detected with NGS were filtered with bioinformatics analyses, validated with Sanger sequencing, and prioritized with pathogenicity analysis. Results Mutations in pre-mRNA splicing genes were identified in three individual families including one novel frameshift mutation in PRPF31 (p.Leu366fs*1) and two known mutations in SNRNP200 (p.Arg681His and p.Ser1087Leu). The patients carrying SNRNP200 p.R681H showed rapid disease progression, and the family carrying p.S1087L presented earlier onset ages and more severe phenotypes compared to another previously reported family with p.S1087L. In five other families, we identified mutations in other RP-related genes, including RP1 p. Ser781* (novel), RP2 p.Gln65* (novel) and p.Ile137del (novel), IMPDH1 p.Asp311Asn (recurrent), and RHO p.Pro347Leu (recurrent). Conclusions Mutations in splicing genes identified in the present and our previous study account for 9.5% in our adRP cohort, indicating the important role of pre-mRNA splicing deficiency in the etiology of adRP. Mutations in the same splicing gene, or even the same mutation, could correlate with different phenotypic severities, complicating the genotype–phenotype correlation and clinical prognosis. PMID:24940031

  9. A novel AVPR2 splice site mutation leads to partial X-linked nephrogenic diabetes insipidus in two brothers.

    PubMed

    Schernthaner-Reiter, Marie Helene; Adams, David; Trivellin, Giampaolo; Ramnitz, Mary Scott; Raygada, Margarita; Golas, Gretchen; Faucz, Fabio R; Nilsson, Ola; Nella, Aikaterini A; Dileepan, Kavitha; Lodish, Maya; Lee, Paul; Tifft, Cynthia; Markello, Thomas; Gahl, William; Stratakis, Constantine A

    2016-05-01

    X-linked nephrogenic diabetes insipidus (NDI, OMIM#304800) is caused by mutations in the arginine vasopressin (AVP, OMIM*192340) receptor type 2 (AVPR2, OMIM*300538) gene. A 20-month-old boy and his 8-year-old brother presented with polyuria, polydipsia, and failure to thrive. Both boys demonstrated partial DDAVP (1-desamino-8-D AVP or desmopressin) responses; thus, NDI diagnosis was delayed. While routine sequencing of AVPR2 showed a potential splice site variant, it was not until exome sequencing confirmed the AVPR2 splice site variant and did not reveal any more likely candidates that the patients' diagnosis was made and proper treatment was instituted. Both patients were hemizygous for two AVPR2 variants predicted in silico to affect AVPR2 messenger RNA (mRNA) splicing. A minigene assay revealed that the novel AVPR2 c.276A>G mutation creates a novel splice acceptor site leading to 5' truncation of AVPR2 exon 2 in HEK293 human kidney cells. Both patients have been treated with high-dose DDAVP with a remarkable improvement of their symptoms and accelerated linear growth and weight gain. We present here a unique case of partial X-linked NDI due to an AVPR2 splice site mutation; patients with diabetes insipidus of unknown etiology may harbor splice site mutations that are initially underestimated in their pathogenicity on sequence analysis. • X-linked nephrogenic diabetes insipidus is caused by AVPR2 mutations, and disease severity can vary depending on the functional effect of the mutation. What is New: • We demonstrate here that a splice site mutation in AVPR2 leads to partial X-linked NDI in two brothers. • Treatment with high-dose DDAVP led to improvement of polyuria and polydipsia, weight gain, and growth.

  10. Postnatal Expression of V2 Vasopressin Receptor Splice Variants in the Rat Cerebellum

    PubMed Central

    Vargas, Karina J.; Sarmiento, José M.; Ehrenfeld, Pamela; Añazco, Carolina C.; Villanueva, Carolina I.; Carmona, Pamela L.; Brenet, Marianne; Navarro, Javier; Müller-Esterl, Werner; Figueroa, Carlos D.; González, Carlos B.

    2010-01-01

    The V2 vasopressin receptor gene contains an alternative splice site in exon-3, which leads to the generation of two splice variants (V2a and V2b) first identified in the kidney. The open reading frame of the alternatively spliced V2b transcripten codes a truncated receptor, showing the same amino acid sequence as the canonical V2a receptor up to the 6th transmembrane segment, but displaying a distinct sequence to the corresponding 7th transmembrane segment and C-terminal domain relative to the V2a receptor. Here, we demonstrate the postnatal expression of V2a and V2b variants in the rat cerebellum. Most importantly, we showed by in situ hybridization and immunocytochemistry that both V2 splice variants were preferentially expressed in Purkinje cells, from early to late postnatal development. In addition, both variants were transiently expressed in the neuroblastic external granule cells and Bergmann fibers. These results indicate that the cellular distributions of both splice variants are developmentally regulated, and suggest that the transient expression of the V2 receptor is involved in the mechanisms of cerebellar cytodifferentiation by AVP. Finally, transfected CHO-K1 .expressing similar amounts of both V2 splice variants, as that found in the cerebellum, showed a significant reduction in the surface expression of V2a receptors, suggesting that the differential expression of the V2 splice variants regulate the vasopressin signaling in the cerebellum. PMID:19281786

  11. Random Splicing of Several Exons Caused by a Single Base Change in the Target Exon of CRISPR/Cas9 Mediated Gene Knockout.

    PubMed

    Kapahnke, Marcel; Banning, Antje; Tikkanen, Ritva

    2016-12-14

    The clustered regularly interspaced short palindromic repeats (CRISPR)-associated sequence 9 (CRISPR/Cas9) system is widely used for genome editing purposes as it facilitates an efficient knockout of a specific gene in, e.g. cultured cells. Targeted double-strand breaks are introduced to the target sequence of the guide RNAs, which activates the cellular DNA repair mechanism for non-homologous-end-joining, resulting in unprecise repair and introduction of small deletions or insertions. Due to this, sequence alterations in the coding region of the target gene frequently cause frame-shift mutations, facilitating degradation of the mRNA. We here show that such CRISPR/Cas9-mediated alterations in the target exon may also result in altered splicing of the respective pre-mRNA, most likely due to mutations of splice-regulatory sequences. Using the human FLOT-1 gene as an example, we demonstrate that such altered splicing products also give rise to aberrant protein products. These may potentially function as dominant-negative proteins and thus interfere with the interpretation of the data generated with these cell lines. Since most researchers only control the consequences of CRISPR knockout at genomic and protein level, our data should encourage to also check the alterations at the mRNA level.

  12. Splice-site mutations identified in PDE6A responsible for retinitis pigmentosa in consanguineous Pakistani families

    PubMed Central

    Khan, Shahid Y.; Ali, Shahbaz; Naeem, Muhammad Asif; Khan, Shaheen N.; Husnain, Tayyab; Butt, Nadeem H.; Qazi, Zaheeruddin A.; Akram, Javed; Riazuddin, Sheikh; Ayyagari, Radha; Hejtmancik, J. Fielding

    2015-01-01

    Purpose This study was conducted to localize and identify causal mutations associated with autosomal recessive retinitis pigmentosa (RP) in consanguineous familial cases of Pakistani origin. Methods Ophthalmic examinations that included funduscopy and electroretinography (ERG) were performed to confirm the affectation status. Blood samples were collected from all participating individuals, and genomic DNA was extracted. A genome-wide scan was performed, and two-point logarithm of odds (LOD) scores were calculated. Sanger sequencing was performed to identify the causative variants. Subsequently, we performed whole exome sequencing to rule out the possibility of a second causal variant within the linkage interval. Sequence conservation was performed with alignment analyses of PDE6A orthologs, and in silico splicing analysis was completed with Human Splicing Finder version 2.4.1. Results A large multigenerational consanguineous family diagnosed with early-onset RP was ascertained. An ophthalmic clinical examination consisting of fundus photography and electroretinography confirmed the diagnosis of RP. A genome-wide scan was performed, and suggestive two-point LOD scores were observed with markers on chromosome 5q. Haplotype analyses identified the region; however, the region did not segregate with the disease phenotype in the family. Subsequently, we performed a second genome-wide scan that excluded the entire genome except the chromosome 5q region harboring PDE6A. Next-generation whole exome sequencing identified a splice acceptor site mutation in intron 16: c.2028–1G>A, which was completely conserved in PDE6A orthologs and was absent in ethnically matched 350 control chromosomes, the 1000 Genomes database, and the NHLBI Exome Sequencing Project. Subsequently, we investigated our entire cohort of RP familial cases and identified a second family who harbored a splice acceptor site mutation in intron 10: c.1408–2A>G. In silico analysis suggested that these mutations will result in the elimination of wild-type splice acceptor sites that would result in either skipping of the respective exon or the creation of a new cryptic splice acceptor site; both possibilities would result in retinal photoreceptor cells that lack PDE6A wild-type protein. Conclusions we report two splice acceptor site variations in PDE6A in consanguineous Pakistani families who manifested cardinal symptoms of RP. Taken together with our previously published work, our data suggest that mutations in PDE6A account for about 2% of the total genetic load of RP in our cohort and possibly in the Pakistani population as well. PMID:26321862

  13. 76 FR 41237 - Public Utility District No. 1 of Snohomish County, WA; Notice Concluding Pre-Filing Process and...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-07-13

    ... monitoring plans; (2) a request for waivers of certain Integrated Licensing Process (ILP) regulations...Hydro Group Ltd., mounted on completely submerged gravity foundations; (2) two 250-meter service cables connected at a subsea junction box or spliced to a 0.5-kilometer subsea transmission cable, connecting to a...

  14. The in vivo use of alternate 3'-splice sites in group I introns.

    PubMed

    Sellem, C H; Belcour, L

    1994-04-11

    Alternative splicing of group I introns has been postulated as a possible mechanism that would ensure the translation of proteins encoded into intronic open reading frames, discontinuous with the upstream exon and lacking an initiation signal. Alternate splice sites were previously depicted according to secondary structures of several group I introns. We present here strong evidence that, in the case of Podospora anserina nad 1-i4 and cox1-i7 mitochondrial introns, alternative splicing events do occur in vivo. Indeed, by PCR experiments we have detected molecules whose sequence is precisely that expected if the predicted alternate 3'-splice sites were used.

  15. The spliced leader trans-splicing mechanism in different organisms: molecular details and possible biological roles

    PubMed Central

    Bitar, Mainá; Boroni, Mariana; Macedo, Andréa M.; Machado, Carlos R.; Franco, Glória R.

    2013-01-01

    The spliced leader (SL) is a gene that generates a functional ncRNA that is composed of two regions: an intronic region of unknown function (SLi) and an exonic region (SLe), which is transferred to the 5′ end of independent transcripts yielding mature mRNAs, in a process known as spliced leader trans-splicing (SLTS). The best described function for SLTS is to solve polycistronic transcripts into monocistronic units, specifically in Trypanosomatids. In other metazoans, it is speculated that the SLe addition could lead to increased mRNA stability, differential recruitment of the translational machinery, modification of the 5′ region or a combination of these effects. Although important aspects of this mechanism have been revealed, several features remain to be elucidated. We have analyzed 157 SLe sequences from 148 species from seven phyla and found a high degree of conservation among the sequences of species from the same phylum, although no considerable similarity seems to exist between sequences of species from different phyla. When analyzing case studies, we found evidence that a given SLe will always be related to a given set of transcripts in different species from the same phylum, and therefore, different SLe sequences from the same species would regulate different sets of transcripts. In addition, we have observed distinct transcript categories to be preferential targets for the SLe addition in different phyla. This work sheds light into crucial and controversial aspects of the SLTS mechanism. It represents a comprehensive study concerning various species and different characteristics of this important post-transcriptional regulatory mechanism. PMID:24130571

  16. Characterization of intronic uridine-rich sequence elements acting as possible targets for nuclear proteins during pre-mRNA splicing in Nicotiana plumbaginifolia.

    PubMed

    Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W

    1996-02-15

    Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U).

  17. Characterization of intronic uridine-rich sequence elements acting as possible targets for nuclear proteins during pre-mRNA splicing in Nicotiana plumbaginifolia.

    PubMed Central

    Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W

    1996-01-01

    Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U). PMID:8604302

  18. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Vidaud, M.; Vidaud, D.; Amselem, S.

    The authors have characterized a Mediterranean {beta}-thalassemia allele containing a sequence change at codon 30 that alters both {beta}-globin pre-mRNA splicing and the structure of the homoglobin product. Presumably, this G {yields} C transversion at position {minus}1 of intron 1 reduces severely the utilization of the normal 5{prime} splice site since the level of the Arg {yields} Thr mutant hemoglobin (designated hemoglobin Kairouan) found in the erythrocytes of the patient is very low (2% of total hemoglobin). Since no natural mutations of the guanine located at position {minus}1 of the CAG/GTAAGT consensus sequence had been isolated previously. They investigated themore » role of this nucleotide in the constitution of an active 5{prime} splice site by studying the splicing of the pre-mRNA in cell-free extracts. They demonstrate that correct splicing of the mutant pre-mRNA is 98% inhibited. Their results provide further insights into the mechanisms of pre-mRNA maturation by revealing that the last residue of the exon plays a role at least equivalent to that of the intron residue at position +5.« less

  19. Cryptic splice site in the complementary DNA of glucocerebrosidase causes inefficient expression.

    PubMed

    Bukovac, Scott W; Bagshaw, Richard D; Rigat, Brigitte A; Callahan, John W; Clarke, Joe T R; Mahuran, Don J

    2008-10-15

    The low levels of human lysosomal glucocerebrosidase activity expressed in transiently transfected Chinese hamster ovary (CHO) cells were investigated. Reverse transcription PCR (RT-PCR) demonstrated that a significant portion of the transcribed RNA was misspliced owing to the presence of a cryptic splice site in the complementary DNA (cDNA). Missplicing results in the deletion of 179 bp of coding sequence and a premature stop codon. A repaired cDNA was constructed abolishing the splice site without changing the amino acid sequence. The level of glucocerebrosidase expression was increased sixfold. These data demonstrate that for maximum expression of any cDNA construct, the transcription products should be examined.

  20. Circular RNAs Are the Predominant Transcript Isoform from Hundreds of Human Genes in Diverse Cell Types

    PubMed Central

    Wang, Peter Lincoln; Lacayo, Norman; Brown, Patrick O.

    2012-01-01

    Most human pre-mRNAs are spliced into linear molecules that retain the exon order defined by the genomic sequence. By deep sequencing of RNA from a variety of normal and malignant human cells, we found RNA transcripts from many human genes in which the exons were arranged in a non-canonical order. Statistical estimates and biochemical assays provided strong evidence that a substantial fraction of the spliced transcripts from hundreds of genes are circular RNAs. Our results suggest that a non-canonical mode of RNA splicing, resulting in a circular RNA isoform, is a general feature of the gene expression program in human cells. PMID:22319583

  1. Gene organization and alternative splicing of human prohormone convertase PC8.

    PubMed Central

    Goodge, K A; Thomas, R J; Martin, T J; Gillespie, M T

    1998-01-01

    The mammalian Ca2+-dependent serine protease prohormone convertase PC8 is expressed ubiquitously, being transcribed as 3.5, 4.3 and 6.0 kb mRNA isoforms in various tissues. To determine the origin of these various mRNA isoforms we report the characterization of the human PC8 gene, which has been previously localized to chromosome 11q23-24. Consisting of 16 exons, the human PC8 gene spans approx. 27 kb. A comparison of the position of intron-exon junctions of the human PC8 gene with the gene structures of previously reported prohormone convertase genes demonstrated a divergence of the human PC8 from the highly conserved nature of the gene organization of this enzyme family. The nucleotide sequence of the 5'-flanking region of the human PC8 is reported and possesses putative promoter elements characteristic of a GC-rich promoter. Further supporting the potential role of a GC-rich promoter element, multiple transcriptional initiation sites within a 200 bp region were demonstrated. We propose that the various mRNA isoforms of PC8 result from the inclusion of intronic sequences within transcripts. PMID:9820811

  2. The alternatively-included 11a sequence modifies the effects of Mena on actin cytoskeletal organization and cell behavior

    PubMed Central

    Balsamo, Michele; Mondal, Chandrani; Carmona, Guillaume; McClain, Leslie M.; Riquelme, Daisy N.; Tadros, Jenny; Ma, Duan; Vasile, Eliza; Condeelis, John S.; Lauffenburger, Douglas A.; Gertler, Frank B.

    2016-01-01

    During tumor progression, alternative splicing gives rise to different Mena protein isoforms. We analyzed how Mena11a, an isoform enriched in epithelia and epithelial-like cells, affects Mena-dependent regulation of actin dynamics and cell behavior. While other Mena isoforms promote actin polymerization and drive membrane protrusion, we find that Mena11a decreases actin polymerization and growth factor-stimulated membrane protrusion at lamellipodia. Ectopic Mena11a expression slows mesenchymal-like cell motility, while isoform-specific depletion of endogenous Mena11a in epithelial-like tumor cells perturbs cell:cell junctions and increases membrane protrusion and overall cell motility. Mena11a can dampen membrane protrusion and reduce actin polymerization in the absence of other Mena isoforms, indicating that it is not simply an inactive Mena isoform. We identify a phosphorylation site within 11a that is required for some Mena11a-specific functions. RNA-seq data analysis from patient cohorts demonstrates that the difference between mRNAs encoding constitutive Mena sequences and those containing the 11a exon correlates with metastasis in colorectal cancer, suggesting that 11a exon exclusion contributes to invasive phenotypes and leads to poor clinical outcomes. PMID:27748415

  3. The alternatively-included 11a sequence modifies the effects of Mena on actin cytoskeletal organization and cell behavior.

    PubMed

    Balsamo, Michele; Mondal, Chandrani; Carmona, Guillaume; McClain, Leslie M; Riquelme, Daisy N; Tadros, Jenny; Ma, Duan; Vasile, Eliza; Condeelis, John S; Lauffenburger, Douglas A; Gertler, Frank B

    2016-10-17

    During tumor progression, alternative splicing gives rise to different Mena protein isoforms. We analyzed how Mena11a, an isoform enriched in epithelia and epithelial-like cells, affects Mena-dependent regulation of actin dynamics and cell behavior. While other Mena isoforms promote actin polymerization and drive membrane protrusion, we find that Mena11a decreases actin polymerization and growth factor-stimulated membrane protrusion at lamellipodia. Ectopic Mena11a expression slows mesenchymal-like cell motility, while isoform-specific depletion of endogenous Mena11a in epithelial-like tumor cells perturbs cell:cell junctions and increases membrane protrusion and overall cell motility. Mena11a can dampen membrane protrusion and reduce actin polymerization in the absence of other Mena isoforms, indicating that it is not simply an inactive Mena isoform. We identify a phosphorylation site within 11a that is required for some Mena11a-specific functions. RNA-seq data analysis from patient cohorts demonstrates that the difference between mRNAs encoding constitutive Mena sequences and those containing the 11a exon correlates with metastasis in colorectal cancer, suggesting that 11a exon exclusion contributes to invasive phenotypes and leads to poor clinical outcomes.

  4. Proteogenomic analysis reveals alternative splicing and translation as part of the abscisic acid response in Arabidopsis seedlings.

    PubMed

    Zhu, Fu-Yuan; Chen, Mo-Xian; Ye, Neng-Hui; Shi, Lu; Ma, Kai-Long; Yang, Jing-Fang; Cao, Yun-Ying; Zhang, Youjun; Yoshida, Takuya; Fernie, Alisdair R; Fan, Guang-Yi; Wen, Bo; Zhou, Ruo; Liu, Tie-Yuan; Fan, Tao; Gao, Bei; Zhang, Di; Hao, Ge-Fei; Xiao, Shi; Liu, Ying-Gao; Zhang, Jianhua

    2017-08-01

    In eukaryotes, mechanisms such as alternative splicing (AS) and alternative translation initiation (ATI) contribute to organismal protein diversity. Specifically, splicing factors play crucial roles in responses to environment and development cues; however, the underlying mechanisms are not well investigated in plants. Here, we report the parallel employment of short-read RNA sequencing, single molecule long-read sequencing and proteomic identification to unravel AS isoforms and previously unannotated proteins in response to abscisic acid (ABA) treatment. Combining the data from the two sequencing methods, approximately 83.4% of intron-containing genes were alternatively spliced. Two AS types, which are referred to as alternative first exon (AFE) and alternative last exon (ALE), were more abundant than intron retention (IR); however, by contrast to AS events detected under normal conditions, differentially expressed AS isoforms were more likely to be translated. ABA extensively affects the AS pattern, indicated by the increasing number of non-conventional splicing sites. This work also identified thousands of unannotated peptides and proteins by ATI based on mass spectrometry and a virtual peptide library deduced from both strands of coding regions within the Arabidopsis genome. The results enhance our understanding of AS and alternative translation mechanisms under normal conditions, and in response to ABA treatment. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  5. Engineered U7 snRNA mediates sustained splicing correction in erythroid cells from β-thalassemia/HbE patients.

    PubMed

    Preedagasamzin, Sarinthip; Nualkaew, Tiwaporn; Pongrujikorn, Tanjitti; Jinawath, Natini; Kole, Ryszard; Fucharoen, Suthat; Jearawiriyapaisarn, Natee; Svasti, Saovaros

    2018-04-30

    Repair of a splicing defect of β-globin pre-mRNA harboring hemoglobin E (HbE) mutation was successfully accomplished in erythroid cells from patients with β-thalassemia/HbE disorder by a synthetic splice-switching oligonucleotide (SSO). However, its application is limited by short-term effectiveness and requirement of lifelong periodic administration of SSO, especially for chronic diseases like thalassemias. Here, we engineered lentiviral vectors that stably express U7 small nuclear RNA (U7 snRNA) carrying the splice-switching sequence of the SSO that restores correct splicing of β E -globin pre-mRNA and achieves a long-term therapeutic effect. Using a two-step tiling approach, we systematically screened U7 snRNAs carrying splice-switching SSO sequences targeted to the cryptic 5' splice site created by HbE mutation. We tested this approach and identified the most responsive element for mediating splicing correction in engineered U7 snRNAs in HeLa-β E cell model cell line. Remarkably, the U7 snRNA lentiviral vector (U7 βE4+1) targeted to this region effectively restored the correctly-spliced β E -globin mRNA for at least 5 months. Moreover, the effects of the U7 βE4+1 snRNA lentiviral vector were also evident as upregulation of the correctly-spliced β E -globin mRNA in erythroid progenitor cells from β-thalassemia/HbE patients treated with the vector, which led to improvements of pathologies in erythroid progenitor cells from thalassemia patients. These results suggest that the splicing correction of β E -globin pre-mRNA by the engineered U7 snRNA lentiviral vector provides a promising, long-term treatment for β-thalassemia/HbE. Copyright © 2018 Elsevier Inc. All rights reserved.

  6. Polyoma virus small tumor antigen pre-mRNA splicing requires cooperation between two 3' splice sites.

    PubMed Central

    Ge, H; Noble, J; Colgan, J; Manley, J L

    1990-01-01

    We have studied splicing of the polyoma virus early region pre-mRNA in vitro. This RNA is alternatively spliced in vivo to produce mRNA encoding the large, middle-sized (MTAg), and small (StAg) tumor antigens. Our primary interest was to learn how the 48-nucleotide StAg intron is excised, because the length of this intron is significantly less than the apparent minimum established for mammalian introns. Although the products of all three splices are detected in vitro, characterization of the pathway and sequence requirements of StAg splicing suggests that splicing factors interact with the precursor RNA in an unexpected way to catalyze removal of this intron. Specifically, StAg splicing uses either of two lariat branch points, one of which is located only 4 nucleotides from the 3' splice site. Furthermore, the StAg splice absolutely requires that the alternative MTAg 3' splice site, located 14 nucleotides downstream of the StAg 3' splice site, be intact. Insertion mutations that increase or decrease the quality of the MTAg pyrimidine stretch enhance or repress StAg as well as MTAg splicing, and a single-base change in the MTAg AG splice acceptor totally blocks both splices. These results demonstrate the ability of two 3' splice sites to cooperate with each other to bring about removal of a single intron. Images PMID:2159146

  7. An RNAi-Enhanced Logic Circuit for Cancer Specific Detection and Destruction

    DTIC Science & Technology

    2013-02-01

    monomeric protein secreted by Corynebacterium diphtheriae, and pro-apoptotic members of Bcl-2 family: mBax (Mus musculus), hBax ( Homo sapiens ), and its...Gata3 mStaple. Intron- feature sequences – donor site, branch point, poly- pyrimidine tract, and acceptor site – were selected based on previously...sequences found in literature our intron features were chosen according SplicePort [4], an online analyzer that detects the likelihood of splicing to

  8. Alternative Splicing in Neurogenesis and Brain Development.

    PubMed

    Su, Chun-Hao; D, Dhananjaya; Tarn, Woan-Yuh

    2018-01-01

    Alternative splicing of precursor mRNA is an important mechanism that increases transcriptomic and proteomic diversity and also post-transcriptionally regulates mRNA levels. Alternative splicing occurs at high frequency in brain tissues and contributes to every step of nervous system development, including cell-fate decisions, neuronal migration, axon guidance, and synaptogenesis. Genetic manipulation and RNA sequencing have provided insights into the molecular mechanisms underlying the effects of alternative splicing in stem cell self-renewal and neuronal fate specification. Timely expression and perhaps post-translational modification of neuron-specific splicing regulators play important roles in neuronal development. Alternative splicing of many key transcription regulators or epigenetic factors reprograms the transcriptome and hence contributes to stem cell fate determination. During neuronal differentiation, alternative splicing also modulates signaling activity, centriolar dynamics, and metabolic pathways. Moreover, alternative splicing impacts cortical lamination and neuronal development and function. In this review, we focus on recent progress toward understanding the contributions of alternative splicing to neurogenesis and brain development, which has shed light on how splicing defects may cause brain disorders and diseases.

  9. Functional assessment of a novel COL4A5 splice region variant and immunostaining of plucked hair follicles as an alternative method of diagnosis in X-linked Alport syndrome.

    PubMed

    Malone, Andrew F; Funk, Steven D; Alhamad, Tarek; Miner, Jeffrey H

    2017-06-01

    Many COL4A5 splice region variants have been described in patients with X-linked Alport syndrome, but few have been confirmed by functional analysis to actually cause defective splicing. We sought to demonstrate that a novel COL4A5 splice region variant in a family with Alport syndrome is pathogenic using functional studies. We also describe an alternative method of diagnosis. Targeted next-generation sequencing results of an individual with Alport syndrome were analyzed and the results confirmed by Sanger sequencing in family members. A splicing reporter minigene assay was used to examine the variant's effect on splicing in transfected cells. Plucked hair follicles from patients and controls were examined for collagen IV proteins using immunofluorescence microscopy. A novel splice region mutation in COL4A5, c.1780-6T>G, was identified and segregated with disease in this family. This variant caused frequent skipping of exon 25, resulting in a frameshift and truncation of collagen α5(IV) protein. We also developed and validated a new approach to characterize the expression of collagen α5(IV) protein in the basement membranes of plucked hair follicles. Using this approach we demonstrated reduced collagen α5(IV) protein in affected male and female individuals in this family, supporting frequent failure of normal splicing. Differing normal to abnormal transcript ratios in affected individuals carrying splice region variants may contribute to variable disease severity observed in Alport families. Examination of plucked hair follicles in suspected X-linked Alport syndrome patients may offer a less invasive alternative method of diagnosis and serve as a pathogenicity test for COL4A5 variants of uncertain significance.

  10. Functional assessment of a novel COL4A5 splice region variant and immunostaining of plucked hair follicles as an alternative method of diagnosis in X-linked Alport syndrome

    PubMed Central

    Malone, Andrew F.; Funk, Steven D.; Alhamad, Tarek; Miner, Jeffrey H.

    2016-01-01

    Introduction Many COL4A5 splice region variants have been described in patients with X-linked Alport syndrome, but few have been confirmed by functional analysis to actually cause defective splicing. We sought to demonstrate that a novel COL4A5 splice region variant in a family with Alport syndrome is pathogenic using functional studies. We also describe an alternative method of diagnosis. Methods We analyzed targeted next-generation sequencing results of an individual with Alport syndrome and confirmed results by Sanger sequencing in family members. A splicing reporter minigene assay was used to examine the variant’s effect on splicing in transfected cells. Plucked hair follicles from patients and controls were examined for collagen IV proteins using immunofluorescence microscopy. Results A novel splice region mutation in COL4A5, c.1780-6T>G, was identified and segregated with disease in this family. This variant caused frequent skipping of exon 25, resulting in a frameshift and truncation of collagen α5(IV) protein. We also developed and validated a new approach to characterize the expression of collagen α5(IV) protein in the basement membranes of plucked hair follicles. We demonstrated reduced collagen α5(IV) protein in affected male and female individuals in this family, supporting frequent failure of normal splicing. Conclusions Differing normal to abnormal transcript ratios in affected individuals carrying splice region variants may contribute to variable disease severity observed in Alport families. Examination of plucked hair follicles in suspected X-linked Alport syndrome patients may offer a less invasive alternative method of diagnosis and serve as a pathogenicity test for COL4A5 variants of uncertain significance. PMID:28013382

  11. RNA editing in nascent RNA affects pre-mRNA splicing

    PubMed Central

    Hsiao, Yun-Hua Esther; Bahn, Jae Hoon; Yang, Yun; Lin, Xianzhi; Tran, Stephen; Yang, Ei-Wen; Quinones-Valdez, Giovanni

    2018-01-01

    In eukaryotes, nascent RNA transcripts undergo an intricate series of RNA processing steps to achieve mRNA maturation. RNA editing and alternative splicing are two major RNA processing steps that can introduce significant modifications to the final gene products. By tackling these processes in isolation, recent studies have enabled substantial progress in understanding their global RNA targets and regulatory pathways. However, the interplay between individual steps of RNA processing, an essential aspect of gene regulation, remains poorly understood. By sequencing the RNA of different subcellular fractions, we examined the timing of adenosine-to-inosine (A-to-I) RNA editing and its impact on alternative splicing. We observed that >95% A-to-I RNA editing events occurred in the chromatin-associated RNA prior to polyadenylation. We report about 500 editing sites in the 3′ acceptor sequences that can alter splicing of the associated exons. These exons are highly conserved during evolution and reside in genes with important cellular function. Furthermore, we identified a second class of exons whose splicing is likely modulated by RNA secondary structures that are recognized by the RNA editing machinery. The genome-wide analyses, supported by experimental validations, revealed remarkable interplay between RNA editing and splicing and expanded the repertoire of functional RNA editing sites. PMID:29724793

  12. RNA editing in nascent RNA affects pre-mRNA splicing.

    PubMed

    Hsiao, Yun-Hua Esther; Bahn, Jae Hoon; Yang, Yun; Lin, Xianzhi; Tran, Stephen; Yang, Ei-Wen; Quinones-Valdez, Giovanni; Xiao, Xinshu

    2018-06-01

    In eukaryotes, nascent RNA transcripts undergo an intricate series of RNA processing steps to achieve mRNA maturation. RNA editing and alternative splicing are two major RNA processing steps that can introduce significant modifications to the final gene products. By tackling these processes in isolation, recent studies have enabled substantial progress in understanding their global RNA targets and regulatory pathways. However, the interplay between individual steps of RNA processing, an essential aspect of gene regulation, remains poorly understood. By sequencing the RNA of different subcellular fractions, we examined the timing of adenosine-to-inosine (A-to-I) RNA editing and its impact on alternative splicing. We observed that >95% A-to-I RNA editing events occurred in the chromatin-associated RNA prior to polyadenylation. We report about 500 editing sites in the 3' acceptor sequences that can alter splicing of the associated exons. These exons are highly conserved during evolution and reside in genes with important cellular function. Furthermore, we identified a second class of exons whose splicing is likely modulated by RNA secondary structures that are recognized by the RNA editing machinery. The genome-wide analyses, supported by experimental validations, revealed remarkable interplay between RNA editing and splicing and expanded the repertoire of functional RNA editing sites. © 2018 Hsiao et al.; Published by Cold Spring Harbor Laboratory Press.

  13. A short conserved motif in ALYREF directs cap- and EJC-dependent assembly of export complexes on spliced mRNAs

    PubMed Central

    Gromadzka, Agnieszka M.; Steckelberg, Anna-Lena; Singh, Kusum K.; Hofmann, Kay; Gehring, Niels H.

    2016-01-01

    The export of messenger RNAs (mRNAs) is the final of several nuclear posttranscriptional steps of gene expression. The formation of export-competent mRNPs involves the recruitment of export factors that are assumed to facilitate transport of the mature mRNAs. Using in vitro splicing assays, we show that a core set of export factors, including ALYREF, UAP56 and DDX39, readily associate with the spliced RNAs in an EJC (exon junction complex)- and cap-dependent manner. In order to elucidate how ALYREF and other export adaptors mediate mRNA export, we conducted a computational analysis and discovered four short, conserved, linear motifs present in RNA-binding proteins. We show that mutation in one of the new motifs (WxHD) in an unstructured region of ALYREF reduced RNA binding and abolished the interaction with eIF4A3 and CBP80. Additionally, the mutation impaired proper localization to nuclear speckles and export of a spliced reporter mRNA. Our results reveal important details of the orchestrated recruitment of export factors during the formation of export competent mRNPs. PMID:26773052

  14. Sequence variation between 462 human individuals fine-tunes functional sites of RNA processing

    NASA Astrophysics Data System (ADS)

    Ferreira, Pedro G.; Oti, Martin; Barann, Matthias; Wieland, Thomas; Ezquina, Suzana; Friedländer, Marc R.; Rivas, Manuel A.; Esteve-Codina, Anna; Estivill, Xavier; Guigó, Roderic; Dermitzakis, Emmanouil; Antonarakis, Stylianos; Meitinger, Thomas; Strom, Tim M.; Palotie, Aarno; François Deleuze, Jean; Sudbrak, Ralf; Lerach, Hans; Gut, Ivo; Syvänen, Ann-Christine; Gyllensten, Ulf; Schreiber, Stefan; Rosenstiel, Philip; Brunner, Han; Veltman, Joris; Hoen, Peter A. C. T.; Jan van Ommen, Gert; Carracedo, Angel; Brazma, Alvis; Flicek, Paul; Cambon-Thomsen, Anne; Mangion, Jonathan; Bentley, David; Hamosh, Ada; Rosenstiel, Philip; Strom, Tim M.; Lappalainen, Tuuli; Guigó, Roderic; Sammeth, Michael

    2016-09-01

    Recent advances in the cost-efficiency of sequencing technologies enabled the combined DNA- and RNA-sequencing of human individuals at the population-scale, making genome-wide investigations of the inter-individual genetic impact on gene expression viable. Employing mRNA-sequencing data from the Geuvadis Project and genome sequencing data from the 1000 Genomes Project we show that the computational analysis of DNA sequences around splice sites and poly-A signals is able to explain several observations in the phenotype data. In contrast to widespread assessments of statistically significant associations between DNA polymorphisms and quantitative traits, we developed a computational tool to pinpoint the molecular mechanisms by which genetic markers drive variation in RNA-processing, cataloguing and classifying alleles that change the affinity of core RNA elements to their recognizing factors. The in silico models we employ further suggest RNA editing can moonlight as a splicing-modulator, albeit less frequently than genomic sequence diversity. Beyond existing annotations, we demonstrate that the ultra-high resolution of RNA-Seq combined from 462 individuals also provides evidence for thousands of bona fide novel elements of RNA processing—alternative splice sites, introns, and cleavage sites—which are often rare and lowly expressed but in other characteristics similar to their annotated counterparts.

  15. Short intronic repeat sequences facilitate circular RNA production

    PubMed Central

    Liang, Dongming

    2014-01-01

    Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery “backsplices” and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3′ end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. PMID:25281217

  16. Lessons from non-canonical splicing

    PubMed Central

    Ule, Jernej

    2016-01-01

    Recent improvements in experimental and computational techniques used to study the transcriptome have enabled an unprecedented view of RNA processing, revealing many previously unknown non-canonical splicing events. This includes cryptic events located far from the currently annotated exons, and unconventional splicing mechanisms that have important roles in regulating gene expression. These non-canonical splicing events are a major source of newly emerging transcripts during evolution, especially when they involve sequences derived from transposable elements. They are therefore under precise regulation and quality control, which minimises their potential to disrupt gene expression. While non-canonical splicing can lead to aberrant transcripts that cause many diseases, we also explain how it can be exploited for new therapeutic strategies. PMID:27240813

  17. Diverse alternative back-splicing and alternative splicing landscape of circular RNAs

    PubMed Central

    Zhang, Xiao-Ou; Dong, Rui; Zhang, Yang; Zhang, Jia-Lin; Luo, Zheng; Zhang, Jun; Chen, Ling-Ling; Yang, Li

    2016-01-01

    Circular RNAs (circRNAs) derived from back-spliced exons have been widely identified as being co-expressed with their linear counterparts. A single gene locus can produce multiple circRNAs through alternative back-splice site selection and/or alternative splice site selection; however, a detailed map of alternative back-splicing/splicing in circRNAs is lacking. Here, with the upgraded CIRCexplorer2 pipeline, we systematically annotated different types of alternative back-splicing and alternative splicing events in circRNAs from various cell lines. Compared with their linear cognate RNAs, circRNAs exhibited distinct patterns of alternative back-splicing and alternative splicing. Alternative back-splice site selection was correlated with the competition of putative RNA pairs across introns that bracket alternative back-splice sites. In addition, all four basic types of alternative splicing that have been identified in the (linear) mRNA process were found within circRNAs, and many exons were predominantly spliced in circRNAs. Unexpectedly, thousands of previously unannotated exons were detected in circRNAs from the examined cell lines. Although these novel exons had similar splice site strength, they were much less conserved than known exons in sequences. Finally, both alternative back-splicing and circRNA-predominant alternative splicing were highly diverse among the examined cell lines. All of the identified alternative back-splicing and alternative splicing in circRNAs are available in the CIRCpedia database (http://www.picb.ac.cn/rnomics/circpedia). Collectively, the annotation of alternative back-splicing and alternative splicing in circRNAs provides a valuable resource for depicting the complexity of circRNA biogenesis and for studying the potential functions of circRNAs in different cells. PMID:27365365

  18. 75 FR 11533 - Public Utility District No. 1 of Snohomish County, WA; Notice of Technical Meeting To Discuss...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-03-11

    ... supplied by OpenHydro Group Ltd., mounted on completely submerged gravity foundations; (2) two 250-meter service cables connected at a subsea junction box or spliced to a 0.5-kilometer subsea transmission cable... building; (4) a 140-meter long buried cable from the control building to the grid; and (5) appurtenant...

  19. RNA splicing regulated by RBFOX1 is essential for cardiac function in zebrafish.

    PubMed

    Frese, Karen S; Meder, Benjamin; Keller, Andreas; Just, Steffen; Haas, Jan; Vogel, Britta; Fischer, Simon; Backes, Christina; Matzas, Mark; Köhler, Doreen; Benes, Vladimir; Katus, Hugo A; Rottbauer, Wolfgang

    2015-08-15

    Alternative splicing is one of the major mechanisms through which the proteomic and functional diversity of eukaryotes is achieved. However, the complex nature of the splicing machinery, its associated splicing regulators and the functional implications of alternatively spliced transcripts are only poorly understood. Here, we investigated the functional role of the splicing regulator rbfox1 in vivo using the zebrafish as a model system. We found that loss of rbfox1 led to progressive cardiac contractile dysfunction and heart failure. By using deep-transcriptome sequencing and quantitative real-time PCR, we show that depletion of rbfox1 in zebrafish results in an altered isoform expression of several crucial target genes, such as actn3a and hug. This study underlines that tightly regulated splicing is necessary for unconstrained cardiac function and renders the splicing regulator rbfox1 an interesting target for investigation in human heart failure and cardiomyopathy. © 2015. Published by The Company of Biologists Ltd.

  20. An alternative splicing program promotes adipose tissue thermogenesis

    PubMed Central

    Vernia, Santiago; Edwards, Yvonne JK; Han, Myoung Sook; Cavanagh-Kyros, Julie; Barrett, Tamera; Kim, Jason K; Davis, Roger J

    2016-01-01

    Alternative pre-mRNA splicing expands the complexity of the transcriptome and controls isoform-specific gene expression. Whether alternative splicing contributes to metabolic regulation is largely unknown. Here we investigated the contribution of alternative splicing to the development of diet-induced obesity. We found that obesity-induced changes in adipocyte gene expression include alternative pre-mRNA splicing. Bioinformatics analysis associated part of this alternative splicing program with sequence specific NOVA splicing factors. This conclusion was confirmed by studies of mice with NOVA deficiency in adipocytes. Phenotypic analysis of the NOVA-deficient mice demonstrated increased adipose tissue thermogenesis and improved glycemia. We show that NOVA proteins mediate a splicing program that suppresses adipose tissue thermogenesis. Together, these data provide quantitative analysis of gene expression at exon-level resolution in obesity and identify a novel mechanism that contributes to the regulation of adipose tissue function and the maintenance of normal glycemia. DOI: http://dx.doi.org/10.7554/eLife.17672.001 PMID:27635635

  1. Evaluating approaches to find exon chains based on long reads.

    PubMed

    Kuosmanen, Anna; Norri, Tuukka; Mäkinen, Veli

    2018-05-01

    Transcript prediction can be modeled as a graph problem where exons are modeled as nodes and reads spanning two or more exons are modeled as exon chains. Pacific Biosciences third-generation sequencing technology produces significantly longer reads than earlier second-generation sequencing technologies, which gives valuable information about longer exon chains in a graph. However, with the high error rates of third-generation sequencing, aligning long reads correctly around the splice sites is a challenging task. Incorrect alignments lead to spurious nodes and arcs in the graph, which in turn lead to incorrect transcript predictions. We survey several approaches to find the exon chains corresponding to long reads in a splicing graph, and experimentally study the performance of these methods using simulated data to allow for sensitivity/precision analysis. Our experiments show that short reads from second-generation sequencing can be used to significantly improve exon chain correctness either by error-correcting the long reads before splicing graph creation, or by using them to create a splicing graph on which the long-read alignments are then projected. We also study the memory and time consumption of various modules, and show that accurate exon chains lead to significantly increased transcript prediction accuracy. The simulated data and in-house scripts used for this article are available at http://www.cs.helsinki.fi/group/gsa/exon-chains/exon-chains-bib.tar.bz2.

  2. LEDGF/p75 interacts with mRNA splicing factors and targets HIV-1 integration to highly spliced genes

    PubMed Central

    Singh, Parmit Kumar; Plumb, Matthew R.; Ferris, Andrea L.; Iben, James R.; Wu, Xiaolin; Fadel, Hind J.; Luke, Brian T.; Esnault, Caroline; Poeschla, Eric M.; Hughes, Stephen H.; Kvaratskhelia, Mamuka; Levin, Henry L.

    2015-01-01

    The host chromatin-binding factor LEDGF/p75 interacts with HIV-1 integrase and directs integration to active transcription units. To understand how LEDGF/p75 recognizes transcription units, we sequenced 1 million HIV-1 integration sites isolated from cultured HEK293T cells. Analysis of integration sites showed that cancer genes were preferentially targeted, raising concerns about using lentivirus vectors for gene therapy. Additional analysis led to the discovery that introns and alternative splicing contributed significantly to integration site selection. These correlations were independent of transcription levels, size of transcription units, and length of the introns. Multivariate analysis with five parameters previously found to predict integration sites showed that intron density is the strongest predictor of integration density in transcription units. Analysis of previously published HIV-1 integration site data showed that integration density in transcription units in mouse embryonic fibroblasts also correlated strongly with intron number, and this correlation was absent in cells lacking LEDGF. Affinity purification showed that LEDGF/p75 is associated with a number of splicing factors, and RNA sequencing (RNA-seq) analysis of HEK293T cells lacking LEDGF/p75 or the LEDGF/p75 integrase-binding domain (IBD) showed that LEDGF/p75 contributes to splicing patterns in half of the transcription units that have alternative isoforms. Thus, LEDGF/p75 interacts with splicing factors, contributes to exon choice, and directs HIV-1 integration to transcription units that are highly spliced. PMID:26545813

  3. ASFinder: a tool for genome-wide identification of alternatively splicing transcripts from EST-derived sequences.

    PubMed

    Min, Xiang Jia

    2013-01-01

    Expressed Sequence Tags (ESTs) are a rich resource for identifying Alternatively Splicing (AS) genes. The ASFinder webserver is designed to identify AS isoforms from EST-derived sequences. Two approaches are implemented in ASFinder. If no genomic sequences are provided, the server performs a local BLASTN to identify AS isoforms from ESTs having both ends aligned but an internal segment unaligned. Otherwise, ASFinder uses SIM4 to map ESTs to the genome, then the overlapping ESTs that are mapped to the same genomic locus and have internal variable exon/intron boundaries are identified as AS isoforms. The tool is available at http://proteomics.ysu.edu/tools/ASFinder.html.

  4. Exon trapping: a genetic screen to identify candidate transcribed sequences in cloned mammalian genomic DNA.

    PubMed

    Duyk, G M; Kim, S W; Myers, R M; Cox, D R

    1990-11-01

    Identification and recovery of transcribed sequences from cloned mammalian genomic DNA remains an important problem in isolating genes on the basis of their chromosomal location. We have developed a strategy that facilitates the recovery of exons from random pieces of cloned genomic DNA. The basis of this "exon trapping" strategy is that, during a retroviral life cycle, genomic sequences of nonviral origin are correctly spliced and may be recovered as a cDNA copy of the introduced segment. By using this genetic assay for cis-acting sequences required for RNA splicing, we have screened approximately 20 kilobase pairs of cloned genomic DNA and have recovered all four predicted exons.

  5. Exon trapping: a genetic screen to identify candidate transcribed sequences in cloned mammalian genomic DNA.

    PubMed Central

    Duyk, G M; Kim, S W; Myers, R M; Cox, D R

    1990-01-01

    Identification and recovery of transcribed sequences from cloned mammalian genomic DNA remains an important problem in isolating genes on the basis of their chromosomal location. We have developed a strategy that facilitates the recovery of exons from random pieces of cloned genomic DNA. The basis of this "exon trapping" strategy is that, during a retroviral life cycle, genomic sequences of nonviral origin are correctly spliced and may be recovered as a cDNA copy of the introduced segment. By using this genetic assay for cis-acting sequences required for RNA splicing, we have screened approximately 20 kilobase pairs of cloned genomic DNA and have recovered all four predicted exons. PMID:2247475

  6. Alternative Splicing Studies of the Reactive Oxygen Species Gene Network in Populus Reveal Two Isoforms of High-Isoelectric-Point Superoxide Dismutase1[C][W

    PubMed Central

    Srivastava, Vaibhav; Srivastava, Manoj Kumar; Chibani, Kamel; Nilsson, Robert; Rouhier, Nicolas; Melzer, Michael; Wingsle, Gunnar

    2009-01-01

    Recent evidence has shown that alternative splicing (AS) is widely involved in the regulation of gene expression, substantially extending the diversity of numerous proteins. In this study, a subset of expressed sequence tags representing members of the reactive oxygen species gene network was selected from the PopulusDB database to investigate AS mechanisms in Populus. Examples of all known types of AS were detected, but intron retention was the most common. Interestingly, the closest Arabidopsis (Arabidopsis thaliana) homologs of half of the AS genes identified in Populus are not reportedly alternatively spliced. Two genes encoding the protein of most interest in our study (high-isoelectric-point superoxide dismutase [hipI-SOD]) have been found in black cottonwood (Populus trichocarpa), designated PthipI-SODC1 and PthipI-SODC2. Analysis of the expressed sequence tag libraries has indicated the presence of two transcripts of PthipI-SODC1 (hipI-SODC1b and hipI-SODC1s). Alignment of these sequences with the PthipI-SODC1 gene showed that hipI-SODC1b was 69 bp longer than hipI-SODC1s due to an AS event involving the use of an alternative donor splice site in the sixth intron. Transcript analysis showed that the splice variant hipI-SODC1b was differentially expressed, being clearly expressed in cambial and xylem, but not phloem, regions. In addition, immunolocalization and mass spectrometric data confirmed the presence of hipI-SOD proteins in vascular tissue. The functionalities of the spliced gene products were assessed by expressing recombinant hipI-SOD proteins and in vitro SOD activity assays. PMID:19176719

  7. Alternative splicing studies of the reactive oxygen species gene network in Populus reveal two isoforms of high-isoelectric-point superoxide dismutase.

    PubMed

    Srivastava, Vaibhav; Srivastava, Manoj Kumar; Chibani, Kamel; Nilsson, Robert; Rouhier, Nicolas; Melzer, Michael; Wingsle, Gunnar

    2009-04-01

    Recent evidence has shown that alternative splicing (AS) is widely involved in the regulation of gene expression, substantially extending the diversity of numerous proteins. In this study, a subset of expressed sequence tags representing members of the reactive oxygen species gene network was selected from the PopulusDB database to investigate AS mechanisms in Populus. Examples of all known types of AS were detected, but intron retention was the most common. Interestingly, the closest Arabidopsis (Arabidopsis thaliana) homologs of half of the AS genes identified in Populus are not reportedly alternatively spliced. Two genes encoding the protein of most interest in our study (high-isoelectric-point superoxide dismutase [hipI-SOD]) have been found in black cottonwood (Populus trichocarpa), designated PthipI-SODC1 and PthipI-SODC2. Analysis of the expressed sequence tag libraries has indicated the presence of two transcripts of PthipI-SODC1 (hipI-SODC1b and hipI-SODC1s). Alignment of these sequences with the PthipI-SODC1 gene showed that hipI-SODC1b was 69 bp longer than hipI-SODC1s due to an AS event involving the use of an alternative donor splice site in the sixth intron. Transcript analysis showed that the splice variant hipI-SODC1b was differentially expressed, being clearly expressed in cambial and xylem, but not phloem, regions. In addition, immunolocalization and mass spectrometric data confirmed the presence of hipI-SOD proteins in vascular tissue. The functionalities of the spliced gene products were assessed by expressing recombinant hipI-SOD proteins and in vitro SOD activity assays.

  8. Deciphering Transcriptome and Complex Alternative Splicing Transcripts in Mammary Gland Tissues from Cows Naturally Infected with Staphylococcus aureus Mastitis

    PubMed Central

    Jiang, Qiang; Yang, Chun Hong; Zhang, Yan; Sun, Yan; Li, Rong Ling; Wang, Chang Fa; Zhong, Ji Feng; Huang, Jin Ming

    2016-01-01

    Alternative splicing (AS) contributes to the complexity of the mammalian proteome and plays an important role in diseases, including infectious diseases. The differential AS patterns of these transcript sequences between the healthy (HS3A) and mastitic (HS8A) cows naturally infected by Staphylococcus aureus were compared to understand the molecular mechanisms underlying mastitis resistance and susceptibility. In this study, using the Illumina paired-end RNA sequencing method, 1352 differentially expressed genes (DEGs) with higher than twofold changes were found in the HS3A and HS8A mammary gland tissues. Gene ontology and KEGG pathway analyses revealed that the cytokine–cytokine receptor interaction pathway is the most significantly enriched pathway. Approximately 16k annotated unigenes were respectively identified in two libraries, based on the bovine Bos taurus UMD3.1 sequence assembly and search. A total of 52.62% and 51.24% annotated unigenes were alternatively spliced in term of exon skipping, intron retention, alternative 5′ splicing and alternative 3ʹ splicing. Additionally, 1,317 AS unigenes were HS3A-specific, whereas 1,093 AS unigenes were HS8A-specific. Some immune-related genes, such as ITGB6, MYD88, ADA, ACKR1, and TNFRSF1B, and their potential relationships with mastitis were highlighted. From Chromosome 2, 4, 6, 7, 10, 13, 14, 17, and 20, 3.66% (HS3A) and 5.4% (HS8A) novel transcripts, which harbor known quantitative trait locus associated with clinical mastitis, were identified. Many DEGs in the healthy and mastitic mammary glands are involved in immune, defense, and inflammation responses. These DEGs, which exhibit diverse and specific splicing patterns and events, can endow dairy cattle with the potential complex genetic resistance against mastitis. PMID:27459697

  9. Evolution of the Antisense Overlap between Genes for Thyroid Hormone Receptor and Rev-erbα and Characterization of an Exonic G-Rich Element That Regulates Splicing of TRα2 mRNA

    PubMed Central

    Munroe, Stephen H.; Morales, Christopher H.; Duyck, Tessa H.; Waters, Paul D.

    2015-01-01

    The α-thyroid hormone receptor gene (TRα) codes for two functionally distinct proteins: TRα1, the α-thyroid hormone receptor; and TRα2, a non-hormone-binding variant. The final exon of TRα2 mRNA overlaps the 3’ end of Rev-erbα mRNA, which encodes another nuclear receptor on the opposite strand of DNA. To understand the evolution of this antisense overlap, we sequenced these genes and mRNAs in the platypus Orthorhynchus anatinus. Despite its strong homology with other mammals, the platypus TRα/Rev-erbα locus lacks elements essential for expression of TRα2. Comparative analysis suggests that alternative splicing of TRα2 mRNA expression evolved in a stepwise fashion before the divergence of eutherian and marsupial mammals. A short G-rich element (G30) located downstream of the alternative 3’splice site of TRα2 mRNA and antisense to the 3’UTR of Rev-erbα plays an important role in regulating TRα2 splicing. G30 is tightly conserved in eutherian mammals, but is absent in marsupials and monotremes. Systematic deletions and substitutions within G30 have dramatically different effects on TRα2 splicing, leading to either its inhibition or its enhancement. Mutations that disrupt one or more clusters of G residues enhance splicing two- to three-fold. These results suggest the G30 sequence can adopt a highly structured conformation, possibly a G-quadruplex, and that it is part of a complex splicing regulatory element which exerts both positive and negative effects on TRα2 expression. Since mutations that strongly enhance splicing in vivo have no effect on splicing in vitro, it is likely that the regulatory role of G30 is mediated through linkage of transcription and splicing. PMID:26368571

  10. A Novel Subgenomic Murine Leukemia Virus RNA Transcript Results from Alternative Splicing

    PubMed Central

    Déjardin, Jérôme; Bompard-Maréchal, Guillaume; Audit, Muriel; Hope, Thomas J.; Sitbon, Marc; Mougel, Marylène

    2000-01-01

    Here we show the existence of a novel subgenomic 4.4-kb RNA in cells infected with the prototypic replication-competent Friend or Moloney murine leukemia viruses (MuLV). This RNA derives by splicing from an alternative donor site (SD′) within the capsid-coding region to the canonical envelope splice acceptor site. The position and the sequence of SD′ was highly conserved among mammalian type C and D oncoviruses. Point mutations used to inactivate SD′ without changing the capsid-coding ability affected viral RNA splicing and reduced viral replication in infected cells. PMID:10729146

  11. High mutation detection rate in TCOF1 among Treacher Collins syndrome patients reveals clustering of mutations and 16 novel pathogenic changes.

    PubMed

    Splendore, A; Silva, E O; Alonso, L G; Richieri-Costa, A; Alonso, N; Rosa, A; Carakushanky, G; Cavalcanti, D P; Brunoni, D; Passos-Bueno, M R

    2000-10-01

    Twenty-eight families with a clinical diagnosis of Treacher Collins syndrome were screened for mutations in the 25 coding exons of TCOF1 and their adjacent splice junctions through SSCP and direct sequencing. Pathogenic mutations were detected in 26 patients, yielding the highest detection rate reported so far for this disease (93%) and bringing the number of known disease-causing mutations from 35 to 51. This is the first report to describe clustering of pathogenic mutations. Thirteen novel polymorphic alterations were characterized, confirming previous reports that TCOF1 has an unusually high rate of single-nucleotide polymorphisms (SNPs) within its coding region. We suggest a possible different mechanism leading to TCS or genetic heterogeneity for this condition, as we identified two families with no apparent pathogenic mutation in the gene. Furthermore, our data confirm the absence of genotype-phenotype correlation and reinforce that the apparent anticipation often observed in TCS families is due to ascertainment bias. Copyright 2000 Wiley-Liss, Inc.

  12. The dynamic genome of Hydra.

    PubMed

    Chapman, Jarrod A; Kirkness, Ewen F; Simakov, Oleg; Hampson, Steven E; Mitros, Therese; Weinmaier, Thomas; Rattei, Thomas; Balasubramanian, Prakash G; Borman, Jon; Busam, Dana; Disbennett, Kathryn; Pfannkoch, Cynthia; Sumin, Nadezhda; Sutton, Granger G; Viswanathan, Lakshmi Devi; Walenz, Brian; Goodstein, David M; Hellsten, Uffe; Kawashima, Takeshi; Prochnik, Simon E; Putnam, Nicholas H; Shu, Shengquiang; Blumberg, Bruce; Dana, Catherine E; Gee, Lydia; Kibler, Dennis F; Law, Lee; Lindgens, Dirk; Martinez, Daniel E; Peng, Jisong; Wigge, Philip A; Bertulat, Bianca; Guder, Corina; Nakamura, Yukio; Ozbek, Suat; Watanabe, Hiroshi; Khalturin, Konstantin; Hemmrich, Georg; Franke, André; Augustin, René; Fraune, Sebastian; Hayakawa, Eisuke; Hayakawa, Shiho; Hirose, Mamiko; Hwang, Jung Shan; Ikeo, Kazuho; Nishimiya-Fujisawa, Chiemi; Ogura, Atshushi; Takahashi, Toshio; Steinmetz, Patrick R H; Zhang, Xiaoming; Aufschnaiter, Roland; Eder, Marie-Kristin; Gorny, Anne-Kathrin; Salvenmoser, Willi; Heimberg, Alysha M; Wheeler, Benjamin M; Peterson, Kevin J; Böttger, Angelika; Tischler, Patrick; Wolf, Alexander; Gojobori, Takashi; Remington, Karin A; Strausberg, Robert L; Venter, J Craig; Technau, Ulrich; Hobmayer, Bert; Bosch, Thomas C G; Holstein, Thomas W; Fujisawa, Toshitaka; Bode, Hans R; David, Charles N; Rokhsar, Daniel S; Steele, Robert E

    2010-03-25

    The freshwater cnidarian Hydra was first described in 1702 and has been the object of study for 300 years. Experimental studies of Hydra between 1736 and 1744 culminated in the discovery of asexual reproduction of an animal by budding, the first description of regeneration in an animal, and successful transplantation of tissue between animals. Today, Hydra is an important model for studies of axial patterning, stem cell biology and regeneration. Here we report the genome of Hydra magnipapillata and compare it to the genomes of the anthozoan Nematostella vectensis and other animals. The Hydra genome has been shaped by bursts of transposable element expansion, horizontal gene transfer, trans-splicing, and simplification of gene structure and gene content that parallel simplification of the Hydra life cycle. We also report the sequence of the genome of a novel bacterium stably associated with H. magnipapillata. Comparisons of the Hydra genome to the genomes of other animals shed light on the evolution of epithelia, contractile tissues, developmentally regulated transcription factors, the Spemann-Mangold organizer, pluripotency genes and the neuromuscular junction.

  13. Novel LAMP2 mutations in Chinese patients with Danon disease cause varying degrees of clinical severity.

    PubMed

    Luo, Su-shan; Xi, Jian-ying; Cai, Shuang; Zhao, Chong-bo; Lu, Jia-hong; Zhu, Wen-hua; Lin, Jie; Qiao, Kai; Wang, Yin; Ye, Zhu-rong

    2014-01-01

    Danon disease is an Xlinked dominant lysosomal glycogen storage disorder characterized by cardiomyopathy, skeletal myopathy, and mental retardation. This study described two Chinese cases of Danon disease in order to broaden the phenotypic and genetic spectrum. Clinical data were collected and LAMP2 mutations were analyzed. Patient A had fluctuating limb weakness during 6 months follow-up and was diagnosed with drug-induced myopathy due to anti-hepatitis B therapy with lamivudine. However, the first muscle biopsy with large cytoplasmic vacuoles confused the diagnosis and led to the second biopsy that allowed for the final diagnosis. Patient B had severe cardiac disturbances leading to sudden death. Molecularly, patient A harbored a synonymous mutation adjacent to the exon 6-intron 6 junction; mRNA analysis provided evidence that totally abolished the donor site and caused skipping of exon 6. Patient B harbored a frame-shift deletion mutation in exon 3 (c.396delA) leading to a truncated protein. To our knowledge, this is the first report of Danon disease caused by a synonymous exon mutation that affected mRNA splicing, which indicates that a synonymous substitution may not be silent when it is in the exon sequences close to the splice sites. It is also the first description of Danon disease clinically presenting as druginduced myopathy at onset; the pathological changes might be the key point for making a differential diagnosis. *These two authors contributed equally to this work.

  14. Loss of ERLIN2 function leads to juvenile primary lateral sclerosis.

    PubMed

    Al-Saif, Amr; Bohlega, Saeed; Al-Mohanna, Futwan

    2012-10-01

    Primary lateral sclerosis (PLS) is a motor neuron disorder that exclusively affects upper motor neurons leading to their degeneration. Mutations in the ALS2 gene encoding the protein Alsin have been described previously in the juvenile form of the disease. In this study, we identify mutation of the ERLIN2 gene in juvenile PLS patients and describe an in vitro model for loss of ERLIN2 function. Single nucleotide polymorphism arrays were used for homozygosity mapping. DNA sequencing of candidate genes was used to detect the underlying mutation. Level of ERLIN2 mRNA was measured by quantitative real time polymerase chain reaction. Knocking down ERLIN2 in NSC34 cells was accomplished by short-hairpin RNA interference. We identified a splice junction mutation in the ERLIN2 gene-a component of the endoplasmic reticulum (ER) lipid rafts-that resulted in abnormal splicing of ERLIN2 transcript and nonsense-mediated decay of ERLIN2 mRNA. Knocking down ERLIN2 in NSC34 cells suppressed their growth in culture. Recently, we found that mutation of SIGMAR1, a component of ER lipid rafts, leads to juvenile amyotrophic lateral sclerosis. The identification of mutation in another component of the ER lipid rafts in juvenile PLS patients emphasizes their role in motor neuron function. Furthermore, the discovered effect of ERLIN2 loss on cell growth may advance understanding of the mechanism behind motor neuron degeneration in PLS. Copyright © 2012 American Neurological Association.

  15. Familial retinoblastoma due to intronic LINE-1 insertion causes aberrant and noncanonical mRNA splicing of the RB1 gene.

    PubMed

    Rodríguez-Martín, Carlos; Cidre, Florencia; Fernández-Teijeiro, Ana; Gómez-Mariano, Gema; de la Vega, Leticia; Ramos, Patricia; Zaballos, Ángel; Monzón, Sara; Alonso, Javier

    2016-05-01

    Retinoblastoma (RB, MIM 180200) is the paradigm of hereditary cancer. Individuals harboring a constitutional mutation in one allele of the RB1 gene have a high predisposition to develop RB. Here, we present the first case of familial RB caused by a de novo insertion of a full-length long interspersed element-1 (LINE-1) into intron 14 of the RB1 gene that caused a highly heterogeneous splicing pattern of RB1 mRNA. LINE-1 insertion was inferred by mRNA studies and full-length sequenced by massive parallel sequencing. Some of the aberrant mRNAs were produced by noncanonical acceptor splice sites, a new finding that up to date has not been described to occur upon LINE-1 retrotransposition. Our results clearly show that RNA-based strategies have the potential to detect disease-causing transposon insertions. It also confirms that the incorporation of new genetic approaches, such as massive parallel sequencing, contributes to characterize at the sequence level these unique and exceptional genetic alterations.

  16. RNA editing in the anticodon of tRNA Leu (CAA) occurs before group I intron splicing in plastids of a moss Takakia lepidozioides S. Hatt. & Inoue.

    PubMed

    Miyata, Y; Sugita, C; Maruyama, K; Sugita, M

    2008-03-01

    RNA editing of cytidine (C) to uridine (U) transitions occurs in plastids and mitochondria of most land plants. In this study, we amplified and sequenced the group I intron-containing tRNA Leu gene, trnL-CAA, from Takakia lepidozioides, a moss. DNA sequence analysis revealed that the T. lepidozioides tRNA Leu gene consisted of a 35-bp 5' exon, a 469-bp group I intron and a 50-bp 3' exon. The intron was inserted between the first and second position of the tRNA Leu anticodon. In general, plastid tRNA Leu genes with a group I intron code for a TAA anticodon in most land plants. This strongly suggests that the first nucleotide of the CAA anticodon could be edited in T. lepidozioides plastids. To investigate this possibility, we analysed cDNAs derived from the trnL-CAA transcripts. We demonstrated that the first nucleotide C of the anticodon was edited to create a canonical UAA anticodon in T. lepidozioides plastids. cDNA sequencing analyses of the spliced or unspliced tRNA Leu transcripts revealed that, while the spliced tRNA was completely edited, editing in the unspliced tRNAs were only partial. This is the first experimental evidence that the anticodon editing of tRNA occurs before RNA splicing in plastids. This suggests that this editing is a prerequisite to splicing of pre-tRNA Leu.

  17. regSNPs-splicing: a tool for prioritizing synonymous single-nucleotide substitution.

    PubMed

    Zhang, Xinjun; Li, Meng; Lin, Hai; Rao, Xi; Feng, Weixing; Yang, Yuedong; Mort, Matthew; Cooper, David N; Wang, Yue; Wang, Yadong; Wells, Clark; Zhou, Yaoqi; Liu, Yunlong

    2017-09-01

    While synonymous single-nucleotide variants (sSNVs) have largely been unstudied, since they do not alter protein sequence, mounting evidence suggests that they may affect RNA conformation, splicing, and the stability of nascent-mRNAs to promote various diseases. Accurately prioritizing deleterious sSNVs from a pool of neutral ones can significantly improve our ability of selecting functional genetic variants identified from various genome-sequencing projects, and, therefore, advance our understanding of disease etiology. In this study, we develop a computational algorithm to prioritize sSNVs based on their impact on mRNA splicing and protein function. In addition to genomic features that potentially affect splicing regulation, our proposed algorithm also includes dozens structural features that characterize the functions of alternatively spliced exons on protein function. Our systematical evaluation on thousands of sSNVs suggests that several structural features, including intrinsic disorder protein scores, solvent accessible surface areas, protein secondary structures, and known and predicted protein family domains, show significant differences between disease-causing and neutral sSNVs. Our result suggests that the protein structure features offer an added dimension of information while distinguishing disease-causing and neutral synonymous variants. The inclusion of structural features increases the predictive accuracy for functional sSNV prioritization.

  18. hnRNP L regulates differences in expression of mouse integrin alpha2beta1.

    PubMed

    Cheli, Yann; Kunicki, Thomas J

    2006-06-01

    There is a 2-fold variation in platelet integrin alpha2beta1 levels among inbred mouse strains. Decreased alpha2beta1 in 4 strains carrying Itga2 haplotype 2 results from decreased affinity of heterogeneous ribonucleoprotein L (hnRNP L) for a 6 CA repeat sequence (CA6) within intron 1. Seven strains bearing haplotype 1 and a 21 CA repeat sequence at this position (CA21) express twice the level of platelet alpha2beta1 and exhibit an equivalent gain of platelet function in vitro. By UV crosslinking and immunoprecipitation, hnRNP L binds more avidly to CA21, relative to CA6. By cell-free, in vitro mRNA splicing, decreased binding of hnRNP L results in decreased splicing efficiency and an increased proportion of alternatively spliced product. The splicing enhancer activity of CA21 in vivo is abolished by prior treatment with hnRNP L-specific siRNA. Thus, decreased surface alpha2beta1 results from decreased Itga2 pre-mRNA splicing regulated by hnRNP L and depends on CA repeat length at a specific site in intron 1.

  19. hnRNP L regulates differences in expression of mouse integrin α2β1

    PubMed Central

    Cheli, Yann; Kunicki, Thomas J.

    2006-01-01

    There is a 2-fold variation in platelet integrin α2β1 levels among inbred mouse strains. Decreased α2β1 in 4 strains carrying Itga2 haplotype 2 results from decreased affinity of heterogeneous ribonucleoprotein L (hnRNP L) for a 6 CA repeat sequence (CA6) within intron 1. Seven strains bearing haplotype 1 and a 21 CA repeat sequence at this position (CA21) express twice the level of platelet α2β1 and exhibit an equivalent gain of platelet function in vitro. By UV crosslinking and immunoprecipitation, hnRNP L binds more avidly to CA21, relative to CA6. By cell-free, in vitro mRNA splicing, decreased binding of hnRNP L results in decreased splicing efficiency and an increased proportion of alternatively spliced product. The splicing enhancer activity of CA21 in vivo is abolished by prior treatment with hnRNP L–specific siRNA. Thus, decreased surface α2β1 results from decreased Itga2 pre-mRNA splicing regulated by hnRNP L and depends on CA repeat length at a specific site in intron 1. PMID:16455949

  20. Detection and Analysis of Circular RNAs by RT-PCR.

    PubMed

    Panda, Amaresh C; Gorospe, Myriam

    2018-03-20

    Gene expression in eukaryotic cells is tightly regulated at the transcriptional and posttranscriptional levels. Posttranscriptional processes, including pre-mRNA splicing, mRNA export, mRNA turnover, and mRNA translation, are controlled by RNA-binding proteins (RBPs) and noncoding (nc)RNAs. The vast family of ncRNAs comprises diverse regulatory RNAs, such as microRNAs and long noncoding (lnc)RNAs, but also the poorly explored class of circular (circ)RNAs. Although first discovered more than three decades ago by electron microscopy, only the advent of high-throughput RNA-sequencing (RNA-seq) and the development of innovative bioinformatic pipelines have begun to allow the systematic identification of circRNAs (Szabo and Salzman, 2016; Panda et al ., 2017b; Panda et al ., 2017c). However, the validation of true circRNAs identified by RNA sequencing requires other molecular biology techniques including reverse transcription (RT) followed by conventional or quantitative (q) polymerase chain reaction (PCR), and Northern blot analysis (Jeck and Sharpless, 2014). RT-qPCR analysis of circular RNAs using divergent primers has been widely used for the detection, validation, and sometimes quantification of circRNAs (Abdelmohsen et al ., 2015 and 2017; Panda et al ., 2017b). As detailed here, divergent primers designed to span the circRNA backsplice junction sequence can specifically amplify the circRNAs and not the counterpart linear RNA. In sum, RT-PCR analysis using divergent primers allows direct detection and quantification of circRNAs.

  1. Therapeutic Potential of a Scorpion Venom-Derived Antimicrobial Peptide and Its Homologs Against Antibiotic-Resistant Gram-Positive Bacteria.

    PubMed

    Liu, Gaomin; Yang, Fan; Li, Fangfang; Li, Zhongjie; Lang, Yange; Shen, Bingzheng; Wu, Yingliang; Li, Wenxin; Harrison, Patrick L; Strong, Peter N; Xie, Yingqiu; Miller, Keith; Cao, Zhijian

    2018-01-01

    The alarming rise in the prevalence of antibiotic resistance among pathogenic bacteria poses a unique challenge for the development of effective therapeutic agents. Antimicrobial peptides (AMPs) have attracted a great deal of attention as a possible solution to the increasing problem of antibiotic-resistant bacteria. Marcin-18 was identified from the scorpion Mesobuthus martensii at both DNA and protein levels. The genomic sequence revealed that the marcin-18 coding gene contains a phase-I intron with a GT-AG splice junction located in the DNA region encoding the N -terminal part of signal peptide. The peptide marcin-18 was also isolated from scorpion venom. A protein sequence homology search revealed that marcin-18 shares extremely high sequence identity to the AMPs meucin-18 and megicin-18. In vitro , chemically synthetic marcin-18 and its homologs (meucin-18 and megicin-18) showed highly potent inhibitory activity against Gram-positive bacteria, including some clinical antibiotic-resistant strains. Importantly, in a mouse acute peritonitis model, these peptides significantly decreased the bacterial load in ascites and rescued nearly all mice heavily infected with clinical methicillin-resistant Staphylococcus aureus from lethal bacteremia. Peptides exerted antimicrobial activity via a bactericidal mechanism and killed bacteria through membrane disruption. Taken together, marcin-18 and its homologs have potential for development as therapeutic agents for treating antibiotic-resistant, Gram-positive bacterial infections.

  2. The spectrum of mutations that underlie the neuromuscular junction synaptopathy in DOK7 congenital myasthenic syndrome.

    PubMed

    Cossins, Judith; Liu, Wei Wei; Belaya, Katsiaryna; Maxwell, Susan; Oldridge, Michael; Lester, Tracy; Robb, Stephanie; Beeson, David

    2012-09-01

    Congenital myasthenic syndromes (CMS) are a group of inherited diseases that affect synaptic transmission at the neuromuscular junction and result in fatiguable muscle weakness. A subgroup of CMS patients have a recessively inherited limb-girdle pattern of weakness caused by mutations in DOK7. DOK7 encodes DOK7, an adaptor protein that is expressed in the skeletal muscle and heart and that is essential for the development and maintenance of the neuromuscular junction. We have screened the DOK7 gene for mutations by polymerase chain reaction amplification and bi-directional sequencing of exonic and promoter regions and performed acetylcholine receptor (AChR) clustering assays and used exon trapping to determine the pathogenicity of detected variants. Approximately 18% of genetically diagnosed CMSs in the UK have mutations in DOK7, with mutations in this gene identified in more than 60 kinships to date. Thirty-four different pathogenic mutations were identified as well as 27 variants likely to be non-pathogenic. An exon 7 frameshift duplication c.1124_1127dupTGCC is commonly found in at least one allele. We analyse the effect of the common frameshift c.1124_1127dupTGCC and show that 10/11 suspected missense mutations have a deleterious effect on AChR clustering. We identify for the first time homozygous or compound heterozygous mutations that are localized 5' to exon 7. In addition, three silent variants in the N-terminal half of DOK7 are predicted to alter the splicing of the DOK7 RNA transcript. The DOK7 gene is highly polymorphic, and within these many variants, we define a spectrum of mutations that can underlie DOK7 CMS that will inform in managing this disorder.

  3. Hereditary cancer genes are highly susceptible to splicing mutations

    PubMed Central

    Soemedi, Rachel; Maguire, Samantha; Murray, Michael F.; Monaghan, Sean F.

    2018-01-01

    Substitutions that disrupt pre-mRNA splicing are a common cause of genetic disease. On average, 13.4% of all hereditary disease alleles are classified as splicing mutations mapping to the canonical 5′ and 3′ splice sites. However, splicing mutations present in exons and deeper intronic positions are vastly underreported. A recent re-analysis of coding mutations in exon 10 of the Lynch Syndrome gene, MLH1, revealed an extremely high rate (77%) of mutations that lead to defective splicing. This finding is confirmed by extending the sampling to five other exons in the MLH1 gene. Further analysis suggests a more general phenomenon of defective splicing driving Lynch Syndrome. Of the 36 mutations tested, 11 disrupted splicing. Furthermore, analyzing past reports suggest that MLH1 mutations in canonical splice sites also occupy a much higher fraction (36%) of total mutations than expected. When performing a comprehensive analysis of splicing mutations in human disease genes, we found that three main causal genes of Lynch Syndrome, MLH1, MSH2, and PMS2, belonged to a class of 86 disease genes which are enriched for splicing mutations. Other cancer genes were also enriched in the 86 susceptible genes. The enrichment of splicing mutations in hereditary cancers strongly argues for additional priority in interpreting clinical sequencing data in relation to cancer and splicing. PMID:29505604

  4. Antisense oligonucleotide-induced alternative splicing of the APOB mRNA generates a novel isoform of APOB.

    PubMed

    Khoo, Bernard; Roca, Xavier; Chew, Shern L; Krainer, Adrian R

    2007-01-17

    Apolipoprotein B (APOB) is an integral part of the LDL, VLDL, IDL, Lp(a) and chylomicron lipoprotein particles. The APOB pre-mRNA consists of 29 constitutively-spliced exons. APOB exists as two natural isoforms: the full-length APOB100 isoform, assembled into LDL, VLDL, IDL and Lp(a) and secreted by the liver in humans; and the C-terminally truncated APOB48, assembled into chylomicrons and secreted by the intestine in humans. Down-regulation of APOB100 is a potential therapy to lower circulating LDL and cholesterol levels. We investigated the ability of 2'O-methyl RNA antisense oligonucleotides (ASOs) to induce the skipping of exon 27 in endogenous APOB mRNA in HepG2 cells. These ASOs are directed towards the 5' and 3' splice-sites of exon 27, the branch-point sequence (BPS) of intron 26-27 and several predicted exonic splicing enhancers within exon 27. ASOs targeting either the 5' or 3' splice-site, in combination with the BPS, are the most effective. The splicing of other alternatively spliced genes are not influenced by these ASOs, suggesting that the effects seen are not due to non-specific changes in alternative splicing. The skip 27 mRNA is translated into a truncated isoform, APOB87SKIP27. The induction of APOB87SKIP27 expression in vivo should lead to decreased LDL and cholesterol levels, by analogy to patients with hypobetalipoproteinemia. As intestinal APOB mRNA editing and APOB48 expression rely on sequences within exon 26, exon 27 skipping should not affect APOB48 expression unlike other methods of down-regulating APOB100 expression which also down-regulate APOB48.

  5. Partial androgen insensitivity syndrome caused by a deep intronic mutation creating an alternative splice acceptor site of the AR gene.

    PubMed

    Ono, Hiroyuki; Saitsu, Hirotomo; Horikawa, Reiko; Nakashima, Shinichi; Ohkubo, Yumiko; Yanagi, Kumiko; Nakabayashi, Kazuhiko; Fukami, Maki; Fujisawa, Yasuko; Ogata, Tsutomu

    2018-02-02

    Although partial androgen insensitivity syndrome (PAIS) is caused by attenuated responsiveness to androgens, androgen receptor gene (AR) mutations on the coding regions and their splice sites have been identified only in <25% of patients with a diagnosis of PAIS. We performed extensive molecular studies including whole exome sequencing in a Japanese family with PAIS, identifying a deep intronic variant beyond the branch site at intron 6 of AR (NM_000044.4:c.2450-42 G > A). This variant created the splice acceptor motif that was accompanied by pyrimidine-rich sequence and two candidate branch sites. Consistent with this, reverse transcriptase (RT)-PCR experiments for cycloheximide-treated lymphoblastoid cell lines revealed a relatively large amount of aberrant mRNA produced by the newly created splice acceptor site and a relatively small amount of wildtype mRNA produced by the normal splice acceptor site. Furthermore, most of the aberrant mRNA was shown to undergo nonsense mediated decay (NMD) and, if a small amount of aberrant mRNA may have escaped NMD, such mRNA was predicted to generate a truncated AR protein missing some functional domains. These findings imply that the deep intronic mutation creating an alternative splice acceptor site resulted in the production of a relatively small amount of wildtype AR mRNA, leading to PAIS.

  6. The neuron-specific RNA-binding protein ELAV regulates neuroglian alternative splicing in neurons and binds directly to its pre-mRNA

    PubMed Central

    Lisbin, Michael J.; Qiu, Jan; White, Kalpana

    2001-01-01

    Drosophila melanogaster neural-specific protein, ELAV, has been shown to regulate the neural-specific splicing of three genes: neuroglian (nrg), erect wing, and armadillo. Alternative splicing of the nrg transcript involves alternative inclusion of a 3′-terminal exon. Here, using a minigene reporter, we show that the nrg alternatively spliced intron (nASI) has all the determinants required to recreate proper neural-specific RNA processing seen with the endogenous nrg transcript, including regulation by ELAV. An in vitro UV cross-linking assay revealed that ELAV from nuclear extracts cross-links to four distinct sites along the 3200 nucleotide long nASI; one EXS is positioned at the polypyrimidine tract of the default 3′ splice site. ELAV cross-linking sites (EXSs) have in common long tracts of (U)-rich sequence rather than a precise consensus; moreover, each tract has at least two 8/10U elements; their importance is validated by mutant transgene reporter analysis. Further, we propose criteria for ELAV target sequence recognition based on the four EXSs, sites within the nASI that are (U) rich but do not cross-link with ELAV, and predicted EXSs from a phylogenetic comparison with Drosophila virilis nASI. These results suggest that ELAV regulates nrg alternative splicing by direct interaction with the nASI. PMID:11581160

  7. The neuron-specific RNA-binding protein ELAV regulates neuroglian alternative splicing in neurons and binds directly to its pre-mRNA.

    PubMed

    Lisbin, M J; Qiu, J; White, K

    2001-10-01

    Drosophila melanogaster neural-specific protein, ELAV, has been shown to regulate the neural-specific splicing of three genes: neuroglian (nrg), erect wing, and armadillo. Alternative splicing of the nrg transcript involves alternative inclusion of a 3'-terminal exon. Here, using a minigene reporter, we show that the nrg alternatively spliced intron (nASI) has all the determinants required to recreate proper neural-specific RNA processing seen with the endogenous nrg transcript, including regulation by ELAV. An in vitro UV cross-linking assay revealed that ELAV from nuclear extracts cross-links to four distinct sites along the 3200 nucleotide long nASI; one EXS is positioned at the polypyrimidine tract of the default 3' splice site. ELAV cross-linking sites (EXSs) have in common long tracts of (U)-rich sequence rather than a precise consensus; moreover, each tract has at least two 8/10U elements; their importance is validated by mutant transgene reporter analysis. Further, we propose criteria for ELAV target sequence recognition based on the four EXSs, sites within the nASI that are (U) rich but do not cross-link with ELAV, and predicted EXSs from a phylogenetic comparison with Drosophila virilis nASI. These results suggest that ELAV regulates nrg alternative splicing by direct interaction with the nASI.

  8. iSS-PC: Identifying Splicing Sites via Physical-Chemical Properties Using Deep Sparse Auto-Encoder.

    PubMed

    Xu, Zhao-Chun; Wang, Peng; Qiu, Wang-Ren; Xiao, Xuan

    2017-08-15

    Gene splicing is one of the most significant biological processes in eukaryotic gene expression, such as RNA splicing, which can cause a pre-mRNA to produce one or more mature messenger RNAs containing the coded information with multiple biological functions. Thus, identifying splicing sites in DNA/RNA sequences is significant for both the bio-medical research and the discovery of new drugs. However, it is expensive and time consuming based only on experimental technique, so new computational methods are needed. To identify the splice donor sites and splice acceptor sites accurately and quickly, a deep sparse auto-encoder model with two hidden layers, called iSS-PC, was constructed based on minimum error law, in which we incorporated twelve physical-chemical properties of the dinucleotides within DNA into PseDNC to formulate given sequence samples via a battery of cross-covariance and auto-covariance transformations. In this paper, five-fold cross-validation test results based on the same benchmark data-sets indicated that the new predictor remarkably outperformed the existing prediction methods in this field. Furthermore, it is expected that many other related problems can be also studied by this approach. To implement classification accurately and quickly, an easy-to-use web-server for identifying slicing sites has been established for free access at: http://www.jci-bioinfo.cn/iSS-PC.

  9. RNA splicing process analysis for identifying antisense oligonucleotide inhibitors with padlock probe-based isothermal amplification† †Electronic supplementary information (ESI) available: Additional experimental materials, methods, DNA sequences and supplementary figures and tables. See DOI: 10.1039/c7sc01336a Click here for additional data file.

    PubMed Central

    Ren, Xiaojun; Deng, Ruijie; Wang, Lida; Zhang, Kaixiang

    2017-01-01

    RNA splicing, which mainly involves two transesterification steps, is a fundamental process of gene expression and its abnormal regulation contributes to serious genetic diseases. Antisense oligonucleotides (ASOs) are genetic control tools that can be used to specifically control genes through alteration of the RNA splicing pathway. Despite intensive research, how ASOs or various other factors influence the multiple processes of RNA splicing still remains obscure. This is largely due to an inability to analyze the splicing efficiency of each step in the RNA splicing process with high sensitivity. We addressed this limitation by introducing a padlock probe-based isothermal amplification assay to achieve quantification of the specific products in different splicing steps. With this amplified assay, the roles that ASOs play in RNA splicing inhibition in the first and second steps could be distinguished. We identified that 5′-ASO could block RNA splicing by inhibiting the first step, while 3′-ASO could block RNA splicing by inhibiting the second step. This method provides a versatile tool for assisting efficient ASO design and discovering new splicing modulators and therapeutic drugs. PMID:28989608

  10. Genomic overview of mRNA 5′-leader trans-splicing in the ascidian Ciona intestinalis

    PubMed Central

    Satou, Yutaka; Hamaguchi, Makoto; Takeuchi, Keisuke; Hastings, Kenneth E. M.; Satoh, Nori

    2006-01-01

    Although spliced leader (SL) trans-splicing in the chordates was discovered in the tunicate Ciona intestinalis there has been no genomic overview analysis of the extent of trans-splicing or the make-up of the trans-spliced and non-trans-spliced gene populations of this model organism. Here we report such an analysis for Ciona based on the oligo-capping full-length cDNA approach. We randomly sampled 2078 5′-full-length ESTs representing 668 genes, or 4.2% of the entire genome. Our results indicate that Ciona contains a single major SL, which is efficiently trans-spliced to mRNAs transcribed from a specific set of genes representing ∼50% of the total number of expressed genes, and that individual trans-spliced mRNA species are, on average, 2–3-fold less abundant than non-trans-spliced mRNA species. Our results also identify a relationship between trans-splicing status and gene functional classification; ribosomal protein genes fall predominantly into the non-trans-spliced category. In addition, our data provide the first evidence for the occurrence of polycistronic transcription in Ciona. An interesting feature of the Ciona polycistronic transcription units is that the great majority entirely lack intercistronic sequences. PMID:16822859

  11. High-throughput sequence analysis of Ciona intestinalis SL trans-spliced mRNAs: alternative expression modes and gene function correlates.

    PubMed

    Matsumoto, Jun; Dewar, Ken; Wasserscheid, Jessica; Wiley, Graham B; Macmil, Simone L; Roe, Bruce A; Zeller, Robert W; Satou, Yutaka; Hastings, Kenneth E M

    2010-05-01

    Pre-mRNA 5' spliced-leader (SL) trans-splicing occurs in some metazoan groups but not in others. Genome-wide characterization of the trans-spliced mRNA subpopulation has not yet been reported for any metazoan. We carried out a high-throughput analysis of the SL trans-spliced mRNA population of the ascidian tunicate Ciona intestinalis by 454 Life Sciences (Roche) pyrosequencing of SL-PCR-amplified random-primed reverse transcripts of tailbud embryo RNA. We obtained approximately 250,000 high-quality reads corresponding to 8790 genes, approximately 58% of the Ciona total gene number. The great depth of this data revealed new aspects of trans-splicing, including the existence of a significant class of "infrequently trans-spliced" genes, accounting for approximately 28% of represented genes, that generate largely non-trans-spliced mRNAs, but also produce trans-spliced mRNAs, in part through alternative promoter use. Thus, the conventional qualitative dichotomy of trans-spliced versus non-trans-spliced genes should be supplanted by a more accurate quantitative view recognizing frequently and infrequently trans-spliced gene categories. Our data include reads representing approximately 80% of Ciona frequently trans-spliced genes. Our analysis also revealed significant use of closely spaced alternative trans-splice acceptor sites which further underscores the mechanistic similarity of cis- and trans-splicing and indicates that the prevalence of +/-3-nt alternative splicing events at tandem acceptor sites, NAGNAG, is driven by spliceosomal mechanisms, and not nonsense-mediated decay, or selection at the protein level. The breadth of gene representation data enabled us to find new correlations between trans-splicing status and gene function, namely the overrepresentation in the frequently trans-spliced gene class of genes associated with plasma/endomembrane system, Ca(2+) homeostasis, and actin cytoskeleton.

  12. Coordinated tissue-specific regulation of adjacent alternative 3′ splice sites in C. elegans

    PubMed Central

    Ragle, James Matthew; Katzman, Sol; Akers, Taylor F.; Barberan-Soler, Sergio; Zahler, Alan M.

    2015-01-01

    Adjacent alternative 3′ splice sites, those separated by ≤18 nucleotides, provide a unique problem in the study of alternative splicing regulation; there is overlap of the cis-elements that define the adjacent sites. Identification of the intron's 3′ end depends upon sequence elements that define the branchpoint, polypyrimidine tract, and terminal AG dinucleotide. Starting with RNA-seq data from germline-enriched and somatic cell-enriched Caenorhabditis elegans samples, we identify hundreds of introns with adjacent alternative 3′ splice sites. We identify 203 events that undergo tissue-specific alternative splicing. For these, the regulation is monodirectional, with somatic cells preferring to splice at the distal 3′ splice site (furthest from the 5′ end of the intron) and germline cells showing a distinct shift toward usage of the adjacent proximal 3′ splice site (closer to the 5′ end of the intron). Splicing patterns in somatic cells follow C. elegans consensus rules of 3′ splice site definition; a short stretch of pyrimidines preceding an AG dinucleotide. Splicing in germline cells occurs at proximal 3′ splice sites that lack a preceding polypyrimidine tract, and in three instances the germline-specific site lacks the AG dinucleotide. We provide evidence that use of germline-specific proximal 3′ splice sites is conserved across Caenorhabditis species. We propose that there are differences between germline and somatic cells in the way that the basal splicing machinery functions to determine the intron terminus. PMID:25922281

  13. Mutually Exclusive Splicing of the Insect Dscam Pre-mRNA Directed by Competing Intronic RNA Secondary Structures

    PubMed Central

    Graveley, Brenton R.

    2008-01-01

    Summary Drosophila Dscam encodes 38,016 distinct axon guidance receptors through the mutually exclusive alternative splicing of 95 variable exons. Importantly, known mechanisms that ensure the mutually exclusive splicing of pairs of exons cannot explain this phenomenon in Dscam. I have identified two classes of conserved elements in the Dscam exon 6 cluster, which contains 48 alternative exons—the docking site, located in the intron downstream of constitutive exon 5, and the selector sequences, which are located upstream of each exon 6 variant. Strikingly, each selector sequence is complementary to a portion of the docking site, and this pairing juxtaposes one, and only one, alternative exon to the upstream constitutive exon. The mutually exclusive nature of the docking site:selector sequence interactions suggests that the formation of these competing RNA structures is a central component of the mechanism guaranteeing that only one exon 6 variant is included in each Dscam mRNA. PMID:16213213

  14. Short intronic repeat sequences facilitate circular RNA production.

    PubMed

    Liang, Dongming; Wilusz, Jeremy E

    2014-10-15

    Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery "backsplices" and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼ 30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3' end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. © 2014 Liang and Wilusz; Published by Cold Spring Harbor Laboratory Press.

  15. RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease.

    PubMed

    Xiong, Hui Y; Alipanahi, Babak; Lee, Leo J; Bretschneider, Hannes; Merico, Daniele; Yuen, Ryan K C; Hua, Yimin; Gueroussov, Serge; Najafabadi, Hamed S; Hughes, Timothy R; Morris, Quaid; Barash, Yoseph; Krainer, Adrian R; Jojic, Nebojsa; Scherer, Stephen W; Blencowe, Benjamin J; Frey, Brendan J

    2015-01-09

    To facilitate precision medicine and whole-genome annotation, we developed a machine-learning technique that scores how strongly genetic variants affect RNA splicing, whose alteration contributes to many diseases. Analysis of more than 650,000 intronic and exonic variants revealed widespread patterns of mutation-driven aberrant splicing. Intronic disease mutations that are more than 30 nucleotides from any splice site alter splicing nine times as often as common variants, and missense exonic disease mutations that have the least impact on protein function are five times as likely as others to alter splicing. We detected tens of thousands of disease-causing mutations, including those involved in cancers and spinal muscular atrophy. Examination of intronic and exonic variants found using whole-genome sequencing of individuals with autism revealed misspliced genes with neurodevelopmental phenotypes. Our approach provides evidence for causal variants and should enable new discoveries in precision medicine. Copyright © 2015, American Association for the Advancement of Science.

  16. [Alternative splicing regulation: implications in cancer diagnosis and treatment].

    PubMed

    Martínez-Montiel, Nancy; Rosas-Murrieta, Nora; Martínez-Contreras, Rebeca

    2015-04-08

    The accurate expression of the genetic information is regulated by processes like mRNA splicing, proposed after the discoveries of Phil Sharp and Richard Roberts, who demonstrated the existence of intronic sequences, present in almost every structural eukaryotic gene, which should be precisely removed. This intron removal is called "splicing", which generates different proteins from a single mRNA, with different or even antagonistic functions. We currently know that alternative splicing is the most important source of protein diversity, given that 70% of the human genes undergo splicing and that mutations causing defects in this process could originate up to 50% of genetic diseases, including cancer. When these defects occur in genes involved in cell adhesion, proliferation and cell cycle regulation, there is an impact on cancer progression, rising the opportunity to diagnose and treat some types of cancer according to a particular splicing profile. Copyright © 2013 Elsevier España, S.L.U. All rights reserved.

  17. Evolution of a tissue-specific splicing network

    PubMed Central

    Taliaferro, J. Matthew; Alvarez, Nehemiah; Green, Richard E.; Blanchette, Marco; Rio, Donald C.

    2011-01-01

    Alternative splicing of precursor mRNA (pre-mRNA) is a strategy employed by most eukaryotes to increase transcript and proteomic diversity. Many metazoan splicing factors are members of multigene families, with each member having different functions. How these highly related proteins evolve unique properties has been unclear. Here we characterize the evolution and function of a new Drosophila splicing factor, termed LS2 (Large Subunit 2), that arose from a gene duplication event of dU2AF50, the large subunit of the highly conserved heterodimeric general splicing factor U2AF (U2-associated factor). The quickly evolving LS2 gene has diverged from the splicing-promoting, ubiquitously expressed dU2AF50 such that it binds a markedly different RNA sequence, acts as a splicing repressor, and is preferentially expressed in testes. Target transcripts of LS2 are also enriched for performing testes-related functions. We therefore propose a path for the evolution of a new splicing factor in Drosophila that regulates specific pre-mRNAs and contributes to transcript diversity in a tissue-specific manner. PMID:21406555

  18. Cancer-Associated Perturbations in Alternative Pre-messenger RNA Splicing.

    PubMed

    Shkreta, Lulzim; Bell, Brendan; Revil, Timothée; Venables, Julian P; Prinos, Panagiotis; Elela, Sherif Abou; Chabot, Benoit

    2013-01-01

    For most of our 25,000 genes, the removal of introns by pre-messenger RNA (pre-mRNA) splicing represents an essential step toward the production of functional messenger RNAs (mRNAs). Alternative splicing of a single pre-mRNA results in the production of different mRNAs. Although complex organisms use alternative splicing to expand protein function and phenotypic diversity, patterns of alternative splicing are often altered in cancer cells. Alternative splicing contributes to tumorigenesis by producing splice isoforms that can stimulate cell proliferation and cell migration or induce resistance to apoptosis and anticancer agents. Cancer-specific changes in splicing profiles can occur through mutations that are affecting splice sites and splicing control elements, and also by alterations in the expression of proteins that control splicing decisions. Recent progress in global approaches that interrogate splicing diversity should help to obtain specific splicing signatures for cancer types. The development of innovative approaches for annotating and reprogramming splicing events will more fully establish the essential contribution of alternative splicing to the biology of cancer and will hopefully provide novel targets and anticancer strategies. Metazoan genes are usually made up of several exons interrupted by introns. The introns are removed from the pre-mRNA by RNA splicing. In conjunction with other maturation steps, such as capping and polyadenylation, the spliced mRNA is then transported to the cytoplasm to be translated into a functional protein. The basic mechanism of splicing requires accurate recognition of each extremity of each intron by the spliceosome. Introns are identified by the binding of U1 snRNP to the 5' splice site and the U2AF65/U2AF35 complex to the 3' splice site. Following these interactions, other proteins and snRNPs are recruited to generate the complete spliceosomal complex needed to excise the intron. While many introns are constitutively removed by the spliceosome, other splice junctions are not used systematically, generating the phenomenon of alternative splicing. Alternative splicing is therefore the process by which a single species of pre-mRNA can be matured to produce different mRNA molecules (Fig. 1). Depending on the number and types of alternative splicing events, a pre-mRNA can generate from two to several thousands different mRNAs leading to the production of a corresponding number of proteins. It is now believed that the expression of at least 70 % of human genes is subjected to alternative splicing, implying an enormous contribution to proteomic diversity, and by extension, to the development and the evolution of complex animals. Defects in splicing have been associated with human diseases (Caceres and Kornblihtt, Trends Genet 18(4):186-93, 2002, Cartegni et al., Nat Rev Genet 3(4):285-98, 2002, Pagani and Baralle, Nat Rev Genet 5(5):389-96, 2004), including cancer (Brinkman, Clin Biochem 37(7):584-94, 2004, Venables, Bioessays 28(4):378-86, 2006, Srebrow and Kornblihtt, J Cell Sci 119(Pt 13):2635-2641, 2006, Revil et al., Bull Cancer 93(9):909-919, 2006, Venables, Transworld Res Network, 2006, Pajares et al., Lancet Oncol 8(4):349-57, 2007, Skotheim and Nees, Int J Biochem Cell Biol 39:1432-1449, 2007). Numerous studies have now confirmed the existence of specific differences in the alternative splicing profiles between normal and cancer tissues. Although there are a few cases where specific mutations are the primary cause for these changes, global alterations in alternative splicing in cancer cells may be primarily derived from changes in the expression of RNA-binding proteins that control splice site selection. Overall, these cancer-specific differences in alternative splicing offer an immense potential to improve the diagnosis and the prognosis of cancer. This review will focus on the functional impact of cancer-associated alternative splicing variants, the molecular determinants that alter the splicing decisions in cancer cells, and future therapeutic strategies.

  19. Informational structure of genetic sequences and nature of gene splicing

    NASA Astrophysics Data System (ADS)

    Trifonov, E. N.

    1991-10-01

    Only about 1/20 of DNA of higher organisms codes for proteins, by means of classical triplet code. The rest of DNA sequences is largely silent, with unclear functions, if any. The triplet code is not the only code (message) carried by the sequences. There are three levels of molecular communication, where the same sequence ``talks'' to various bimolecules, while having, respectively, three different appearances: DNA, RNA and protein. Since the molecular structures and, hence, sequence specific preferences of these are substantially different, the original DNA sequence has to carry simultaneously three types of sequence patterns (codes, messages), thus, being a composite structure in which one had the same letter (nucleotide) is frequently involved in several overlapping codes of different nature. This multiplicity and overlapping of the codes is a unique feature of the Gnomic, language of genetic sequences. The coexisting codes have to be degenerate in various degrees to allow an optimal and concerted performance of all the encoded functions. There is an obvious conflict between the best possible performance of a given function and necessity to compromise the quality of a given sequence pattern in favor of other patterns. It appears that the major role of various changes in the sequences on their ``ontogenetic'' way from DNA to RNA to protein, like RNA editing and splicing, or protein post-translational modifications is to resolve such conflicts. New data are presented strongly indicating that the gene splicing is such a device to resolve the conflict between the code of DNA folding in chromatin and the triplet code for protein synthesis.

  20. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments

    PubMed Central

    Haas, Brian J; Salzberg, Steven L; Zhu, Wei; Pertea, Mihaela; Allen, Jonathan E; Orvis, Joshua; White, Owen; Buell, C Robin; Wortman, Jennifer R

    2008-01-01

    EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation. PMID:18190707

  1. A short conserved motif in ALYREF directs cap- and EJC-dependent assembly of export complexes on spliced mRNAs.

    PubMed

    Gromadzka, Agnieszka M; Steckelberg, Anna-Lena; Singh, Kusum K; Hofmann, Kay; Gehring, Niels H

    2016-03-18

    The export of messenger RNAs (mRNAs) is the final of several nuclear posttranscriptional steps of gene expression. The formation of export-competent mRNPs involves the recruitment of export factors that are assumed to facilitate transport of the mature mRNAs. Using in vitro splicing assays, we show that a core set of export factors, including ALYREF, UAP56 and DDX39, readily associate with the spliced RNAs in an EJC (exon junction complex)- and cap-dependent manner. In order to elucidate how ALYREF and other export adaptors mediate mRNA export, we conducted a computational analysis and discovered four short, conserved, linear motifs present in RNA-binding proteins. We show that mutation in one of the new motifs (WxHD) in an unstructured region of ALYREF reduced RNA binding and abolished the interaction with eIF4A3 and CBP80. Additionally, the mutation impaired proper localization to nuclear speckles and export of a spliced reporter mRNA. Our results reveal important details of the orchestrated recruitment of export factors during the formation of export competent mRNPs. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. Large-scale transcriptome analysis in chickpea (Cicer arietinum L.), an orphan legume crop of the semi-arid tropics of Asia and Africa.

    PubMed

    Hiremath, Pavana J; Farmer, Andrew; Cannon, Steven B; Woodward, Jimmy; Kudapa, Himabindu; Tuteja, Reetu; Kumar, Ashish; Bhanuprakash, Amindala; Mulaosmanovic, Benjamin; Gujaria, Neha; Krishnamurthy, Laxmanan; Gaur, Pooran M; Kavikishor, Polavarapu B; Shah, Trushar; Srinivasan, Ramamurthy; Lohse, Marc; Xiao, Yongli; Town, Christopher D; Cook, Douglas R; May, Gregory D; Varshney, Rajeev K

    2011-10-01

    Chickpea (Cicer arietinum L.) is an important legume crop in the semi-arid regions of Asia and Africa. Gains in crop productivity have been low however, particularly because of biotic and abiotic stresses. To help enhance crop productivity using molecular breeding techniques, next generation sequencing technologies such as Roche/454 and Illumina/Solexa were used to determine the sequence of most gene transcripts and to identify drought-responsive genes and gene-based molecular markers. A total of 103,215 tentative unique sequences (TUSs) have been produced from 435,018 Roche/454 reads and 21,491 Sanger expressed sequence tags (ESTs). Putative functions were determined for 49,437 (47.8%) of the TUSs, and gene ontology assignments were determined for 20,634 (41.7%) of the TUSs. Comparison of the chickpea TUSs with the Medicago truncatula genome assembly (Mt 3.5.1 build) resulted in 42,141 aligned TUSs with putative gene structures (including 39,281 predicted intron/splice junctions). Alignment of ∼37 million Illumina/Solexa tags generated from drought-challenged root tissues of two chickpea genotypes against the TUSs identified 44,639 differentially expressed TUSs. The TUSs were also used to identify a diverse set of markers, including 728 simple sequence repeats (SSRs), 495 single nucleotide polymorphisms (SNPs), 387 conserved orthologous sequence (COS) markers, and 2088 intron-spanning region (ISR) markers. This resource will be useful for basic and applied research for genome analysis and crop improvement in chickpea. Plant Biotechnology Journal © 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd. No claim to original US government works.

  3. Identifying RNA splicing factors using IFT genes in Chlamydomonas reinhardtii.

    PubMed

    Lin, Huawen; Zhang, Zhengyan; Iomini, Carlo; Dutcher, Susan K

    2018-03-01

    Intraflagellar transport moves proteins in and out of flagella/cilia and it is essential for the assembly of these organelles. Using whole-genome sequencing, we identified splice site mutations in two IFT genes, IFT81 ( fla9 ) and IFT121 ( ift121-2 ), which lead to flagellar assembly defects in the unicellular green alga Chlamydomonas reinhardtii The splicing defects in these ift mutants are partially corrected by mutations in two conserved spliceosome proteins, DGR14 and FRA10. We identified a dgr14 deletion mutant, which suppresses the 3' splice site mutation in IFT81 , and a frameshift mutant of FRA10 , which suppresses the 5' splice site mutation in IFT121 Surprisingly, we found dgr14-1 and fra10 mutations suppress both splice site mutations. We suggest these two proteins are involved in facilitating splice site recognition/interaction; in their absence some splice site mutations are tolerated. Nonsense mutations in SMG1 , which is involved in nonsense-mediated decay, lead to accumulation of aberrant transcripts and partial restoration of flagellar assembly in the ift mutants. The high density of introns and the conservation of noncore splicing factors, together with the ease of scoring the ift mutant phenotype, make Chlamydomonas an attractive organism to identify new proteins involved in splicing through suppressor screening. © 2018 The Authors.

  4. Mis-Spliced Lr34 Transcript Events in Winter Wheat.

    PubMed

    Fang, Tilin; Carver, Brett F; Hunger, Robert M; Yan, Liuling

    2017-01-01

    Lr34 in wheat is a non-race-specific gene that confers resistance against multiple fungal pathogens. The resistant allele Lr34 and the susceptible allele Lr34s can be distinguished by three polymorphisms that cause alternation of deduced amino acid sequences of Lr34 at the protein level. In seedlings of a cultivar carrying the resistant Lr34r allele, only a portion (35%) of its transcripts was correctly spliced and the majority (65%) of its transcripts were incorrectly spliced due to multiple mis-splicing events. Lr34 mis-splicing events were also observed at adult plant age when this gene exerts its function. All of the mis-spliced Lr34r cDNA transcripts observed in this study resulted in a premature stop codon due to a shift of the open reading frame; hence, the mis-spliced Lr34r cDNAs were deduced to encode incomplete proteins. Even if a cultivar has a functional Lr34 gene, its transcripts might not completely splice in a correct pattern. These findings suggested that the partial resistance conferred by a quantitative gene might be due to mis-splicing events in its transcripts; hence, the resistance of the gene could be increased by eliminating or mutating regulators that cause mis-splicing events in wheat.

  5. Regulatory RNA binding proteins contribute to the transcriptome-wide splicing alterations in human cellular senescence.

    PubMed

    Dong, Qiongye; Wei, Lei; Zhang, Michael Q; Wang, Xiaowo

    2018-06-24

    Dysregulation of mRNA splicing has been observed in certain cellular senescence process. However, the common splicing alterations on the whole transcriptome shared by various types of senescence are poorly understood. In order to systematically identify senescence-associated transcriptomic changes in genome-wide scale, we collected RNA sequencing datasets of different human cell types with a variety of senescence-inducing methods from public databases and performed meta-analysis. First, we discovered that a group of RNA binding proteins were consistently down-regulated in diverse senescent samples and identified 406 senescence-associated common differential splicing events. Then, eight differentially expressed RNA binding proteins were predicted to regulate these senescence-associated splicing alterations through an enrichment analysis of their RNA binding information, including motif scanning and enhanced cross-linking immunoprecipitation data. In addition, we constructed the splicing regulatory modules that might contribute to senescence-associated biological processes. Finally, it was confirmed that knockdown of the predicted senescence-associated potential splicing regulators through shRNAs in HepG2 cell line could result in senescence-like splicing changes. Taken together, our work demonstrated a broad range of common changes in mRNA splicing switches and detected their central regulatory RNA binding proteins during senescence. These findings would help to better understand the coordinating splicing alterations in cellular senescence.

  6. In silico study of breast cancer associated gene 3 using LION Target Engine and other tools.

    PubMed

    León, Darryl A; Cànaves, Jaume M

    2003-12-01

    Sequence analysis of individual targets is an important step in annotation and validation. As a test case, we investigated human breast cancer associated gene 3 (BCA3) with LION Target Engine and with other bioinformatics tools. LION Target Engine confirmed that the BCA3 gene is located on 11p15.4 and that the two most likely splice variants (lacking exon 3 and exons 3 and 5, respectively) exist. Based on our manual curation of sequence data, it is proposed that an additional variant (missing only exon 5) published in a public sequence repository, is a prediction artifact. A significant number of new orthologs were also identified, and these were the basis for a high-quality protein secondary structure prediction. Moreover, our research confirmed several distinct functional domains as described in earlier reports. Sequence conservation from multiple sequence alignments, splice variant identification, secondary structure predictions, and predicted phosphorylation sites suggest that the removal of interaction sites through alternative splicing might play a modulatory role in BCA3. This in silico approach shows the depth and relevance of an analysis that can be accomplished by including a variety of publicly available tools with an integrated and customizable life science informatics platform.

  7. Novel C8orf37 mutations cause retinitis pigmentosa in consanguineous families of Pakistani origin

    PubMed Central

    Ravesh, Zeinab; El Asrag, Mohammed E.; Weisschuh, Nicole; McKibbin, Martin; Reuter, Peggy; Watson, Christopher M.; Baumann, Britta; Poulter, James A.; Sajid, Sundus; Panagiotou, Evangelia S.; O’Sullivan, James; Abdelhamed, Zakia; Bonin, Michael; Soltanifar, Mehdi; Black, Graeme C.M.; Din, Muhammad Amin-ud; Toomes, Carmel; Ansar, Muhammad; Inglehearn, Chris F.; Wissinger, Bernd

    2015-01-01

    Purpose To investigate the molecular basis of retinitis pigmentosa in two consanguineous families of Pakistani origin with multiple affected members. Methods Homozygosity mapping and Sanger sequencing of candidate genes were performed in one family while the other was analyzed with whole exome next-generation sequencing. A minigene splicing assay was used to confirm the splicing defects. Results In family MA48, a novel homozygous nucleotide substitution in C8orf37, c.244–2A>C, that disrupted the consensus splice acceptor site of exon 3 was found. The minigene splicing assay revealed that this mutation activated a cryptic splice site within exon 3, causing a 22 bp deletion in the transcript that is predicted to lead to a frameshift followed by premature protein truncation. In family MA13, a novel homozygous null mutation in C8orf37, c.555G>A, p.W185*, was identified. Both mutations segregated with the disease phenotype as expected in a recessive manner and were absent in 8,244 unrelated individuals of South Asian origin. Conclusions In this report, we describe C8orf37 mutations that cause retinal dystrophy in two families of Pakistani origin, contributing further data on the phenotype and the spectrum of mutations in this form of retinitis pigmentosa. PMID:25802487

  8. Wire Crimp Termination Verification Using Ultrasonic Inspection

    NASA Technical Reports Server (NTRS)

    Perey, Daniel F.; Cramer, K. Elliott; Yost, William T.

    2007-01-01

    The development of a new ultrasonic measurement technique to quantitatively assess wire crimp terminations is discussed. The amplitude change of a compressional ultrasonic wave propagating through the junction of a crimp termination and wire is shown to correlate with the results of a destructive pull test, which is a standard for assessing crimp wire junction quality. Various crimp junction pathologies such as undercrimping, missing wire strands, incomplete wire insertion, partial insulation removal, and incorrect wire gauge are ultrasonically tested, and their results are correlated with pull tests. Results show that the nondestructive ultrasonic measurement technique consistently (as evidenced with destructive testing) predicts good crimps when ultrasonic transmission is above a certain threshold amplitude level. A physics-based model, solved by finite element analysis, describes the compressional ultrasonic wave propagation through the junction during the crimping process. This model is in agreement within 6% of the ultrasonic measurements. A prototype instrument for applying this technique while wire crimps are installed is also presented. The instrument is based on a two-jaw type crimp tool suitable for butt-splice type connections. Finally, an approach for application to multipin indenter type crimps will be discussed.

  9. U2AF1 mutations alter splice site recognition in hematological malignancies.

    PubMed

    Ilagan, Janine O; Ramakrishnan, Aravind; Hayes, Brian; Murphy, Michele E; Zebari, Ahmad S; Bradley, Philip; Bradley, Robert K

    2015-01-01

    Whole-exome sequencing studies have identified common mutations affecting genes encoding components of the RNA splicing machinery in hematological malignancies. Here, we sought to determine how mutations affecting the 3' splice site recognition factor U2AF1 alter its normal role in RNA splicing. We find that U2AF1 mutations influence the similarity of splicing programs in leukemias, but do not give rise to widespread splicing failure. U2AF1 mutations cause differential splicing of hundreds of genes, affecting biological pathways such as DNA methylation (DNMT3B), X chromosome inactivation (H2AFY), the DNA damage response (ATR, FANCA), and apoptosis (CASP8). We show that U2AF1 mutations alter the preferred 3' splice site motif in patients, in cell culture, and in vitro. Mutations affecting the first and second zinc fingers give rise to different alterations in splice site preference and largely distinct downstream splicing programs. These allele-specific effects are consistent with a computationally predicted model of U2AF1 in complex with RNA. Our findings suggest that U2AF1 mutations contribute to pathogenesis by causing quantitative changes in splicing that affect diverse cellular pathways, and give insight into the normal function of U2AF1's zinc finger domains. © 2015 Ilagan et al.; Published by Cold Spring Harbor Laboratory Press.

  10. Optimized approach for Ion Proton RNA sequencing reveals details of RNA splicing and editing features of the transcriptome.

    PubMed

    Brown, Roger B; Madrid, Nathaniel J; Suzuki, Hideaki; Ness, Scott A

    2017-01-01

    RNA-sequencing (RNA-seq) has become the standard method for unbiased analysis of gene expression but also provides access to more complex transcriptome features, including alternative RNA splicing, RNA editing, and even detection of fusion transcripts formed through chromosomal translocations. However, differences in library methods can adversely affect the ability to recover these different types of transcriptome data. For example, some methods have bias for one end of transcripts or rely on low-efficiency steps that limit the complexity of the resulting library, making detection of rare transcripts less likely. We tested several commonly used methods of RNA-seq library preparation and found vast differences in the detection of advanced transcriptome features, such as alternatively spliced isoforms and RNA editing sites. By comparing several different protocols available for the Ion Proton sequencer and by utilizing detailed bioinformatics analysis tools, we were able to develop an optimized random primer based RNA-seq technique that is reliable at uncovering rare transcript isoforms and RNA editing features, as well as fusion reads from oncogenic chromosome rearrangements. The combination of optimized libraries and rapid Ion Proton sequencing provides a powerful platform for the transcriptome analysis of research and clinical samples.

  11. Method of artificial DNA splicing by directed ligation (SDL).

    PubMed Central

    Lebedenko, E N; Birikh, K R; Plutalov, O V; Berlin YuA

    1991-01-01

    An approach to directed genetic recombination in vitro has been devised, which allows for joining together, in a predetermined way, a series of DNA segments to give a precisely spliced polynucleotide sequence (DNA splicing by directed ligation, SDL). The approach makes use of amplification, by means of several polymerase chain reactions (PCR), of a chosen set of DNA segments. Primers for the amplifications contain recognition sites of the class IIS restriction endonucleases, which transform blunt ends of the amplification products into protruding ends of unique primary structures, the ends to be used for joining segments together being mutually complementary. Ligation of the mixture of the segments so synthesized gives the desired sequence in an unambiguous way. The suggested approach has been exemplified by the synthesis of a totally processed (intronless) gene encoding human mature interleukin-1 alpha. Images PMID:1662363

  12. RNA-Seq analysis identifies aberrant RNA splicing of TRIP12 in acute myeloid leukemia patients at remission.

    PubMed

    Gao, Panke; Jin, Zhen; Cheng, Yingying; Cao, Xiangshan

    2014-10-01

    Aberrant splicing events play important roles in the pathogenesis of acute myeloid leukemia (AML). To investigate the aberrant splicing events in AML during treatment, we carried out RNA sequencing in peripheral mononuclear cell samples from a patient with complete remission. In addition to the sequencing samples, selected splicing events were confirmed and validated with real-time quantitative RT-PCR in another seven pairs of samples. A total of 4.05 and 3.39 GB clean data of the AML and remission sample were generated, respectively, and 2,223 differentially expressed genes (DEGs) were identified. Integrated with gene expression profiling on T cells from AML patients compared with healthy donors, 82 DEGs were also differentially expressed in AML CD4 T cells and CD8 T cells. Twenty-three alternative splicing events were considered to be confidential, and they were involved in many biological processes, such as RNA processing, cellular macromolecule catabolic process, and DNA binding process. An exon3-skipping event in TRIP12 was detected in patients at remission and further validated in another three independent samples. TRIP12 is an ubiquitin ligase of ARF, which suppresses aberrant cell growth by activating p53 responses. The exon3-skipping isoform of TRIP12 increased significantly after treatment. Our results may provide new understanding of AML, and the confirmed alternative splicing event of TRIP12 may be used as potential target for future investigations.

  13. MYCN controls an alternative RNA splicing program in high-risk metastatic neuroblastoma

    PubMed Central

    Zhang, Shile; Wei, Jun S.; Li, Samuel Q.; Badgett, Tom C.; Song, Young K.; Agarwal, Saurabh; Coarfa, Cristian; Tolman, Catherine; Hurd, Laura; Liao, Hongling; He, Jianbin; Wen, Xinyu; Liu, Zhihui; Thiele, Carol J.; Westermann, Frank; Asgharzadeh, Shahab; Seeger, Robert C.; Maris, John M.; Auvil, Jamie M Guidry; Smith, Malcolm A; Kolaczyk, Eric D; Shohet, Jason; Khan, Javed

    2016-01-01

    The molecular mechanisms underlying the aggressive behavior of MYCN driven neuroblastoma (NBL) is under intense investigation; however, little is known about the impact of this family of transcription factors on the splicing program. Here we used high-throughput RNA sequencing to systematically study the expression of RNA isoforms in stage 4 MYCN-amplified NBL, an aggressive subtype of metastatic NBL. We show that MYCN-amplified NBL tumors display a distinct gene splicing pattern affecting multiple cancer hallmark functions. Six splicing factors displayed unique differential expression patterns in MYCN-amplified tumors and cell lines, and the binding motifs for some of these splicing factors are significantly enriched in differentially-spliced genes. Direct binding of MYCN to promoter regions of the splicing factors PTBP1 and HNRNPA1 detected by ChIP-seq demonstrates MYCN controls the splicing pattern by direct regulation of the expression of these key splicing factors. Furthermore, high expression of PTBP1 and HNRNPA1 was significantly associated with poor overall survival of stage4 NBL patients (p≤0.05). Knocking down PTBP1, HNRNPA1 and their downstream target PKM2, an isoform of pro-tumor-growth, result in repressed growth of NBL cells. Therefore, our study reveals a novel role of MYCN in controlling global splicing program through regulation of splicing factors in addition to its well-known role in the transcription program. These findings suggest a therapeutically potential to target the key splicing factors or gene isoforms in high-risk NBL with MYCN-amplification. PMID:26683771

  14. A mutational analysis of U12-dependent splice site dinucleotides

    PubMed Central

    DIETRICH, ROSEMARY C.; FULLER, JOHN D.; PADGETT, RICHARD A.

    2005-01-01

    Introns spliced by the U12-dependent minor spliceosome are divided into two classes based on their splice site dinucleotides. The /AU-AC/ class accounts for about one-third of U12-dependent introns in humans, while the /GU-AG/ class accounts for the other two-thirds. We have investigated the in vivo and in vitro splicing phenotypes of mutations in these dinucleotide sequences. A 5′ A residue can splice to any 3′ residue, although C is preferred. A 5′ G residue can splice to 3′ G or U residues with a preference for G. Little or no splicing was observed to 3′ A or C residues. A 5′ U or C residue is highly deleterious for U12-dependent splicing, although some combinations, notably 5′ U to 3′ U produced detectable spliced products. The dependence of 3′ splice site activity on the identity of the 5′ residue provides evidence for communication between the first and last nucleotides of the intron. Most mutants in the second position of the 5′ splice site and the next to last position of the 3′ splice site were defective for splicing. Double mutants of these residues showed no evidence of communication between these nucleotides. Varying the distance between the branch site and the 3′ splice site dinucleotide in the /GU-AG/ class showed that a somewhat larger range of distances was functional than for the /AU-AC/ class. The optimum branch site to 3′ splice site distance of 11–12 nucleotides appears to be the same for both classes. PMID:16043500

  15. Conditional protein splicing: a new tool to control protein structure and function in vitro and in vivo.

    PubMed

    Mootz, Henning D; Blum, Elyse S; Tyszkiewicz, Amy B; Muir, Tom W

    2003-09-03

    Protein splicing is a naturally occurring process in which an intervening intein domain excises itself out of a precursor polypeptide in an autocatalytic fashion with concomitant linkage of the two flanking extein sequences by a native peptide bond. We have recently reported an engineered split VMA intein whose splicing activity in trans between two polypeptides can be triggered by the small molecule rapamycin. In this report, we show that this conditional protein splicing (CPS) system can be used in mammalian cells. Two model constructs harboring maltose-binding protein (MBP) and a His-tag as exteins were expressed from a constitutive promoter after transient transfection. The splicing product MBP-His was detected by Western blotting and immunoprecipitation in cells treated with rapamycin or a nontoxic analogue thereof. No background splicing in the absence of the small-molecule inducer was observed over a 24-h time course. Product formation could be detected within 10 min of addition of rapamycin, indicating the advantage of the posttranslational nature of CPS for quick responses. The level of protein splicing was dose dependent and could be competitively attenuated with the small molecule ascomycin. In related studies, the geometric flexibility of the CPS components was investigated with a series of purified proteins. The FKBP and FRB domains, which are dimerized by rapamycin and thereby induce the reconstitution of the split intein, were fused to the extein sequences of the split intein halves. CPS was still triggered by rapamycin when FKBP and FRB occupied one or both of the extein positions. This finding suggests yet further applications of CPS in the area of proteomics. In summary, CPS holds great promise to become a powerful new tool to control protein structure and function in vitro and in living cells.

  16. Characterization of an apparently synonymous F5 mutation causing aberrant splicing and factor V deficiency.

    PubMed

    Nuzzo, F; Bulato, C; Nielsen, B I; Lee, K; Wielders, S J; Simioni, P; Key, N S; Castoldi, E

    2015-03-01

    Coagulation factor V (FV) deficiency is a rare autosomal recessive bleeding disorder. We investigated a patient with severe FV deficiency (FV:C < 3%) and moderate bleeding symptoms. Thrombin generation experiments showed residual FV expression in the patient's plasma, which was quantified as 0.7 ± 0.3% by a sensitive prothrombinase-based assay. F5 gene sequencing identified a novel missense mutation in exon 4 (c.578G>C, p.Cys193Ser), predicting the abolition of a conserved disulphide bridge, and an apparently synonymous variant in exon 8 (c.1281C>G). The observation that half of the patient's F5 mRNA lacked the last 18 nucleotides of exon 8 prompted us to re-evaluate the c.1281C>G variant for its possible effects on splicing. Bioinformatics sequence analysis predicted that this transversion would activate a cryptic donor splice site and abolish an exonic splicing enhancer. Characterization in a F5 minigene model confirmed that the c.1281C>G variant was responsible for the patient's splicing defect, which could be partially corrected by a mutation-specific morpholino antisense oligonucleotide. The aberrantly spliced F5 mRNA, whose stability was similar to that of the normal mRNA, encoded a putative FV mutant lacking amino acids 427-432. Expression in COS-1 cells indicated that the mutant protein is poorly secreted and not functional. In conclusion, the c.1281C>G mutation, which was predicted to be translationally silent and hence neutral, causes FV deficiency by impairing pre-mRNA splicing. This finding underscores the importance of cDNA analysis for the correct assessment of exonic mutations. © 2014 John Wiley & Sons Ltd.

  17. [Genetic diagnostics of pathogenic splicing abnormalities in the clinical laboratory--pitfalls and screening approaches].

    PubMed

    Niimi, Hideki; Ogawa, Tomomi; Note, Rhougou; Hayashi, Shirou; Ueno, Tomohiro; Harada, Kenu; Uji, Yoshinori; Kitajima, Isao

    2010-12-01

    In recent years, genetic diagnostics of pathogenic splicing abnormalities are increasingly recognized as critically important in the clinical genetic diagnostics. It is reported that approximately 10% of pathogenic mutations causing human inherited diseases are splicing mutations. Nonetheless, it is still difficult to identify splicing abnormalities in routine genetic diagnostic settings. Here, we studied two different kinds of cases with splicing abnormalities. The first case is a protein S deficiency. Nucleotide analyses revealed that the proband had a previously reported G to C substitution in the invariant AG dinucleotide at the splicing acceptor site of intronl/exon2, which produces multiple splicing abnormalities resulting in protein S deficiency. The second case is an antithrombin (AT) deficiency. This proband had a previously reported G to A substitution, at nucleotide position 9788 in intron 4, 14 bp in front of exon 5, which created a de novo exon 5 splice site and resulted in AT deficiency. From a practical standpoint, we discussed the pitfalls, attentions, and screening approaches in genetic diagnostics of pathogenic splicing abnormalities. Due to the difficulty with full-length sequence analysis of introns, and the lack of RNA samples, splicing mutations may escape identification. Although current genetic testing remains to be improved, to screen for splicing abnormalities more efficiently, it is significant to use an appropriate combination of various approaches such as DNA and/or RNA samples, splicing mutation databases, bioinformatic tools to detect splice sites and cis-regulatory elements, and in vitro and/or in vivo experimentally methods as needed.

  18. Transcriptome Bioinformatical Analysis of Vertebrate Stages of Schistosoma japonicum Reveals Alternative Splicing Events

    PubMed Central

    Wang, Xinye; Xu, Xindong; Lu, Xingyu; Zhang, Yuanbin; Pan, Weiqing

    2015-01-01

    Alternative splicing is a molecular process that contributes greatly to the diversification of proteome and to gene functions. Understanding the mechanisms of stage-specific alternative splicing can provide a better understanding of the development of eukaryotes and the functions of different genes. Schistosoma japonicum is an infectious blood-dwelling trematode with a complex lifecycle that causes the tropical disease schistosomiasis. In this study, we analyzed the transcriptome of Schistosoma japonicum to discover alternative splicing events in this parasite, by applying RNA-seq to cDNA library of adults and schistosomula. Results were validated by RT-PCR and sequencing. We found 11,623 alternative splicing events among 7,099 protein encoding genes and average proportion of alternative splicing events per gene was 42.14%. We showed that exon skip is the most common type of alternative splicing events as found in high eukaryotes, whereas intron retention is the least common alternative splicing type. According to intron boundary analysis, the parasite possesses same intron boundaries as other organisms, namely the classic “GT-AG” rule. And in alternative spliced introns or exons, this rule is less strict. And we have attempted to detect alternative splicing events in genes encoding proteins with signal peptides and transmembrane helices, suggesting that alternative splicing could change subcellular locations of specific gene products. Our results indicate that alternative splicing is prevalent in this parasitic worm, and that the worm is close to its hosts. The revealed secretome involved in alternative splicing implies new perspective into understanding interaction between the parasite and its host. PMID:26407301

  19. Alternative splicing by participation of the group II intron ORF in extremely halotolerant and alkaliphilic Oceanobacillus iheyensis.

    PubMed

    Chee, Gab-Joo; Takami, Hideto

    2011-01-01

    Group II introns inserted into genes often undergo splicing at unexpected sites, and participate in the transcription of host genes. We identified five copies of a group II intron, designated Oi.Int, in the genome of an extremely halotolerant and alkaliphilic bacillus, Oceanobacillus iheyensis. The Oi.Int4 differs from the Oi.Int3 at four bases. The ligated exons of the Oi.Int4 could not be detected by RT-PCR assays in vivo or in vitro although group II introns can generally self-splice in vitro without the involvement of an intron-encoded open reading frame (ORF). In the Oi.Int4 mutants with base substitutions within the ORF, ligated exons were detected by in vitro self-splicing. It was clear that the ligation of exons during splicing is affected by the sequence of the intron-encoded ORF since the splice sites corresponded to the joining sites of the intron. In addition, the mutant introns showed unexpected multiple products with alternative 5' splice sites. These findings imply that alternative 5' splicing which causes a functional change of ligated exons presumably has influenced past adaptations of O. iheyensis to various environmental changes.

  20. 17β-estradiol regulates the RNA-binding protein Nova1, which then regulates the alternative splicing of estrogen receptor β in the aging female rat brain.

    PubMed

    Shults, Cody L; Dingwall, Caitlin B; Kim, Chun K; Pinceti, Elena; Rao, Yathindar S; Pak, Toni R

    2018-01-01

    Alternative RNA splicing results in the translation of diverse protein products arising from a common nucleotide sequence. These alternative protein products are often functional and can have widely divergent actions from the canonical protein. Studies in humans and other vertebrate animals have demonstrated that alternative splicing events increase with advanced age, sometimes resulting in pathological consequences. Menopause represents a critical transition for women, where the beneficial effects of estrogens are no longer evident; therefore, factors underlying increased pathological conditions in women are confounded by the dual factors of aging and declining estrogens. Estrogen receptors (ERs) are subject to alternative splicing, the spliced variants increase following menopause, and they fail to efficiently activate estrogen-dependent signaling pathways. However, the factors that regulate the alternative splicing of ERs remain unknown. We demonstrate novel evidence supporting a potential biological feedback loop where 17β-estradiol regulates the RNA-binding protein Nova1, which, in turn, regulates the alternative splicing of ERβ. These data increase our understanding of ER alternative splicing and could have potential implications for women taking hormone replacement therapy after menopause. Copyright © 2017 Elsevier Inc. All rights reserved.

  1. Role of TAR RNA splicing in translational regulation of simian immunodeficiency virus from rhesus macaques.

    PubMed Central

    Viglianti, G A; Rubinstein, E P; Graves, K L

    1992-01-01

    The untranslated leader sequences of rhesus macaque simian immunodeficiency virus mRNAs form a stable secondary structure, TAR. This structure can be modified by RNA splicing. In this study, the role of TAR splicing in virus replication was investigated. The proportion of viral RNAs containing a spliced TAR structure is high early after infection and decreases at later times. Moreover, proviruses containing mutations which prevent TAR splicing are significantly delayed in replication. These mutant viruses require approximately 20 days to achieve half-maximal virus production, in contrast to wild-type viruses, which require approximately 8 days. We attribute this delay to the inefficient translation of unspliced-TAR-containing mRNAs. The molecular basis for this translational effect was examined in in vitro assays. We found that spliced-TAR-containing mRNAs were translated up to 8.5 times more efficiently than were similar mRNAs containing an unspliced TAR leader. Furthermore, these spliced-TAR-containing mRNAs were more efficiently associated with ribosomes. We postulate that the level of TAR splicing provides a balance for the optimal expression of both viral proteins and genomic RNA and therefore ultimately controls the production of infectious virions. Images PMID:1629957

  2. RBFOX and PTBP1 proteins regulate the alternative splicing of micro-exons in human brain transcripts

    PubMed Central

    Sanchez-Pulido, Luis; Haerty, Wilfried

    2015-01-01

    Ninety-four percent of mammalian protein-coding exons exceed 51 nucleotides (nt) in length. The paucity of micro-exons (≤ 51 nt) suggests that their recognition and correct processing by the splicing machinery present greater challenges than for longer exons. Yet, because thousands of human genes harbor processed micro-exons, specialized mechanisms may be in place to promote their splicing. Here, we survey deep genomic data sets to define 13,085 micro-exons and to study their splicing mechanisms and molecular functions. More than 60% of annotated human micro-exons exhibit a high level of sequence conservation, an indicator of functionality. While most human micro-exons require splicing-enhancing genomic features to be processed, the splicing of hundreds of micro-exons is enhanced by the adjacent binding of splice factors in the introns of pre-messenger RNAs. Notably, splicing of a significant number of micro-exons was found to be facilitated by the binding of RBFOX proteins, which promote their inclusion in the brain, muscle, and heart. Our analyses suggest that accurate regulation of micro-exon inclusion by RBFOX proteins and PTBP1 plays an important role in the maintenance of tissue-specific protein–protein interactions. PMID:25524026

  3. A human-specific mutation leads to the origin of a novel splice form of neuropsin (KLK8), a gene involved in learning and memory.

    PubMed

    Lu, Zhi-xiang; Peng, Jia; Su, Bing

    2007-10-01

    Neuropsin (kallikrein 8, KLK8) is a secreted-type serine protease preferentially expressed in the central nervous system and involved in learning and memory. Its splicing pattern is different in human and mouse, with the longer form (type II) only expressed in human. Sequence analysis suggested a recent origin of type II during primate evolution. Here we demonstrate that the type II form is absent in nonhuman primates, and is thus a human-specific splice form. With the use of an in vitro splicing assay, we show that a human-specific T to A mutation (c.71-127T>A) triggers the change of splicing pattern, leading to the origin of a novel splice form in the human brain. Using mutation assay, we prove that this mutation is not only necessary but also sufficient for type II expression. Our results demonstrate a molecular mechanism for the creation of novel proteins through alternative splicing in the central nervous system during human evolution. Copyright 2007 Wiley-Liss, Inc.

  4. Molecular characterization of a nuclear topoisomerase II from Nicotiana tabacum that functionally complements a temperature-sensitive topoisomerase II yeast mutant.

    PubMed

    Singh, B N; Mudgil, Yashwanti; Sopory, S K; Reddy, M K

    2003-07-01

    We have successfully expressed enzymatically active plant topoisomerase II in Escherichia coli for the first time, which has enabled its biochemical characterization. Using a PCR-based strategy, we obtained a full-length cDNA and the corresponding genomic clone of tobacco topoisomerase II. The genomic clone has 18 exons interrupted by 17 introns. Most of the 5' and 3' splice junctions follow the typical canonical consensus dinucleotide sequence GU-AG present in other plant introns. The position of introns and phasing with respect to primary amino acid sequence in tobacco TopII and Arabidopsis TopII are highly conserved, suggesting that the two genes are evolved from the common ancestral type II topoisomerase gene. The cDNA encodes a polypeptide of 1482 amino acids. The primary amino acid sequence shows a striking sequence similarity, preserving all the structural domains that are conserved among eukaryotic type II topoisomerases in an identical spatial order. We have expressed the full-length polypeptide in E. coli and purified the recombinant protein to homogeneity. The full-length polypeptide relaxed supercoiled DNA and decatenated the catenated DNA in a Mg(2+)- and ATP-dependent manner, and this activity was inhibited by 4'-(9-acridinylamino)-3'-methoxymethanesulfonanilide (m-AMSA). The immunofluorescence and confocal microscopic studies, with antibodies developed against the N-terminal region of tobacco recombinant topoisomerase II, established the nuclear localization of topoisomerase II in tobacco BY2 cells. The regulated expression of tobacco topoisomerase II gene under the GAL1 promoter functionally complemented a temperature-sensitive TopII(ts) yeast mutant.

  5. Comparative oncology DNA sequencing of canine T cell lymphoma via human hotspot panel

    PubMed Central

    Beheshti, Afshin; Pilichowska, Monika; Burgess, Kristine; Ricks-Santi, Luisel; McNiel, Elizabeth; London, Cheryl B.; Ravi, Dashnamoorthy; Evens, Andrew M.

    2018-01-01

    T-cell lymphoma (TCL) is an uncommon and aggressive form of human cancer. Lymphoma is the most common hematopoietic tumor in canines (companion animals), with TCL representing approximately 30% of diagnoses. Collectively, the canine is an appealing model for cancer research given the spontaneous occurrence of cancer, intact immune system, and phytogenetic proximity to humans. We sought to establish mutational congruence of the canine with known human TCL mutations in order to identify potential actionable oncogenic pathways. Following pathologic confirmation, DNA was sequenced in 16 canine TCL (cTCL) cases using a custom Human Cancer Hotspot Panel of 68 genes commonly mutated in human TCL. Sequencing identified 4,527,638 total reads with average length of 229 bases containing 346 unique variants and 1,474 total variants; each sample had an average of 92 variants. Among these, there were 258 germline and 32 somatic variants. Among the 32 somatic variants there were 8 missense variants, 1 splice junction variant and the remaining were intron or synonymous variants. A frequency of 4 somatic mutations per sample were noted with >7 mutations detected in MET, KDR, STK11 and BRAF. Expression of these associated proteins were also detected via Western blot analyses. In addition, Sanger sequencing confirmed three variants of high quality (MYC, MET, and TP53 missense mutation). Taken together, the mutational spectrum and protein analyses showed mutations in signaling pathways similar to human TCL and also identified novel mutations that may serve as drug targets as well as potential biomarkers. PMID:29854308

  6. Integrative genome-wide analysis of the determinants of RNA splicing in kidney renal clear cell carcinoma.

    PubMed

    Lehmann, Kjong-Van; Kahles, André; Kandoth, Cyriac; Lee, William; Schultz, Nikolaus; Stegle, Oliver; Rätsch, Gunnar

    2015-01-01

    We present a genome-wide analysis of splicing patterns of 282 kidney renal clear cell carcinoma patients in which we integrate data from whole-exome sequencing of tumor and normal samples, RNA-seq and copy number variation. We proposed a scoring mechanism to compare splicing patterns in tumor samples to normal samples in order to rank and detect tumor-specific isoforms that have a potential for new biomarkers. We identified a subset of genes that show introns only observable in tumor but not in normal samples, ENCODE and GEUVADIS samples. In order to improve our understanding of the underlying genetic mechanisms of splicing variation we performed a large-scale association analysis to find links between somatic or germline variants with alternative splicing events. We identified 915 cis- and trans-splicing quantitative trait loci (sQTL) associated with changes in splicing patterns. Some of these sQTL have previously been associated with being susceptibility loci for cancer and other diseases. Our analysis also allowed us to identify the function of several COSMIC variants showing significant association with changes in alternative splicing. This demonstrates the potential significance of variants affecting alternative splicing events and yields insights into the mechanisms related to an array of disease phenotypes.

  7. Mutation in Pyrroline-5-Carboxylate Reductase 1 Gene in Families with Cutis Laxa Type 2

    PubMed Central

    Guernsey, Duane L.; Jiang, Haiyan; Evans, Susan C.; Ferguson, Meghan; Matsuoka, Makoto; Nightingale, Mathew; Rideout, Andrea L.; Provost, Sylvie; Bedard, Karen; Orr, Andrew; Dubé, Marie-Pierre; Ludman, Mark; Samuels, Mark E.

    2009-01-01

    Autosomal-recessive cutis laxa type 2 (ARCL2) is a multisystem disorder characterized by the appearance of premature aging, wrinkled and lax skin, joint laxity, and a general developmental delay. Cutis laxa includes a family of clinically overlapping conditions with confusing nomenclature, generally requiring molecular analyses for definitive diagnosis. Six genes are currently known to mutate to yield one of these related conditions. We ascertained a cohort of typical ARCL2 patients from a subpopulation isolate within eastern Canada. Homozygosity mapping with high-density SNP genotyping excluded all six known genes, and instead identified a single homozygous region near the telomere of chromosome 17, shared identically by state by all genotyped affected individuals from the families. A putative pathogenic variant was identified by direct DNA sequencing of genes within the region. The single nucleotide change leads to a missense mutation adjacent to a splice junction in the gene encoding pyrroline-5-carboxylate reductase 1 (PYCR1). Bioinformatic analysis predicted a pathogenic effect of the variant on splice donor site function. Skipping of the associated exon was confirmed in RNA from blood lymphocytes of affected homozygotes and heterozygous mutation carriers. Exon skipping leads to deletion of the reductase functional domain-coding region and an obligatory downstream frameshift. PYCR1 plays a critical role in proline biosynthesis. Pathogenicity of the genetic variant in PYCR1 is likely, given that a similar clinical phenotype has been documented for mutation carriers of another proline biosynthetic enzyme, pyrroline-5-carboxylate synthase. Our results support a significant role for proline in normal development. PMID:19576563

  8. Homologous SV40 RNA trans-splicing

    PubMed Central

    Eul, Joachim; Patzel, Volker

    2013-01-01

    Simian Virus 40 (SV40) is a polyomavirus found in both monkeys and humans, which causes cancer in some animal models. In humans, SV40 has been reported to be associated with cancers but causality has not been proven yet. The transforming activity of SV40 is mainly due to its 94-kD large T antigen, which binds to the retinoblastoma (pRb) and p53 tumor suppressor proteins, and thereby perturbs their functions. Here we describe a 100 kD super T antigen harboring a duplication of the pRB binding domain that was associated with unusual high cell transformation activity and that was generated by a novel mechanism involving homologous RNA trans-splicing of SV40 early transcripts in transformed rodent cells. Enhanced trans-splice activity was observed in clones carrying a single point mutation in the large T antigen 5′ donor splice site (ss). This mutation impaired cis-splicing in favor of an alternative trans-splice reaction via a cryptic 5′ss within a second cis-spliced SV40 pre-mRNA molecule and enabled detectable gene expression. Next to the cryptic 5′ss we identified additional trans-splice helper functions, including putative dimerization domains and a splice enhancer sequence. Our findings suggest RNA trans-splicing as a SV40-intrinsic mechanism that supports the diversification of viral RNA and phenotypes. PMID:24178438

  9. Structure of the human myelin/oligodendrocyte glycoprotein gene and multiple alternative spliced isoforms

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pham-Dinh, D.; Gaspera, D.B.; Dautigny, A.

    1995-09-20

    Myelin/oligodendrocyte glycoprotein (MOG), a special component of the central nervous system localization on the outermost lamellae of mature myelin, is a member of the immunoglobulin superfamily. We report here the organization of the human MOG gene, which spans approximately 17 kb, and the characterization of six MOG mRNA splicing variants. The intron/exon structure of the human MOG gene confirmed the splicing pattern, supporting the hypothesis that mRNA isoforms could arise by alternative splicing of a single gene. In addition to the eight exons coding for the major MOG isoform, the human MOG gene also contains 3` region, a previously unknownmore » alternatively spliced coding exon, VIA. Alternative utilization of two acceptor splicing sites for exon VIII could produce two different C-termini. The nucleotide sequences presented here may be a useful tool to study further possible involvement if the MOG gene in hereditary neurological disorders. 23 refs., 5 figs.« less

  10. Quantitative imaging of single mRNA splice variants in living cells

    NASA Astrophysics Data System (ADS)

    Lee, Kyuwan; Cui, Yi; Lee, Luke P.; Irudayaraj, Joseph

    2014-06-01

    Alternative messenger RNA (mRNA) splicing is a fundamental process of gene regulation, and errors in RNA splicing are known to be associated with a variety of different diseases. However, there is currently a lack of quantitative technologies for monitoring mRNA splice variants in cells. Here, we show that a combination of plasmonic dimer probes and hyperspectral imaging can be used to detect and quantify mRNA splice variants in living cells. The probes are made from gold nanoparticles functionalized with oligonucleotides and can hybridize to specific mRNA sequences, forming nanoparticle dimers that exhibit distinct spectral shifts due to plasmonic coupling. With this approach, we show that the spatial and temporal distribution of three selected splice variants of the breast cancer susceptibility gene, BRCA1, can be monitored at single-copy resolution by measuring the hybridization dynamics of the nanoplasmonic dimers. Our study provides insights into RNA and its transport in living cells, which could improve our understanding of cellular protein complexes, pharmacogenomics, genetic diagnosis and gene therapies.

  11. Global impact of RNA splicing on transcriptome remodeling in the heart.

    PubMed

    Gao, Chen; Wang, Yibin

    2012-08-01

    In the eukaryotic transcriptome, both the numbers of genes and different RNA species produced by each gene contribute to the overall complexity. These RNA species are generated by the utilization of different transcriptional initiation or termination sites, or more commonly, from different messenger RNA (mRNA) splicing events. Among the 30,000+ genes in human genome, it is estimated that more than 95% of them can generate more than one gene product via alternative RNA splicing. The protein products generated from different RNA splicing variants can have different intracellular localization, activity, or tissue-distribution. Therefore, alternative RNA splicing is an important molecular process that contributes to the overall complexity of the genome and the functional specificity and diversity among different cell types. In this review, we will discuss current efforts to unravel the full complexity of the cardiac transcriptome using a deep-sequencing approach, and highlight the potential of this technology to uncover the global impact of RNA splicing on the transcriptome during development and diseases of the heart.

  12. Alternative splicing of the tyrosinase gene transcript in normal human melanocytes and lymphocytes.

    PubMed

    Fryer, J P; Oetting, W S; Brott, M J; King, R A

    2001-11-01

    We have identified and isolated ectopically expressed tyrosinase transcripts in normal human melanocytes and lymphocytes and in a human melanoma (MNT-1) cell line to establish a baseline for the expression pattern of this gene in normal tissue. Tyrosinase mRNA from human lymphoblastoid cell lines was reverse transcribed and amplified using specific "nested" primers. This amplification yielded eight identifiable transcripts; five that resulted from alternative splicing patterns arising from the utilization of normal and alternative splice sequences. Identical splicing patterns were found in transcripts from human primary melanocytes in culture and a melanoma cell line, indicating that lymphoblastoid cell lines provide an accurate reflection of transcript processing in melanocytes. Similar splicing patterns have also been found with murine melanocyte tyrosinase transcripts. Our results demonstrate that alternative splicing of human tyrosinase gene transcript produces a number of predictable and identifiable transcripts, and that human lymphoblastoid cell lines provide a source of ectopically expressed transcripts that can be used to study the biology of tyrosinase gene expression in humans.

  13. Rare splicing defects of FAS underly severe recessive autoimmune lymphoproliferative syndrome.

    PubMed

    Agrebi, N; Ben-Mustapha, I; Matoussi, N; Dhouib, N; Ben-Ali, M; Mekki, N; Ben-Ahmed, M; Larguèche, B; Ben Becher, S; Béjaoui, M; Barbouche, M R

    2017-10-01

    Autoimmune lymphoproliferative syndrome (ALPS) is a prototypic disorder of impaired apoptosis characterized by autoimmune features and lymphoproliferation. Heterozygous germline or somatic FAS mutations associated with preserved protein expression have been described. Very rare cases of homozygous germline FAS mutations causing severe autosomal recessive form of ALPS with a complete defect of Fas expression have been reported. We report two unrelated patients from highly inbred North African population showing a severe ALPS phenotype and an undetectable Fas surface expression. Two novel homozygous mutations have been identified underlying rare splicing defects mechanisms. The first mutation breaks a branch point sequence and the second alters a regulatory exonic splicing site. These splicing defects induce the skipping of exon 6 encoding the transmembrane domain of CD95. Our findings highlight the requirement of tight regulation of FAS exon 6 splicing for balanced alternative splicing and illustrate the importance of such studies in highly consanguineous populations. Copyright © 2017 Elsevier Inc. All rights reserved.

  14. A genome-wide aberrant RNA splicing in patients with acute myeloid leukemia identifies novel potential disease markers and therapeutic targets.

    PubMed

    Adamia, Sophia; Haibe-Kains, Benjamin; Pilarski, Patrick M; Bar-Natan, Michal; Pevzner, Samuel; Avet-Loiseau, Herve; Lode, Laurence; Verselis, Sigitas; Fox, Edward A; Burke, John; Galinsky, Ilene; Dagogo-Jack, Ibiayi; Wadleigh, Martha; Steensma, David P; Motyckova, Gabriela; Deangelo, Daniel J; Quackenbush, John; Stone, Richard; Griffin, James D

    2014-03-01

    Despite new treatments, acute myeloid leukemia (AML) remains an incurable disease. More effective drug design requires an expanded view of the molecular complexity that underlies AML. Alternative splicing of RNA is used by normal cells to generate protein diversity. Growing evidence indicates that aberrant splicing of genes plays a key role in cancer. We investigated genome-wide splicing abnormalities in AML and based on these abnormalities, we aimed to identify novel potential biomarkers and therapeutic targets. We used genome-wide alternative splicing screening to investigate alternative splicing abnormalities in two independent AML patient cohorts [Dana-Farber Cancer Institute (DFCI) (Boston, MA) and University Hospital de Nantes (UHN) (Nantes, France)] and normal donors. Selected splicing events were confirmed through cloning and sequencing analysis, and than validated in 193 patients with AML. Our results show that approximately 29% of expressed genes genome-wide were differentially and recurrently spliced in patients with AML compared with normal donors bone marrow CD34(+) cells. Results were reproducible in two independent AML cohorts. In both cohorts, annotation analyses indicated similar proportions of differentially spliced genes encoding several oncogenes, tumor suppressor proteins, splicing factors, and heterogeneous-nuclear-ribonucleoproteins, proteins involved in apoptosis, cell proliferation, and spliceosome assembly. Our findings are consistent with reports for other malignances and indicate that AML-specific aberrations in splicing mechanisms are a hallmark of AML pathogenesis. Overall, our results suggest that aberrant splicing is a common characteristic for AML. Our findings also suggest that splice variant transcripts that are the result of splicing aberrations create novel disease markers and provide potential targets for small molecules or antibody therapeutics for this disease. ©2013 AACR

  15. Evolutionary conservation analysis increases the colocalization of predicted exonic splicing enhancers in the BRCA1 gene with missense sequence changes and in-frame deletions, but not polymorphisms

    PubMed Central

    Pettigrew, Christopher; Wayte, Nicola; Lovelock, Paul K; Tavtigian, Sean V; Chenevix-Trench, Georgia; Spurdle, Amanda B; Brown, Melissa A

    2005-01-01

    Introduction Aberrant pre-mRNA splicing can be more detrimental to the function of a gene than changes in the length or nature of the encoded amino acid sequence. Although predicting the effects of changes in consensus 5' and 3' splice sites near intron:exon boundaries is relatively straightforward, predicting the possible effects of changes in exonic splicing enhancers (ESEs) remains a challenge. Methods As an initial step toward determining which ESEs predicted by the web-based tool ESEfinder in the breast cancer susceptibility gene BRCA1 are likely to be functional, we have determined their evolutionary conservation and compared their location with known BRCA1 sequence variants. Results Using the default settings of ESEfinder, we initially detected 669 potential ESEs in the coding region of the BRCA1 gene. Increasing the threshold score reduced the total number to 464, while taking into consideration the proximity to splice donor and acceptor sites reduced the number to 211. Approximately 11% of these ESEs (23/211) either are identical at the nucleotide level in human, primates, mouse, cow, dog and opossum Brca1 (conserved) or are detectable by ESEfinder in the same position in the Brca1 sequence (shared). The frequency of conserved and shared predicted ESEs between human and mouse is higher in BRCA1 exons (2.8 per 100 nucleotides) than in introns (0.6 per 100 nucleotides). Of conserved or shared putative ESEs, 61% (14/23) were predicted to be affected by sequence variants reported in the Breast Cancer Information Core database. Applying the filters described above increased the colocalization of predicted ESEs with missense changes, in-frame deletions and unclassified variants predicted to be deleterious to protein function, whereas they decreased the colocalization with known polymorphisms or unclassified variants predicted to be neutral. Conclusion In this report we show that evolutionary conservation analysis may be used to improve the specificity of an ESE prediction tool. This is the first report on the prediction of the frequency and distribution of ESEs in the BRCA1 gene, and it is the first reported attempt to predict which ESEs are most likely to be functional and therefore which sequence variants in ESEs are most likely to be pathogenic. PMID:16280041

  16. Comprehensive Characterization of Swine Cardiac Troponin T Proteoforms by Top-Down Mass Spectrometry

    NASA Astrophysics Data System (ADS)

    Lin, Ziqing; Guo, Fang; Gregorich, Zachery R.; Sun, Ruixiang; Zhang, Han; Hu, Yang; Shanmuganayagam, Dhanansayan; Ge, Ying

    2018-04-01

    Cardiac troponin T (cTnT) regulates the Ca2+-mediated interaction between myosin thick filaments and actin thin filaments during cardiac contraction and relaxation. cTnT is released into the blood following injury, and increased serum levels of the protein are used clinically as a biomarker for myocardial infarction. Moreover, mutations in cTnT are causative in a number of familial cardiomyopathies. With the increasing use of large animal (swine) model to recapitulate human diseases, it is essential to characterize species-dependent protein sequence variants, alternative RNA splicing, and post-translational modifications (PTMs), but challenges remain due to the incomplete database and lack of validation of the predicted splicing isoforms. Herein, we integrated top-down mass spectrometry (MS) with online liquid chromatography (LC) and immunoaffinity purification to comprehensively characterize miniature swine cTnT proteoforms, including those arising from alternative RNA splicing and PTMs. A total of seven alternative splicing isoforms of cTnT were identified by LC/MS from swine left ventricular tissue, with each isoform containing un-phosphorylated and mono-phosphorylated proteoforms. The phosphorylation site was localized to Ser1 for the mono-phosphorylated proteoforms of cTnT1, 3, 4, and 6 by online MS/MS combining collisionally activated dissociation (CAD) and electron transfer dissociation (ETD). Offline MS/MS on Fourier-transform ion cyclotron resonance (FT-ICR) mass spectrometer with CAD and electron capture dissociation (ECD) was then utilized to achieve deep sequencing of mono-phosphorylated cTnT1 (35.2 kDa) with a high sequence coverage of 87%. Taken together, this study demonstrated the unique advantage of top-down MS in the comprehensive characterization of protein alternative splicing isoforms together with PTMs. [Figure not available: see fulltext.

  17. Exome Sequencing Identified a Splice Site Mutation in FHL1 that Causes Uruguay Syndrome, an X-Linked Disorder With Skeletal Muscle Hypertrophy and Premature Cardiac Death.

    PubMed

    Xue, Yuan; Schoser, Benedikt; Rao, Aliz R; Quadrelli, Roberto; Vaglio, Alicia; Rupp, Verena; Beichler, Christine; Nelson, Stanley F; Schapacher-Tilp, Gudrun; Windpassinger, Christian; Wilcox, William R

    2016-04-01

    Previously, we reported a rare X-linked disorder, Uruguay syndrome in a single family. The main features are pugilistic facies, skeletal deformities, and muscular hypertrophy despite a lack of exercise and cardiac ventricular hypertrophy leading to premature death. An ≈19 Mb critical region on X chromosome was identified through identity-by-descent analysis of 3 affected males. Exome sequencing was conducted on one affected male to identify the disease-causing gene and variant. A splice site variant (c.502-2A>G) in the FHL1 gene was highly suspicious among other candidate genes and variants. FHL1A is the predominant isoform of FHL1 in cardiac and skeletal muscle. Sequencing cDNA showed the splice site variant led to skipping of exons 6 of the FHL1A isoform, equivalent to the FHL1C isoform. Targeted analysis showed that this splice site variant cosegregated with disease in the family. Western blot and immunohistochemical analysis of muscle from the proband showed a significant decrease in protein expression of FHL1A. Real-time polymerase chain reaction analysis of different isoforms of FHL1 demonstrated that the FHL1C is markedly increased. Mutations in the FHL1 gene have been reported in disorders with skeletal and cardiac myopathy but none has the skeletal or facial phenotype seen in patients with Uruguay syndrome. Our data suggest that a novel FHL1 splice site variant results in the absence of FHL1A and the abundance of FHL1C, which may contribute to the complex and severe phenotype. Mutation screening of the FHL1 gene should be considered for patients with uncharacterized myopathies and cardiomyopathies. © 2016 American Heart Association, Inc.

  18. Potentially pathogenic germline CHEK2 c.319+2T>A among multiple early-onset cancer families.

    PubMed

    Dominguez-Valentin, Mev; Nakken, Sigve; Tubeuf, Hélène; Vodak, Daniel; Ekstrøm, Per Olaf; Nissen, Anke M; Morak, Monika; Holinski-Feder, Elke; Martins, Alexandra; Møller, Pål; Hovig, Eivind

    2018-01-01

    To study the potential contribution of genes other than BRCA1/2, PTEN, and TP53 to the biological and clinical characteristics of multiple early-onset cancers in Norwegian families, including early-onset breast cancer, Cowden-like and Li-Fraumeni-like syndromes (BC, CSL and LFL, respectively). The Hereditary Cancer Biobank from the Norwegian Radium Hospital was used to identify early-onset BC, CSL or LFL for whom no pathogenic variants in BRCA1/2, PTEN, or TP53 had been found in routine diagnostic DNA sequencing. Forty-four cancer susceptibility genes were selected and analyzed by our in-house designed TruSeq amplicon-based assay for targeted sequencing. Protein- and RNA splicing-dedicated in silico analyses were performed for all variants of unknown significance (VUS). Variants predicted as the more likely to affect splicing were experimentally analyzed by minigene assay. We identified a CSL individual carrying a variant in CHEK2 (c.319+2T>A, IVS2), here considered as likely pathogenic. Out of the five VUS (BRCA2, CDH1, CHEK2, MAP3K1, NOTCH3) tested in the minigene splicing assay, only NOTCH3 c.14090C>T (p.Ser497Leu) showed a significant effect on RNA splicing, notably by inducing partial skipping of exon 9. Among 13 early-onset BC, CSL and LFL patients, gene panel sequencing identified a potentially pathogenic variant in CHEK2 that affects a canonical RNA splicing signal. Our study provides new information on genetic loci that may affect the risk of developing cancer in these patients and their families, demonstrating that genes presently not routinely tested in molecular diagnostic settings may be important for capturing cancer predisposition in these families.

  19. Free energy landscapes of RNA/RNA complexes: with applications to snRNA complexes in spliceosomes.

    PubMed

    Cao, Song; Chen, Shi-Jie

    2006-03-17

    We develop a statistical mechanical model for RNA/RNA complexes with both intramolecular and intermolecular interactions. As an application of the model, we compute the free energy landscapes, which give the full distribution for all the possible conformations, for U4/U6 and U2/U6 in major spliceosome and U4atac/U6atac and U12/U6atac in minor spliceosome. Different snRNA experiments found contrasting structures, our free energy landscape theory shows why these structures emerge and how they compete with each other. For yeast U2/U6, the model predicts that the two distinct experimental structures, the four-helix junction structure and the helix Ib-containing structure, can actually coexist and specifically compete with each other. In addition, the energy landscapes suggest possible mechanisms for the conformational switches in splicing. For instance, our calculation shows that coaxial stacking is essential for stabilizing the four-helix junction in yeast U2/U6. Therefore, inhibition of the coaxial stacking possibly by protein-binding may activate the conformational switch from the four-helix junction to the helix Ib-containing structure. Moreover, the change of the energy landscape shape gives information about the conformational changes. We find multiple (native-like and misfolded) intermediates formed through base-pairing rearrangements in snRNA complexes. For example, the unfolding of the U2/U6 undergoes a transition to a misfolded state which is functional, while in the unfolding of U12/U6atac, the functional helix Ib is found to be the last one to unfold and is thus the most stable structural component. Furthermore, the energy landscape gives the stabilities of all the possible (functional) intermediates and such information is directly related to splicing efficiency.

  20. Genetic heterogeneity in patients with Bartter syndrome type 1

    PubMed Central

    Sun, Mingran; Ning, Jing; Xu, Weihong; Zhang, Han; Zhao, Kaishu; Li, Wenfu; Li, Guiying; Li, Shibo

    2017-01-01

    Bartter syndrome (BS) type 1 is an autosomal recessive kidney disorder caused by loss-of-function mutations in the solute carrier family 12 member 1 (SLC12A1) gene. To date, 72 BS type 1 patients harboring SLC12A1 mutations have been documented. Of these 144 alleles studied, 68 different disease-causing mutations have been detected in 129 alleles, and no mutation was detected in the remaining 15 alleles. The mutation types included missense/nonsense mutations, splicing mutations and small insertions and deletions ranging from 1 to 4 nucleotides. A large deletion encompassing a whole exon in the SLC12A1 gene has not yet been reported. The current study initially identified an undocumented homozygous frameshift mutation (c.1833delT) by Sanger sequencing analysis of a single infant with BS type 1. However, in a subsequent analysis, the mutation was detected only in the father's DNA. Upon further investigation using a next-generation sequencing approach, a deletion in exons 14 and 15 in both the patient and patient's mother was detected. The deletion was subsequently confirmed by use of a long-range polymerase chain reaction and was determined to be 3.16 kb in size based on sequencing of the junction fragment. The results of the present study demonstrated that pathogenic variants of SLC12A1 are heterogeneous. Large deletions appear to serve an etiological role in BS type 1, and may be more prevalent than previously thought. PMID:28000888

  1. Genetic heterogeneity in patients with Bartter syndrome type 1.

    PubMed

    Sun, Mingran; Ning, Jing; Xu, Weihong; Zhang, Han; Zhao, Kaishu; Li, Wenfu; Li, Guiying; Li, Shibo

    2017-02-01

    Bartter syndrome (BS) type 1 is an autosomal recessive kidney disorder caused by loss‑of‑function mutations in the solute carrier family 12 member 1 (SLC12A1) gene. To date, 72 BS type 1 patients harboring SLC12A1 mutations have been documented. Of these 144 alleles studied, 68 different disease‑causing mutations have been detected in 129 alleles, and no mutation was detected in the remaining 15 alleles. The mutation types included missense/nonsense mutations, splicing mutations and small insertions and deletions ranging from 1 to 4 nucleotides. A large deletion encompassing a whole exon in the SLC12A1 gene has not yet been reported. The current study initially identified an undocumented homozygous frameshift mutation (c.1833delT) by Sanger sequencing analysis of a single infant with BS type 1. However, in a subsequent analysis, the mutation was detected only in the father's DNA. Upon further investigation using a next‑generation sequencing approach, a deletion in exons 14 and 15 in both the patient and patient's mother was detected. The deletion was subsequently confirmed by use of a long‑range polymerase chain reaction and was determined to be 3.16 kb in size based on sequencing of the junction fragment. The results of the present study demonstrated that pathogenic variants of SLC12A1 are heterogeneous. Large deletions appear to serve an etiological role in BS type 1, and may be more prevalent than previously thought.

  2. WES homozygosity mapping in a recessive form of Charcot-Marie-Tooth neuropathy reveals intronic GDAP1 variant leading to a premature stop codon.

    PubMed

    Masingue, Marion; Perrot, Jimmy; Carlier, Robert-Yves; Piguet-Lacroix, Guenaelle; Latour, Philippe; Stojkovic, Tanya

    2018-05-01

    Charcot-Marie-Tooth disease (CMT) refers to a group of clinically and genetically heterogeneous inherited neuropathies. Ganglioside-induced differentiation-associated protein 1 GDAP1-related CMT has been reported in an autosomal dominant or recessive form in patients presenting either axonal or demyelinating neuropathy. We report two Sri Lankan sisters born to consanguineous parents and presenting with a severe axonal sensorimotor neuropathy. The early onset of the disease, the distal and proximal weakness and atrophy leading to major disability, along with areflexia, and, most notably, vocal cord and diaphragm paralysis were highly evocative of a GDAP1-related CMT. However, sequencing of the coding regions of the gene was normal. Whole-exome sequencing (WES) was performed and revealed that the largest region of homozygosity was around GDAP1 with several variants, mostly in non-coding regions. In view of the high clinical suspicion of GDAP1 gene involvement, we examined the variants in this gene and this, along with functional studies, allowed us to identify an alternative splicing site revealing a cryptic in-frame stop codon in intron 4 responsible for a severe loss of wild-type GDAP1. This work is the first to describe a deleterious mutation in GDAP1 gene outside of coding sequences or intronic junctions and emphasizes the importance of interpreting molecular analysis, and in particular WES results, in light of the clinical and electrophysiological phenotype.

  3. Developmental expression of a regulatory gene is programmed at the level of splicing.

    PubMed Central

    Chou, T B; Zachar, Z; Bingham, P M

    1987-01-01

    We report sequence and transcript structures for a 6191-base chromosomal segment containing the presumptive regulatory gene from Drosophila, suppressor-of-white-apricot [su(wa)]. Our results indicate that su(wa) expression is controlled by regulating occurrence of specific splices. Seven introns are removed from the su(wa) primary transcript during precellular blastoderm development. The sequence of this mature RNA indicates that it is a conventional messenger RNA. In contrast, after cellular blastoderm the first two of these introns cease to be efficiently removed. The mature RNAs resulting from this failure to remove the first two introns have structures quite unexpected of mRNAs. We propose that postcellular blastoderm su(wa) expression is repressed by preventing splices necessary to produce a functional mRNA. Implications and mechanisms are discussed. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. PMID:2832151

  4. Pseudoexon activation increases phenotype severity in a Becker muscular dystrophy patient.

    PubMed

    Greer, Kane; Mizzi, Kayla; Rice, Emily; Kuster, Lukas; Barrero, Roberto A; Bellgard, Matthew I; Lynch, Bryan J; Foley, Aileen Reghan; O Rathallaigh, Eoin; Wilton, Steve D; Fletcher, Sue

    2015-07-01

    We report a dystrophinopathy patient with an in-frame deletion of DMD exons 45-47, and therefore a genetic diagnosis of Becker muscular dystrophy, who presented with a more severe than expected phenotype. Analysis of the patient DMD mRNA revealed an 82 bp pseudoexon, derived from intron 44, that disrupts the reading frame and is expected to yield a nonfunctional dystrophin. Since the sequence of the pseudoexon and canonical splice sites does not differ from the reference sequence, we concluded that the genomic rearrangement promoted recognition of the pseudoexon, causing a severe dystrophic phenotype. We characterized the deletion breakpoints and identified motifs that might influence selection of the pseudoexon. We concluded that the donor splice site was strengthened by juxtaposition of intron 47, and loss of intron 44 silencer elements, normally located downstream of the pseudoexon donor splice site, further enhanced pseudoexon selection and inclusion in the DMD transcript in this patient.

  5. MYCN controls an alternative RNA splicing program in high-risk metastatic neuroblastoma.

    PubMed

    Zhang, Shile; Wei, Jun S; Li, Samuel Q; Badgett, Tom C; Song, Young K; Agarwal, Saurabh; Coarfa, Cristian; Tolman, Catherine; Hurd, Laura; Liao, Hongling; He, Jianbin; Wen, Xinyu; Liu, Zhihui; Thiele, Carol J; Westermann, Frank; Asgharzadeh, Shahab; Seeger, Robert C; Maris, John M; Guidry Auvil, Jamie M; Smith, Malcolm A; Kolaczyk, Eric D; Shohet, Jason; Khan, Javed

    2016-02-28

    The molecular mechanisms underlying the aggressive behavior of MYCN driven neuroblastoma (NBL) is under intense investigation; however, little is known about the impact of this family of transcription factors on the splicing program. Here we used high-throughput RNA sequencing to systematically study the expression of RNA isoforms in stage 4 MYCN-amplified NBL, an aggressive subtype of metastatic NBL. We show that MYCN-amplified NBL tumors display a distinct gene splicing pattern affecting multiple cancer hallmark functions. Six splicing factors displayed unique differential expression patterns in MYCN-amplified tumors and cell lines, and the binding motifs for some of these splicing factors are significantly enriched in differentially-spliced genes. Direct binding of MYCN to promoter regions of the splicing factors PTBP1 and HNRNPA1 detected by ChIP-seq demonstrates that MYCN controls the splicing pattern by direct regulation of the expression of these key splicing factors. Furthermore, high expression of PTBP1 and HNRNPA1 was significantly associated with poor overall survival of stage4 NBL patients (p ≤ 0.05). Knocking down PTBP1, HNRNPA1 and their downstream target PKM2, an isoform of pro-tumor-growth, result in repressed growth of NBL cells. Therefore, our study reveals a novel role of MYCN in controlling global splicing program through regulation of splicing factors in addition to its well-known role in the transcription program. These findings suggest a therapeutically potential to target the key splicing factors or gene isoforms in high-risk NBL with MYCN-amplification. Published by Elsevier Ireland Ltd.

  6. Novel BRCA1 splice-site mutation in ovarian cancer patients of Slavic origin.

    PubMed

    Krivokuca, Ana; Dragos, Vita Setrajcic; Stamatovic, Ljiljana; Blatnik, Ana; Boljevic, Ivana; Stegel, Vida; Rakobradovic, Jelena; Skerl, Petra; Jovandic, Stevo; Krajc, Mateja; Magic, Mirjana Brankovic; Novakovic, Srdjan

    2018-04-01

    Mutations in breast cancer susceptibility gene 1 (BRCA1) lead to defects in a number of cellular pathways including DNA damage repair and transcriptional regulation, resulting in the elevated genome instability and predisposing to breast and ovarian cancers. We report a novel mutation LRG_292t1:c.4356delA,p.(Ala1453Glnfs*3) in the 12th exon of BRCA1, in the splice site region near the donor site of intron 12. It is a frameshift mutation with the termination codon generated on the third amino acid position from the site of deletion. Human Splice Finder 3.0 and MutationTaster have assessed this variation as disease causing, based on the alteration of splicing, creation of premature stop codon and other potential alterations initiated by nucleotide deletion. Among the most important alterations are frameshift and splice site changes (score of the newly created donor splice site: 0.82). c.4356delA was associated with two ovarian cancer cases in two families of Slavic origin. It was detected by next generation sequencing, and confirmed with Sanger sequencing in both cases. Because of the fact that it changes the reading frame of the protein, novel mutation c.4356delA p.(Ala1453Glnfs*3) in BRCA1 gene might be of clinical significance for hereditary ovarian cancer. Further functional as well as segregation analyses within the families are necessary for appropriate clinical classification of this variant. Since it has been detected in two ovarian cancer patients of Slavic origin, it is worth investigating founder effect of this mutation in Slavic populations.

  7. Leveraging transcript quantification for fast computation of alternative splicing profiles.

    PubMed

    Alamancos, Gael P; Pagès, Amadís; Trincado, Juan L; Bellora, Nicolás; Eyras, Eduardo

    2015-09-01

    Alternative splicing plays an essential role in many cellular processes and bears major relevance in the understanding of multiple diseases, including cancer. High-throughput RNA sequencing allows genome-wide analyses of splicing across multiple conditions. However, the increasing number of available data sets represents a major challenge in terms of computation time and storage requirements. We describe SUPPA, a computational tool to calculate relative inclusion values of alternative splicing events, exploiting fast transcript quantification. SUPPA accuracy is comparable and sometimes superior to standard methods using simulated as well as real RNA-sequencing data compared with experimentally validated events. We assess the variability in terms of the choice of annotation and provide evidence that using complete transcripts rather than more transcripts per gene provides better estimates. Moreover, SUPPA coupled with de novo transcript reconstruction methods does not achieve accuracies as high as using quantification of known transcripts, but remains comparable to existing methods. Finally, we show that SUPPA is more than 1000 times faster than standard methods. Coupled with fast transcript quantification, SUPPA provides inclusion values at a much higher speed than existing methods without compromising accuracy, thereby facilitating the systematic splicing analysis of large data sets with limited computational resources. The software is implemented in Python 2.7 and is available under the MIT license at https://bitbucket.org/regulatorygenomicsupf/suppa. © 2015 Alamancos et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  8. An RNA-Sequencing Transcriptome and Splicing Database of Glia, Neurons, and Vascular Cells of the Cerebral Cortex

    PubMed Central

    Chen, Kenian; Sloan, Steven A.; Bennett, Mariko L.; Scholze, Anja R.; O'Keeffe, Sean; Phatnani, Hemali P.; Guarnieri, Paolo; Caneda, Christine; Ruderisch, Nadine; Deng, Shuyun; Liddelow, Shane A.; Zhang, Chaolin; Daneman, Richard; Maniatis, Tom; Barres, Ben A.

    2014-01-01

    The major cell classes of the brain differ in their developmental processes, metabolism, signaling, and function. To better understand the functions and interactions of the cell types that comprise these classes, we acutely purified representative populations of neurons, astrocytes, oligodendrocyte precursor cells, newly formed oligodendrocytes, myelinating oligodendrocytes, microglia, endothelial cells, and pericytes from mouse cerebral cortex. We generated a transcriptome database for these eight cell types by RNA sequencing and used a sensitive algorithm to detect alternative splicing events in each cell type. Bioinformatic analyses identified thousands of new cell type-enriched genes and splicing isoforms that will provide novel markers for cell identification, tools for genetic manipulation, and insights into the biology of the brain. For example, our data provide clues as to how neurons and astrocytes differ in their ability to dynamically regulate glycolytic flux and lactate generation attributable to unique splicing of PKM2, the gene encoding the glycolytic enzyme pyruvate kinase. This dataset will provide a powerful new resource for understanding the development and function of the brain. To ensure the widespread distribution of these datasets, we have created a user-friendly website (http://web.stanford.edu/group/barres_lab/brain_rnaseq.html) that provides a platform for analyzing and comparing transciption and alternative splicing profiles for various cell classes in the brain. PMID:25186741

  9. [Analysis of USH2A gene mutation in a Chinese family affected with Usher syndrome].

    PubMed

    Li, Pengcheng; Liu, Fei; Zhang, Mingchang; Wang, Qiufen; Liu, Mugen

    2015-08-01

    To investigate the disease-causing mutation in a Chinese family affected with Usher syndrome type II. All of the 11 members from the family underwent comprehensive ophthalmologic examination and hearing test, and their genomic DNA were isolated from venous leukocytes. PCR and direct sequencing of USH2A gene were performed for the proband. Wild type and mutant type minigene vectors containing exon 42, intron 42 and exon 43 of the USH2A gene were constructed and transfected into Hela cells by lipofectamine reagent. Reverse transcription (RT)-PCR was carried out to verify the splicing of the minigenes. Pedigree analysis and clinical diagnosis indicated that the patients have suffered from autosomal recessive Usher syndrome type II. DNA sequencing has detected a homozygous c.8559-2A>G mutation of the USH2A gene in the proband, which has co-segregated with the disease in the family. The mutation has affected a conserved splice site in intron 42, which has led to inactivation of the splice site. Minigene experiment has confirmed the retaining of intron 42 in mature mRNA. The c.8559-2A>G mutation in the USH2A gene probably underlies the Usher syndrome type II in this family. The splice site mutation has resulted in abnormal splicing of USH2A pre-mRNA.

  10. Capturing novel mouse genes encoding chromosomal and other nuclear proteins.

    PubMed

    Tate, P; Lee, M; Tweedie, S; Skarnes, W C; Bickmore, W A

    1998-09-01

    The burgeoning wealth of gene sequences contrasts with our ignorance of gene function. One route to assigning function is by determining the sub-cellular location of proteins. We describe the identification of mouse genes encoding proteins that are confined to nuclear compartments by splicing endogeneous gene sequences to a promoterless betageo reporter, using a gene trap approach. Mouse ES (embryonic stem) cell lines were identified that express betageo fusions located within sub-nuclear compartments, including chromosomes, the nucleolus and foci containing splicing factors. The sequences of 11 trapped genes were ascertained, and characterisation of endogenous protein distribution in two cases confirmed the validity of the approach. Three novel proteins concentrated within distinct chromosomal domains were identified, one of which appears to be a serine/threonine kinase. The sequence of a gene whose product co-localises with splicesome components suggests that this protein may be an E3 ubiquitin-protein ligase. The majority of the other genes isolated represent novel genes. This approach is shown to be a powerful tool for identifying genes encoding novel proteins with specific sub-nuclear localisations and exposes our ignorance of the protein composition of the nucleus. Motifs in two of the isolated genes suggest new links between cellular regulatory mechanisms (ubiquitination and phosphorylation) and mRNA splicing and chromosome structure/function.

  11. The Nucleotide Sequence and Spliced pol mRNA Levels of the Nonprimate Spumavirus Bovine Foamy Virus

    PubMed Central

    Holzschu, Donald L.; Delaney, Mari A.; Renshaw, Randall W.; Casey, James W.

    1998-01-01

    We have determined the complete nucleotide sequence of a replication-competent clone of bovine foamy virus (BFV) and have quantitated the amount of splice pol mRNA processed early in infection. The 544-amino-acid Gag protein precursor has little sequence similarity with its primate foamy virus homologs, but the putative nucleocapsid (NC) protein, like the primate NCs, contains the three glycine-arginine-rich regions that are postulated to bind genomic RNA during virion assembly. The BFV gag and pol open reading frames overlap, with pro and pol in the same translational frame. As with the human foamy virus (HFV) and feline foamy virus, we have detected a spliced pol mRNA by PCR. Quantitatively, this mRNA approximates the level of full-length genomic RNA early in infection. The integrase (IN) domain of reverse transcriptase does not contain the canonical HH-CC zinc finger motif present in all characterized retroviral INs, but it does contain a nearby histidine residue that could conceivably participate as a member of the zinc finger. The env gene encodes a protein that is over 40% identical in sequence to the HFV Env. By comparison, the Gag precursor of BFV is predicted to be only 28% identical to the HFV protein. PMID:9499074

  12. An RRM–ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion

    PubMed Central

    Collins, Katherine M.; Kainov, Yaroslav A.; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A.

    2017-01-01

    Abstract RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1–ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. PMID:28379442

  13. An RRM-ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion.

    PubMed

    Collins, Katherine M; Kainov, Yaroslav A; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A; Makeyev, Eugene V; Ramos, Andres

    2017-06-20

    RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1-ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  14. The dynamic genome of Hydra

    PubMed Central

    Chapman, Jarrod A.; Kirkness, Ewen F.; Simakov, Oleg; Hampson, Steven E.; Mitros, Therese; Weinmaier, Therese; Rattei, Thomas; Balasubramanian, Prakash G.; Borman, Jon; Busam, Dana; Disbennett, Kathryn; Pfannkoch, Cynthia; Sumin, Nadezhda; Sutton, Granger G.; Viswanathan, Lakshmi Devi; Walenz, Brian; Goodstein, David M.; Hellsten, Uffe; Kawashima, Takeshi; Prochnik, Simon E.; Putnam, Nicholas H.; Shu, Shengquiang; Blumberg, Bruce; Dana, Catherine E.; Gee, Lydia; Kibler, Dennis F.; Law, Lee; Lindgens, Dirk; Martinez, Daniel E.; Peng, Jisong; Wigge, Philip A.; Bertulat, Bianca; Guder, Corina; Nakamura, Yukio; Ozbek, Suat; Watanabe, Hiroshi; Khalturin, Konstantin; Hemmrich, Georg; Franke, André; Augustin, René; Fraune, Sebastian; Hayakawa, Eisuke; Hayakawa, Shiho; Hirose, Mamiko; Hwang, Jung Shan; Ikeo, Kazuho; Nishimiya-Fujisawa, Chiemi; Ogura, Atshushi; Takahashi, Toshio; Steinmetz, Patrick R. H.; Zhang, Xiaoming; Aufschnaiter, Roland; Eder, Marie-Kristin; Gorny, Anne-Kathrin; Salvenmoser, Willi; Heimberg, Alysha M.; Wheeler, Benjamin M.; Peterson, Kevin J.; Böttger, Angelika; Tischler, Patrick; Wolf, Alexander; Gojobori, Takashi; Remington, Karin A.; Strausberg, Robert L.; Venter, J. Craig; Technau, Ulrich; Hobmayer, Bert; Bosch, Thomas C. G.; Holstein, Thomas W.; Fujisawa, Toshitaka; Bode, Hans R.; David, Charles N.; Rokhsar, Daniel S.; Steele, Robert E.

    2015-01-01

    The freshwater cnidarian Hydra was first described in 17021 and has been the object of study for 300 years. Experimental studies of Hydra between 1736 and 1744 culminated in the discovery of asexual reproduction of an animal by budding, the first description of regeneration in an animal, and successful transplantation of tissue between animals2. Today, Hydra is an important model for studies of axial patterning3, stem cell biology4 and regeneration5. Here we report the genome of Hydra magnipapillata and compare it to the genomes of the anthozoan Nematostella vectensis6 and other animals. The Hydra genome has been shaped by bursts of transposable element expansion, horizontal gene transfer, trans-splicing, and simplification of gene structure and gene content that parallel simplification of the Hydra life cycle. We also report the sequence of the genome of a novel bacterium stably associated with H. magnipapillata. Comparisons of the Hydra genome to the genomes of other animals shed light on the evolution of epithelia, contractile tissues, developmentally regulated transcription factors, the Spemann–Mangold organizer, pluripotency genes and the neuromuscular junction. PMID:20228792

  15. Identification of novel FBN1 and TGFBR2 mutations in 65 probands with Marfan syndrome or Marfan-like phenotypes.

    PubMed

    Chung, Brian Hon-Yin; Lam, Stephen Tak-Sum; Tong, Tony Ming-For; Li, Susanna Yuk-Han; Lun, Kin-Shing; Chan, Daniel Hon-Chuen; Fok, Susanna Fung-Shan; Or, June Siu-Fong; Smith, David Keith; Yang, Wanling; Lau, Yu-Lung

    2009-07-01

    Marfan syndrome is an autosomal dominant connective tissue disorder, and mutations in the FBN1 and TGFBR2 genes have been identified in probands with MFS and related phenotypes. Using DHPLC and sequencing, we studied the mutation spectrum in 65 probands with Marfan syndrome and related phenotypes. A total of 24 mutations in FBN1 were identified, of which 19 (nine missense, six frameshift, two nonsense and two affecting splice junctions) were novel. In the remaining 41 probands, six were identified to have novel TGFBR2 mutations (one frameshift and five missense mutations). All novel mutations found in this study were confirmed to be absent in 50 unrelated normal individuals of the same ethnic background. In probands who fulfilled the Ghent criteria (n = 16), mutations in FBN1 were found in 81% of cases. None of those with TGFBR2 mutations fulfilled the Ghent criteria. Novel missense mutations of unknown significance were classified according to the latest ACMG guidelines and their likelihood to be causative was evaluated.

  16. Characterization of variegate porphyria mutations using a minigene approach.

    PubMed

    Granata, Barbara Xoana; Baralle, Marco; De Conti, Laura; Parera, Victoria; Rossetti, Maria Victoria

    2015-01-01

    Porphyrias are a group of metabolic diseases that affect the skin and/or nervous system. In 2008, three unrelated patients were diagnosed with variegate porphyria at the CIPYP (Centro de Investigaciones sobre Porfirinas y Porfirias). Sequencing of the protoporphyrinogen oxidase gene, the gene altered in this type of porphyria, revealed three previously undescribed mutations: c.338+3insT, c.807G>A, and c.808-1G>C. As these mutations do not affect the protein sequence, we hypothesized that they might be splicing mutations. RT-PCRs performed on the patient's mRNAs showed normal mRNA or no amplification at all. This result indicated that the aberrant spliced transcript is possibly being degraded. In order to establish whether they were responsible or not for the patient's disease by causing aberrant splicing, we utilized a minigene approach. We found that the three mutations lead to exon skipping; therefore, the abnormal mRNAs are most likely degraded by a mechanism such as nonsense-mediated decay. In conclusion, these mutations are responsible for the disease because they alter the normal splicing pathway, thus providing a functional explanation for the appearance of disease and highlighting the use of minigene assays to complement transcript analysis.

  17. Involvement of Alternative Splicing in Barley Seed Germination

    PubMed Central

    Zhang, Qisen; Zhang, Xiaoqi; Wang, Songbo; Tan, Cong; Zhou, Gaofeng; Li, Chengdao

    2016-01-01

    Seed germination activates many new biological processes including DNA, membrane and mitochondrial repairs and requires active protein synthesis and sufficient energy supply. Alternative splicing (AS) regulates many cellular processes including cell differentiation and environmental adaptations. However, limited information is available on the regulation of seed germination at post-transcriptional levels. We have conducted RNA-sequencing experiments to dissect AS events in barley seed germination. We identified between 552 and 669 common AS transcripts in germinating barley embryos from four barley varieties (Hordeum vulgare L. Bass, Baudin, Harrington and Stirling). Alternative 3’ splicing (34%-45%), intron retention (32%-34%) and alternative 5’ splicing (16%-21%) were three major AS events in germinating embryos. The AS transcripts were predominantly mapped onto ribosome, RNA transport machineries, spliceosome, plant hormone signal transduction, glycolysis, sugar and carbon metabolism pathways. Transcripts of these genes were also very abundant in the early stage of seed germination. Correlation analysis of gene expression showed that AS hormone responsive transcripts could also be co-expressed with genes responsible for protein biosynthesis and sugar metabolisms. Our RNA-sequencing data revealed that AS could play important roles in barley seed germination. PMID:27031341

  18. Organellar maturases: A window into the evolution of the spliceosome.

    PubMed

    Schmitz-Linneweber, Christian; Lampe, Marie-Kristin; Sultan, Laure D; Ostersetzer-Biran, Oren

    2015-09-01

    During the evolution of eukaryotic genomes, many genes have been interrupted by intervening sequences (introns) that must be removed post-transcriptionally from RNA precursors to form mRNAs ready for translation. The origin of nuclear introns is still under debate, but one hypothesis is that the spliceosome and the intron-exon structure of genes have evolved from bacterial-type group II introns that invaded the eukaryotic genomes. The group II introns were most likely introduced into the eukaryotic genome from an α-proteobacterial predecessor of mitochondria early during the endosymbiosis event. These self-splicing and mobile introns spread through the eukaryotic genome and later degenerated. Pieces of introns became part of the general splicing machinery we know today as the spliceosome. In addition, group II introns likely brought intron maturases with them to the nucleus. Maturases are found in most bacterial introns, where they act as highly specific splicing factors for group II introns. In the spliceosome, the core protein Prp8 shows homology to group II intron-encoded maturases. While maturases are entirely intron specific, their descendant of the spliceosomal machinery, the Prp8 protein, is an extremely versatile splicing factor with multiple interacting proteins and RNAs. How could such a general player in spliceosomal splicing evolve from the monospecific bacterial maturases? Analysis of the organellar splicing machinery in plants may give clues on the evolution of nuclear splicing. Plants encode various proteins which are closely related to bacterial maturases. The organellar genomes contain one maturase each, named MatK in chloroplasts and MatR in mitochondria. In addition, several maturase genes have been found in the nucleus as well, which are acting on mitochondrial pre-RNAs. All plant maturases show sequence deviation from their progenitor bacterial maturases, and interestingly are all acting on multiple organellar group II intron targets. Moreover, they seem to function in the splicing of group II introns together with a number of additional nuclear-encoded splicing factors, possibly acting as an organellar proto-spliceosome. Together, this makes them interesting models for the early evolution of nuclear spliceosomal splicing. In this review, we summarize recent advances in our understanding of the role of plant maturases and their accessory factors in plants. This article is part of a Special Issue entitled: Chloroplast Biogenesis. Copyright © 2015 Elsevier B.V. All rights reserved.

  19. RBFOX and PTBP1 proteins regulate the alternative splicing of micro-exons in human brain transcripts.

    PubMed

    Li, Yang I; Sanchez-Pulido, Luis; Haerty, Wilfried; Ponting, Chris P

    2015-01-01

    Ninety-four percent of mammalian protein-coding exons exceed 51 nucleotides (nt) in length. The paucity of micro-exons (≤ 51 nt) suggests that their recognition and correct processing by the splicing machinery present greater challenges than for longer exons. Yet, because thousands of human genes harbor processed micro-exons, specialized mechanisms may be in place to promote their splicing. Here, we survey deep genomic data sets to define 13,085 micro-exons and to study their splicing mechanisms and molecular functions. More than 60% of annotated human micro-exons exhibit a high level of sequence conservation, an indicator of functionality. While most human micro-exons require splicing-enhancing genomic features to be processed, the splicing of hundreds of micro-exons is enhanced by the adjacent binding of splice factors in the introns of pre-messenger RNAs. Notably, splicing of a significant number of micro-exons was found to be facilitated by the binding of RBFOX proteins, which promote their inclusion in the brain, muscle, and heart. Our analyses suggest that accurate regulation of micro-exon inclusion by RBFOX proteins and PTBP1 plays an important role in the maintenance of tissue-specific protein-protein interactions. © 2015 Li et al.; Published by Cold Spring Harbor Laboratory Press.

  20. Diversification of the muscle proteome through alternative splicing.

    PubMed

    Nakka, Kiran; Ghigna, Claudia; Gabellini, Davide; Dilworth, F Jeffrey

    2018-03-06

    Skeletal muscles express a highly specialized proteome that allows the metabolism of energy sources to mediate myofiber contraction. This muscle-specific proteome is partially derived through the muscle-specific transcription of a subset of genes. Surprisingly, RNA sequencing technologies have also revealed a significant role for muscle-specific alternative splicing in generating protein isoforms that give specialized function to the muscle proteome. In this review, we discuss the current knowledge with respect to the mechanisms that allow pre-mRNA transcripts to undergo muscle-specific alternative splicing while identifying some of the key trans-acting splicing factors essential to the process. The importance of specific splicing events to specialized muscle function is presented along with examples in which dysregulated splicing contributes to myopathies. Though there is now an appreciation that alternative splicing is a major contributor to proteome diversification, the emergence of improved "targeted" proteomic methodologies for detection of specific protein isoforms will soon allow us to better appreciate the extent to which alternative splicing modifies the activity of proteins (and their ability to interact with other proteins) in the skeletal muscle. In addition, we highlight a continued need to better explore the signaling pathways that contribute to the temporal control of trans-acting splicing factor activity to ensure specific protein isoforms are expressed in the proper cellular context. An understanding of the signal-dependent and signal-independent events driving muscle-specific alternative splicing has the potential to provide us with novel therapeutic strategies to treat different myopathies.

  1. [Molecular structure and alternative splicing analysis of heat shock factors of Schistosoma japonicum].

    PubMed

    Yu, Xie; Hai-Yan, Liao; Shu-Jie, Chen; Ling-Yu, Shi; Li-Yan, Ou; Ping-Ying, Teng; Dan, Xia; Qi-Wei, Chen; Sinan, Zheng; Xiao-Hong, Zhou

    2016-07-12

    To clone and identify the heat shock factors (HSFs) of Schistosoma japonicum and analyze its molecular structure and alternative splicing pattern. The New Zealand rabbits were infected with the cercariae of Schistosoma japonicum and were killed and dissected 42 days post-infection, and the adult worms of S. japonicum and the livers of the rabbits were harvested. Then, the total RNA was extracted by using Trizol reagent. The Sj-hsf open reading frame (ORF) and the alternative splicing fragments were amplified by RT-PCR from the female, male and egg samples, then cloned and verified by enzyme digestion and sequencing. DNAMAN 8.0, InterPro, Mega 6 combined with the Internet databases were utilized to clarify the gene structure, functional domains, alternative splicing pattern, and the homology and phylogenetic tree of HSFs. Sj-hsf ORF and the alternative splicing fragments were amplified from the female, male and egg samples of S. japonicum by RT-PCR. After cloning, the positive recombinant plasmids pB Sj HSFf-F, pB Sj HSFf-M, pB Sj HSFf-E containing Sj-hsf ORF, pB Sj HSFs-F, pB Sj HSFs-M, pB Sj HSFs-E with Sj-hsf alternative splicing fragments were identified by enzyme digestion and sequencing. Three alternative splicing Sj-hsf isoforms were observed through sequence analysis: Sj-hsf -isoform1 (2 050 bp), Sj-hsf -isoform2 (2 086 bp) and Sj - hsf -isoform3 (2 111 bp); the GenBank accession numbers were KU954546, KX119143 and KX119144, respectively. All the three isoforms located in the same Contig SJC_S000780 of S. japonicum genome and all expressed at female, male and egg stages, but Sj-hsf -isoform1 with a high-level expression. Sj -HSF-isoform1 (671 aa) and Sj -HSF-isoform2 (683 aa) had DBD (DNA binding domain), HR-A/B and HR-C domains, while Sj -HSF-isoform3 (282 aa) stopped in advance without HR-C domain. Phylogenetic tree analysis of HSFs illustrated that Sj - HSFs belonged to HSF1 family, with a close phylogenetic relationship to Sm -HSFs. There are three alternative splicing isoforms of Sj -HSF existing in the female, male and egg stages of S. japonicum , but Sj -HSF-isoform1 expresses in a high-level. This study lays the foundation for further study on molecular mechanisms of Sj- HSFs in regulating the heat shock response system.

  2. Incorporating significant amino acid pairs and protein domains to predict RNA splicing-related proteins with functional roles

    NASA Astrophysics Data System (ADS)

    Hsu, Justin Bo-Kai; Huang, Kai-Yao; Weng, Tzu-Ya; Huang, Chien-Hsun; Lee, Tzong-Yi

    2014-01-01

    Machinery of pre-mRNA splicing is carried out through the interaction of RNA sequence elements and a variety of RNA splicing-related proteins (SRPs) (e.g. spliceosome and splicing factors). Alternative splicing, which is an important post-transcriptional regulation in eukaryotes, gives rise to multiple mature mRNA isoforms, which encodes proteins with functional diversities. However, the regulation of RNA splicing is not yet fully elucidated, partly because SRPs have not yet been exhaustively identified and the experimental identification is labor-intensive. Therefore, we are motivated to design a new method for identifying SRPs with their functional roles in the regulation of RNA splicing. The experimentally verified SRPs were manually curated from research articles. According to the functional annotation of Splicing Related Gene Database, the collected SRPs were further categorized into four functional groups including small nuclear Ribonucleoprotein, Splicing Factor, Splicing Regulation Factor and Novel Spliceosome Protein. The composition of amino acid pairs indicates that there are remarkable differences among four functional groups of SRPs. Then, support vector machines (SVMs) were utilized to learn the predictive models for identifying SRPs as well as their functional roles. The cross-validation evaluation presents that the SVM models trained with significant amino acid pairs and functional domains could provide a better predictive performance. In addition, the independent testing demonstrates that the proposed method could accurately identify SRPs in mammals/plants as well as effectively distinguish between SRPs and RNA-binding proteins. This investigation provides a practical means to identifying potential SRPs and a perspective for exploring the regulation of RNA splicing.

  3. Incorporating significant amino acid pairs and protein domains to predict RNA splicing-related proteins with functional roles.

    PubMed

    Hsu, Justin Bo-Kai; Huang, Kai-Yao; Weng, Tzu-Ya; Huang, Chien-Hsun; Lee, Tzong-Yi

    2014-01-01

    Machinery of pre-mRNA splicing is carried out through the interaction of RNA sequence elements and a variety of RNA splicing-related proteins (SRPs) (e.g. spliceosome and splicing factors). Alternative splicing, which is an important post-transcriptional regulation in eukaryotes, gives rise to multiple mature mRNA isoforms, which encodes proteins with functional diversities. However, the regulation of RNA splicing is not yet fully elucidated, partly because SRPs have not yet been exhaustively identified and the experimental identification is labor-intensive. Therefore, we are motivated to design a new method for identifying SRPs with their functional roles in the regulation of RNA splicing. The experimentally verified SRPs were manually curated from research articles. According to the functional annotation of Splicing Related Gene Database, the collected SRPs were further categorized into four functional groups including small nuclear Ribonucleoprotein, Splicing Factor, Splicing Regulation Factor and Novel Spliceosome Protein. The composition of amino acid pairs indicates that there are remarkable differences among four functional groups of SRPs. Then, support vector machines (SVMs) were utilized to learn the predictive models for identifying SRPs as well as their functional roles. The cross-validation evaluation presents that the SVM models trained with significant amino acid pairs and functional domains could provide a better predictive performance. In addition, the independent testing demonstrates that the proposed method could accurately identify SRPs in mammals/plants as well as effectively distinguish between SRPs and RNA-binding proteins. This investigation provides a practical means to identifying potential SRPs and a perspective for exploring the regulation of RNA splicing.

  4. Identification of a novel alternative splicing variant of hemocyanin from shrimp Litopenaeus vannamei.

    PubMed

    Zhao, Shan; Lu, Xin; Zhang, Yueling; Zhao, Xianliang; Zhong, Mingqi; Li, Shengkang; Lun, Jingsheng

    2013-01-01

    Recent evidences suggest that invertebrates express families of immune molecules with high levels of sequence diversity. Hemocyanin is an important non-specific immune molecule present in the hemolymph of both mollusks and arthropods. In the present study, we characterized a novel alternative splicing variant of hemocyanin (cHE1) from Litopenaeus vannamei that produced mRNA transcript of 2579 bp in length. The isoform contained two additional sequences of 296 and 267 bp in the 5'- and 3'-terminus respectively, in comparison to that of wild type hemocyanin (cHE). Sequence of cHE1 shows 100% identity to that of hemocyanin genomic DNA (HE, which does not form an open reading frame), suggesting that cHE1 might be an alternative splicing variant due to intron retention. Moreover, cHE1 could be detected by RT-PCR from five tissues (heart, gill, stomach, intestine and brain), and from shrimps at stages from nauplius to mysis larva. Further, cHE1 mRNA transcripts were significantly increased in hearts after 12h of infection with Vibrio parahemolyticus or poly I: C, while no significant difference in the transcript levels of hepatopancreas cHE was detected in the pathogen-treated shrimps during the period. In summary, these studies suggested a novel splicing variant of hemocyanin in shrimp, which might be involved in shrimp resistance to pathogenic infection. Copyright © 2013 Elsevier B.V. All rights reserved.

  5. Congenital analbuminemia caused by a novel aberrant splicing in the albumin gene.

    PubMed

    Caridi, Gianluca; Dagnino, Monica; Erdeve, Omer; Di Duca, Marco; Yildiz, Duran; Alan, Serdar; Atasay, Begum; Arsan, Saadet; Campagnoli, Monica; Galliano, Monica; Minchiotti, Lorenzo

    2014-01-01

    Congenital analbuminemia is a rare autosomal recessive disorder manifested by the presence of a very low amount of circulating serum albumin. It is an allelic heterogeneous defect, caused by variety of mutations within the albumin gene in homozygous or compound heterozygous state. Herein we report the clinical and molecular characterization of a new case of congenital analbuminemia diagnosed in a female newborn of consanguineous (first degree cousins) parents from Ankara, Turkey, who presented with a low albumin concentration (< 8 g/L) and severe clinical symptoms. The albumin gene of the index case was screened by single-strand conformation polymorphism, heteroduplex analysis, and direct DNA sequencing. The effect of the splicing mutation was evaluated by examining the cDNA obtained by reverse transcriptase - polymerase chain reaction (RT-PCR) from the albumin mRNA extracted from proband's leukocytes. DNA sequencing revealed that the proband is homozygous, and both parents are heterozygous, for a novel G>A transition at position c.1652+1, the first base of intron 12, which inactivates the strongly conserved GT dinucleotide at the 5' splice site consensus sequence of this intron. The splicing defect results in the complete skipping of the preceding exon (exon 12) and in a frame-shift within exon 13 with a premature stop codon after the translation of three mutant amino acid residues. Our results confirm the clinical diagnosis of congenital analbuminemia in the proband and the inheritance of the trait and contribute to shed light on the molecular genetics of analbuminemia.

  6. Expanding the action of duplex RNAs into the nucleus: redirecting alternative splicing

    PubMed Central

    Liu, Jing; Hu, Jiaxin; Corey, David R.

    2012-01-01

    Double-stranded RNAs are powerful agents for silencing gene expression in the cytoplasm of mammalian cells. The potential for duplex RNAs to control expression in the nucleus has received less attention. Here, we investigate the ability of small RNAs to redirect splicing. We identify RNAs targeting an aberrant splice site that restore splicing and production of functional protein. RNAs can target sequences within exons or introns and affect the inclusion of exons within SMN2 and dystrophin, genes responsible for spinal muscular atrophy and Duchenne muscular dystrophy, respectively. Duplex RNAs recruit argonaute 2 (AGO2) to pre-mRNA transcripts and altered splicing requires AGO2 expression. AGO2 promotes transcript cleavage in the cytoplasm, but recruitment of AGO2 to pre-mRNAs does not reduce transcript levels, exposing a difference between cytoplasmic and nuclear pathways. Involvement of AGO2 in splicing, a classical nuclear process, reinforces the conclusion from studies of RNA-mediated transcriptional silencing that RNAi pathways can be adapted to function in the mammalian nucleus. These data provide a new strategy for controlling splicing and expand the reach of small RNAs within the nucleus of mammalian cells. PMID:21948593

  7. Gene expression and splicing alterations analyzed by high throughput RNA sequencing of chronic lymphocytic leukemia specimens.

    PubMed

    Liao, Wei; Jordaan, Gwen; Nham, Phillipp; Phan, Ryan T; Pelegrini, Matteo; Sharma, Sanjai

    2015-10-16

    To determine differentially expressed and spliced RNA transcripts in chronic lymphocytic leukemia specimens a high throughput RNA-sequencing (HTS RNA-seq) analysis was performed. Ten CLL specimens and five normal peripheral blood CD19+ B cells were analyzed by HTS RNA-seq. The library preparation was performed with Illumina TrueSeq RNA kit and analyzed by Illumina HiSeq 2000 sequencing system. An average of 48.5 million reads for B cells, and 50.6 million reads for CLL specimens were obtained with 10396 and 10448 assembled transcripts for normal B cells and primary CLL specimens respectively. With the Cuffdiff analysis, 2091 differentially expressed genes (DEG) between B cells and CLL specimens based on FPKM (fragments per kilobase of transcript per million reads and false discovery rate, FDR q < 0.05, fold change >2) were identified. Expression of selected DEGs (n = 32) with up regulated and down regulated expression in CLL from RNA-seq data were also analyzed by qRT-PCR in a test cohort of CLL specimens. Even though there was a variation in fold expression of DEG genes between RNA-seq and qRT-PCR; more than 90 % of analyzed genes were validated by qRT-PCR analysis. Analysis of RNA-seq data for splicing alterations in CLL and B cells was performed by Multivariate Analysis of Transcript Splicing (MATS analysis). Skipped exon was the most frequent splicing alteration in CLL specimens with 128 significant events (P-value <0.05, minimum inclusion level difference >0.1). The RNA-seq analysis of CLL specimens identifies novel DEG and alternatively spliced genes that are potential prognostic markers and therapeutic targets. High level of validation by qRT-PCR for a number of DEG genes supports the accuracy of this analysis. Global comparison of transcriptomes of B cells, IGVH non-mutated CLL (U-CLL) and mutated CLL specimens (M-CLL) with multidimensional scaling analysis was able to segregate CLL and B cell transcriptomes but the M-CLL and U-CLL transcriptomes were indistinguishable. The analysis of HTS RNA-seq data to identify alternative splicing events and other genetic abnormalities specific to CLL is an added advantage of RNA-seq that is not feasible with other genome wide analysis.

  8. Analysis of 31-year-old patient with SYNGAP1 gene defect points to importance of variants in broader splice regions and reveals developmental trajectory of SYNGAP1-associated phenotype: case report.

    PubMed

    Prchalova, Darina; Havlovicova, Marketa; Sterbova, Katalin; Stranecky, Viktor; Hancarova, Miroslava; Sedlacek, Zdenek

    2017-06-02

    Whole exome sequencing is a powerful tool for the analysis of genetically heterogeneous conditions. The prioritization of variants identified often focuses on nonsense, frameshift and canonical splice site mutations, and highly deleterious missense variants, although other defects can also play a role. The definition of the phenotype range and course of rare genetic conditions requires long-term clinical follow-up of patients. We report an adult female patient with severe intellectual disability, severe speech delay, epilepsy, autistic features, aggressiveness, sleep problems, broad-based clumsy gait and constipation. Whole exome sequencing identified a de novo mutation in the SYNGAP1 gene. The variant was located in the broader splice donor region of intron 10 and replaced G by A at position +5 of the splice site. The variant was predicted in silico and shown experimentally to abolish the regular splice site and to activate a cryptic donor site within exon 10, causing frameshift and premature termination. The overall clinical picture of the patient corresponded well with the characteristic SYNGAP1-associated phenotype observed in previously reported patients. However, our patient was 31 years old which contrasted with most other published SYNGAP1 cases who were much younger. Our patient had a significant growth delay and microcephaly. Both features normalised later, although the head circumference stayed only slightly above the lower limit of the norm. The patient had a delayed puberty. Her cognitive and language performance remained at the level of a one-year-old child even in adulthood and showed a slow decline. Myopathic facial features and facial dysmorphism became more pronounced with age. Although the gait of the patient was unsteady in childhood, more severe gait problems developed in her teens. While the seizures remained well-controlled, her aggressive behaviour worsened with age and required extensive medication. The finding in our patient underscores the notion that the interpretation of variants identified using whole exome sequencing should focus not only on variants in the canonical splice dinucleotides GT and AG, but also on broader splice regions. The long-term clinical follow-up of our patient contributes to the knowledge of the developmental trajectory in individuals with SYNGAP1 gene defects.

  9. Dehydration-induced tps gene transcripts from an anhydrobiotic nematode contain novel spliced leaders and encode atypical GT-20 family proteins.

    PubMed

    Goyal, K; Browne, J A; Burnell, A M; Tunnacliffe, A

    2005-06-01

    Accumulation of the non-reducing disaccharide trehalose is associated with desiccation tolerance during anhydrobiosis in a number of invertebrates, but there is little information on trehalose biosynthetic genes in these organisms. We have identified two trehalose-6-phosphate synthase (tps) genes in the anhydrobiotic nematode Aphelenchus avenae and determined full length cDNA sequences for both; for comparison, full length tps cDNAs from the model nematode, Caenorhabditis elegans, have also been obtained. The A. avenae genes encode very similar proteins containing the catalytic domain characteristic of the GT-20 family of glycosyltransferases and are most similar to tps-2 of C. elegans; no evidence was found for a gene in A. avenae corresponding to Ce-tps-1. Analysis of A. avenae tps cDNAs revealed several features of interest, including alternative trans-splicing of spliced leader sequences in Aav-tps-1, and four different, novel SL1-related trans-spliced leaders, which were different to the canonical SL1 sequence found in all other nematodes studied. The latter observation suggests that A. avenae does not comply with the strict evolutionary conservation of SL1 sequences observed in other species. Unusual features were also noted in predicted nematode TPS proteins, which distinguish them from homologues in other higher eukaryotes (plants and insects) and in micro-organisms. Phylogenetic analysis confirmed their membership of the GT-20 glycosyltransferase family, but indicated an accelerated rate of molecular evolution. Furthermore, nematode TPS proteins possess N- and C-terminal domains, which are unrelated to those of other eukaryotes: nematode C-terminal domains, for example, do not contain trehalose-6-phosphate phosphatase-like sequences, as seen in plant and insect homologues. During onset of anhydrobiosis, both tps genes in A. avenae are upregulated, but exposure to cold or increased osmolarity also results in gene induction, although to a lesser extent. Trehalose seems likely therefore to play a role in a number of stress responses in nematodes.

  10. Expressed sequence tag analysis of adult human lens for the NEIBank Project: over 2000 non-redundant transcripts, novel genes and splice variants.

    PubMed

    Wistow, Graeme; Bernstein, Steven L; Wyatt, M Keith; Behal, Amita; Touchman, Jeffrey W; Bouffard, Gerald; Smith, Don; Peterson, Katherine

    2002-06-15

    To explore the expression profile of the human lens and to provide a resource for microarray studies, expressed sequence tag (EST) analysis has been performed on cDNA libraries from adult lenses. A cDNA library was constructed from two adult (40 year old) human lenses. Over two thousand clones were sequenced from the unamplified, un-normalized library. The library was then normalized and a further 2200 sequences were obtained. All the data were analyzed using GRIST (GRouping and Identification of Sequence Tags), a procedure for gene identification and clustering. The lens library (by) contains a low percentage of non-mRNA contaminants and a high fraction (over 75%) of apparently full length cDNA clones. Approximately 2000 reads from the unamplified library yields 810 clusters, potentially representing individual genes expressed in the lens. After normalization, the content of crystallins and other abundant cDNAs is markedly reduced and a similar number of reads from this library (fs) yields 1455 unique groups of which only two thirds correspond to named genes in GenBank. Among the most abundant cDNAs is one for a novel gene related to glutamine synthetase, which was designated "lengsin" (LGS). Analyses of ESTs also reveal examples of alternative transcripts, including a major alternative splice form for the lens specific membrane protein MP19. Variant forms for other transcripts, including those encoding the apoptosis inhibitor Livin and the armadillo repeat protein ARVCF, are also described. The lens cDNA libraries are a resource for gene discovery, full length cDNAs for functional studies and microarrays. The discovery of an abundant, novel transcript, lengsin, and a major novel splice form of MP19 reflect the utility of unamplified libraries constructed from dissected tissue. Many novel transcripts and splice forms are represented, some of which may be candidates for genetic diseases.

  11. Global Profiling of the Cellular Alternative RNA Splicing Landscape during Virus-Host Interactions

    PubMed Central

    Boudreault, Simon; Martenon-Brodeur, Camille; Caron, Marie; Garant, Jean-Michel; Tremblay, Marie-Pier; Armero, Victoria E. S.; Durand, Mathieu; Lapointe, Elvy; Thibault, Philippe; Tremblay-Létourneau, Maude; Perreault, Jean-Pierre; Scott, Michelle S.; Lemay, Guy; Bisaillon, Martin

    2016-01-01

    Alternative splicing (AS) is a central mechanism of genetic regulation which modifies the sequence of RNA transcripts in higher eukaryotes. AS has been shown to increase both the variability and diversity of the cellular proteome by changing the composition of resulting proteins through differential choice of exons to be included in mature mRNAs. In the present study, alterations to the global RNA splicing landscape of cellular genes upon viral infection were investigated using mammalian reovirus as a model. Our study provides the first comprehensive portrait of global changes in the RNA splicing signatures that occur in eukaryotic cells following infection with a human virus. We identify 240 modified alternative splicing events upon infection which belong to transcripts frequently involved in the regulation of gene expression and RNA metabolism. Using mass spectrometry, we also confirm modifications to transcript-specific peptides resulting from AS in virus-infected cells. These findings provide additional insights into the complexity of virus-host interactions as these splice variants expand proteome diversity and function during viral infection. PMID:27598998

  12. Global Profiling of the Cellular Alternative RNA Splicing Landscape during Virus-Host Interactions.

    PubMed

    Boudreault, Simon; Martenon-Brodeur, Camille; Caron, Marie; Garant, Jean-Michel; Tremblay, Marie-Pier; Armero, Victoria E S; Durand, Mathieu; Lapointe, Elvy; Thibault, Philippe; Tremblay-Létourneau, Maude; Perreault, Jean-Pierre; Scott, Michelle S; Lemay, Guy; Bisaillon, Martin

    2016-01-01

    Alternative splicing (AS) is a central mechanism of genetic regulation which modifies the sequence of RNA transcripts in higher eukaryotes. AS has been shown to increase both the variability and diversity of the cellular proteome by changing the composition of resulting proteins through differential choice of exons to be included in mature mRNAs. In the present study, alterations to the global RNA splicing landscape of cellular genes upon viral infection were investigated using mammalian reovirus as a model. Our study provides the first comprehensive portrait of global changes in the RNA splicing signatures that occur in eukaryotic cells following infection with a human virus. We identify 240 modified alternative splicing events upon infection which belong to transcripts frequently involved in the regulation of gene expression and RNA metabolism. Using mass spectrometry, we also confirm modifications to transcript-specific peptides resulting from AS in virus-infected cells. These findings provide additional insights into the complexity of virus-host interactions as these splice variants expand proteome diversity and function during viral infection.

  13. Microbial and Natural Metabolites That Inhibit Splicing: A Powerful Alternative for Cancer Treatment.

    PubMed

    Martínez-Montiel, Nancy; Rosas-Murrieta, Nora Hilda; Martínez-Montiel, Mónica; Gaspariano-Cholula, Mayra Patricia; Martínez-Contreras, Rebeca D

    2016-01-01

    In eukaryotes, genes are frequently interrupted with noncoding sequences named introns. Alternative splicing is a nuclear mechanism by which these introns are removed and flanking coding regions named exons are joined together to generate a message that will be translated in the cytoplasm. This mechanism is catalyzed by a complex machinery known as the spliceosome, which is conformed by more than 300 proteins and ribonucleoproteins that activate and regulate the precision of gene expression when assembled. It has been proposed that several genetic diseases are related to defects in the splicing process, including cancer. For this reason, natural products that show the ability to regulate splicing have attracted enormous attention due to its potential use for cancer treatment. Some microbial metabolites have shown the ability to inhibit gene splicing and the molecular mechanism responsible for this inhibition is being studied for future applications. Here, we summarize the main types of natural products that have been characterized as splicing inhibitors, the recent advances regarding molecular and cellular effects related to these molecules, and the applications reported so far in cancer therapeutics.

  14. Open reading frames in a 4556 nucleotide sequence within MDV-1 BamHI-D DNA fragment: evidence for splicing of mRNA from a new viral glycoprotein gene.

    PubMed

    Becker, Y; Asher, Y; Tabor, E; Davidson, I; Malkinson, M

    1994-01-01

    A DNA segment of the MDV-1 BamHI-D fragment was sequenced, and the open reading frames (ORFs) present in the 4556 nucleotide fragment were analyzed by computer programs. Computer analysis identified 19 putative ORFs in the sequence ranging from a coding capacity of 37 amino acids (aa) (ORF-1a) to 684aa (ORF-1). The special properties of four ORFs (1a, 1, 2, and 3) were investigated. Two adjacent ORFs, ORF-1a and ORF-1, were found by computer analysis to have the properties of two introns encoding a glycoprotein: ORF-1a encodes an aa sequence with the properties of a signal peptide, and ORF-1 encodes a polypeptide with a membrane anchor domain and putative N-glycosylation sites in the aa sequence. ORF-1a and ORF-1 were found to be transcribed in MDV-1-infected cells. Two RNA transcripts were detected: a precursor RNA and its spliced form. Both are transcribed from a promoter located 5' to ORF-1a, and splice donor and acceptor sites are used to splice the mRNA after cleavage of a 71-nucleotide sequence. This finding suggest that ORF-1a and ORF-1 are two introns of a new MDV-1 glycoprotein gene. The DNA sequence containing ORF-1 was transiently expressed in COS-1 cells, and the viral protein produced in these cells was found to react with anti-MDV serotype-1 Antigen B-specific monoclonal antibodies. These studies indicate that the protein encoded by ORF-1 has antigenic properties resembling Antigen B of MDV-1. A gene homologous to ORF-1 was detected in the genome of both MDV-2(SB1) and MDV-3(HVT), which serve as commercial vaccine strains. Two additional ORFs were noted in the 4556 nucleotide sequence: ORF-2, which encodes a 333 aa polypeptide initiating in the UL and terminating in the TRL prior to the putative origin of replication, and ORF-3, which encodes a 155 aa polypeptide that is partly homologous to the phosphoprotein pp38 encoded by the BamHI-H sequence. The 65 N-terminal aa of the two gene products are identical, both being derived from the nucleotide sequences in the TRL and IRL, respectively. Additional homologous aa sequences are the hydrophobic aa domain in the middle of both proteins. The functions of ORF-2, ORF-3, and additional ORFs are under study.

  15. Kassiopeia: a database and web application for the analysis of mutually exclusive exomes of eukaryotes

    PubMed Central

    2014-01-01

    Background Alternative splicing is an important process in higher eukaryotes that allows obtaining several transcripts from one gene. A specific case of alternative splicing is mutually exclusive splicing, in which exactly one exon out of a cluster of neighbouring exons is spliced into the mature transcript. Recently, a new algorithm for the prediction of these exons has been developed based on the preconditions that the exons of the cluster have similar lengths, sequence homology, and conserved splice sites, and that they are translated in the same reading frame. Description In this contribution we introduce Kassiopeia, a database and web application for the generation, storage, and presentation of genome-wide analyses of mutually exclusive exomes. Currently, Kassiopeia provides access to the mutually exclusive exomes of twelve Drosophila species, the thale cress Arabidopsis thaliana, the flatworm Caenorhabditis elegans, and human. Mutually exclusive spliced exons (MXEs) were predicted based on gene reconstructions from Scipio. Based on the standard prediction values, with which 83.5% of the annotated MXEs of Drosophila melanogaster were reconstructed, the exomes contain surprisingly more MXEs than previously supposed and identified. The user can search Kassiopeia using BLAST or browse the genes of each species optionally adjusting the parameters used for the prediction to reveal more divergent or only very similar exon candidates. Conclusions We developed a pipeline to predict MXEs in the genomes of several model organisms and a web interface, Kassiopeia, for their visualization. For each gene Kassiopeia provides a comprehensive gene structure scheme, the sequences and predicted secondary structures of the MXEs, and, if available, further evidence for MXE candidates from cDNA/EST data, predictions of MXEs in homologous genes of closely related species, and RNA secondary structure predictions. Kassiopeia can be accessed at http://www.motorprotein.de/kassiopeia. PMID:24507667

  16. "iSS-Hyb-mRMR": Identification of splicing sites using hybrid space of pseudo trinucleotide and pseudo tetranucleotide composition.

    PubMed

    Iqbal, Muhammad; Hayat, Maqsood

    2016-05-01

    Gene splicing is a vital source of protein diversity. Perfectly eradication of introns and joining exons is the prominent task in eukaryotic gene expression, as exons are usually interrupted by introns. Identification of splicing sites through experimental techniques is complicated and time-consuming task. With the avalanche of genome sequences generated in the post genomic age, it remains a complicated and challenging task to develop an automatic, robust and reliable computational method for fast and effective identification of splicing sites. In this study, a hybrid model "iSS-Hyb-mRMR" is proposed for quickly and accurately identification of splicing sites. Two sample representation methods namely; pseudo trinucleotide composition (PseTNC) and pseudo tetranucleotide composition (PseTetraNC) were used to extract numerical descriptors from DNA sequences. Hybrid model was developed by concatenating PseTNC and PseTetraNC. In order to select high discriminative features, minimum redundancy maximum relevance algorithm was applied on the hybrid feature space. The performance of these feature representation methods was tested using various classification algorithms including K-nearest neighbor, probabilistic neural network, general regression neural network, and fitting network. Jackknife test was used for evaluation of its performance on two benchmark datasets S1 and S2, respectively. The predictor, proposed in the current study achieved an accuracy of 93.26%, sensitivity of 88.77%, and specificity of 97.78% for S1, and the accuracy of 94.12%, sensitivity of 87.14%, and specificity of 98.64% for S2, respectively. It is observed, that the performance of proposed model is higher than the existing methods in the literature so for; and will be fruitful in the mechanism of RNA splicing, and other research academia. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  17. mRNA trans-splicing in gene therapy for genetic diseases.

    PubMed

    Berger, Adeline; Maire, Séverine; Gaillard, Marie-Claude; Sahel, José-Alain; Hantraye, Philippe; Bemelmans, Alexis-Pierre

    2016-07-01

    Spliceosome-mediated RNA trans-splicing, or SMaRT, is a promising strategy to design innovative gene therapy solutions for currently intractable genetic diseases. SMaRT relies on the correction of mutations at the post-transcriptional level by modifying the mRNA sequence. To achieve this, an exogenous RNA is introduced into the target cell, usually by means of gene transfer, to induce a splice event in trans between the exogenous RNA and the target endogenous pre-mRNA. This produces a chimeric mRNA composed partly of exons of the latter, and partly of exons of the former, encoding a sequence free of mutations. The principal challenge of SMaRT technology is to achieve a reaction as complete as possible, i.e., resulting in 100% repairing of the endogenous mRNA target. The proof of concept of SMaRT feasibility has already been established in several models of genetic diseases caused by recessive mutations. In such cases, in fact, the repair of only a portion of the mutant mRNA pool may be sufficient to obtain a significant therapeutic effect. However in the case of dominant mutations, the target cell must be freed from the majority of mutant mRNA copies, requiring a highly efficient trans-splicing reaction. This likely explains why only a few examples of SMaRT approaches targeting dominant mutations are reported in the literature. In this review, we explain in details the mechanism of trans-splicing, review the different strategies that are under evaluation to lead to efficient trans-splicing, and discuss the advantages and limitations of SMaRT. WIREs RNA 2016, 7:487-498. doi: 10.1002/wrna.1347 For further resources related to this article, please visit the WIREs website. © 2016 The Authors. WIREs RNA published by Wiley Periodicals, Inc.

  18. Co-evolution of SNF spliceosomal proteins with their RNA targets in trans-splicing nematodes.

    PubMed

    Strange, Rex Meade; Russelburg, L Peyton; Delaney, Kimberly J

    2016-08-01

    Although the mechanism of pre-mRNA splicing has been well characterized, the evolution of spliceosomal proteins is poorly understood. The U1A/U2B″/SNF family (hereafter referred to as the SNF family) of RNA binding spliceosomal proteins participates in both the U1 and U2 small interacting nuclear ribonucleoproteins (snRNPs). The highly constrained nature of this system has inhibited an analysis of co-evolutionary trends between the proteins and their RNA binding targets. Here we report accelerated sequence evolution in the SNF protein family in Phylum Nematoda, which has allowed an analysis of protein:RNA co-evolution. In a comparison of SNF genes from ecdysozoan species, we found a correlation between trans-splicing species (nematodes) and increased phylogenetic branch lengths of the SNF protein family, with respect to their sister clade Arthropoda. In particular, we found that nematodes (~70-80 % of pre-mRNAs are trans-spliced) have experienced higher rates of SNF sequence evolution than arthropods (predominantly cis-spliced) at both the nucleotide and amino acid levels. Interestingly, this increased evolutionary rate correlates with the reliance on trans-splicing by nematodes, which would alter the role of the SNF family of spliceosomal proteins. We mapped amino acid substitutions to functionally important regions of the SNF protein, specifically to sites that are predicted to disrupt protein:RNA and protein:protein interactions. Finally, we investigated SNF's RNA targets: the U1 and U2 snRNAs. Both are more divergent in nematodes than arthropods, suggesting the RNAs have co-evolved with SNF in order to maintain the necessarily high affinity interaction that has been characterized in other species.

  19. Molecular Cloning and Characterization of the Human ErbB4 Gene: Identification of Novel Splice Isoforms in the Developing and Adult Brain

    PubMed Central

    Tan, Wei; Dean, Michael; Law, Amanda J.

    2010-01-01

    ErbB4 is a growth factor receptor tyrosine kinase essential for neurodevelopment. Genetic variation in ErbB4 is associated with schizophrenia and risk-associated polymorphisms predict overexpression of ErbB4 CYT-1 isoforms in the brain in the disorder. The molecular mechanism of association is unclear because the polymorphisms flank exon 3 of the gene and reside 700 kb distal to the CYT-1 defining exon. We hypothesized that the polymorphisms are indirectly associated with ErbB4 CYT-1 via splicing of exon 3 on the CYT-1 background. We report via cloning and sequencing of adult and fetal human brain cDNA libraries the identification of novel splice isoforms of ErbB4, whereby exon 3 is skipped (del.3). ErbB4 del.3 transcripts exist as CYT-2 isoforms and are predicted to produce truncated proteins. Furthermore, our data refine the structure of the human ErbB4 gene, clarify that juxtamembrane (JM) splice variants of ErbB4, JM-a and JM-b respectively, are characterized by the replacement of a 75 nucleotide (nt) sequence with a 45-nt insertion, and demonstrate that there are four alternative exons in the gene. Our analyses reveal that novel splice variants of ErbB4 exist in the developing and adult human brain and, given the failure to identify ErbB4 del.3 CYT-1 transcripts, suggest that the association of risk polymorphisms in the ErbB4 gene with CYT-1 transcript levels is not mediated via an exon 3 splicing event. PMID:20886074

  20. Mechanism for DNA transposons to generate introns on genomic scales

    PubMed Central

    Huff, Jason T.; Zilberman, Daniel; Roy, Scott W.

    2017-01-01

    Discovered four decades ago, the existence of introns was one of the most unexpected findings in molecular biology1. Introns are sequences interrupting genes that must be removed as part of mRNA production. Genome sequencing projects have documented that most eukaryotic genes contain at least one and frequently many introns2,3. Comparison of these genomes reveals a history of long evolutionary periods with little intron gain punctuated by episodes of rapid, extensive gain2,3. However, no detailed mechanism for such episodic intron generation has been empirically supported on a sufficient scale, despite several proposals4–8. Here we show how short non-autonomous DNA transposons independently generated hundreds to thousands of introns in the prasinophyte Micromonas pusilla and the pelagophyte Aureococcus anophagefferens. Each transposon carries one splice site. The other splice site is co-opted from gene sequence duplicated upon transposon insertion, allowing perfect splicing out of RNA. The distributions of sequences that can be co-opted are biased with respect to codons, and phasing of transposon-generated introns is similarly biased. These transposons insert between preexisting nucleosomes, so that multiple nearby insertions generate nucleosome-sized intervening segments. Thus, transposon insertion and sequence co-option may explain the intron phase biases2 and prevalence of nucleosome-sized exons9 observed in eukaryotes. Overall, the two independent examples of proliferating elements illustrate a general DNA transposon mechanism plausibly accounting for episodes of rapid, extensive intron gain during eukaryotic evolution2,3. PMID:27760113

  1. Identification of a novel splicing mutation within SLC17A8 in a Korean family with hearing loss by whole-exome sequencing.

    PubMed

    Ryu, Nari; Lee, Seokwon; Park, Hong-Joon; Lee, Byeonghyeon; Kwon, Tae-Jun; Bok, Jinwoong; Park, Chan Ik; Lee, Kyu-Yup; Baek, Jeong-In; Kim, Un-Kyung

    2017-09-05

    Hereditary hearing loss (HHL) is a common genetically heterogeneous disorder, which follows Mendelian inheritance in humans. Because of this heterogeneity, the identification of the causative gene of HHL by linkage analysis or Sanger sequencing have shown economic and temporal limitations. With recent advances in next-generation sequencing (NGS) techniques, rapid identification of a causative gene via massively parallel sequencing is now possible. We recruited a Korean family with three generations exhibiting autosomal dominant inheritance of hearing loss (HL), and the clinical information about this family revealed that there are no other symptoms accompanied with HL. To identify a causative mutation of HL in this family, we performed whole-exome sequencing of 4 family members, 3 affected and an unaffected. As the result, A novel splicing mutation, c.763+1G>T, in the solute carrier family 17, member 8 (SLC17A8) gene was identified in the patients, and the genotypes of the mutation were co-segregated with the phenotype of HL. Additionally, this mutation was not detected in 100 Koreans with normal hearing. Via NGS, we detected a novel splicing mutation that might influence the hearing ability within the patients with autosomal dominant non-syndromic HL. Our data suggests that this technique is a powerful tool to discover causative genetic factors of HL and facilitate diagnoses of the primary cause of HHL. Copyright © 2017 Elsevier B.V. All rights reserved.

  2. Mammalian splicing factor SF1 interacts with SURP domains of U2 snRNP-associated proteins

    PubMed Central

    Crisci, Angela; Raleff, Flore; Bagdiul, Ivona; Raabe, Monika; Urlaub, Henning; Rain, Jean-Christophe; Krämer, Angela

    2015-01-01

    Splicing factor 1 (SF1) recognizes the branch point sequence (BPS) at the 3′ splice site during the formation of early complex E, thereby pre-bulging the BPS adenosine, thought to facilitate subsequent base-pairing of the U2 snRNA with the BPS. The 65-kDa subunit of U2 snRNP auxiliary factor (U2AF65) interacts with SF1 and was shown to recruit the U2 snRNP to the spliceosome. Co-immunoprecipitation experiments of SF1-interacting proteins from HeLa cell extracts shown here are consistent with the presence of SF1 in early splicing complexes. Surprisingly almost all U2 snRNP proteins were found associated with SF1. Yeast two-hybrid screens identified two SURP domain-containing U2 snRNP proteins as partners of SF1. A short, evolutionarily conserved region of SF1 interacts with the SURP domains, stressing their role in protein–protein interactions. A reduction of A complex formation in SF1-depleted extracts could be rescued with recombinant SF1 containing the SURP-interaction domain, but only partial rescue was observed with SF1 lacking this sequence. Thus, SF1 can initially recruit the U2 snRNP to the spliceosome during E complex formation, whereas U2AF65 may stabilize the association of the U2 snRNP with the spliceosome at later times. In addition, these findings may have implications for alternative splicing decisions. PMID:26420826

  3. Evolutionary dynamics and sites of illegitimate recombination revealed in the interspersion and sequence junctions of two nonhomologous satellite DNAs in cactophilic Drosophila species.

    PubMed

    Kuhn, G C S; Teo, C H; Schwarzacher, T; Heslop-Harrison, J S

    2009-05-01

    Satellite DNA (satDNA) is a major component of genomes but relatively little is known about the fine-scale organization of unrelated satDNAs residing at the same chromosome location, and the sequence structure and dynamics of satDNA junctions. We studied the organization and sequence junctions of two nonhomologous satDNAs, pBuM and DBC-150, in three species from the neotropical Drosophila buzzatii cluster (repleta group). In situ hybridization to microchromosomes, interphase nuclei and extended DNA fibers showed frequent interspersion of the two satellites in D. gouveai, D. antonietae and, to a lesser extent, D. seriema. We isolated by PCR six pBuM x DBC-150 junctions: four are exclusive to D. gouveai and two are exclusive to D. antonietae. The six junction breakpoints occur at different positions within monomers, suggesting independent origin. Four junctions showed abrupt transitions between the two satellites, whereas two junctions showed a distinct 10 bp tandem duplication before the junction. Unlike pBuM, DBC-150 junction repeats are more variable than randomly cloned monomers and showed diagnostic features in common to a 3-monomer higher-order repeat seen in the sister species D. serido. The high levels of interspersion between pBuM and DBC-150 repeats suggest extensive rearrangements between the two satellites, maybe favored by specific features of the microchromosomes. Our interpretation is that the junctions evolved by multiples events of illegitimate recombination between nonhomologous satDNA repeats, with subsequent rounds of unequal crossing-over expanding the copy number of some of the junctions.

  4. Diversified clinical presentations associated with a novel sal-like 4 gene mutation in a Chinese pedigree with Duane retraction syndrome.

    PubMed

    Yang, Ming-ming; Ho, Mary; Lau, Henry H W; Tam, Pancy O S; Young, Alvin L; Pang, Chi Pui; Yip, Wilson W K; Chen, LiJia

    2013-01-01

    To determine the underlying genetic cause of Duane retraction syndrome (DRS) in a non-consanguineous Chinese Han family. Detailed ophthalmic and physical examinations were performed on all members from a pedigree with DRS. All exons and their adjacent splicing junctions of the sal-like 4 (SALL4) gene were amplified with polymerase chain reaction and analyzed with direct sequencing in all the recruited family members and 200 unrelated control subjects. Clinical examination revealed a broad spectrum of phenotypes in the DRS family. Mutation analysis of SALL4 identified a novel heterozygous duplication mutation, c.1919dupT, which was completely cosegregated with the disease in the family and absent in controls. This mutation was predicted to cause a frameshift, introducing a premature stop codon, when translated, resulting in a truncated SALL4 protein, i.e., p.Met640IlefsX25. Bioinformatics analysis showed that the affected region of SALL4 shared a highly conserved sequence across different species. Diversified clinical manifestations were observed in the c.1919dupT carriers of the family. We identified a novel truncating mutation in the SALL4 gene that leads to diversified clinical features of DRS in a Chinese family. This mutation is predicted to result in a truncated SALL4 protein affecting two functional domains and cause disease development due to haploinsufficiency through nonsense-mediated mRNA decay.

  5. The Choice of Alternative 5' Splice Sites in Influenza Virus M1 mRNA is Regulated by the Viral Polymerase Complex

    NASA Astrophysics Data System (ADS)

    Shih, Shin-Ru; Nemeroff, Martin E.; Krug, Robert M.

    1995-07-01

    The influenza virus M1 mRNA has two alternative 5' splice sites: a distal 5' splice site producing mRNA_3 that has the coding potential for 9 amino acids and a proximal 5' splice site producing M2 mRNA encoding the essential M2 ion-channel protein. Only mRNA_3 was made in uninfected cells transfected with DNA expressing M1 mRNA. Similarly, using nuclear extracts from uninfected cells, in vitro splicing of M1 mRNA yielded only mRNA_3. Only when the mRNA_3 5' splice site was inactivated by mutation was M2 mRNA made in uninfected cells and in uninfected cell extracts. In influenza virus-infected cells, M2 mRNA was made, but only after a delay, suggesting that newly synthesized viral gene product(s) were needed to activate the M2 5' splice site. We present strong evidence that these gene products are the complex of the three polymerase proteins, the same complex that functions in the transcription and replication of the viral genome. Gel shift experiments showed that the viral polymerase complex bound to the 5' end of the viral M1 mRNA in a sequence-specific and cap-dependent manner. During in vitro splicing catalyzed by uninfected cell extracts, the binding of the viral polymerase complex blocked the mRNA_3 5' splice site, resulting in the switch to the M2 mRNA 5' splice site and the production of M2 mRNA.

  6. Targeted Single-Shot Methods for Diffusion-Weighted Imaging in the Kidneys

    PubMed Central

    Jin, Ning; Deng, Jie; Zhang, Longjiang; Zhang, Zhuoli; Lu, Guangming; Omary, Reed A.; Larson, Andrew C.

    2011-01-01

    Purpose To investigate the feasibility of combining the inner-volume-imaging (IVI) technique with single-shot diffusion-weighted (DW) spin-echo echo-planar imaging (SE-EPI) and DW-SPLICE (split acquisition of fast spin-echo) sequences for renal DW imaging. Materials and Methods Renal DW imaging was performed in 10 healthy volunteers using single-shot DW-SE-EPI, DW-SPLICE, targeted-DW-SE-EPI and targeted-DW-SPLICE. We compared the quantitative diffusion measurement accuracy and image quality of these targeted-DW-SE-EPI and targeted DW-SPLICE methods with conventional full FOV DW-SE-EPI and DW-SPLICE measurements in phantoms and normal volunteers. Results Compared with full FOV DW-SE-EPI and DW-SPLICE methods, targeted-DW-SE-EPI and targeted-DW-SPLICE approaches produced images of superior overall quality with fewer artifacts, less distortion and reduced spatial blurring in both phantom and volunteer studies. The ADC values measured with each of the four methods were similar and in agreement with previously published data. There were no statistically significant differences between the ADC values and intra-voxel incoherent motion (IVIM) measurements in the kidney cortex and medulla using single-shot DW-SE-EPI, targeted-DW-EPI and targeted-DW-SPLICE (p > 0.05). Conclusion Compared with full-FOV DW imaging methods, targeted-DW-SE-EPI and targeted-DW-SPLICE techniques reduced image distortion and artifacts observed in the single-shot DW-SE-EPI images, reduced blurring in DW-SPLICE images and produced comparable quantitative DW and IVIM measurements to those produced with conventional full-FOV approaches. PMID:21591023

  7. Alternative Splicing of STAT3 Is Affected by RNA Editing.

    PubMed

    Goldberg, Lior; Abutbul-Amitai, Mor; Paret, Gideon; Nevo-Caspi, Yael

    2017-05-01

    A-to-I RNA editing, carried out by adenosine deaminase acting on RNA (ADAR) enzymes, is an epigenetic phenomenon of posttranscriptional modifications on pre-mRNA. RNA editing in intronic sequences may influence alternative splicing of flanking exons. We have previously shown that conditions that induce editing result in elevated expression of signal transducer and activator of transcription 3 (STAT3), preferentially the alternatively-spliced STAT3β isoform. Mechanisms regulating alternative splicing of STAT3 have not been elucidated. STAT3 undergoes A-to-I RNA editing in an intron residing in proximity to the alternatively spliced exon. We hypothesized that RNA editing plays a role in regulating alternative splicing toward STAT3β. In this study we extend our observation connecting RNA editing to the preferential induction of STAT3β expression. We study the involvement of ADAR1 in STAT3 editing and reveal the connection between editing and alternative splicing of STAT3. Deferoaxamine treatment caused the induction in STAT3 RNA editing and STAT3β expression. Silencing ADAR1 caused a decrease in STAT3 editing and expression with a preferential decrease in STAT3β. Cells transfected with a mutated minigene showed preferential splicing toward the STAT3β transcript. Editing in the STAT3 intron is performed by ADAR1 and affects STAT3 alternative splicing. These results suggest that RNA editing is one of the molecular mechanisms regulating the expression of STAT3β.

  8. Circular RNAs: Unexpected outputs of many protein-coding genes

    PubMed Central

    Wilusz, Jeremy E.

    2017-01-01

    ABSTRACT Pre-mRNAs from thousands of eukaryotic genes can be non-canonically spliced to generate circular RNAs, some of which accumulate to higher levels than their associated linear mRNA. Recent work has revealed widespread mechanisms that dictate whether the spliceosome generates a linear or circular RNA. For most genes, circular RNA biogenesis via backsplicing is far less efficient than canonical splicing, but circular RNAs can accumulate due to their long half-lives. Backsplicing is often initiated when complementary sequences from different introns base pair and bring the intervening splice sites close together. This process is further regulated by the combinatorial action of RNA binding proteins, which allow circular RNAs to be expressed in unique patterns. Some genes do not require complementary sequences to generate RNA circles and instead take advantage of exon skipping events. It is still unclear what most mature circular RNAs do, but future investigations into their functions will be facilitated by recently described methods to modulate circular RNA levels. PMID:27571848

  9. Sequence variants of KHDRBS1 as high penetrance susceptibility risks for primary ovarian insufficiency by mis-regulating mRNA alternative splicing.

    PubMed

    Wang, Binbin; Li, Lin; Zhu, Ying; Zhang, Wei; Wang, Xi; Chen, Beili; Li, Tengyan; Pan, Hong; Wang, Jing; Kee, Kehkooi; Cao, Yunxia

    2017-10-01

    Does a novel heterozygous KHDRBS1 variant, identified using whole-exome sequencing (WES) in two patients with primary ovarian insufficiency (POI) in a pedigree, cause defects in mRNA alternative splicing? The heterozygous variant of KHDRBS1 was confirmed to cause defects in alternative splicing of many genes involved in DNA replication and repair. Studies in mice revealed that Khdrbs1 deficient females are subfertile, which manifests as delayed sexual maturity and significantly reduced numbers of secondary and pre-antral follicles. No mutation of KHDRBS1, however, has been reported in patients with POI. This genetic and functional study used WES to find putative mutations in a POI pedigree. Altogether, 215 idiopathic POI patients and 400 healthy controls were screened for KHDRBS1 mutations. Two POI patients were subjected to WES to identify sequence variants. Mutational analysis of the KHDRBS1 gene in 215 idiopathic POI patients and 400 healthy controls were performed. RNA-sequencing was carried out to find the mis-regulation of gene expression due to KHDRBS1 mutation. Bioinformatics was used to analyze the change in alternative splicing events. We identified a heterozygous mutation (c.460A > G, p.M154V) in KHDRBS1 in two patients. Further mutational analysis of 215 idiopathic POI patients with the KHDRBS1 gene found one heterozygous mutation (c.263C > T, p.P88L). We failed to find these two mutations in 400 healthy control women. Using RNA-sequencing, we found that the KGN cells expressing the M154V KHDRBS1 mutant had different expression of 66 genes compared with wild-type (WT) cells. Furthermore, 145 genes were alternatively spliced in M154V cells, and these genes were enriched for DNA replication and repair function, revealing a potential underlying mechanism of the pathology that leads to POI. Although the in vitro assays demonstrated the effect of the KHDRBS1 variant on alternative splicing, further studies are needed to validate the in vivo effects on germ cell and follicle development. This finding provides researchers and clinicians a better understanding of the etiology and molecular mechanism of POI. This study was supported by the Ministry of Science and Technology of China (2012CB944704; 2012CB966702), National Research Institute for Family Planning (2017GJZ05), the National Natural Science Foundation of China (31171429) and Beijing Advanced Innovation Center for Structural Biology. The authors declare no conflict of interest. © The Author 2017. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  10. Genetic variation affecting exon skipping contributes to brain structural atrophy in Alzheimer's disease.

    PubMed

    Lee, Younghee; Han, Seonggyun; Kim, Dongwook; Kim, Dokyoon; Horgousluoglu, Emrin; Risacher, Shannon L; Saykin, Andrew J; Nho, Kwangsik

    2018-01-01

    Genetic variation in cis-regulatory elements related to splicing machinery and splicing regulatory elements (SREs) results in exon skipping and undesired protein products. We developed a splicing decision model to identify actionable loci among common SNPs for gene regulation. The splicing decision model identified SNPs affecting exon skipping by analyzing sequence-driven alternative splicing (AS) models and by scanning the genome for the regions with putative SRE motifs. We used non-Hispanic Caucasians with neuroimaging, and fluid biomarkers for Alzheimer's disease (AD) and identified 17,088 common exonic SNPs affecting exon skipping. GWAS identified one SNP (rs1140317) in HLA-DQB1 as significantly associated with entorhinal cortical thickness, AD neuroimaging biomarker, after controlling for multiple testing. Further analysis revealed that rs1140317 was significantly associated with brain amyloid-f deposition (PET and CSF). HLA-DQB1 is an essential immune gene and may regulate AS, thereby contributing to AD pathology. SRE may hold potential as novel therapeutic targets for AD.

  11. An ARHGEF10 Deletion Is Highly Associated with a Juvenile-Onset Inherited Polyneuropathy in Leonberger and Saint Bernard Dogs

    PubMed Central

    Minor, Katie M.; Shelton, G. Diane; Patterson, Edward E.; Bley, Tim; Oevermann, Anna; Bilzer, Thomas; Leeb, Tosso

    2014-01-01

    An inherited polyneuropathy (PN) observed in Leonberger dogs has clinical similarities to a genetically heterogeneous group of peripheral neuropathies termed Charcot-Marie-Tooth (CMT) disease in humans. The Leonberger disorder is a severe, juvenile-onset, chronic, progressive, and mixed PN, characterized by exercise intolerance, gait abnormalities and muscle atrophy of the pelvic limbs, as well as inspiratory stridor and dyspnea. We mapped a PN locus in Leonbergers to a 250 kb region on canine chromosome 16 (Praw = 1.16×10−10, Pgenome, corrected = 0.006) utilizing a high-density SNP array. Within this interval is the ARHGEF10 gene, a member of the rho family of GTPases known to be involved in neuronal growth and axonal migration, and implicated in human hypomyelination. ARHGEF10 sequencing identified a 10 bp deletion in affected dogs that removes four nucleotides from the 3′-end of exon 17 and six nucleotides from the 5′-end of intron 17 (c.1955_1958+6delCACGGTGAGC). This eliminates the 3′-splice junction of exon 17, creates an alternate splice site immediately downstream in which the processed mRNA contains a frame shift, and generates a premature stop codon predicted to truncate approximately 50% of the protein. Homozygosity for the deletion was highly associated with the severe juvenile-onset PN phenotype in both Leonberger and Saint Bernard dogs. The overall clinical picture of PN in these breeds, and the effects of sex and heterozygosity of the ARHGEF10 deletion, are less clear due to the likely presence of other forms of PN with variable ages of onset and severity of clinical signs. This is the first documented severe polyneuropathy associated with a mutation in ARHGEF10 in any species. PMID:25275565

  12. Expression of exon-8-skipped kindlin-1 does not compensate for defects of Kindler syndrome.

    PubMed

    Natsuga, Ken; Nishie, Wataru; Shinkuma, Satoru; Nakamura, Hideki; Matsushima, Yoichiro; Tatsuta, Aya; Komine, Mayumi; Shimizu, Hiroshi

    2011-01-01

    Kindler syndrome (KS) is a rare, inherited skin disease characterized by blister formation and generalized poikiloderma. Mutations in KIND1, which encodes kindlin-1, are responsible for KS. c.1089del/1089+1del is a recurrent splice-site deletion mutation in KS patients. To elucidate the effects of c.1089del/1089+1del at the mRNA and protein level. Two KS patients with c.1089del/1089+1del were included in this study. Immunofluorescence analysis of KS skin samples using antibodies against the dermo-epidermal junction proteins was performed. Exon-trapping experiments were performed to isolate the mRNA sequences transcribed from genomic DNA harbouring c.1089del/1089+1del. β1 integrin activation in HeLa cells transfected with truncated KIND1 cDNA was analyzed. Immunofluorescence study showed positive expression of kindlin-1 in KS skin with c.1089del/1089+1del mutation. We identified the exon-8-skipped in-frame transcript as the main product among multiple splicing variants derived from that mutation. HeLa cells transfected with KIND1 cDNA without exon 8 showed impaired β1 integrin activation. Exon-8-coding amino acids are located in the FERM F2 domain, which is conserved among species, and the unstructured region between F2 and the pleckstrin homology domain. This study suggests that exon-8-skipped truncated kindlin-1 is functionally defective and does not compensate for the defects of KS, even though kindlin-1 expression in skin is positive. Copyright © 2010 Japanese Society for Investigative Dermatology. Published by Elsevier Ireland Ltd. All rights reserved.

  13. Functional analysis of a large set of BRCA2 exon 7 variants highlights the predictive value of hexamer scores in detecting alterations of exonic splicing regulatory elements.

    PubMed

    Di Giacomo, Daniela; Gaildrat, Pascaline; Abuli, Anna; Abdat, Julie; Frébourg, Thierry; Tosi, Mario; Martins, Alexandra

    2013-11-01

    Exonic variants can alter pre-mRNA splicing either by changing splice sites or by modifying splicing regulatory elements. Often these effects are difficult to predict and are only detected by performing RNA analyses. Here, we analyzed, in a minigene assay, 26 variants identified in the exon 7 of BRCA2, a cancer predisposition gene. Our results revealed eight new exon skipping mutations in this exon: one directly altering the 5' splice site and seven affecting potential regulatory elements. This brings the number of splicing regulatory mutations detected in BRCA2 exon 7 to a total of 11, a remarkably high number considering the total number of variants reported in this exon (n = 36), all tested in our minigene assay. We then exploited this large set of splicing data to test the predictive value of splicing regulator hexamers' scores recently established by Ke et al. (). Comparisons of hexamer-based predictions with our experimental data revealed high sensitivity in detecting variants that increased exon skipping, an important feature for prescreening variants before RNA analysis. In conclusion, hexamer scores represent a promising tool for predicting the biological consequences of exonic variants and may have important applications for the interpretation of variants detected by high-throughput sequencing. © 2013 WILEY PERIODICALS, INC.

  14. Expansion of the Preimmune Antibody Repertoire by Junctional Diversity in Bos taurus

    PubMed Central

    Liljavirta, Jenni; Niku, Mikael; Pessa-Morikawa, Tiina; Ekman, Anna; Iivanainen, Antti

    2014-01-01

    Cattle have a limited range of immunoglobulin genes which are further diversified by antigen independent somatic hypermutation in fetuses. Junctional diversity generated during somatic recombination contributes to antibody diversity but its relative significance has not been comprehensively studied. We have investigated the importance of terminal deoxynucleotidyl transferase (TdT) -mediated junctional diversity to the bovine immunoglobulin repertoire. We also searched for new bovine heavy chain diversity (IGHD) genes as the information of the germline sequences is essential to define the junctional boundaries between gene segments. New heavy chain variable genes (IGHV) were explored to address the gene usage in the fetal recombinations. Our bioinformatics search revealed five new IGHD genes, which included the longest IGHD reported so far, 154 bp. By genomic sequencing we found 26 new IGHV sequences that represent potentially new IGHV genes or allelic variants. Sequence analysis of immunoglobulin heavy chain cDNA libraries of fetal bone marrow, ileum and spleen showed 0 to 36 nontemplated N-nucleotide additions between variable, diversity and joining genes. A maximum of 8 N nucleotides were also identified in the light chains. The junctional base profile was biased towards A and T nucleotide additions (64% in heavy chain VD, 52% in heavy chain DJ and 61% in light chain VJ junctions) in contrast to the high G/C content which is usually observed in mice. Sequence analysis also revealed extensive exonuclease activity, providing additional diversity. B-lymphocyte specific TdT expression was detected in bovine fetal bone marrow by reverse transcription-qPCR and immunofluorescence. These results suggest that TdT-mediated junctional diversity and exonuclease activity contribute significantly to the size of the cattle preimmune antibody repertoire already in the fetal period. PMID:24926997

  15. The Mitochondrial Genome of the Prasinophyte Prasinoderma coloniale Reveals Two Trans-Spliced Group I Introns in the Large Subunit rRNA Gene

    PubMed Central

    Pombert, Jean-François; Otis, Christian; Turmel, Monique; Lemieux, Claude

    2013-01-01

    Organelle genes are often interrupted by group I and or group II introns. Splicing of these mobile genetic occurs at the RNA level via serial transesterification steps catalyzed by the introns'own tertiary structures and, sometimes, with the help of external factors. These catalytic ribozymes can be found in cis or trans configuration, and although trans-arrayed group II introns have been known for decades, trans-spliced group I introns have been reported only recently. In the course of sequencing the complete mitochondrial genome of the prasinophyte picoplanktonic green alga Prasinoderma coloniale CCMP 1220 (Prasinococcales, clade VI), we uncovered two additional cases of trans-spliced group I introns. Here, we describe these introns and compare the 54,546 bp-long mitochondrial genome of Prasinoderma with those of four other prasinophytes (clades II, III and V). This comparison underscores the highly variable mitochondrial genome architecture in these ancient chlorophyte lineages. Both Prasinoderma trans-spliced introns reside within the large subunit rRNA gene (rnl) at positions where cis-spliced relatives, often containing homing endonuclease genes, have been found in other organelles. In contrast, all previously reported trans-spliced group I introns occur in different mitochondrial genes (rns or coxI). Each Prasinoderma intron is fragmented into two pieces, forming at the RNA level a secondary structure that resembles those of its cis-spliced counterparts. As observed for other trans-spliced group I introns, the breakpoint of the first intron maps to the variable loop L8, whereas that of the second is uniquely located downstream of P9.1. The breakpoint In each Prasinoderma intron corresponds to the same region where the open reading frame (ORF) occurs when present in cis-spliced orthologs. This correlation between the intron breakpoint and the ORF location in cis-spliced orthologs also holds for other trans-spliced introns; we discuss the possible implications of this interesting observation for trans-splicing of group I introns. PMID:24386369

  16. High prevalence of mutations affecting the splicing process in a Spanish cohort with autosomal dominant retinitis pigmentosa

    PubMed Central

    Ezquerra-Inchausti, Maitane; Barandika, Olatz; Anasagasti, Ander; Irigoyen, Cristina; López de Munain, Adolfo; Ruiz-Ederra, Javier

    2017-01-01

    Retinitis pigmentosa is the most frequent group of inherited retinal dystrophies. It is highly heterogeneous, with more than 80 disease-causing genes 27 of which are known to cause autosomal dominant RP (adRP), having been identified. In this study a total of 29 index cases were ascertained based on a family tree compatible with adRP. A custom panel of 31 adRP genes was analysed by targeted next-generation sequencing using the Ion PGM platform in combination with Sanger sequencing. This allowed us to detect putative disease-causing mutations in 14 out of the 29 (48.28%) families analysed. Remarkably, around 38% of all adRP cases analysed showed mutations affecting the splicing process, mainly due to mutations in genes coding for spliceosome factors (SNRNP200 and PRPF8) but also due to splice-site mutations in RHO. Twelve of the 14 mutations found had been reported previously and two were novel mutations found in PRPF8 in two unrelated patients. In conclusion, our results will lead to more accurate genetic counselling and will contribute to a better characterisation of the disease. In addition, they may have a therapeutic impact in the future given the large number of studies currently underway based on targeted RNA splicing for therapeutic purposes. PMID:28045043

  17. G-quadruplex structure at intron 2 of TFE3 and its role in Xp11.2 translocation and splicing.

    PubMed

    Verma, Shiv Prakash; Das, Parimal

    2018-03-01

    Transcription Factor E3 (TFE3) translocation is found in a group of different type of cancers and most of the translocations are located in the 5' region of TFE3 which may be considered as Breakpoint Region (BR). In our In silico study by QGRS mapper and non BdB web servers we found a Potential G-quadruplex forming Sequence (PQS) in the intron 2 of TFE3 gene. In vitro G-quadruplex formation was shown by native PAGE in presence of Pyridostatin(PDS), which with inter molecular secondary structure caused reduced mobility to migrate slower. G-quadruplex formation was mapped at single base resolution by Sanger sequencing and Circular Dichroism showed the formation of parallel G-quadruplex. FRET analysis revealed increased and decreased formation of G-quadruplex in presence of PDS and antisense oligonucleotide respectively. PCR stop assay, transcriptional and translational inhibition by PQS showed stable G-quadruplex formation affecting the biological processes. TFE3 minigene splicing study showed the involvement of this G-quadruplex in TFE3 splicing too. Therefore, G-quadruplex is evident to be the reason behind TFE3 induced oncogenesis executed by translocation and also involved in the mRNA splicing. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. Mutation-adapted U1 snRNA corrects a splicing error of the dopa decarboxylase gene.

    PubMed

    Lee, Ni-Chung; Lee, Yu-May; Chen, Pin-Wen; Byrne, Barry J; Hwu, Wuh-Liang

    2016-12-01

    Aromatic l-amino acid decarboxylase (AADC) deficiency is an inborn error of monoamine neurotransmitter synthesis, which results in dopamine, serotonin, epinephrine and norepinephrine deficiencies. The DDC gene founder mutation IVS6 + 4A > T is highly prevalent in Chinese patients with AADC deficiency. In this study, we designed several U1 snRNA vectors to adapt U1 snRNA binding sequences of the mutated DDC gene. We found that only the modified U1 snRNA (IVS-AAA) that completely matched both the intronic and exonic U1 binding sequences of the mutated DDC gene could correct splicing errors of either the mutated human DDC minigene or the mouse artificial splicing construct in vitro. We further injected an adeno-associated viral (AAV) vector to express IVS-AAA in the brain of a knock-in mouse model. This treatment was well tolerated and improved both the survival and brain dopamine and serotonin levels of mice with AADC deficiency. Therefore, mutation-adapted U1 snRNA gene therapy can be a promising method to treat genetic diseases caused by splicing errors, but the efficiency of such a treatment still needs improvements. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  19. Integrating alternative splicing detection into gene prediction.

    PubMed

    Foissac, Sylvain; Schiex, Thomas

    2005-02-10

    Alternative splicing (AS) is now considered as a major actor in transcriptome/proteome diversity and it cannot be neglected in the annotation process of a new genome. Despite considerable progresses in term of accuracy in computational gene prediction, the ability to reliably predict AS variants when there is local experimental evidence of it remains an open challenge for gene finders. We have used a new integrative approach that allows to incorporate AS detection into ab initio gene prediction. This method relies on the analysis of genomically aligned transcript sequences (ESTs and/or cDNAs), and has been implemented in the dynamic programming algorithm of the graph-based gene finder EuGENE. Given a genomic sequence and a set of aligned transcripts, this new version identifies the set of transcripts carrying evidence of alternative splicing events, and provides, in addition to the classical optimal gene prediction, alternative optimal predictions (among those which are consistent with the AS events detected). This allows for multiple annotations of a single gene in a way such that each predicted variant is supported by a transcript evidence (but not necessarily with a full-length coverage). This automatic combination of experimental data analysis and ab initio gene finding offers an ideal integration of alternatively spliced gene prediction inside a single annotation pipeline.

  20. Identification of human short introns

    PubMed Central

    Abebrese, Emmanuel L.; Arnold, Zachary R.; Armstrong, Katharine; Burns, Lindsay; Day, R. Thomas; Hsu, Daniel G.; Jarrell, Katherine; Luo, Yi; Mugayo, Daphine

    2017-01-01

    Canonical pre-mRNA splicing requires snRNPs and associated splicing factors to excise conserved intronic sequences, with a minimum intron length required for efficient splicing. Non-canonical splicing–intron excision without the spliceosome–has been documented; most notably, some tRNAs and the XBP1 mRNA contain short introns that are not removed by the spliceosome. There have been some efforts to identify additional short introns, but little is known about how many short introns are processed from mRNAs. Here, we report an approach to identify RNA short introns from RNA-Seq data, discriminating against small genomic deletions. We identify hundreds of short introns conserved among multiple human cell lines. These short introns are often alternatively spliced and are found in a variety of RNAs–both mRNAs and lncRNAs. Short intron splicing efficiency is increased by secondary structure, and we detect both canonical and non-canonical short introns. In many cases, splicing of these short introns from mRNAs is predicted to alter the reading frame and change protein output. Our findings imply that standard gene prediction models which often assume a lower limit for intron size fail to predict short introns effectively. We conclude that short introns are abundant in the human transcriptome, and short intron splicing represents an added layer to mRNA regulation. PMID:28520720

  1. Transcriptome-wide analysis of alternative RNA splicing events in Epstein-Barr virus-associated gastric carcinomas

    PubMed Central

    Armero, Victoria E. S.; Tremblay, Marie-Pier; Allaire, Andréa; Boudreault, Simon; Martenon-Brodeur, Camille; Duval, Cyntia; Durand, Mathieu; Lapointe, Elvy; Thibault, Philippe; Tremblay-Létourneau, Maude; Perreault, Jean-Pierre; Scott, Michelle S.

    2017-01-01

    Multiple human diseases including cancer have been associated with a dysregulation in RNA splicing patterns. In the current study, modifications to the global RNA splicing landscape of cellular genes were investigated in the context of Epstein-Barr virus-associated gastric cancer. Global alterations to the RNA splicing landscape of cellular genes was examined in a large-scale screen from 295 primary gastric adenocarcinomas using high-throughput RNA sequencing data. RT-PCR analysis, mass spectrometry, and co-immunoprecipitation studies were also used to experimentally validate and investigate the differential alternative splicing (AS) events that were observed through RNA-seq studies. Our study identifies alterations in the AS patterns of approximately 900 genes such as tumor suppressor genes, transcription factors, splicing factors, and kinases. These findings allowed the identification of unique gene signatures for which AS is misregulated in both Epstein-Barr virus-associated gastric cancer and EBV-negative gastric cancer. Moreover, we show that the expression of Epstein–Barr nuclear antigen 1 (EBNA1) leads to modifications in the AS profile of cellular genes and that the EBNA1 protein interacts with cellular splicing factors. These findings provide insights into the molecular differences between various types of gastric cancer and suggest a role for the EBNA1 protein in the dysregulation of cellular AS. PMID:28493890

  2. New discoveries of old SON: a link between RNA splicing and cancer.

    PubMed

    Hickey, Christopher J; Kim, Jung-Hyun; Ahn, Eun-Young Erin

    2014-02-01

    The SON protein is a ubiquitously expressed DNA- and RNA-binding protein primarily localized to nuclear speckles. Although several early studies implicated SON in DNA-binding, tumorigenesis and apoptosis, functional significance of this protein had not been recognized until recent studies discovered SON as a novel RNA splicing co-factor. During constitutive RNA splicing, SON ensures efficient intron removal from the transcripts containing suboptimal splice sites. Importantly, SON-mediated splicing is required for proper processing of selective transcripts related to cell cycle, microtubules, centrosome maintenance, and genome stability. Moreover, SON regulates alternative splicing of RNAs from the genes involved in apoptosis and epigenetic modification. In addition to the role in RNA splicing, SON has an ability to suppress transcriptional activation at certain promoter/enhancer DNA sequences. Considering the multiple SON target genes which are directly involved in cell proliferation, genome stability and chromatin modifications, SON is an emerging player in gene regulation during cancer development and progression. Here, we summarize available information from several early studies on SON, and highlight recent discoveries describing molecular mechanisms of SON-mediated gene regulation. We propose that our future effort on better understanding of diverse SON functions would reveal novel targets for cancer therapy. © 2013 Wiley Periodicals, Inc.

  3. Transcriptome-wide analysis of alternative RNA splicing events in Epstein-Barr virus-associated gastric carcinomas.

    PubMed

    Armero, Victoria E S; Tremblay, Marie-Pier; Allaire, Andréa; Boudreault, Simon; Martenon-Brodeur, Camille; Duval, Cyntia; Durand, Mathieu; Lapointe, Elvy; Thibault, Philippe; Tremblay-Létourneau, Maude; Perreault, Jean-Pierre; Scott, Michelle S; Bisaillon, Martin

    2017-01-01

    Multiple human diseases including cancer have been associated with a dysregulation in RNA splicing patterns. In the current study, modifications to the global RNA splicing landscape of cellular genes were investigated in the context of Epstein-Barr virus-associated gastric cancer. Global alterations to the RNA splicing landscape of cellular genes was examined in a large-scale screen from 295 primary gastric adenocarcinomas using high-throughput RNA sequencing data. RT-PCR analysis, mass spectrometry, and co-immunoprecipitation studies were also used to experimentally validate and investigate the differential alternative splicing (AS) events that were observed through RNA-seq studies. Our study identifies alterations in the AS patterns of approximately 900 genes such as tumor suppressor genes, transcription factors, splicing factors, and kinases. These findings allowed the identification of unique gene signatures for which AS is misregulated in both Epstein-Barr virus-associated gastric cancer and EBV-negative gastric cancer. Moreover, we show that the expression of Epstein-Barr nuclear antigen 1 (EBNA1) leads to modifications in the AS profile of cellular genes and that the EBNA1 protein interacts with cellular splicing factors. These findings provide insights into the molecular differences between various types of gastric cancer and suggest a role for the EBNA1 protein in the dysregulation of cellular AS.

  4. The consensus sequence of FAMLF alternative splice variants is overexpressed in undifferentiated hematopoietic cells.

    PubMed

    Chen, W L; Luo, D F; Gao, C; Ding, Y; Wang, S Y

    2015-07-01

    The familial acute myeloid leukemia related factor gene (FAMLF) was previously identified from a familial AML subtractive cDNA library and shown to undergo alternative splicing. This study used real-time quantitative PCR to investigate the expression of the FAMLF alternative-splicing transcript consensus sequence (FAMLF-CS) in peripheral blood mononuclear cells (PBMCs) from 119 patients with de novo acute leukemia (AL) and 104 healthy controls, as well as in CD34+ cells from 12 AL patients and 10 healthy donors. A 429-bp fragment from a novel splicing variant of FAMLF was obtained, and a 363-bp consensus sequence was targeted to quantify total FAMLF expression. Kruskal-Wallis, Nemenyi, Spearman's correlation, and Mann-Whitney U-tests were used to analyze the data. FAMLF-CS expression in PBMCs from AL patients and CD34+ cells from AL patients and controls was significantly higher than in control PBMCs (P < 0.0001). Moreover, FAMLF-CS expression in PBMCs from the AML group was positively correlated with red blood cell count (rs =0.317, P=0.006), hemoglobin levels (rs = 0.210, P = 0.049), and percentage of peripheral blood blasts (rs = 0.256, P = 0.027), but inversely correlated with hemoglobin levels in the control group (rs = -0.391, P < 0.0001). AML patients with high CD34+ expression showed significantly higher FAMLF-CS expression than those with low CD34+ expression (P = 0.041). Our results showed that FAMLF is highly expressed in both normal and malignant immature hematopoietic cells, but that expression is lower in normal mature PBMCs.

  5. Two Novel Variants Affecting CDKL5 Transcript Associated with Epileptic Encephalopathy.

    PubMed

    Neupauerová, Jana; Štěrbová, Katalin; Vlčková, Markéta; Sebroňová, Věra; Maříková, Tat'ána; Krůtová, Marcela; David, Staněk; Kršek, Pavel; Žaliová, Markéta; Seeman, Pavel; Laššuthová, Petra

    2017-10-01

    Variants in the human X-linked cyclin-dependent kinase-like 5 (CDKL5) gene have been reported as being etiologically associated with early infantile epileptic encephalopathy type 2 (EIEE2). We report on two patients, a boy and a girl, with EIEE2 that present with early onset epilepsy, hypotonia, severe intellectual disability, and poor eye contact. Massively parallel sequencing (MPS) of a custom-designed gene panel for epilepsy and epileptic encephalopathy containing 112 epilepsy-related genes was performed. Sanger sequencing was used to confirm the novel variants. For confirmation of the functional consequence of an intronic CDKL5 variant in patient 2, an RNA study was done. DNA sequencing revealed de novo variants in CDKL5, a c.2578C>T (p. Gln860*) present in a hemizygous state in a 3-year-old boy, and a potential splice site variant c.463+5G>A in heterozygous state in a 5-year-old girl. Multiple in silico splicing algorithms predicted a highly reduced splice site score for c.463+5G>A. A subsequent mRNA study confirmed an aberrant shorter transcript lacking exon 7. Our data confirmed that variants in the CDKL5 are associated with EIEE2. There is credible evidence that the novel identified variants are pathogenic and, therefore, are likely the cause of the disease in the presented patients. In one of the patients a stop codon variant is predicted to produce a truncated protein, and in the other patient an intronic variant results in aberrant splicing.

  6. Congenital analbuminemia caused by a novel aberrant splicing in the albumin gene

    PubMed Central

    Caridi, Gianluca; Dagnino, Monica; Erdeve, Omer; Di Duca, Marco; Yildiz, Duran; Alan, Serdar; Atasay, Begum; Arsan, Saadet; Campagnoli, Monica; Galliano, Monica; Minchiotti, Lorenzo

    2014-01-01

    Introduction: Congenital analbuminemia is a rare autosomal recessive disorder manifested by the presence of a very low amount of circulating serum albumin. It is an allelic heterogeneous defect, caused by variety of mutations within the albumin gene in homozygous or compound heterozygous state. Herein we report the clinical and molecular characterization of a new case of congenital analbuminemia diagnosed in a female newborn of consanguineous (first degree cousins) parents from Ankara, Turkey, who presented with a low albumin concentration (< 8 g/L) and severe clinical symptoms. Materials and methods: The albumin gene of the index case was screened by single-strand conformation polymorphism, heteroduplex analysis, and direct DNA sequencing. The effect of the splicing mutation was evaluated by examining the cDNA obtained by reverse transcriptase - polymerase chain reaction (RT-PCR) from the albumin mRNA extracted from proband’s leukocytes. Results: DNA sequencing revealed that the proband is homozygous, and both parents are heterozygous, for a novel G>A transition at position c.1652+1, the first base of intron 12, which inactivates the strongly conserved GT dinucleotide at the 5′ splice site consensus sequence of this intron. The splicing defect results in the complete skipping of the preceding exon (exon 12) and in a frame-shift within exon 13 with a premature stop codon after the translation of three mutant amino acid residues. Conclusions: Our results confirm the clinical diagnosis of congenital analbuminemia in the proband and the inheritance of the trait and contribute to shed light on the molecular genetics of analbuminemia. PMID:24627724

  7. Murine homeobox-containing gene, Msx-1: analysis of genomic organization, promoter structure, and potential autoregulatory cis-acting elements.

    PubMed

    Kuzuoka, M; Takahashi, T; Guron, C; Raghow, R

    1994-05-01

    Detailed molecular organization of the coding and upstream regulatory regions of the murine homeodomain-containing gene, Msx-1, is reported. The protein-encoding portion of the gene is contained in two exons, 590 and 1214 bp in length, separated by a 2107-bp intron; the homeodomain is located in the second exon. The two-exon organization of the murine Msx-1 gene resembles a number of other homeodomain-containing genes. The 5'-(GTAAGT) and 3'-(CCCTAG) splicing junctions and the mRNA polyadenylation signal (UAUAA) of the murine Msx-1 gene are also characteristic of other vertebrate genes. By nuclease protection and primer extension assays, the start of transcription of the Msx-1 gene was located 256 bp upstream of the first AUG. Computer analysis of the promoter proximal 1280-bp sequence revealed a number of potentially important cis-regulatory sequences; these include the recognition elements for Ap-1, Ap-2, Ap-3, Sp-1, a possible binding site for RAR:RXR, and a number of TCF-1 consensus motifs. Importantly, a perfect reverse complement of (C/G)TTAATTG, which was recently shown to be an optimal binding sequence for the homeodomain of Msx-1 protein (K.M. Catron, N. Iler, and C. Abate (1993) Mol. Cell. Biol. 13:2354-2365), was also located in the murine Msx-1 promoter. Binding of bacterially expressed Msx-1 homeodomain polypeptide to Msx-1-specific oligonucleotide was experimentally demonstrated, raising a distinct possibility of autoregulation of this developmentally regulated gene.

  8. Three-junction solar cell

    DOEpatents

    Ludowise, Michael J.

    1986-01-01

    A photovoltaic solar cell is formed in a monolithic semiconductor. The cell contains three junctions. In sequence from the light-entering face, the junctions have a high, a medium, and a low energy gap. The lower junctions are connected in series by one or more metallic members connecting the top of the lower junction through apertures to the bottom of the middle junction. The upper junction is connected in voltage opposition to the lower and middle junctions by second metallic electrodes deposited in holes 60 through the upper junction. The second electrodes are connected to an external terminal.

  9. SpliceCenter: A suite of web-based bioinformatic applications for evaluating the impact of alternative splicing on RT-PCR, RNAi, microarray, and peptide-based studies

    PubMed Central

    Ryan, Michael C; Zeeberg, Barry R; Caplen, Natasha J; Cleland, James A; Kahn, Ari B; Liu, Hongfang; Weinstein, John N

    2008-01-01

    Background Over 60% of protein-coding genes in vertebrates express mRNAs that undergo alternative splicing. The resulting collection of transcript isoforms poses significant challenges for contemporary biological assays. For example, RT-PCR validation of gene expression microarray results may be unsuccessful if the two technologies target different splice variants. Effective use of sequence-based technologies requires knowledge of the specific splice variant(s) that are targeted. In addition, the critical roles of alternative splice forms in biological function and in disease suggest that assay results may be more informative if analyzed in the context of the targeted splice variant. Results A number of contemporary technologies are used for analyzing transcripts or proteins. To enable investigation of the impact of splice variation on the interpretation of data derived from those technologies, we have developed SpliceCenter. SpliceCenter is a suite of user-friendly, web-based applications that includes programs for analysis of RT-PCR primer/probe sets, effectors of RNAi, microarrays, and protein-targeting technologies. Both interactive and high-throughput implementations of the tools are provided. The interactive versions of SpliceCenter tools provide visualizations of a gene's alternative transcripts and probe target positions, enabling the user to identify which splice variants are or are not targeted. The high-throughput batch versions accept user query files and provide results in tabular form. When, for example, we used SpliceCenter's batch siRNA-Check to process the Cancer Genome Anatomy Project's large-scale shRNA library, we found that only 59% of the 50,766 shRNAs in the library target all known splice variants of the target gene, 32% target some but not all, and 9% do not target any currently annotated transcript. Conclusion SpliceCenter provides unique, user-friendly applications for assessing the impact of transcript variation on the design and interpretation of RT-PCR, RNAi, gene expression microarrays, antibody-based detection, and mass spectrometry proteomics. The tools are intended for use by bench biologists as well as bioinformaticists. PMID:18638396

  10. The human serotonin 5-HT{sub 2C} receptor: Complete cDNA, genomic structure, and alternatively spliced variant

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Xie, Enzhong; Zhu, Lingyu; Zhao, Lingyun

    1996-08-01

    The complete 4775-nt cDNA encoding the human serotonin 5-HT{sub 2C} receptor (5-HT{sub 2C}R), a G-protein-coupled receptor, has been isolated. It contains a 1377-nt coding region flanked by a 728-nt 5{prime}-untranslated region and a 2670-nt 3{prime}-untranslated region. By using the cloned 5-HT{sub 2C}R cDNA probe, the complete human gene for this receptor has been isolated and shown to contain six exons and five introns spanning at least 230 kb of DNA. The coding region of the human 5-HT{sub 2C}R gene is interrupted by three introns, and the positions of the intron/exon junctions are conserved between the human and the rodent genes.more » In addition, an alternatively spliced 5-HT{sub 2C}R RNA that contains a 95-nt deletion in the region coding for the second intracellular loop and the fourth transmembrane domain of the receptor has been identified. This deletion leads to a frameshift and premature termination so that the short isoform RNA encodes a putative protein of 248 amino acids. The ratio for the short isoform over the 5-HT{sub 2C}R RNA was found to be higher in choroid plexus tumor than in normal brain tissue, suggesting the possibility of differential regulation of the 5-HT{sub 2C}R gene in different neural tissues or during tumorigenesis. Transcription of the human 5-HT{sub 2C}R gene was found to be initiated at multiple sites. No classical TATA-box sequence was found at the appropriate location, and the 5{prime}-flanking sequence contains many potential transcription factor-binding sites. A 7.3-kb 5{prime}-flanking 5-HT{sub 2C}R DNA directed the efficient expression of a luciferase reported gene in SK-N-SH and IMR32 neuroblastoma cells, indicating that is contains a functional promoter. 69 refs., 8 figs., 1 tab.« less

  11. Unusual splice site mutations disrupt FANCA exon 8 definition.

    PubMed

    Mattioli, Chiara; Pianigiani, Giulia; De Rocco, Daniela; Bianco, Anna Monica Rosaria; Cappelli, Enrico; Savoia, Anna; Pagani, Franco

    2014-07-01

    The pathological role of mutations that affect not conserved splicing regulatory sequences can be difficult to determine. In a patient with Fanconi anemia, we identified two unpredictable splicing mutations that act on either sides of FANCA exon 8. In patients-derived cells and in minigene splicing assay, we showed that both an apparently benign intronic c.710-5T>C transition and the nonsense c.790C>T substitution induce almost complete exon 8 skipping. Site-directed mutagenesis experiments indicated that the c.710-5T>C transition affects a polypyrimidine tract where most of the thymidines cannot be compensated by cytidines. The c.790C>T mutation located in position -3 relative to the donor site induce exon 8 skipping in an NMD-independent manner and complementation experiments with modified U1 snRNAs showed that U1 snRNP is only partially involved in the splicing defect. Our results highlight the importance of performing splicing functional assay for correct identification of disease-causing mechanism of genomic variants and provide mechanistic insights on how these two FANCA mutations affect exon 8 definition. Copyright © 2014 Elsevier B.V. All rights reserved.

  12. Short linear motif acquisition, exon formation and alternative splicing determine a pathway to diversity for NCoR-family co-repressors

    PubMed Central

    Short, Stephen; Peterkin, Tessa; Guille, Matthew; Patient, Roger; Sharpe, Colin

    2015-01-01

    Vertebrate NCoR-family co-repressors play central roles in the timing of embryo and stem cell differentiation by repressing the activity of a range of transcription factors. They interact with nuclear receptors using short linear motifs (SLiMs) termed co-repressor for nuclear receptor (CoRNR) boxes. Here, we identify the pathway leading to increasing co-repressor diversity across the deuterostomes. The final complement of CoRNR boxes arose in an ancestral cephalochordate, and was encoded in one large exon; the urochordates and vertebrates then split this region between 10 and 12 exons. In Xenopus, alternative splicing is prevalent in NCoR2, but absent in NCoR1. We show for one NCoR1 exon that alternative splicing can be recovered by a single point mutation, suggesting NCoR1 lost the capacity for alternative splicing. Analyses in Xenopus and zebrafish identify that cellular context, rather than gene sequence, predominantly determines species differences in alternative splicing. We identify a pathway to diversity for the NCoR family beginning with the addition of a SLiM, followed by gene duplication, the generation of alternatively spliced isoforms and their differential deployment. PMID:26289800

  13. Identification and functional analysis of two alternatively spliced transcripts of ABSCISIC ACID INSENSITIVE3 (ABI3) in linseed flax (Linum usitatissimum L.).

    PubMed

    Wang, Yanyan; Zhang, Tianbao; Song, Xiaxia; Zhang, Jianping; Dang, Zhanhai; Pei, Xinwu; Long, Yan

    2018-01-01

    Alternative splicing is a popular phenomenon in different types of plants. It can produce alternative spliced transcripts that encode proteins with altered functions. Previous studies have shown that one transcription factor, ABSCISIC ACID INSENSITIVE3 (ABI3), which encodes an important component in abscisic acid (ABA) signaling, is subjected to alternative splicing in both mono- and dicotyledons. In the current study, we identified two homologs of ABI3 in the genome of linseed flax. We screened two alternatively spliced flax LuABI3 transcripts, LuABI3-2 and LuABI3-3, and one normal flax LuABI3 transcript, LuABI3-1. Sequence analysis revealed that one of the alternatively spliced transcripts, LuABI3-3, retained a 6 bp intron. RNA accumulation analysis showed that all three transcripts were expressed during seed development, while subcellular localization and transgene experiments showed that LuABI3-3 had no biological function. The two normal transcripts, LuABI3-1 and LuABI3-2, are the important functional isoforms in flax and play significant roles in the ABA regulatory pathway during seed development, germination, and maturation.

  14. Complex alternative splicing of acetylcholinesterase transcripts in Torpedo electric organ; primary structure of the precursor of the glycolipid-anchored dimeric form.

    PubMed Central

    Sikorav, J L; Duval, N; Anselmet, A; Bon, S; Krejci, E; Legay, C; Osterlund, M; Reimund, B; Massoulié, J

    1988-01-01

    In this paper, we show the existence of alternative splicing in the 3' region of the coding sequence of Torpedo acetylcholinesterase (AChE). We describe two cDNA structures which both diverge from the previously described coding sequence of the catalytic subunit of asymmetric (A) forms (Schumacher et al., 1986; Sikorav et al., 1987). They both contain a coding sequence followed by a non-coding sequence and a poly(A) stretch. Both of these structures were shown to exist in poly(A)+ RNAs, by S1 mapping experiments. The divergent region encoded by the first sequence corresponds to the precursor of the globular dimeric form (G2a), since it contains the expected C-terminal amino acids, Ala-Cys. These amino acids are followed by a 29 amino acid extension which contains a hydrophobic segment and must be replaced by a glycolipid in the mature protein. Analyses of intact G2a AChE showed that the common domain of the protein contains intersubunit disulphide bonds. The divergent region of the second type of cDNA consists of an adjacent genomic sequence, which is removed as an intron in A and Ga mRNAs, but may encode a distinct, less abundant catalytic subunit. The structures of the cDNA clones indicate that they are derived from minor mRNAs, shorter than the three major transcripts which have been described previously (14.5, 10.5 and 5.5 kb). Oligonucleotide probes specific for the asymmetric and globular terminal regions hybridize with the three major transcripts, indicating that their size is determined by 3'-untranslated regions which are not related to the differential splicing leading to A and Ga forms. Images PMID:3181125

  15. Germ line insertion of mtDNA at the breakpoint junction of a reciprocal constitutional translocation.

    PubMed

    Willett-Brozick, J E; Savul, S A; Richey, L E; Baysal, B E

    2001-08-01

    Constitutional chromosomal translocations are relatively common causes of human morbidity, yet the DNA double-strand break (DSB) repair mechanisms that generate them are incompletely understood. We cloned, sequenced and analyzed the breakpoint junctions of a familial constitutional reciprocal translocation t(9;11)(p24;q23). Within the 10-kb region flanking the breakpoints, chromosome 11 had 25% repeat elements, whereas chromosome 9 had 98% repeats, 95% of which were L1-type LINE elements. The breakpoints occurred within an L1-type repeat element at 9p24 and at the 3'-end of an Alu sequence at 11q23. At the breakpoint junction of derivative chromosome 9, we discovered an unusually large 41-bp insertion, which showed 100% identity to 12S mitochondrial DNA (mtDNA) between nucleotides 896 and 936 of the mtDNA sequence. Analysis of the human genome failed to show the preexistence of the inserted sequence at normal chromosomes 9 and 11 breakpoint junctions or elsewhere in the genome, strongly suggesting that the insertion was derived from human mtDNA and captured into the junction during the DSB repair process. To our knowledge, these findings represent the first observation of spontaneous germ line insertion of modern human mtDNA sequences and suggest that DSB repair may play a role in inter-organellar gene transfer in vivo. Our findings also provide evidence for a previously unrecognized insertional mechanism in human, by which non-mobile extra-chromosomal fragments can be inserted into the genome at DSB repair junctions.

  16. Regulation of insulin preRNA splicing by glucose

    PubMed Central

    Wang, Juehu; Shen, Luping; Najafi, Habiba; Kolberg, Janice; Matschinsky, Franz M.; Urdea, Mickey; German, Michael

    1997-01-01

    Glucose tightly regulates the synthesis and secretion of insulin by β cells in the pancreatic islets of Langerhans. To investigate whether glucose regulates insulin synthesis at the level of insulin RNA splicing, we developed a method to detect and quantify a small amount of RNA by using the branched DNA (bDNA) signal-amplification technique. This assay is both sensitive and highly specific: mouse insulin II mRNA can be detected from a single β cell (βTC3 cells or mouse islets), whereas 1 million non-insulin-producing α cells (αTC1.6 cells) give no signal. By using intron and exon sequences, oligonucleotide probes were designed to distinguish the various unspliced and partially spliced insulin preRNAs from mature insulin mRNA. Insulin RNA splicing rates were estimated from the rate of disappearance of insulin preRNA signal from β cells treated with actinomycin D to block transcription. We found that the two introns in mouse insulin II are not spliced with the same efficiency. Intron 2 is spliced out more efficiently than intron 1. As a result, some mRNA retaining intron 1 enters the cytoplasm, making up ≈2-10% of insulin mRNA in the cell. This partially spliced cytoplasmic mRNA is quite stable, with a half-life similar to the completely spliced form. When islets grown in high glucose are shifted to low glucose medium, the level of insulin preRNA and the rate of splicing fall significantly. We conclude that glucose stimulates insulin gene transcription and insulin preRNA splicing. Previous estimates of insulin transcription rates based on insulin preRNA levels that did not consider the rate of splicing may have underestimated the effect of glucose on insulin gene transcription. PMID:9113994

  17. RNA-Seq of Arabidopsis Pollen Uncovers Novel Transcription and Alternative Splicing1[C][W][OA

    PubMed Central

    Loraine, Ann E.; McCormick, Sheila; Estrada, April; Patel, Ketan; Qin, Peng

    2013-01-01

    Pollen grains of Arabidopsis (Arabidopsis thaliana) contain two haploid sperm cells enclosed in a haploid vegetative cell. Upon germination, the vegetative cell extrudes a pollen tube that carries the sperm to an ovule for fertilization. Knowing the identity, relative abundance, and splicing patterns of pollen transcripts will improve our understanding of pollen and allow investigation of tissue-specific splicing in plants. Most Arabidopsis pollen transcriptome studies have used the ATH1 microarray, which does not assay splice variants and lacks specific probe sets for many genes. To investigate the pollen transcriptome, we performed high-throughput sequencing (RNA-Seq) of Arabidopsis pollen and seedlings for comparison. Gene expression was more diverse in seedling, and genes involved in cell wall biogenesis were highly expressed in pollen. RNA-Seq detected at least 4,172 protein-coding genes expressed in pollen, including 289 assayed only by nonspecific probe sets. Additional exons and previously unannotated 5′ and 3′ untranslated regions for pollen-expressed genes were revealed. We detected regions in the genome not previously annotated as expressed; 14 were tested and 12 were confirmed by polymerase chain reaction. Gapped read alignments revealed 1,908 high-confidence new splicing events supported by 10 or more spliced read alignments. Alternative splicing patterns in pollen and seedling were highly correlated. For most alternatively spliced genes, the ratio of variants in pollen and seedling was similar, except for some encoding proteins involved in RNA splicing. This study highlights the robustness of splicing patterns in plants and the importance of ongoing annotation and visualization of RNA-Seq data using interactive tools such as Integrated Genome Browser. PMID:23590974

  18. Therapeutic strategies based on modified U1 snRNAs and chaperones for Sanfilippo C splicing mutations.

    PubMed

    Matos, Liliana; Canals, Isaac; Dridi, Larbi; Choi, Yoo; Prata, Maria João; Jordan, Peter; Desviat, Lourdes R; Pérez, Belén; Pshezhetsky, Alexey V; Grinberg, Daniel; Alves, Sandra; Vilageliu, Lluïsa

    2014-12-10

    Mutations affecting RNA splicing represent more than 20% of the mutant alleles in Sanfilippo syndrome type C, a rare lysosomal storage disorder that causes severe neurodegeneration. Many of these mutations are localized in the conserved donor or acceptor splice sites, while few are found in the nearby nucleotides. In this study we tested several therapeutic approaches specifically designed for different splicing mutations depending on how the mutations affect mRNA processing. For three mutations that affect the donor site (c.234 + 1G > A, c.633 + 1G > A and c.1542 + 4dupA), different modified U1 snRNAs recognizing the mutated donor sites, have been developed in an attempt to rescue the normal splicing process. For another mutation that affects an acceptor splice site (c.372-2A > G) and gives rise to a protein lacking four amino acids, a competitive inhibitor of the HGSNAT protein, glucosamine, was tested as a pharmacological chaperone to correct the aberrant folding and to restore the normal trafficking of the protein to the lysosome. Partial correction of c.234 + 1G > A mutation was achieved with a modified U1 snRNA that completely matches the splice donor site suggesting that these molecules may have a therapeutic potential for some splicing mutations. Furthermore, the importance of the splice site sequence context is highlighted as a key factor in the success of this type of therapy. Additionally, glucosamine treatment resulted in an increase in the enzymatic activity, indicating a partial recovery of the correct folding. We have assayed two therapeutic strategies for different splicing mutations with promising results for the future applications.

  19. A G-to-A mutation in IVS-3 of the human gamma fibrinogen gene causing afibrinogenemia due to abnormal RNA splicing.

    PubMed

    Margaglione, M; Santacroce, R; Colaizzo, D; Seripa, D; Vecchione, G; Lupone, M R; De Lucia, D; Fortina, P; Grandone, E; Perricone, C; Di Minno, G

    2000-10-01

    Congenital afibrinogenemia is a rare autosomal recessive disorder characterized by a hemorrhagic diathesis of variable severity. Although more than 100 families with this disorder have been described, genetic defects have been characterized in few cases. An investigation of a young propositus, offspring of a consanguineous marriage, with undetectable levels of functional and quantitative fibrinogen, was conducted. Sequence analysis of the fibrinogen genes showed a homozygous G-to-A mutation at the fifth nucleotide (nt 2395) of the third intervening sequence (IVS) of the gamma-chain gene. Her first-degree relatives, who had approximately half the normal fibrinogen values and showed concordance between functional and immunologic levels, were heterozygtes. The G-to-A change predicts the disappearance of a donor splice site. After transfection with a construct, containing either the wild-type or the mutated sequence, cells with the mutant construct showed an aberrant messenger RNA (mRNA), consistent with skipping of exon 3, but not the expected mRNA. Sequencing of the abnormal mRNA showed the complete absence of exon 3. Skipping of exon 3 predicts the deletion of amino acid sequence from residue 16 to residue 75 and shifting of reading frame at amino acid 76 with a premature stop codon within exon 4 at position 77. Thus, the truncated gamma-chain gene product would not interact with other chains to form the mature fibrinogen molecule. The current findings show that mutations within highly conserved IVS regions of fibrinogen genes could affect the efficiency of normal splicing, giving rise to congenital afibrinogenemia.

  20. Sequencing of mRNA identifies re-expression of fetal splice variants in cardiac hypertrophy

    PubMed Central

    Ames, EG; Lawson, MJ; Mackey, AJ; Holmes, JW

    2013-01-01

    Cardiac hypertrophy has been well-characterized at the level of transcription. During cardiac hypertrophy, genes normally expressed primarily during fetal heart development are reexpressed, and this fetal gene program is believed to be a critical component of the hypertrophic process. Recently, alternative splicing of mRNA transcripts has been shown to be temporally regulated during heart development, leading us to consider whether fetal patterns of splicing also reappear during hypertrophy. We hypothesized that patterns of alternative splicing occurring during heart development are recapitulated during cardiac hypertrophy. Here we present a study of isoform expression during pressure-overload cardiac hypertrophy induced by 10 days of transverse aortic constriction (TAC) in rats and in developing fetal rat hearts compared to sham-operated adult rat hearts, using high-throughput sequencing of poly(A) tail mRNA. We find a striking degree of overlap between the isoforms expressed differentially in fetal and pressure-overloaded hearts compared to control: forty-four percent of the isoforms with significantly altered expression in TAC hearts are also expressed at significantly different levels in fetal hearts compared to control (P < 0.001). The isoforms that are shared between hypertrophy and fetal heart development are significantly enriched for genes involved in cytoskeletal organization, RNA processing, developmental processes, and metabolic enzymes. Our data strongly support the concept that mRNA splicing patterns normally associated with heart development recur as part of the hypertrophic response to pressure overload. These findings suggest that cardiac hypertrophy shares post-transcriptional as well as transcriptional regulatory mechanisms with fetal heart development. PMID:23688780

  1. Spliceman2: a computational web server that predicts defects in pre-mRNA splicing.

    PubMed

    Cygan, Kamil Jan; Sanford, Clayton Hendrick; Fairbrother, William Guy

    2017-09-15

    Most pre-mRNA transcripts in eukaryotic cells must undergo splicing to remove introns and join exons, and splicing elements present a large mutational target for disease-causing mutations. Splicing elements are strongly position dependent with respect to the transcript annotations. In 2012, we presented Spliceman, an online tool that used positional dependence to predict how likely distant mutations around annotated splice sites were to disrupt splicing. Here, we present an improved version of the previous tool that will be more useful for predicting the likelihood of splicing mutations. We have added industry-standard input options (i.e. Spliceman now accepts variant call format files), which allow much larger inputs than previously available. The tool also can visualize the locations-within exons and introns-of sequence variants to be analyzed and the predicted effects on splicing of the pre-mRNA transcript. In addition, Spliceman2 integrates with RNAcompete motif libraries to provide a prediction of which trans -acting factors binding sites are disrupted/created and links out to the UCSC genome browser. In summary, the new features in Spliceman2 will allow scientists and physicians to better understand the effects of single nucleotide variations on splicing. Freely available on the web at http://fairbrother.biomed.brown.edu/spliceman2 . Website implemented in PHP framework-Laravel 5, PostgreSQL, Apache, and Perl, with all major browsers supported. william_fairbrother@brown.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  2. Changes in exon–intron structure during vertebrate evolution affect the splicing pattern of exons

    PubMed Central

    Gelfman, Sahar; Burstein, David; Penn, Osnat; Savchenko, Anna; Amit, Maayan; Schwartz, Schraga; Pupko, Tal; Ast, Gil

    2012-01-01

    Exon–intron architecture is one of the major features directing the splicing machinery to the short exons that are located within long flanking introns. However, the evolutionary dynamics of exon–intron architecture and its impact on splicing is largely unknown. Using a comparative genomic approach, we analyzed 17 vertebrate genomes and reconstructed the ancestral motifs of both 3′ and 5′ splice sites, as also the ancestral length of exons and introns. Our analyses suggest that vertebrate introns increased in length from the shortest ancestral introns to the longest primate introns. An evolutionary analysis of splice sites revealed that weak splice sites act as a restrictive force keeping introns short. In contrast, strong splice sites allow recognition of exons flanked by long introns. Reconstruction of the ancestral state suggests these phenomena were not prevalent in the vertebrate ancestor, but appeared during vertebrate evolution. By calculating evolutionary rate shifts in exons, we identified cis-acting regulatory sequences that became fixed during the transition from early vertebrates to mammals. Experimental validations performed on a selection of these hexamers confirmed their regulatory function. We additionally revealed many features of exons that can discriminate alternative from constitutive exons. These features were integrated into a machine-learning approach to predict whether an exon is alternative. Our algorithm obtains very high predictive power (AUC of 0.91), and using these predictions we have identified and successfully validated novel alternatively spliced exons. Overall, we provide novel insights regarding the evolutionary constraints acting upon exons and their recognition by the splicing machinery. PMID:21974994

  3. Unexpected substrate specificity of T4 DNA ligase revealed by in vitro selection

    NASA Technical Reports Server (NTRS)

    Harada, Kazuo; Orgel, Leslie E.

    1993-01-01

    We have used in vitro selection techniques to characterize DNA sequences that are ligated efficiently by T4 DNA ligase. We find that the ensemble of selected sequences ligates about 50 times as efficiently as the random mixture of sequences used as the input for selection. Surprisingly many of the selected sequences failed to produce a match at or close to the ligation junction. None of the 20 selected oligomers that we sequenced produced a match two bases upstream from the ligation junction.

  4. Functional Analyses of a Novel Splice Variant in the CHD7 Gene, Found by Next Generation Sequencing, Confirm Its Pathogenicity in a Spanish Patient and Diagnose Him with CHARGE Syndrome.

    PubMed

    Villate, Olatz; Ibarluzea, Nekane; Fraile-Bethencourt, Eugenia; Valenzuela, Alberto; Velasco, Eladio A; Grozeva, Detelina; Raymond, F L; Botella, María P; Tejada, María-Isabel

    2018-01-01

    Mutations in CHD7 have been shown to be a major cause of CHARGE syndrome, which presents many symptoms and features common to other syndromes making its diagnosis difficult. Next generation sequencing (NGS) of a panel of intellectual disability related genes was performed in an adult patient without molecular diagnosis. A splice donor variant in CHD7 (c.5665 + 1G > T) was identified. To study its potential pathogenicity, exons and flanking intronic sequences were amplified from patient DNA and cloned into the pSAD ® splicing vector. HeLa cells were transfected with this construct and a wild-type minigene and functional analysis were performed. The construct with the c.5665 + 1G > T variant produced an aberrant transcript with an insert of 63 nucleotides of intron 28 creating a premature termination codon (TAG) 25 nucleotides downstream. This would lead to the insertion of 8 new amino acids and therefore a truncated 1896 amino acid protein. As a result of this, the patient was diagnosed with CHARGE syndrome. Functional analyses underline their usefulness for studying the pathogenicity of variants found by NGS and therefore its application to accurately diagnose patients.

  5. A splice junction-targeted CRISPR approach (spJCRISPR) reveals human FOXO3B to be a protein-coding gene.

    PubMed

    Santo, Evan E; Paik, Jihye

    2018-06-17

    The rapid development of CRISPR technology is revolutionizing molecular approaches to the dissection of complex biological phenomena. Here we describe an alternative generally applicable implementation of the CRISPR-Cas9 system that allows for selective knockdown of extremely homologous genes. This strategy employs the lentiviral delivery of paired sgRNAs and nickase Cas9 (Cas9D10A) to achieve targeted deletion of splice junctions. This general strategy offers several advantages over standard single-guide exon-targeting CRISPR-Cas9 such as greatly reduced off-target effects, more restricted genomic editing, routine disruption of target gene mRNA expression and the ability to differentiate between closely related genes. Here we demonstrate the utility of this strategy by achieving selective knockdown of the highly homologous human genes FOXO3A and suspected pseudogene FOXO3B. We find the spJCRISPR strategy to efficiently and selectively disrupt FOXO3A and FOXO3B mRNA and protein expression; thus revealing that the human FOXO3B locus encodes a bona fide human gene. Unlike FOXO3A, we find the FOXO3B protein to be cytosolically localized in both the presence and absence of active Akt. The ability to selectively target and efficiently disrupt the expression of the closely-related FOXO3A and FOXO3B genes demonstrates the efficacy of the spJCRISPR approach. Copyright © 2018. Published by Elsevier B.V.

  6. Position-dependent and neuron-specific splicing regulation by the CELF family RNA-binding protein UNC-75 in Caenorhabditis elegans

    PubMed Central

    Kuroyanagi, Hidehito; Watanabe, Yohei; Suzuki, Yutaka; Hagiwara, Masatoshi

    2013-01-01

    A large fraction of protein-coding genes in metazoans undergo alternative pre-mRNA splicing in tissue- or cell-type-specific manners. Recent genome-wide approaches have identified many putative-binding sites for some of tissue-specific trans-acting splicing regulators. However, the mechanisms of splicing regulation in vivo remain largely unknown. To elucidate the modes of splicing regulation by the neuron-specific CELF family RNA-binding protein UNC-75 in Caenorhabditis elegans, we performed deep sequencing of poly(A)+ RNAs from the unc-75(+)- and unc-75-mutant worms and identified more than 20 cassette and mutually exclusive exons repressed or activated by UNC-75. Motif searches revealed that (G/U)UGUUGUG stretches are enriched in the upstream and downstream introns of the UNC-75-repressed and -activated exons, respectively. Recombinant UNC-75 protein specifically binds to RNA fragments carrying the (G/U)UGUUGUG stretches in vitro. Bi-chromatic fluorescence alternative splicing reporters revealed that the UNC-75-target exons are regulated in tissue-specific and (G/U)UGUUGUG element-dependent manners in vivo. The unc-75 mutation affected the splicing reporter expression specifically in the nervous system. These results indicate that UNC-75 regulates alternative splicing of its target exons in neuron-specific and position-dependent manners through the (G/U)UGUUGUG elements in C. elegans. This study thus reveals the repertoire of target events for the CELF family in the living organism. PMID:23416545

  7. Misregulation of Alternative Splicing in a Mouse Model of Rett Syndrome

    PubMed Central

    Li, Ronghui; Dong, Qiping; Yuan, Xinni; Zeng, Xin; Gao, Yu; Li, Hongda; Keles, Sunduz; Wang, Zefeng; Chang, Qiang

    2016-01-01

    Mutations in the human MECP2 gene cause Rett syndrome (RTT), a severe neurodevelopmental disorder that predominantly affects girls. Despite decades of work, the molecular function of MeCP2 is not fully understood. Here we report a systematic identification of MeCP2-interacting proteins in the mouse brain. In addition to transcription regulators, we found that MeCP2 physically interacts with several modulators of RNA splicing, including LEDGF and DHX9. These interactions are disrupted by RTT causing mutations, suggesting that they may play a role in RTT pathogenesis. Consistent with the idea, deep RNA sequencing revealed misregulation of hundreds of splicing events in the cortex of Mecp2 knockout mice. To reveal the functional consequence of altered RNA splicing due to the loss of MeCP2, we focused on the regulation of the splicing of the flip/flop exon of Gria2 and other AMPAR genes. We found a significant splicing shift in the flip/flop exon toward the flop inclusion, leading to a faster decay in the AMPAR gated current and altered synaptic transmission. In summary, our study identified direct physical interaction between MeCP2 and splicing factors, a novel MeCP2 target gene, and established functional connection between a specific RNA splicing change and synaptic phenotypes in RTT mice. These results not only help our understanding of the molecular function of MeCP2, but also reveal potential drug targets for future therapies. PMID:27352031

  8. A Presumptive Developmental Role for a Sea Urchin Cyclin B Splice Variant

    PubMed Central

    Lozano, Jean-Claude; Schatt, Philippe; Marquès, François; Peaucellier, Gérard; Fort, Philippe; Féral, Jean-Pierre; Genevière, Anne-Marie; Picard, André

    1998-01-01

    We show that a splice variant–derived cyclin B is produced in sea urchin oocytes and embryos. This splice variant protein lacks highly conserved sequences in the COOH terminus of the protein. It is found strikingly abundant in growing oocytes and cells committed to differentiation during embryogenesis. Cyclin B splice variant (CBsv) protein associates weakly in the cell with Xenopus cdc2 and with budding yeast CDC28p. In contrast to classical cyclin B, CBsv very poorly complements a triple CLN deletion in budding yeast, and its microinjection prevents an initial step in MPF activation, leading to an important delay in oocyte meiosis reinitiation. CBsv microinjection in fertilized eggs induces cell cycle delay and abnormal development. We assume that CBsv is produced in growing oocytes to keep them in prophase, and during embryogenesis to slow down cell cycle in cells that will be committed to differentiation. PMID:9442104

  9. Inference of alternative splicing from RNA-Seq data with probabilistic splice graphs

    PubMed Central

    LeGault, Laura H.; Dewey, Colin N.

    2013-01-01

    Motivation: Alternative splicing and other processes that allow for different transcripts to be derived from the same gene are significant forces in the eukaryotic cell. RNA-Seq is a promising technology for analyzing alternative transcripts, as it does not require prior knowledge of transcript structures or genome sequences. However, analysis of RNA-Seq data in the presence of genes with large numbers of alternative transcripts is currently challenging due to efficiency, identifiability and representation issues. Results: We present RNA-Seq models and associated inference algorithms based on the concept of probabilistic splice graphs, which alleviate these issues. We prove that our models are often identifiable and demonstrate that our inference methods for quantification and differential processing detection are efficient and accurate. Availability: Software implementing our methods is available at http://deweylab.biostat.wisc.edu/psginfer. Contact: cdewey@biostat.wisc.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23846746

  10. Implementing a rational and consistent nomenclature for serine/arginine-rich protein splicing factors (SR proteins) in plants.

    PubMed

    Barta, Andrea; Kalyna, Maria; Reddy, Anireddy S N

    2010-09-01

    Growing interest in alternative splicing in plants and the extensive sequencing of new plant genomes necessitate more precise definition and classification of genes coding for splicing factors. SR proteins are a family of RNA binding proteins, which function as essential factors for constitutive and alternative splicing. We propose a unified nomenclature for plant SR proteins, taking into account the newly revised nomenclature of the mammalian SR proteins and a number of plant-specific properties of the plant proteins. We identify six subfamilies of SR proteins in Arabidopsis thaliana and rice (Oryza sativa), three of which are plant specific. The proposed subdivision of plant SR proteins into different subfamilies will allow grouping of paralogous proteins and simple assignment of newly discovered SR orthologs from other plant species and will promote functional comparisons in diverse plant species.

  11. DIEGO: detection of differential alternative splicing using Aitchison's geometry.

    PubMed

    Doose, Gero; Bernhart, Stephan H; Wagener, Rabea; Hoffmann, Steve

    2018-03-15

    Alternative splicing is a biological process of fundamental importance in most eukaryotes. It plays a pivotal role in cell differentiation and gene regulation and has been associated with a number of different diseases. The widespread availability of RNA-Sequencing capacities allows an ever closer investigation of differentially expressed isoforms. However, most tools for differential alternative splicing (DAS) analysis do not take split reads, i.e. the most direct evidence for a splice event, into account. Here, we present DIEGO, a compositional data analysis method able to detect DAS between two sets of RNA-Seq samples based on split reads. The python tool DIEGO works without isoform annotations and is fast enough to analyze large experiments while being robust and accurate. We provide python and perl parsers for common formats. The software is available at: www.bioinf.uni-leipzig.de/Software/DIEGO. steve@bioinf.uni-leipzig.de. Supplementary data are available at Bioinformatics online.

  12. Transcriptome analysis using next generation sequencing reveals molecular signatures of diabetic retinopathy and efficacy of candidate drugs.

    PubMed

    Kandpal, Raj P; Rajasimha, Harsha K; Brooks, Matthew J; Nellissery, Jacob; Wan, Jun; Qian, Jiang; Kern, Timothy S; Swaroop, Anand

    2012-01-01

    To define gene expression changes associated with diabetic retinopathy in a mouse model using next generation sequencing, and to utilize transcriptome signatures to assess molecular pathways by which pharmacological agents inhibit diabetic retinopathy. We applied a high throughput RNA sequencing (RNA-seq) strategy using Illumina GAIIx to characterize the entire retinal transcriptome from nondiabetic and from streptozotocin-treated mice 32 weeks after induction of diabetes. Some of the diabetic mice were treated with inhibitors of receptor for advanced glycation endproducts (RAGE) and p38 mitogen activated protein (MAP) kinase, which have previously been shown to inhibit diabetic retinopathy in rodent models. The transcripts and alternatively spliced variants were determined in all experimental groups. Next generation sequencing-based RNA-seq profiles provided comprehensive signatures of transcripts that are altered in early stages of diabetic retinopathy. These transcripts encoded proteins involved in distinct yet physiologically relevant disease-associated pathways such as inflammation, microvasculature formation, apoptosis, glucose metabolism, Wnt signaling, xenobiotic metabolism, and photoreceptor biology. Significant upregulation of crystallin transcripts was observed in diabetic animals, and the diabetes-induced upregulation of these transcripts was inhibited in diabetic animals treated with inhibitors of either RAGE or p38 MAP kinase. These two therapies also showed dissimilar regulation of some subsets of transcripts that included alternatively spliced versions of arrestin, neutral sphingomyelinase activation associated factor (Nsmaf), SH3-domain GRB2-like interacting protein 1 (Sgip1), and axin. Diabetes alters many transcripts in the retina, and two therapies that inhibit the vascular pathology similarly inhibit a portion of these changes, pointing to possible molecular mechanisms for their beneficial effects. These therapies also changed the abundance of various alternatively spliced versions of signaling transcripts, suggesting a possible role of alternative splicing in disease etiology. Our studies clearly demonstrate RNA-seq as a comprehensive strategy for identifying disease-specific transcripts, and for determining comparative profiles of molecular changes mediated by candidate drugs.

  13. Single-cell full-length total RNA sequencing uncovers dynamics of recursive splicing and enhancer RNAs.

    PubMed

    Hayashi, Tetsutaro; Ozaki, Haruka; Sasagawa, Yohei; Umeda, Mana; Danno, Hiroki; Nikaido, Itoshi

    2018-02-12

    Total RNA sequencing has been used to reveal poly(A) and non-poly(A) RNA expression, RNA processing and enhancer activity. To date, no method for full-length total RNA sequencing of single cells has been developed despite the potential of this technology for single-cell biology. Here we describe random displacement amplification sequencing (RamDA-seq), the first full-length total RNA-sequencing method for single cells. Compared with other methods, RamDA-seq shows high sensitivity to non-poly(A) RNA and near-complete full-length transcript coverage. Using RamDA-seq with differentiation time course samples of mouse embryonic stem cells, we reveal hundreds of dynamically regulated non-poly(A) transcripts, including histone transcripts and long noncoding RNA Neat1. Moreover, RamDA-seq profiles recursive splicing in >300-kb introns. RamDA-seq also detects enhancer RNAs and their cell type-specific activity in single cells. Taken together, we demonstrate that RamDA-seq could help investigate the dynamics of gene expression, RNA-processing events and transcriptional regulation in single cells.

  14. Permanent Neonatal Diabetes Caused by Creation of an Ectopic Splice Site within the INS Gene

    PubMed Central

    Gastaldo, Elena; Harries, Lorna W.; Rubio-Cabezas, Oscar; Castaño, Luis

    2012-01-01

    Background The aim of this study was to characterize the genetic etiology in a patient who presented with permanent neonatal diabetes at 2 months of age. Methodology/Principal Findings Regulatory elements and coding exons 2 and 3 of the INS gene were amplified and sequenced from genomic and complementary DNA samples. A novel heterozygous INS mutation within the terminal intron of the gene was identified in the proband and her affected father. This mutation introduces an ectopic splice site leading to the insertion of 29 nucleotides from the intronic sequence into the mature mRNA, which results in a longer and abnormal transcript. Conclusions/Significance This study highlights the importance of routinely sequencing the exon-intron boundaries and the need to carry out additional studies to confirm the pathogenicity of any identified intronic genetic variants. PMID:22235272

  15. Detection of integrated papillomavirus sequences by ligation-mediated PCR (DIPS-PCR) and molecular characterization in cervical cancer cells.

    PubMed

    Luft, F; Klaes, R; Nees, M; Dürst, M; Heilmann, V; Melsheimer, P; von Knebel Doeberitz, M

    2001-04-01

    Human papillomavirus (HPV) genomes usually persist as episomal molecules in HPV associated preneoplastic lesions whereas they are frequently integrated into the host cell genome in HPV-related cancers cells. This suggests that malignant conversion of HPV-infected epithelia is linked to recombination of cellular and viral sequences. Due to technical limitations, precise sequence information on viral-cellular junctions were obtained only for few cell lines and primary lesions. In order to facilitate the molecular analysis of genomic HPV integration, we established a ligation-mediated PCR assay for the detection of integrated papillomavirus sequences (DIPS-PCR). DIPS-PCR was initially used to amplify genomic viral-cellular junctions from HPV-associated cervical cancer cell lines (C4-I, C4-II, SW756, and HeLa) and HPV-immortalized keratinocyte lines (HPKIA, HPKII). In addition to junctions already reported in public data bases, various new fusion fragments were identified. Subsequently, 22 different viral-cellular junctions were amplified from 17 cervical carcinomas and 1 vulval intraepithelial neoplasia (VIN III). Sequence analysis of each junction revealed that the viral E1 open reading frame (ORF) was fused to cellular sequences in 20 of 22 (91%) cases. Chromosomal integration loci mapped to chromosomes 1 (2n), 2 (3n), 7 (2n), 8 (3n), 10 (1n), 14 (5n), 16 (1n), 17 (2n), and mitochondrial DNA (1n), suggesting random distribution of chromosomal integration sites. Precise sequence information obtained by DIPS-PCR was further used to monitor the monoclonal origin of 4 cervical cancers, 1 case of recurrent premalignant lesions and 1 lymph node metastasis. Therefore, DIPS-PCR might allow efficient therapy control and prediction of relapse in patients with HPV-associated anogenital cancers. Copyright 2001 Wiley-Liss, Inc.

  16. Purifying Selection on Exonic Splice Enhancers in Intronless Genes

    PubMed Central

    Savisaar, Rosina; Hurst, Laurence D.

    2016-01-01

    Exonic splice enhancers (ESEs) are short nucleotide motifs, enriched near exon ends, that enhance the recognition of the splice site and thus promote splicing. Are intronless genes under selection to avoid these motifs so as not to attract the splicing machinery to an mRNA that should not be spliced, thereby preventing the production of an aberrant transcript? Consistent with this possibility, we find that ESEs in putative recent retrocopies are at a higher density and evolving faster than those in other intronless genes, suggesting that they are being lost. Moreover, intronless genes are less dense in putative ESEs than intron-containing ones. However, this latter difference is likely due to the skewed base composition of intronless sequences, a skew that is in line with the general GC richness of few exon genes. Indeed, after controlling for such biases, we find that both intronless and intron-containing genes are denser in ESEs than expected by chance. Importantly, nucleotide-controlled analysis of evolutionary rates at synonymous sites in ESEs indicates that the ESEs in intronless genes are under purifying selection in both human and mouse. We conclude that on the loss of introns, some but not all, ESE motifs are lost, the remainder having functions beyond a role in splice promotion. These results have implications for the design of intronless transgenes and for understanding the causes of selection on synonymous sites. PMID:26802218

  17. GETPrime: a gene- or transcript-specific primer database for quantitative real-time PCR.

    PubMed

    Gubelmann, Carine; Gattiker, Alexandre; Massouras, Andreas; Hens, Korneel; David, Fabrice; Decouttere, Frederik; Rougemont, Jacques; Deplancke, Bart

    2011-01-01

    The vast majority of genes in humans and other organisms undergo alternative splicing, yet the biological function of splice variants is still very poorly understood in large part because of the lack of simple tools that can map the expression profiles and patterns of these variants with high sensitivity. High-throughput quantitative real-time polymerase chain reaction (qPCR) is an ideal technique to accurately quantify nucleic acid sequences including splice variants. However, currently available primer design programs do not distinguish between splice variants and also differ substantially in overall quality, functionality or throughput mode. Here, we present GETPrime, a primer database supported by a novel platform that uniquely combines and automates several features critical for optimal qPCR primer design. These include the consideration of all gene splice variants to enable either gene-specific (covering the majority of splice variants) or transcript-specific (covering one splice variant) expression profiling, primer specificity validation, automated best primer pair selection according to strict criteria and graphical visualization of the latter primer pairs within their genomic context. GETPrime primers have been extensively validated experimentally, demonstrating high transcript specificity in complex samples. Thus, the free-access, user-friendly GETPrime database allows fast primer retrieval and visualization for genes or groups of genes of most common model organisms, and is available at http://updepla1srv1.epfl.ch/getprime/. Database URL: http://deplanckelab.epfl.ch.

  18. GETPrime: a gene- or transcript-specific primer database for quantitative real-time PCR

    PubMed Central

    Gubelmann, Carine; Gattiker, Alexandre; Massouras, Andreas; Hens, Korneel; David, Fabrice; Decouttere, Frederik; Rougemont, Jacques; Deplancke, Bart

    2011-01-01

    The vast majority of genes in humans and other organisms undergo alternative splicing, yet the biological function of splice variants is still very poorly understood in large part because of the lack of simple tools that can map the expression profiles and patterns of these variants with high sensitivity. High-throughput quantitative real-time polymerase chain reaction (qPCR) is an ideal technique to accurately quantify nucleic acid sequences including splice variants. However, currently available primer design programs do not distinguish between splice variants and also differ substantially in overall quality, functionality or throughput mode. Here, we present GETPrime, a primer database supported by a novel platform that uniquely combines and automates several features critical for optimal qPCR primer design. These include the consideration of all gene splice variants to enable either gene-specific (covering the majority of splice variants) or transcript-specific (covering one splice variant) expression profiling, primer specificity validation, automated best primer pair selection according to strict criteria and graphical visualization of the latter primer pairs within their genomic context. GETPrime primers have been extensively validated experimentally, demonstrating high transcript specificity in complex samples. Thus, the free-access, user-friendly GETPrime database allows fast primer retrieval and visualization for genes or groups of genes of most common model organisms, and is available at http://updepla1srv1.epfl.ch/getprime/. Database URL: http://deplanckelab.epfl.ch. PMID:21917859

  19. Genome-wide RNA-binding analysis of the trypanosome U1 snRNP proteins U1C and U1-70K reveals cis/trans-spliceosomal network

    PubMed Central

    Preußer, Christian; Rossbach, Oliver; Hung, Lee-Hsueh; Li, Dan; Bindereif, Albrecht

    2014-01-01

    Trans-splicing in trypanosomes adds a 39-nucleotide mini-exon from the spliced leader (SL) RNA to the 5′ end of each protein-coding sequence. On the other hand, cis-splicing of the few intron-containing genes requires the U1 small nuclear ribonucleoprotein (snRNP) particle. To search for potential new functions of the U1 snRNP in Trypanosoma brucei, we applied genome-wide individual-nucleotide resolution crosslinking-immunoprecipitation (iCLIP), focusing on the U1 snRNP-specific proteins U1C and U1-70K. Surprisingly, U1C and U1-70K interact not only with the U1, but also with U6 and SL RNAs. In addition, mapping of crosslinks to the cis-spliced PAP [poly(A) polymerase] pre-mRNA indicate an active role of these proteins in 5′ splice site recognition. In sum, our results demonstrate that the iCLIP approach provides insight into stable and transient RNA–protein contacts within the spliceosomal network. We propose that the U1 snRNP may represent an evolutionary link between the cis- and trans-splicing machineries, playing a dual role in 5′ splice site recognition on the trans-spliceosomal SL RNP as well as on pre-mRNA cis-introns. PMID:24748659

  20. Sequential recognition of the pre-mRNA branch point by U2AF65 and a novel spliceosome-associated 28-kDa protein.

    PubMed Central

    Gaur, R K; Valcárcel, J; Green, M R

    1995-01-01

    Splicing of pre-mRNAs occurs via a lariat intermediate in which an intronic adenosine, embedded within a branch point sequence, forms a 2',5'-phosphodiester bond (RNA branch) with the 5' end of the intron. How the branch point is recognized and activated remains largely unknown. Using site-specific photochemical cross-linking, we have identified two proteins that specifically interact with the branch point during the splicing reaction. U2AF65, an essential splicing factor that binds to the adjacent polypyrimidine tract, crosslinks to the branch point at the earliest stage of spliceosome formation in an ATP-independent manner. A novel 28-kDa protein, which is a constituent of the mature spliceosome, contacts the branch point after the first catalytic step. Our results indicate that the branch point is sequentially recognized by distinct splicing factors in the course of the splicing reaction. Images FIGURE 1 FIGURE 2 FIGURE 3 FIGURE 4 FIGURE 5 FIGURE 6 FIGURE 7 FIGURE 8 FIGURE 9 PMID:7493318

  1. Next-generation sequencing of translocation renal cell carcinoma reveals novel RNA splicing partners and frequent mutations of chromatin-remodeling genes.

    PubMed

    Malouf, Gabriel G; Su, Xiaoping; Yao, Hui; Gao, Jianjun; Xiong, Liangwen; He, Qiuming; Compérat, Eva; Couturier, Jérôme; Molinié, Vincent; Escudier, Bernard; Camparo, Philippe; Doss, Denaha J; Thompson, Erika J; Khayat, David; Wood, Christopher G; Yu, Willie; Teh, Bin T; Weinstein, John; Tannir, Nizar M

    2014-08-01

    MITF/TFE translocation renal cell carcinoma (TRCC) is a rare subtype of kidney cancer. Its incidence and the genome-wide characterization of its genetic origin have not been fully elucidated. We performed RNA and exome sequencing on an exploratory set of TRCC (n = 7), and validated our findings using The Cancer Genome Atlas (TCGA) clear-cell RCC (ccRCC) dataset (n = 460). Using the TCGA dataset, we identified seven TRCC (1.5%) cases and determined their genomic profile. We discovered three novel partners of MITF/TFE (LUC7L3, KHSRP, and KHDRBS2) that are involved in RNA splicing. TRCC displayed a unique gene expression signature as compared with other RCC types, and showed activation of MITF, the transforming growth factor β1 and the PI3K complex targets. Genes differentially spliced between TRCC and other RCC types were enriched for MITF and ID2 targets. Exome sequencing of TRCC revealed a distinct mutational spectrum as compared with ccRCC, with frequent mutations in chromatin-remodeling genes (six of eight cases, three of which were from the TCGA). In two cases, we identified mutations in INO80D, an ATP-dependent chromatin-remodeling gene, previously shown to control the amplitude of the S phase. Knockdown of INO80D decreased cell proliferation in a novel cell line bearing LUC7L3-TFE3 translocation. This genome-wide study defines the incidence of TRCC within a ccRCC-directed project and expands the genomic spectrum of TRCC by identifying novel MITF/TFE partners involved in RNA splicing and frequent mutations in chromatin-remodeling genes. ©2014 American Association for Cancer Research.

  2. Using a minigene approach to characterize a novel splice site mutation in human F7 gene causing inherited factor VII deficiency in a Chinese pedigree.

    PubMed

    Yu, T; Wang, X; Ding, Q; Fu, Q; Dai, J; Lu, Y; Xi, X; Wang, H

    2009-11-01

    Factor VII deficiency which transmitted as an autosomal recessive disorder is a rare haemorrhagic condition. The aim of this study was to identify the molecular genetic defect and determine its functional consequences in a Chinese pedigree with FVII deficiency. The proband was diagnosed as inherited coagulation FVII deficiency by reduced plasma levels of FVII activity (4.4%) and antigen (38.5%). All nine exons and their flanking sequence of F7 gene were amplified by polymerase chain reaction (PCR) for the proband and the PCR products were directly sequenced. The compound heterozygous mutations of F7 (NM_000131.3) c.572-1G>A and F7 (NM_000131.3) c.1165T>G; p.Cys389Gly were identified in the proband's F7 gene. To investigate the splicing patterns associated with F7 c.572-1G>A, ectopic transcripts in leucocytes of the proband were analyzed. F7 minigenes, spanning from intron 4 to intron 7 and carrying either an A or a G at position -1 of intron 5, were constructed and transiently transfected into human embryonic kidney (HEK) 293T cells, followed by RT-PCR analysis. The aberrant transcripts from the F7 c.572-1G>A mutant allele were not detected by ectopic transcription study. Sequencing of the RT-PCR products from the mutant transfectant demonstrated the production of an erroneously spliced mRNA with exon 6 skipping, whereas a normal splicing occurred in the wide type transfectant. The aberrant mRNA produced from the F7 c.572-1G>A mutant allele is responsible for the factor VII deficiency in this pedigree.

  3. Analysis of cellulose synthase genes from domesticated apple identifies collinear genes WDR53 and CesA8A: partial co-expression, bicistronic mRNA, and alternative splicing of CESA8A

    PubMed Central

    Guerriero, Gea; Spadiut, Oliver; Kerschbamer, Christine; Giorno, Filomena; Baric, Sanja; Ezcurra, Inés

    2016-01-01

    Cellulose synthase (CesA) genes constitute a complex multigene family with six major phylogenetic clades in angiosperms. The recently sequenced genome of domestic apple, Malus×domestica, was mined for CesA genes, by blasting full-length cellulose synthase protein (CESA) sequences annotated in the apple genome against protein databases from the plant models Arabidopsis thaliana and Populus trichocarpa. Thirteen genes belonging to the six angiosperm CesA clades and coding for proteins with conserved residues typical of processive glycosyltransferases from family 2 were detected. Based on their phylogenetic relationship to Arabidopsis CESAs, as well as expression patterns, a nomenclature is proposed to facilitate further studies. Examination of their genomic organization revealed that MdCesA8-A is closely linked and co-oriented with WDR53, a gene coding for a WD40 repeat protein. The WDR53 and CesA8 genes display conserved collinearity in dicots and are partially co-expressed in the apple xylem. Interestingly, the presence of a bicistronic WDR53–CesA8A transcript was detected in phytoplasma-infected phloem tissues of apple. The bicistronic transcript contains a spliced intergenic sequence that is predicted to fold into hairpin structures typical of internal ribosome entry sites, suggesting its potential cap-independent translation. Surprisingly, the CesA8A cistron is alternatively spliced and lacks the zinc-binding domain. The possible roles of WDR53 and the alternatively spliced CESA8 variant during cellulose biosynthesis in M.×domestica are discussed. PMID:23048131

  4. Isolation of a candidate human telomerase catalytic subunit gene, which reveals complex splicing patterns in different cell types.

    PubMed

    Kilian, A; Bowtell, D D; Abud, H E; Hime, G R; Venter, D J; Keese, P K; Duncan, E L; Reddel, R R; Jefferson, R A

    1997-11-01

    Telomerase is a multicomponent reverse transcriptase enzyme that adds DNA repeats to the ends of chromosomes using its RNA component as a template for synthesis. Telomerase activity is detected in the germline as well as the majority of tumors and immortal cell lines, and at low levels in several types of normal cells. We have cloned a human gene homologous to a protein from Saccharomyces cerevisiae and Euplotes aediculatus that has reverse transcriptase motifs and is thought to be the catalytic subunit of telomerase in those species. This gene is present in the human genome as a single copy sequence with a dominant transcript of approximately 4 kb in a human colon cancer cell line, LIM1215. The cDNA sequence was determined using clones from a LIM1215 cDNA library and by RT-PCR, cRACE and 3'RACE on mRNA from the same source. We show that the gene is expressed in several normal tissues, telomerase-positive post-crisis (immortal) cell lines and various tumors but is not expressed in the majority of normal tissues analyzed, pre-crisis (non-immortal) cells and telomerase-negative immortal (ALT) cell lines. Multiple products were identified by RT-PCR using primers within the reverse transcriptase domain. Sequencing of these products suggests that they arise by alternative splicing. Strikingly, various tumors, cell lines and even normal tissues (colonic crypt and testis) showed considerable differences in the splicing patterns. Alternative splicing of the telomerase catalytic subunit transcript may be important for the regulation of telomerase activity and may give rise to proteins with different biochemical functions.

  5. Splice-mediated Variants of Proteins (SpliVaP) - data and characterization of changes in signatures among protein isoforms due to alternative splicing.

    PubMed

    Floris, Matteo; Orsini, Massimiliano; Thanaraj, Thangavel Alphonse

    2008-10-02

    It is often the case that mammalian genes are alternatively spliced; the resulting alternate transcripts often encode protein isoforms that differ in amino acid sequences. Changes among the protein isoforms can alter the cellular properties of proteins. The effect can range from a subtle modulation to a complete loss of function. (i) We examined human splice-mediated protein isoforms (as extracted from a manually curated data set, and from a computationally predicted data set) for differences in the annotation for protein signatures (Pfam domains and PRINTS fingerprints) and we characterized the differences & their effects on protein functionalities. An important question addressed relates to the extent of protein isoforms that may lack any known function in the cell. (ii) We present a database that reports differences in protein signatures among human splice-mediated protein isoform sequences. (i) Characterization: The work points to distinct sets of alternatively spliced genes with varying degrees of annotation for the splice-mediated protein isoforms. Protein molecular functions seen to be often affected are those that relate to: binding, catalytic, transcription regulation, structural molecule, transporter, motor, and antioxidant; and the processes that are often affected are nucleic acid binding, signal transduction, and protein-protein interactions. Signatures are often included/excluded and truncated in length among protein isoforms; truncation is seen as the predominant type of change. Analysis points to the following novel aspects: (a) Analysis using data from the manually curated Vega indicates that one in 8.9 genes can lead to a protein isoform of no "known" function; and one in 18 expressed protein isoforms can be such an "orphan" isoform; the corresponding numbers as seen with computationally predicted ASD data set are: one in 4.9 genes and one in 9.8 isoforms. (b) When swapping of signatures occurs, it is often between those of same functional classifications. (c) Pfam domains can occur in varying lengths, and PRINTS fingerprints can occur with varying number of constituent motifs among isoforms - since such a variation is seen in large number of genes, it could be a general mechanism to modulate protein function. (ii) The reported resource (at http://www.bioinformatica.crs4.org/tools/dbs/splivap/) provides the community ability to access data on splice-mediated protein isoforms (with value-added annotation such as association with diseases) through changes in protein signatures.

  6. Cold-dependent alternative splicing of a Jumonji C domain-containing gene MtJMJC5 in Medicago truncatula.

    PubMed

    Shen, Yingfang; Wu, Xiaopei; Liu, Demei; Song, Shengjing; Liu, Dengcai; Wang, Haiqing

    2016-05-27

    Histone methylation is an epigenetic modification mechanism that regulates gene expression in eukaryotic cells. Jumonji C domain-containing demethylases are involved in removal of methyl groups at lysine or arginine residues. The JmjC domain-only member, JMJ30/JMJD5 of Arabidopsis, is a component of the plant circadian clock. Although some plant circadian clock genes undergo alternative splicing in response to external cues, there is no evidence that JMJ30/JMJD5 is regulated by alternative splicing. In this study, the expression of an Arabidopsis JMJ30/JMJD5 ortholog in Medicago truncatula, MtJMJC5, in response to circadian clock and abiotic stresses were characterized. The results showed that MtJMJC5 oscillates with a circadian rhythm, and undergoes cold specifically induced alternative splicing. The cold-induced alternative splicing could be reversed after ambient temperature returning to the normal. Sequencing results revealed four alternative splicing RNA isoforms including a full-length authentic protein encoding variant, and three premature termination condon-containing variants due to alternative 3' splice sites at the first and second intron. Under cold treatment, the variants that share a common 3' alternative splicing site at the second intron were intensively up-regulated while the authentic protein encoding variant and the premature termination condon-containing variant only undergoing a 3' alternative splicing at the first intron were down regulated. Although all the premature termination condon-harboring alternative splicing variants were sensitive to nonsense-mediated decay, the premature termination codon-harboring alternative splicing variants sharing the 3' alternative splicing site at the second intron showed less sensitivity than the one only containing the 3' alternative slicing site at the first intron under cold treatment. These results suggest that the cold-dependent alternative splicing of MtJMJC5 is likely a species or genus-specific mechanism of gene expression regulation on RNA levels, and might play a role in epigenetic regulation of the link between the circadian clock and ambient temperature fluctuation in Medicago. Copyright © 2016 Elsevier Inc. All rights reserved.

  7. Analysis and recognition of 5′ UTR intron splice sites in human pre-mRNA

    PubMed Central

    Eden, E.; Brunak, S.

    2004-01-01

    Prediction of splice sites in non-coding regions of genes is one of the most challenging aspects of gene structure recognition. We perform a rigorous analysis of such splice sites embedded in human 5′ untranslated regions (UTRs), and investigate correlations between this class of splice sites and other features found in the adjacent exons and introns. By restricting the training of neural network algorithms to ‘pure’ UTRs (not extending partially into protein coding regions), we for the first time investigate the predictive power of the splicing signal proper, in contrast to conventional splice site prediction, which typically relies on the change in sequence at the transition from protein coding to non-coding. By doing so, the algorithms were able to pick up subtler splicing signals that were otherwise masked by ‘coding’ noise, thus enhancing significantly the prediction of 5′ UTR splice sites. For example, the non-coding splice site predicting networks pick up compositional and positional bias in the 3′ ends of non-coding exons and 5′ non-coding intron ends, where cytosine and guanine are over-represented. This compositional bias at the true UTR donor sites is also visible in the synaptic weights of the neural networks trained to identify UTR donor sites. Conventional splice site prediction methods perform poorly in UTRs because the reading frame pattern is absent. The NetUTR method presented here performs 2–3-fold better compared with NetGene2 and GenScan in 5′ UTRs. We also tested the 5′ UTR trained method on protein coding regions, and discovered, surprisingly, that it works quite well (although it cannot compete with NetGene2). This indicates that the local splicing pattern in UTRs and coding regions is largely the same. The NetUTR method is made publicly available at www.cbs.dtu.dk/services/NetUTR. PMID:14960723

  8. Analysis of aberrant pre-messenger RNA splicing resulting from mutations in ATP8B1 and efficient in vitro rescue by adapted U1 small nuclear RNA.

    PubMed

    van der Woerd, Wendy L; Mulder, Johanna; Pagani, Franco; Beuers, Ulrich; Houwen, Roderick H J; van de Graaf, Stan F J

    2015-04-01

    ATP8B1 deficiency is a severe autosomal recessive liver disease resulting from mutations in the ATP8B1 gene characterized by a continuous phenotypical spectrum from intermittent (benign recurrent intrahepatic cholestasis; BRIC) to progressive familial intrahepatic cholestasis (PFIC). Current therapeutic options are insufficient, and elucidating the molecular consequences of mutations could lead to personalized mutation-specific therapies. We investigated the effect on pre-messenger RNA splicing of 14 ATP8B1 mutations at exon-intron boundaries using an in vitro minigene system. Eleven mutations, mostly associated with a PFIC phenotype, resulted in aberrant splicing and a complete absence of correctly spliced product. In contrast, three mutations led to partially correct splicing and were associated with a BRIC phenotype. These findings indicate an inverse correlation between the level of correctly spliced product and disease severity. Expression of modified U1 small nuclear RNAs (snRNA) complementary to the splice donor sites strongly improved or completely rescued splicing for several ATP8B1 mutations located at donor, as well as acceptor, splice sites. In one case, we also evaluated exon-specific U1 snRNAs that, by targeting nonconserved intronic sequences, might reduce possible off-target events. Although very effective in correcting exon skipping, they also induced retention of the short downstream intron. We systematically characterized the molecular consequences of 14 ATP8B1 mutations at exon-intron boundaries associated with ATP8B1 deficiency and found that the majority resulted in total exon skipping. The amount of correctly spliced product inversely correlated with disease severity. Compensatory modified U1 snRNAs, complementary to mutated donor splice sites, were able to improve exon definition very efficiently and could be a novel therapeutic strategy in ATP8B1 deficiency as well as other genetic diseases. © 2014 by the American Association for the Study of Liver Diseases.

  9. Influence of intron length on interaction characters between post-spliced intron and its CDS in ribosomal protein genes

    NASA Astrophysics Data System (ADS)

    Zhao, Xiaoqing; Li, Hong; Bao, Tonglaga; Ying, Zhiqiang

    2012-09-01

    Many experiment evidences showed that sequence structures of introns and intron loss/gain can influence gene expression, but current mechanisms did not refer to the functions of post-spliced introns directly. We propose that postspliced introns play their functions in gene expression by interacting with their mRNA sequences and the interaction is characterized by the matched segments between introns and their CDS. In this study, we investigated the interaction characters with length series by improved Smith-Waterman local alignment software for the ribosomal protein genes in C. elegans and D. melanogaster. Our results showed that RF values of five intron groups are significantly high in the central non-conserved region and very low in 5'-end and 3'-end splicing region. It is interesting that the number of the optimal matched regions gradually increases with intron length. Distributions of the optimal matched regions are different for five intron groups. Our study revealed that there are more interaction regions between longer introns and their CDS than shorter, and it provides a positive pattern for regulating the gene expression.

  10. Selfish DNA: homing endonucleases find a home.

    PubMed

    Edgell, David R

    2009-02-10

    Self-splicing group I introns come in two flavours - those with a homing endonuclease to promote mobility of the intron, and those without an endonuclease. How homing endonucleases and self-splicing introns associate to form a composite selfish genetic element is a question of long-standing interest. Recent work has revealed that a shared characteristic of both introns and endonucleases, the targeting of conserved sequences, may provide the impetus for the evolution of composite mobile genetic elements.

  11. Unexpected dependence of RyR1 splice variant expression in human lower limb muscles on fiber-type composition.

    PubMed

    Willemse, Hermia; Theodoratos, Angelo; Smith, Paul N; Dulhunty, Angela F

    2016-02-01

    The skeletal muscle ryanodine receptor Ca(2+) release channel (RyR1), essential for excitation-contraction (EC) coupling, demonstrates a known developmentally regulated alternative splicing in the ASI region. We now find unexpectedly that the expression of the splice variants is closely related to fiber type in adult human lower limb muscles. We examined the distribution of myosin heavy chain isoforms and ASI splice variants in gluteus minimus, gluteus medius and vastus medialis from patients aged 45 to 85 years. There was a strong positive correlation between ASI(+)RyR1 and the percentage of type 2 fibers in the muscles (r = 0.725), and a correspondingly strong negative correlation between the percentages of ASI(+)RyR1 and percentage of type 1 fibers. When the type 2 fiber data were separated into type 2X and type 2A, the correlation with ASI(+)RyR1 was stronger in type 2X fibers (r = 0.781) than in type 2A fibers (r = 0.461). There was no significant correlation between age and either fiber-type composition or ASI(+)RyR1/ASI(-)RyR1 ratio. The results suggest that the reduced expression of ASI(-)RyR1 during development may reflect a reduction in type 1 fibers during development. Preferential expression of ASI(-) RyR1, having a higher gain of in Ca(2+) release during EC coupling than ASI(+)RyR1, may compensate for the reduced terminal cisternae volume, fewer junctional contacts and reduced charge movement in type 1 fibers.

  12. Crystallization and Preliminary X-ray Analysis of the Human Long Myosin Light-Chain Kinase 1-Specific Domain IgCAM3

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    W Vallen Graham; A Magis; K Bailey

    2011-12-31

    Myosin light-chain kinase-dependent tight junction regulation is a critical event in inflammatory cytokine-induced increases in epithelial paracellular permeability. MLCK is expressed in human intestinal epithelium as two isoforms, long MLCK1 and long MLCK2, and MLCK1 is specifically localized to the tight junction, where it regulates paracellular permeability. The sole difference between these long MLCK splice variants is the presence of an immunoglobulin-like cell-adhesion molecule domain, IgCAM3, in MLCK1. To gain insight into the structure of the IgCAM3 domain, the IgCAM3 domain of MLCK1 has been expressed, purified and crystallized. Preliminary X-ray diffraction data were collected to 2.0 {angstrom} resolution andmore » were consistent with the primitive trigonal space group P2{sub 1}2{sub 1}2{sub 1}.« less

  13. Genotype-specific signal generation based on digestion of 3-way DNA junctions: application to KRAS variation detection.

    PubMed

    Amicarelli, Giulia; Adlerstein, Daniel; Shehi, Erlet; Wang, Fengfei; Makrigiorgos, G Mike

    2006-10-01

    Genotyping methods that reveal single-nucleotide differences are useful for a wide range of applications. We used digestion of 3-way DNA junctions in a novel technology, OneCutEventAmplificatioN (OCEAN) that allows sequence-specific signal generation and amplification. We combined OCEAN with peptide-nucleic-acid (PNA)-based variant enrichment to detect and simultaneously genotype v-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog (KRAS) codon 12 sequence variants in human tissue specimens. We analyzed KRAS codon 12 sequence variants in 106 lung cancer surgical specimens. We conducted a PNA-PCR reaction that suppresses wild-type KRAS amplification and genotyped the product with a set of OCEAN reactions carried out in fluorescence microplate format. The isothermal OCEAN assay enabled a 3-way DNA junction to form between the specific target nucleic acid, a fluorescently labeled "amplifier", and an "anchor". The amplifier-anchor contact contains the recognition site for a restriction enzyme. Digestion produces a cleaved amplifier and generation of a fluorescent signal. The cleaved amplifier dissociates from the 3-way DNA junction, allowing a new amplifier to bind and propagate the reaction. The system detected and genotyped KRAS sequence variants down to approximately 0.3% variant-to-wild-type alleles. PNA-PCR/OCEAN had a concordance rate with PNA-PCR/sequencing of 93% to 98%, depending on the exact implementation. Concordance rate with restriction endonuclease-mediated selective-PCR/sequencing was 89%. OCEAN is a practical and low-cost novel technology for sequence-specific signal generation. Reliable analysis of KRAS sequence alterations in human specimens circumvents the requirement for sequencing. Application is expected in genotyping KRAS codon 12 sequence variants in surgical specimens or in bodily fluids, as well as single-base variations and sequence alterations in other genes.

  14. Exonic Splicing Mutations Are More Prevalent than Currently Estimated and Can Be Predicted by Using In Silico Tools

    PubMed Central

    Soukarieh, Omar; Gaildrat, Pascaline; Hamieh, Mohamad; Drouet, Aurélie; Baert-Desurmont, Stéphanie; Frébourg, Thierry; Tosi, Mario; Martins, Alexandra

    2016-01-01

    The identification of a causal mutation is essential for molecular diagnosis and clinical management of many genetic disorders. However, even if next-generation exome sequencing has greatly improved the detection of nucleotide changes, the biological interpretation of most exonic variants remains challenging. Moreover, particular attention is typically given to protein-coding changes often neglecting the potential impact of exonic variants on RNA splicing. Here, we used the exon 10 of MLH1, a gene implicated in hereditary cancer, as a model system to assess the prevalence of RNA splicing mutations among all single-nucleotide variants identified in a given exon. We performed comprehensive minigene assays and analyzed patient’s RNA when available. Our study revealed a staggering number of splicing mutations in MLH1 exon 10 (77% of the 22 analyzed variants), including mutations directly affecting splice sites and, particularly, mutations altering potential splicing regulatory elements (ESRs). We then used this thoroughly characterized dataset, together with experimental data derived from previous studies on BRCA1, BRCA2, CFTR and NF1, to evaluate the predictive power of 3 in silico approaches recently described as promising tools for pinpointing ESR-mutations. Our results indicate that ΔtESRseq and ΔHZEI-based approaches not only discriminate which variants affect splicing, but also predict the direction and severity of the induced splicing defects. In contrast, the ΔΨ-based approach did not show a compelling predictive power. Our data indicates that exonic splicing mutations are more prevalent than currently appreciated and that they can now be predicted by using bioinformatics methods. These findings have implications for all genetically-caused diseases. PMID:26761715

  15. Additional annotation of the pig transcriptome using integrated Iso-seq and Illumina RNA-seq analysis

    USDA-ARS?s Scientific Manuscript database

    Alternative splicing is a well-known phenomenon that dramatically increases eukaryotic transcriptome diversity. The extent of mRNA isoform diversity among porcine tissues was assessed using Pacific Biosciences single-molecule long-read isoform sequencing (Iso-Seq) and Illumina short read sequencing ...

  16. Complete Genome Sequence of Pig-Tailed Macaque Rhadinovirus 2 and Its Evolutionary Relationship with Rhesus Macaque Rhadinovirus and Human Herpesvirus 8/Kaposi's Sarcoma-Associated Herpesvirus

    PubMed Central

    Bruce, A. Gregory; Thouless, Margaret E.; Haines, Anthony S.; Pallen, Mark J.; Grundhoff, Adam

    2015-01-01

    ABSTRACT Two rhadinovirus lineages have been identified in Old World primates. The rhadinovirus 1 (RV1) lineage consists of human herpesvirus 8, Kaposi's sarcoma-associated herpesvirus (KSHV), and closely related rhadinoviruses of chimpanzees, gorillas, macaques and other Old World primates. The RV2 rhadinovirus lineage is distinct and consists of closely related viruses from the same Old World primate species. Rhesus macaque rhadinovirus (RRV) is the RV2 prototype, and two RRV isolates, 26-95 and 17577, were sequenced. We determined that the pig-tailed macaque RV2 rhadinovirus, MneRV2, is highly associated with lymphomas in macaques with simian AIDS. To further study the role of rhadinoviruses in the development of lymphoma, we sequenced the complete genome of MneRV2 and identified 87 protein coding genes and 17 candidate microRNAs (miRNAs). A strong genome colinearity and sequence homology were observed between MneRV2 and RRV26-95, although the open reading frame (ORF) encoding the KSHV ORFK15 homolog was disrupted in RRV26-95. Comparison with MneRV2 revealed several genomic anomalies in RRV17577 that were not present in other rhadinovirus genomes, including an N-terminal duplication in ORF4 and a recombinative exchange of more distantly related homologs of the ORF22/ORF47 interacting glycoprotein genes. The comparison with MneRV2 has revealed novel genes and important conservation of protein coding domains and transcription initiation, termination, and splicing signals, which have added to our knowledge of RV2 rhadinovirus genetics. Further comparisons with KSHV and other RV1 rhadinoviruses will provide important avenues for dissecting the biology, evolution, and pathology of these closely related tumor-inducing viruses in humans and other Old World primates. IMPORTANCE This work provides the sequence characterization of MneRV2, the pig-tailed macaque homolog of rhesus rhadinovirus (RRV). MneRV2 and RRV belong to the rhadinovirus 2 (RV2) rhadinovirus lineage of Old World primates and are distinct but related to Kaposi's sarcoma-associated herpesvirus (KSHV), the etiologic agent of Kaposi's sarcoma. Pig-tailed macaques provide important models of human disease, and our previous studies have indicated that MneRV2 plays a causal role in AIDS-related lymphomas in macaques. Delineation of the MneRV2 sequence has allowed a detailed characterization of the genome structure, and evolutionary comparisons with RRV and KSHV have identified conserved promoters, splice junctions, and novel genes. This comparison provides insight into RV2 rhadinovirus biology and sets the groundwork for more intensive next-generation (Next-Gen) transcript and genetic analysis of this class of tumor-inducing herpesvirus. This study supports the use of MneRV2 in pig-tailed macaques as an important model for studying rhadinovirus biology, transmission and pathology. PMID:25609822

  17. Simulation-based comprehensive benchmarking of RNA-seq aligners

    PubMed Central

    Baruzzo, Giacomo; Hayer, Katharina E; Kim, Eun Ji; Di Camillo, Barbara; FitzGerald, Garret A; Grant, Gregory R

    2018-01-01

    Alignment is the first step in most RNA-seq analysis pipelines, and the accuracy of downstream analyses depends heavily on it. Unlike most steps in the pipeline, alignment is particularly amenable to benchmarking with simulated data. We performed a comprehensive benchmarking of 14 common splice-aware aligners for base, read, and exon junction-level accuracy and compared default with optimized parameters. We found that performance varied by genome complexity, and accuracy and popularity were poorly correlated. The most widely cited tool underperforms for most metrics, particularly when using default settings. PMID:27941783

  18. Identification of an Intronic Splicing Enhancer Essential for the Inclusion of FGFR2 Exon IIIc*S⃞

    PubMed Central

    Seth, Puneet; Miller, Heather B.; Lasda, Erika L.; Pearson, James L.; Garcia-Blanco, Mariano A.

    2008-01-01

    The ligand specificity of fibroblast growth factor receptor 2 (FGFR2) is determined by the alternative splicing of exons 8 (IIIb) or 9 (IIIc). Exon IIIb is included in epithelial cells, whereas exon IIIc is included in mesenchymal cells. Although a number of cis elements and trans factors have been identified that play a role in exon IIIb inclusion in epithelium, little is known about the activation of exon IIIc in mesenchyme. We report here the identification of a splicing enhancer required for IIIc inclusion. This 24-nucleotide (nt) downstream intronic splicing enhancer (DISE) is located within intron 9 immediately downstream of exon IIIc. DISE was able to activate the inclusion of heterologous exons rat FGFR2 IIIb and human β-globin exon 2 in cell lines from different tissues and species and also in HeLa cell nuclear extracts in vitro. DISE was capable of replacing the intronic activator sequence 1 (IAS1), a known IIIb splicing enhancer and vice versa. This fact, together with the requirement for DISE to be close to the 5′-splice site and the ability of DISE to promote binding of U1 snRNP, suggested that IAS1 and DISE belong to the same class of cis-acting elements. PMID:18256031

  19. Human-specific protein isoforms produced by novel splice sites in the human genome after the human-chimpanzee divergence.

    PubMed

    Kim, Dong Seon; Hahn, Yoonsoo

    2012-11-13

    Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution.

  20. Intron-mediated alternative splicing of WOOD-ASSOCIATED NAC TRANSCRIPTION FACTOR1B regulates cell wall thickening during fiber development in Populus species.

    PubMed

    Zhao, Yunjun; Sun, Jiayan; Xu, Peng; Zhang, Rui; Li, Laigeng

    2014-02-01

    Alternative splicing is an important mechanism involved in regulating the development of multicellular organisms. Although many genes in plants undergo alternative splicing, little is understood of its significance in regulating plant growth and development. In this study, alternative splicing of black cottonwood (Populus trichocarpa) wood-associated NAC domain transcription factor (PtrWNDs), PtrWND1B, is shown to occur exclusively in secondary xylem fiber cells. PtrWND1B is expressed with a normal short-transcript PtrWND1B-s as well as its alternative long-transcript PtrWND1B-l. The intron 2 structure of the PtrWND1B gene was identified as a critical sequence that causes PtrWND1B alternative splicing. Suppression of PtrWND1B expression specifically inhibited fiber cell wall thickening. The two PtrWND1B isoforms play antagonistic roles in regulating cell wall thickening during fiber cell differentiation in Populus spp. PtrWND1B-s overexpression enhanced fiber cell wall thickening, while overexpression of PtrWND1B-l repressed fiber cell wall thickening. Alternative splicing may enable more specific regulation of processes such as fiber cell wall thickening during wood formation.

  1. Intron-Mediated Alternative Splicing of WOOD-ASSOCIATED NAC TRANSCRIPTION FACTOR1B Regulates Cell Wall Thickening during Fiber Development in Populus Species1[W

    PubMed Central

    Zhao, Yunjun; Sun, Jiayan; Xu, Peng; Zhang, Rui; Li, Laigeng

    2014-01-01

    Alternative splicing is an important mechanism involved in regulating the development of multicellular organisms. Although many genes in plants undergo alternative splicing, little is understood of its significance in regulating plant growth and development. In this study, alternative splicing of black cottonwood (Populus trichocarpa) wood-associated NAC domain transcription factor (PtrWNDs), PtrWND1B, is shown to occur exclusively in secondary xylem fiber cells. PtrWND1B is expressed with a normal short-transcript PtrWND1B-s as well as its alternative long-transcript PtrWND1B-l. The intron 2 structure of the PtrWND1B gene was identified as a critical sequence that causes PtrWND1B alternative splicing. Suppression of PtrWND1B expression specifically inhibited fiber cell wall thickening. The two PtrWND1B isoforms play antagonistic roles in regulating cell wall thickening during fiber cell differentiation in Populus spp. PtrWND1B-s overexpression enhanced fiber cell wall thickening, while overexpression of PtrWND1B-l repressed fiber cell wall thickening. Alternative splicing may enable more specific regulation of processes such as fiber cell wall thickening during wood formation. PMID:24394777

  2. A detailed transcript-level probe annotation reveals alternative splicing based microarray platform differences

    PubMed Central

    Lee, Joseph C; Stiles, David; Lu, Jun; Cam, Margaret C

    2007-01-01

    Background Microarrays are a popular tool used in experiments to measure gene expression levels. Improving the reproducibility of microarray results produced by different chips from various manufacturers is important to create comparable and combinable experimental results. Alternative splicing has been cited as a possible cause of differences in expression measurements across platforms, though no study to this point has been conducted to show its influence in cross-platform differences. Results Using probe sequence data, a new microarray probe/transcript annotation was created based on the AceView Aug05 release that allowed for the categorization of genes based on their expression measurements' susceptibility to alternative splicing differences across microarray platforms. Examining gene expression data from multiple platforms in light of the new categorization, genes unsusceptible to alternative splicing differences showed higher signal agreement than those genes most susceptible to alternative splicing differences. The analysis gave rise to a different probe-level visualization method that can highlight probe differences according to transcript specificity. Conclusion The results highlight the need for detailed probe annotation at the transcriptome level. The presence of alternative splicing within a given sample can affect gene expression measurements and is a contributing factor to overall technical differences across platforms. PMID:17708771

  3. Computational Identification of Tissue-Specific Splicing Regulatory Elements in Human Genes from RNA-Seq Data.

    PubMed

    Badr, Eman; ElHefnawi, Mahmoud; Heath, Lenwood S

    2016-01-01

    Alternative splicing is a vital process for regulating gene expression and promoting proteomic diversity. It plays a key role in tissue-specific expressed genes. This specificity is mainly regulated by splicing factors that bind to specific sequences called splicing regulatory elements (SREs). Here, we report a genome-wide analysis to study alternative splicing on multiple tissues, including brain, heart, liver, and muscle. We propose a pipeline to identify differential exons across tissues and hence tissue-specific SREs. In our pipeline, we utilize the DEXSeq package along with our previously reported algorithms. Utilizing the publicly available RNA-Seq data set from the Human BodyMap project, we identified 28,100 differentially used exons across the four tissues. We identified tissue-specific exonic splicing enhancers that overlap with various previously published experimental and computational databases. A complicated exonic enhancer regulatory network was revealed, where multiple exonic enhancers were found across multiple tissues while some were found only in specific tissues. Putative combinatorial exonic enhancers and silencers were discovered as well, which may be responsible for exon inclusion or exclusion across tissues. Some of the exonic enhancers are found to be co-occurring with multiple exonic silencers and vice versa, which demonstrates a complicated relationship between tissue-specific exonic enhancers and silencers.

  4. Alternative splicing at the intersection of biological timing, development, and stress responses.

    PubMed

    Staiger, Dorothee; Brown, John W S

    2013-10-01

    High-throughput sequencing for transcript profiling in plants has revealed that alternative splicing (AS) affects a much higher proportion of the transcriptome than was previously assumed. AS is involved in most plant processes and is particularly prevalent in plants exposed to environmental stress. The identification of mutations in predicted splicing factors and spliceosomal proteins that affect cell fate, the circadian clock, plant defense, and tolerance/sensitivity to abiotic stress all point to a fundamental role of splicing/AS in plant growth, development, and responses to external cues. Splicing factors affect the AS of multiple downstream target genes, thereby transferring signals to alter gene expression via splicing factor/AS networks. The last two to three years have seen an ever-increasing number of examples of functional AS. At a time when the identification of AS in individual genes and at a global level is exploding, this review aims to bring together such examples to illustrate the extent and importance of AS, which are not always obvious from individual publications. It also aims to ensure that plant scientists are aware that AS is likely to occur in the genes that they study and that dynamic changes in AS and its consequences need to be considered routinely.

  5. Alternative Splicing at the Intersection of Biological Timing, Development, and Stress Responses[OPEN

    PubMed Central

    Staiger, Dorothee; Brown, John W.S.

    2013-01-01

    High-throughput sequencing for transcript profiling in plants has revealed that alternative splicing (AS) affects a much higher proportion of the transcriptome than was previously assumed. AS is involved in most plant processes and is particularly prevalent in plants exposed to environmental stress. The identification of mutations in predicted splicing factors and spliceosomal proteins that affect cell fate, the circadian clock, plant defense, and tolerance/sensitivity to abiotic stress all point to a fundamental role of splicing/AS in plant growth, development, and responses to external cues. Splicing factors affect the AS of multiple downstream target genes, thereby transferring signals to alter gene expression via splicing factor/AS networks. The last two to three years have seen an ever-increasing number of examples of functional AS. At a time when the identification of AS in individual genes and at a global level is exploding, this review aims to bring together such examples to illustrate the extent and importance of AS, which are not always obvious from individual publications. It also aims to ensure that plant scientists are aware that AS is likely to occur in the genes that they study and that dynamic changes in AS and its consequences need to be considered routinely. PMID:24179132

  6. The bromodomain protein BRD4 regulates splicing during heat shock

    PubMed Central

    Hussong, Michelle; Kaehler, Christian; Kerick, Martin; Grimm, Christina; Franz, Alexandra; Timmermann, Bernd; Welzel, Franziska; Isensee, Jörg; Hucho, Tim; Krobitsch, Sylvia; Schweiger, Michal R.

    2017-01-01

    The cellular response to heat stress is an ancient and evolutionarily highly conserved defence mechanism characterised by the transcriptional up-regulation of cyto-protective genes and a partial inhibition of splicing. These features closely resemble the proteotoxic stress response during tumor development. The bromodomain protein BRD4 has been identified as an integral member of the oxidative stress as well as of the inflammatory response, mainly due to its role in the transcriptional regulation process. In addition, there are also several lines of evidence implicating BRD4 in the splicing process. Using RNA-sequencing we found a significant increase in splicing inhibition, in particular intron retentions (IR), following heat treatment in BRD4-depleted cells. This leads to a decrease of mRNA abundancy of the affected transcripts, most likely due to premature termination codons. Subsequent experiments revealed that BRD4 interacts with the heat shock factor 1 (HSF1) such that under heat stress BRD4 is recruited to nuclear stress bodies and non-coding SatIII RNA transcripts are up-regulated. These findings implicate BRD4 as an important regulator of splicing during heat stress. Our data which links BRD4 to the stress induced splicing process may provide novel mechanisms of BRD4 inhibitors in regard to anti-cancer therapies. PMID:27536004

  7. PCR-free Quantification of Multiple Splice Variants in Cancer Gene by Surface Enhanced Raman Spectroscopy

    PubMed Central

    Sun, Lan; Irudayaraj, Joseph

    2009-01-01

    We demonstrate a surface enhanced Raman spectroscopy (SERS) based array platform to monitor gene expression in cancer cells in a multiplex and quantitative format without amplification steps. A strategy comprising of DNA/RNA hybridization, S1 nuclease digestion, and alkaline hydrolysis was adopted to obtain DNA targets specific to two splice junction variants Δ(9, 10) and Δ(5) of the breast cancer susceptibility gene 1 (BRCA1) from MCF-7 and MDA-MB-231 breast cancer cell lines. These two targets were identified simultaneously and their absolute quantities were estimated by a SERS strategy utilizing the inherent plasmon-phonon Raman mode of gold nanoparticle probes as a self-referencing standard to correct for variability in surface enhancement. Results were then validated by reverse transcription PCR (RT-PCR). Our proposed methodology could be expanded to a higher level of multiplexing for quantitative gene expression analysis of any gene without any amplification steps. PMID:19780515

  8. SF3B1 mutations constitute a novel therapeutic target in breast cancer

    PubMed Central

    Maguire, Sarah L; Leonidou, Andri; Wai, Patty; Marchiò, Caterina; Ng, Charlotte KY; Sapino, Anna; Salomon, Anne-Vincent; Reis-Filho, Jorge S; Weigelt, Britta; Natrajan, Rachael C

    2015-01-01

    Mutations in genes encoding proteins involved in RNA splicing have been found to occur at relatively high frequencies in several tumour types including myelodysplastic syndromes, chronic lymphocytic leukaemia, uveal melanoma, and pancreatic cancer, and at lower frequencies in breast cancer. To investigate whether dysfunction in RNA splicing is implicated in the pathogenesis of breast cancer, we performed a re-analysis of published exome and whole genome sequencing data. This analysis revealed that mutations in spliceosomal component genes occurred in 5.6% of unselected breast cancers, including hotspot mutations in the SF3B1 gene, which were found in 1.8% of unselected breast cancers. SF3B1 mutations were significantly associated with ER-positive disease, AKT1 mutations, and distinct copy number alterations. Additional profiling of hotspot mutations in a panel of special histological subtypes of breast cancer showed that 16% and 6% of papillary and mucinous carcinomas of the breast harboured the SF3B1 K700E mutation. RNA sequencing identified differentially spliced events expressed in tumours with SF3B1 mutations including the protein coding genes TMEM14C, RPL31, DYNL11, UQCC, and ABCC5, and the long non-coding RNA CRNDE. Moreover, SF3B1 mutant cell lines were found to be sensitive to the SF3b complex inhibitor spliceostatin A and treatment resulted in perturbation of the splicing signature. Albeit rare, SF3B1 mutations result in alternative splicing events, and may constitute drivers and a novel therapeutic target in a subset of breast cancers. © 2014 The Authors. The Journal of Pathology published by John Wiley & Sons Ltd on behalf of Pathological Society of Great Britain and Ireland. PMID:25424858

  9. Alternative splicing of iodothyronine deiodinases in pituitary adenomas. Regulation by oncoprotein SF2/ASF.

    PubMed

    Piekielko-Witkowska, Agnieszka; Kedzierska, Hanna; Poplawski, Piotr; Wojcicka, Anna; Rybicka, Beata; Maksymowicz, Maria; Grajkowska, Wieslawa; Matyja, Ewa; Mandat, Tomasz; Bonicki, Wieslaw; Nauman, Pawel

    2013-06-01

    Pituitary tumors belong to the group of most common neoplasms of the sellar region. Iodothyronine deiodinase types 1 (DIO1) and 2 (DIO2) are enzymes contributing to the levels of locally synthesized T3, a hormone regulating key physiological processes in the pituitary, including its development, cellular proliferation, and hormone secretion. Previous studies revealed that the expression of deiodinases in pituitary tumors is variable and, moreover, there is no correlation between mRNA and protein products of the particular gene, suggesting the potential role of posttranscriptional regulatory mechanisms. In this work we hypothesized that one of such mechanisms could be the alternative splicing. Therefore, we analyzed expression and sequences of DIO1 and DIO2 splicing variants in 30 pituitary adenomas and 9 non-tumorous pituitary samples. DIO2 mRNA was expressed as only two mRNA isoforms. In contrast, nine splice variants of DIO1 were identified. Among them, five were devoid of exon 3. In silico sequence analysis of DIO1 revealed multiple putative binding sites for splicing factor SF2/ASF, of which the top-ranked sites were located in exon 3. Silencing of SF2/ASF in pituitary tumor GH3 cells resulted in change of ratio between DIO1 isoforms with or without exon 3, favoring the expression of variants without exon 3. The expression of SF2/ASF mRNA in pituitary tumors was increased when compared with non-neoplastic control samples. In conclusion, we provide a new mechanism of posttranscriptional regulation of DIO1 and show deregulation of DIO1 expression in pituitary adenoma, possibly resulting from disturbed expression of SF2/ASF. Copyright © 2013 Elsevier B.V. All rights reserved.

  10. Genome-based identification of spliceosomal proteins in the silk moth Bombyx mori.

    PubMed

    Somarelli, Jason A; Mesa, Annia; Fuller, Myron E; Torres, Jacqueline O; Rodriguez, Carol E; Ferrer, Christina M; Herrera, Rene J

    2010-12-01

    Pre-messenger RNA splicing is a highly conserved eukaryotic cellular function that takes place by way of a large, RNA-protein assembly known as the spliceosome. In the mammalian system, nearly 300 proteins associate with uridine-rich small nuclear (sn)RNAs to form this complex. Some of these splicing factors are ubiquitously present in the spliceosome, whereas others are involved only in the processing of specific transcripts. Several proteomics analyses have delineated the proteins of the spliceosome in several species. In this study, we mine multiple sequence data sets of the silk moth Bombyx mori in an attempt to identify the entire set of known spliceosomal proteins. Five data sets were utilized, including the 3X, 6X, and Build 2.0 genomic contigs as well as the expressed sequence tag and protein libraries. While homologs for 88% of vertebrate splicing factors were delineated in the Bombyx mori genome, there appear to be several spliceosomal polypeptides absent in Bombyx mori and seven additional insect species. This apparent increase in spliceosomal complexity in vertebrates may reflect the tissue-specific and developmental stage-specific alternative pre-mRNA splicing requirements in vertebrates. Phylogenetic analyses of 15 eukaryotic taxa using the core splicing factors suggest that the essential functional units of the pre-mRNA processing machinery have remained highly conserved from yeast to humans. The Sm and LSm proteins are the most conserved, whereas proteins of the U1 small nuclear ribonucleoprotein particle are the most divergent. These data highlight both the differential conservation and relative phylogenetic signals of the essential spliceosomal components throughout evolution. © 2010 Wiley Periodicals, Inc.

  11. An empirical study of ensemble-based semi-supervised learning approaches for imbalanced splice site datasets.

    PubMed

    Stanescu, Ana; Caragea, Doina

    2015-01-01

    Recent biochemical advances have led to inexpensive, time-efficient production of massive volumes of raw genomic data. Traditional machine learning approaches to genome annotation typically rely on large amounts of labeled data. The process of labeling data can be expensive, as it requires domain knowledge and expert involvement. Semi-supervised learning approaches that can make use of unlabeled data, in addition to small amounts of labeled data, can help reduce the costs associated with labeling. In this context, we focus on the problem of predicting splice sites in a genome using semi-supervised learning approaches. This is a challenging problem, due to the highly imbalanced distribution of the data, i.e., small number of splice sites as compared to the number of non-splice sites. To address this challenge, we propose to use ensembles of semi-supervised classifiers, specifically self-training and co-training classifiers. Our experiments on five highly imbalanced splice site datasets, with positive to negative ratios of 1-to-99, showed that the ensemble-based semi-supervised approaches represent a good choice, even when the amount of labeled data consists of less than 1% of all training data. In particular, we found that ensembles of co-training and self-training classifiers that dynamically balance the set of labeled instances during the semi-supervised iterations show improvements over the corresponding supervised ensemble baselines. In the presence of limited amounts of labeled data, ensemble-based semi-supervised approaches can successfully leverage the unlabeled data to enhance supervised ensembles learned from highly imbalanced data distributions. Given that such distributions are common for many biological sequence classification problems, our work can be seen as a stepping stone towards more sophisticated ensemble-based approaches to biological sequence annotation in a semi-supervised framework.

  12. An empirical study of ensemble-based semi-supervised learning approaches for imbalanced splice site datasets

    PubMed Central

    2015-01-01

    Background Recent biochemical advances have led to inexpensive, time-efficient production of massive volumes of raw genomic data. Traditional machine learning approaches to genome annotation typically rely on large amounts of labeled data. The process of labeling data can be expensive, as it requires domain knowledge and expert involvement. Semi-supervised learning approaches that can make use of unlabeled data, in addition to small amounts of labeled data, can help reduce the costs associated with labeling. In this context, we focus on the problem of predicting splice sites in a genome using semi-supervised learning approaches. This is a challenging problem, due to the highly imbalanced distribution of the data, i.e., small number of splice sites as compared to the number of non-splice sites. To address this challenge, we propose to use ensembles of semi-supervised classifiers, specifically self-training and co-training classifiers. Results Our experiments on five highly imbalanced splice site datasets, with positive to negative ratios of 1-to-99, showed that the ensemble-based semi-supervised approaches represent a good choice, even when the amount of labeled data consists of less than 1% of all training data. In particular, we found that ensembles of co-training and self-training classifiers that dynamically balance the set of labeled instances during the semi-supervised iterations show improvements over the corresponding supervised ensemble baselines. Conclusions In the presence of limited amounts of labeled data, ensemble-based semi-supervised approaches can successfully leverage the unlabeled data to enhance supervised ensembles learned from highly imbalanced data distributions. Given that such distributions are common for many biological sequence classification problems, our work can be seen as a stepping stone towards more sophisticated ensemble-based approaches to biological sequence annotation in a semi-supervised framework. PMID:26356316

  13. Target gene analyses of 39 amelogenesis imperfecta kindreds

    PubMed Central

    Chan, Hui-Chen; Estrella, Ninna M. R. P.; Milkovich, Rachel N.; Kim, Jung-Wook; Simmer, James P.; Hu, Jan C-C.

    2012-01-01

    Previously, mutational analyses identified six disease-causing mutations in 24 amelogenesis imperfecta (AI) kindreds. We have since expanded the number of AI kindreds to 39, and performed mutation analyses covering the coding exons and adjoining intron sequences for the six proven AI candidate genes [amelogenin (AMELX), enamelin (ENAM), family with sequence similarity 83, member H (FAM83H), WD repeat containing domain 72 (WDR72), enamelysin (MMP20), and kallikrein-related peptidase 4 (KLK4)] and for ameloblastin (AMBN) (a suspected candidate gene). All four of the X-linked AI families (100%) had disease-causing mutations in AMELX, suggesting that AMELX is the only gene involved in the aetiology of X-linked AI. Eighteen families showed an autosomal-dominant pattern of inheritance. Disease-causing mutations were identified in 12 (67%): eight in FAM83H, and four in ENAM. No FAM83H coding-region or splice-junction mutations were identified in three probands with autosomal-dominant hypocalcification AI (ADHCAI), suggesting that a second gene may contribute to the aetiology of ADHCAI. Six families showed an autosomal-recessive pattern of inheritance, and disease-causing mutations were identified in three (50%): two in MMP20, and one in WDR72. No disease-causing mutations were found in 11 families with only one affected member. We conclude that mutation analyses of the current candidate genes for AI have about a 50% chance of identifying the disease-causing mutation in a given kindred. PMID:22243262

  14. Alternative RNA splicing of leucocyte tissue transglutaminase in coeliac disease.

    PubMed

    Arbildi, P; Sóñora, C; Del Río, N; Marqués, J M; Hernández, A

    2018-05-01

    Tissue transglutaminase is a ubiquitous and multifunctional protein that contributes to several processes such as apoptosis/survival, efferocytosis, inflammation and tissue repairing under physiological and pathological conditions. Several activities can be associated with well-established functional domains; in addition, four RNA alternative splice variants have been described, characterized by sequence divergences and residues deletion at the C-terminal domains. Tissue transglutaminase is recognized as the central player in the physiopathology of coeliac disease (CD) mainly through calcium-dependent enzymatic activities. It can be hypothesized that differential regulation of tissue transglutaminase splice variants expression in persons with CD contributes to pathology by altering the protein functionality. We characterized the expression pattern of RNA alternative splice variants by RT-PCR in peripheral cells from patients with CD under free gluten diet adhesion; we considered inflammatory parameters and specific antibodies as markers of the stage of disease. We found significant higher expression of both the full length and the shortest C-truncated splice variants in leucocytes from patients with CD in comparison with healthy individuals. As tissue transglutaminase expression and canonical enzymatic activity are linked to inflammation, we studied the RNA expression of inflammatory cytokines in peripheral leucocytes of persons with CD in relation with splice variants expression; interestingly, we found that recently diagnosed patients showed significant correlation between both the full length and the shortest alternative spliced variants with IL-1 expression. Our results points that regulation of alternative splicing of tissue transglutaminase could account for the complex physiopathology of CD. © 2018 The Foundation for the Scandinavian Journal of Immunology.

  15. Splicing of designer exons informs a biophysical model for exon definition

    PubMed Central

    Arias, Mauricio A.; Chasin, Lawrence A.

    2015-01-01

    Pre-mRNA molecules in humans contain mostly short internal exons flanked by longer introns. To explain the removal of such introns, exon recognition instead of intron recognition has been proposed. We studied this exon definition using designer exons (DEs) made up of three prototype modules of our own design: an exonic splicing enhancer (ESE), an exonic splicing silencer (ESS), and a Reference Sequence (R) predicted to be neither. Each DE was examined as the central exon in a three-exon minigene. DEs made of R modules showed a sharp size dependence, with exons shorter than 14 nt and longer than 174 nt splicing poorly. Changing the strengths of the splice sites improved longer exon splicing but worsened shorter exon splicing, effectively displacing the curve to the right. For the ESE we found, unexpectedly, that its enhancement efficiency was independent of its position within the exon. For the ESS we found a step-wise positional increase in its effects; it was most effective at the 3′ end of the exon. To apply these results quantitatively, we developed a biophysical model for exon definition of internal exons undergoing cotranscriptional splicing. This model features commitment to inclusion before the downstream exon is synthesized and competition between skipping and inclusion fates afterward. Collision of both exon ends to form an exon definition complex was incorporated to account for the effect of size; ESE/ESS effects were modeled on the basis of stabilization/destabilization. This model accurately predicted the outcome of independent experiments on more complex DEs that combined ESEs and ESSs. PMID:25492963

  16. Early Clinical Diagnosis of PC1/3 Deficiency in a Patient With a Novel Homozygous PCSK1 Splice-Site Mutation.

    PubMed

    Härter, Bettina; Fuchs, Irene; Müller, Thomas; Akbulut, Ulas Emre; Cakir, Murat; Janecke, Andreas R

    2016-04-01

    Autosomal recessive proprotein convertase 1/3 (PC1/3) deficiency, caused by mutations in the PCSK1 gene, is characterized by severe congenital malabsorptive diarrhea, early-onset obesity, and certain endocrine abnormalities. We suspected PC1/3 deficiency in a 4-month-old girl based on the presence of congenital diarrhea and polyuria. Sequencing the whole coding region and splice sites detected a novel homozygous PCSK1 splice-site mutation, c.544-2A>G, in the patient. The mutation resulted in the skipping of exon 5, the generation of a premature termination codon, and nonsense-mediated PCSK1 messenger ribonucleic acid decay, which was demonstrated in complementary DNA derived from fibroblasts.

  17. E6^E7, a Novel Splice Isoform Protein of Human Papillomavirus 16, Stabilizes Viral E6 and E7 Oncoproteins via HSP90 and GRP78

    PubMed Central

    Ajiro, Masahiko

    2015-01-01

    ABSTRACT Transcripts of human papillomavirus 16 (HPV16) E6 and E7 oncogenes undergo alternative RNA splicing to produce multiple splice isoforms. However, the importance of these splice isoforms is poorly understood. Here we report a critical role of E6^E7, a novel isoform containing the 41 N-terminal amino acid (aa) residues of E6 and the 38 C-terminal aa residues of E7, in the regulation of E6 and E7 stability. Through mass spectrometric analysis, we identified that HSP90 and GRP78, which are frequently upregulated in cervical cancer tissues, are two E6^E7-interacting proteins responsible for the stability and function of E6^E7, E6, and E7. Although GRP78 and HSP90 do not bind each other, GRP78, but not HSP90, interacts with E6 and E7. E6^E7 protein, in addition to self-binding, interacts with E6 and E7 in the presence of GRP78 and HSP90, leading to the stabilization of E6 and E7 by prolonging the half-life of each protein. Knocking down E6^E7 expression in HPV16-positive CaSki cells by a splice junction-specific small interfering RNA (siRNA) destabilizes E6 and E7 and prevents cell growth. The same is true for the cells with a GRP78 knockdown or in the presence of an HSP90 inhibitor. Moreover, mapping and alignment analyses for splicing elements in 36 alpha-HPVs (α-HPVs) suggest the possible expression of E6^E7 mostly by other oncogenic or possibly oncogenic α-HPVs (HPV18, -30, -31, -39, -42, -45, -56, -59, -70, and -73). HPV18 E6^E7 is detectable in HPV18-positive HeLa cells and HPV18-infected raft tissues. All together, our data indicate that viral E6^E7 and cellular GRP78 or HSP90 might be novel targets for cervical cancer therapy. PMID:25691589

  18. cDNA sequences and organization of IgM heavy chain genes in two holostean fish.

    PubMed

    Wilson, M R; van Ravenstein, E; Miller, N W; Clem, L W; Middleton, D L; Warr, G W

    1995-01-01

    Immunoglobulin M heavy chain (mu) sequences of two holostean fish, the bowfin, Amia calva, and the longnose gar, Lepisosteus osseus, were amplified from spleen mRNA by RACE-PCR, cloned, and sequenced. Each mu chain showed the conserved four constant domain structure typical of a secreted mu chain. Southern blot analyses with specific heavy chain variable (VH) and constant (CH) region probes suggest that both fish possess an IgH locus that resembles that of the teleosts, amphibians, and mammals in its organization. The overall sequence similarity of gar and bowfin mu chains was 60% and 48% at the nucleotide and amino acid levels, respectively, while similarity to the mu chains of teleosts and elasmobranchs was lower. The bowfin mu chain possesses a distinctive proline-rich sequence at the C mu 1/C mu 2 boundary; a shorter proline-rich sequence is present at this position in the gar mu chain. Both gar and bowfin show, in their C mu 4 sequences, motifs that could serve as cryptic splice donor sites for the production of mRNA encoding the membrane-bound form of the mu chains, and the bowfin also shows a potential cryptic splice donor site in the C mu 3 exon.

  19. Inherited mutations in BRCA1 and BRCA2 in an unselected multiethnic cohort of Asian patients with breast cancer and healthy controls from Malaysia

    PubMed Central

    Wen, Wei Xiong; Allen, Jamie; Lai, Kah Nyin; Mariapun, Shivaani; Hasan, Siti Norhidayu; Ng, Pei Sze; Lee, Daphne Shin-Chi; Lee, Sheau Yee; Yoon, Sook-Yee; Lim, Joanna; Lau, Shao Yan; Decker, Brennan; Pooley, Karen; Dorling, Leila; Luccarini, Craig; Baynes, Caroline; Conroy, Don M; Harrington, Patricia; Simard, Jacques; Yip, Cheng Har; Mohd Taib, Nur Aishah; Ho, Weang Kee; Antoniou, Antonis C; Dunning, Alison M; Easton, Douglas F

    2018-01-01

    Background Genetic testing for BRCA1 and BRCA2 is offered typically to selected women based on age of onset and family history of cancer. However, current internationally accepted genetic testing referral guidelines are built mostly on data from cancer genetics clinics in women of European descent. To evaluate the appropriateness of such guidelines in Asians, we have determined the prevalence of germ line variants in an unselected cohort of Asian patients with breast cancer and healthy controls. Methods Germ line DNA from a hospital-based study of 2575 unselected patients with breast cancer and 2809 healthy controls were subjected to amplicon-based targeted sequencing of exonic and proximal splice site junction regions of BRCA1 and BRCA2 using the Fluidigm Access Array system, with sequencing conducted on a Illumina HiSeq2500 platform. Variant calling was performed with GATK UnifiedGenotyper and were validated by Sanger sequencing. Results Fifty-five (2.1%) BRCA1 and 66 (2.6%) BRCA2 deleterious mutations were identified among patients with breast cancer and five (0.18%) BRCA1 and six (0.21%) BRCA2 mutations among controls. One thousand one hundred and eighty-six (46%) patients and 97 (80%) carriers fulfilled the National Comprehensive Cancer Network guidelines for genetic testing. Conclusion Five per cent of unselected Asian patients with breast cancer carry deleterious variants in BRCA1 or BRCA2. While current referral guidelines identified the majority of carriers, one in two patients would be referred for genetic services. Given that such services are largely unavailable in majority of low-resource settings in Asia, our study highlights the need for more efficient guidelines to identify at-risk individuals in Asia. PMID:28993434

  20. Somatic mutation profiles of clear cell endometrial tumors revealed by whole exome and targeted gene sequencing.

    PubMed

    Le Gallo, Matthieu; Rudd, Meghan L; Urick, Mary Ellen; Hansen, Nancy F; Zhang, Suiyuan; Lozy, Fred; Sgroi, Dennis C; Vidal Bel, August; Matias-Guiu, Xavier; Broaddus, Russell R; Lu, Karen H; Levine, Douglas A; Mutch, David G; Goodfellow, Paul J; Salvesen, Helga B; Mullikin, James C; Bell, Daphne W

    2017-09-01

    The molecular pathogenesis of clear cell endometrial cancer (CCEC), a tumor type with a relatively unfavorable prognosis, is not well defined. We searched exome-wide for novel somatically mutated genes in CCEC and assessed the mutational spectrum of known and candidate driver genes in a large cohort of cases. We conducted whole exome sequencing of paired tumor-normal DNAs from 16 cases of CCEC (12 CCECs and the CCEC components of 4 mixed histology tumors). Twenty-two genes-of-interest were Sanger-sequenced from another 47 cases of CCEC. Microsatellite instability (MSI) and microsatellite stability (MSS) were determined by genotyping 5 mononucleotide repeats. Two tumor exomes had relatively high mutational loads and MSI. The other 14 tumor exomes were MSS and had 236 validated nonsynonymous or splice junction somatic mutations among 222 protein-encoding genes. Among the 63 cases of CCEC in this study, we identified frequent somatic mutations in TP53 (39.7%), PIK3CA (23.8%), PIK3R1 (15.9%), ARID1A (15.9%), PPP2R1A (15.9%), SPOP (14.3%), and TAF1 (9.5%), as well as MSI (11.3%). Five of 8 mutations in TAF1, a gene with no known role in CCEC, localized to the putative histone acetyltransferase domain and included 2 recurrently mutated residues. Based on patterns of MSI and mutations in 7 genes, CCEC subsets molecularly resembled serous endometrial cancer (SEC) or endometrioid endometrial cancer (EEC). Our findings demonstrate molecular similarities between CCEC and SEC and EEC and implicate TAF1 as a novel candidate CCEC driver gene. Cancer 2017;123:3261-8. © 2017 American Cancer Society. © 2017 American Cancer Society.

  1. Novel splice site mutation in the growth hormone receptor gene in Turkish patients with Laron-type dwarfism.

    PubMed

    Arman, Ahmet; Ozon, Alev; Isguven, Pinar S; Coker, Ajda; Peker, Ismail; Yordam, Nursen

    2008-01-01

    Growth hormone (GH) is involved in growth, and fat and carbohydrate metabolism. Interaction of GH with the GH receptor (GHR) is necessary for systemic and local production of insulin-like growth factor-I (IGF-I) which mediates GH actions. Mutations in the GHR cause severe postnatal growth failure; the disorder is an autosomal recessive genetic disease resulting in GH insensitivity, called Laron syndrome. It is characterized by dwarfism with elevated serum GH and low levels of IGF-I. We analyzed the GHR gene for mutations and polymorphisms in eight patients with Laron-type dwarfism from six families. We found three missense mutations (S40L, V125A, I526L), one nonsense mutation (W157X), and one splice site mutation in the extracellular domain of GHR. Furthermore, G168G and exon 3 deletion polymorphisms were detected in patients with Laron syndrome. The splice site mutation, which is a novel mutation, was located at the donor splice site of exon 2/ intron 2 within GHR. Although this mutation changed the highly conserved donor splice site consensus sequence GT to GGT by insertion of a G residue, the intron splicing between exon 2 and exon 3 was detected in the patient. These results imply that the splicing occurs arthe GT site in intron 2, leaving the extra inserted G residue at the end of exon 2, thus changing the open reading frame of GHR resulting in a premature termination codon in exon 3.

  2. Pre-mRNA splicing repression triggers abiotic stress signaling in plants.

    PubMed

    Ling, Yu; Alshareef, Sahar; Butt, Haroon; Lozano-Juste, Jorge; Li, Lixin; Galal, Aya A; Moustafa, Ahmed; Momin, Afaque A; Tashkandi, Manal; Richardson, Dale N; Fujii, Hiroaki; Arold, Stefan; Rodriguez, Pedro L; Duque, Paula; Mahfouz, Magdy M

    2017-01-01

    Alternative splicing (AS) of precursor RNAs enhances transcriptome plasticity and proteome diversity in response to diverse growth and stress cues. Recent work has shown that AS is pervasive across plant species, with more than 60% of intron-containing genes producing different isoforms. Mammalian cell-based assays have discovered various inhibitors of AS. Here, we show that the macrolide pladienolide B (PB) inhibits constitutive splicing and AS in plants. Also, our RNA sequencing (RNA-seq) data revealed that PB mimics abiotic stress signals including salt, drought and abscisic acid (ABA). PB activates the abiotic stress- and ABA-responsive reporters RD29A::LUC and MAPKKK18::uidA in Arabidopsis thaliana and mimics the effects of ABA on stomatal aperture. Genome-wide analysis of AS by RNA-seq revealed that PB perturbs the splicing machinery and leads to a striking increase in intron retention and a reduction in other forms of AS. Interestingly, PB treatment activates the ABA signaling pathway by inhibiting the splicing of clade A PP2C phosphatases while still maintaining to some extent the splicing of ABA-activated SnRK2 kinases. Taken together, our data establish PB as an inhibitor and modulator of splicing and a mimic of abiotic stress signals in plants. Thus, PB reveals the molecular underpinnings of the interplay between stress responses, ABA signaling and post-transcriptional regulation in plants. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  3. Fine-Scale Variation and Genetic Determinants of Alternative Splicing across Individuals

    PubMed Central

    Coulombe-Huntington, Jasmin; Lam, Kevin C. L.; Dias, Christel; Majewski, Jacek

    2009-01-01

    Recently, thanks to the increasing throughput of new technologies, we have begun to explore the full extent of alternative pre–mRNA splicing (AS) in the human transcriptome. This is unveiling a vast layer of complexity in isoform-level expression differences between individuals. We used previously published splicing sensitive microarray data from lymphoblastoid cell lines to conduct an in-depth analysis on splicing efficiency of known and predicted exons. By combining publicly available AS annotation with a novel algorithm designed to search for AS, we show that many real AS events can be detected within the usually unexploited, speculative majority of the array and at significance levels much below standard multiple-testing thresholds, demonstrating that the extent of cis-regulated differential splicing between individuals is potentially far greater than previously reported. Specifically, many genes show subtle but significant genetically controlled differences in splice-site usage. PCR validation shows that 42 out of 58 (72%) candidate gene regions undergo detectable AS, amounting to the largest scale validation of isoform eQTLs to date. Targeted sequencing revealed a likely causative SNP in most validated cases. In all 17 incidences where a SNP affected a splice-site region, in silico splice-site strength modeling correctly predicted the direction of the micro-array and PCR results. In 13 other cases, we identified likely causative SNPs disrupting predicted splicing enhancers. Using Fst and REHH analysis, we uncovered significant evidence that 2 putative causative SNPs have undergone recent positive selection. We verified the effect of five SNPs using in vivo minigene assays. This study shows that splicing differences between individuals, including quantitative differences in isoform ratios, are frequent in human populations and that causative SNPs can be identified using in silico predictions. Several cases affected disease-relevant genes and it is likely some of these differences are involved in phenotypic diversity and susceptibility to complex diseases. PMID:20011102

  4. CEP78 is mutated in a distinct type of Usher syndrome.

    PubMed

    Fu, Qing; Xu, Mingchu; Chen, Xue; Sheng, Xunlun; Yuan, Zhisheng; Liu, Yani; Li, Huajin; Sun, Zixi; Li, Huiping; Yang, Lizhu; Wang, Keqing; Zhang, Fangxia; Li, Yumei; Zhao, Chen; Sui, Ruifang; Chen, Rui

    2017-03-01

    Usher syndrome is a genetically heterogeneous disorder featured by combined visual impairment and hearing loss. Despite a dozen of genes involved in Usher syndrome having been identified, the genetic basis remains unknown in 20-30% of patients. In this study, we aimed to identify the novel disease-causing gene of a distinct subtype of Usher syndrome. Ophthalmic examinations and hearing tests were performed on patients with Usher syndrome in two consanguineous families. Target capture sequencing was initially performed to screen causative mutations in known retinal disease-causing loci. Whole exome sequencing (WES) and whole genome sequencing (WGS) were applied for identifying novel disease-causing genes. RT-PCR and Sanger sequencing were performed to evaluate the splicing-altering effect of identified CEP78 variants. Patients from the two independent families show a mild Usher syndrome phenotype featured by juvenile or adult-onset cone-rod dystrophy and sensorineural hearing loss. WES and WGS identified two homozygous rare variants that affect mRNA splicing of a ciliary gene CEP78 . RT-PCR confirmed that the two variants indeed lead to abnormal splicing, resulting in premature stop of protein translation due to frameshift. Our results provide evidence that CEP78 is a novel disease-causing gene for Usher syndrome, demonstrating an additional link between ciliopathy and Usher protein network in photoreceptor cells and inner ear hair cells. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.

  5. High-throughput sequencing of the entire genomic regions of CCM1/KRIT1, CCM2 and CCM3/PDCD10 to search for pathogenic deep-intronic splice mutations in cerebral cavernous malformations.

    PubMed

    Rath, Matthias; Jenssen, Sönke E; Schwefel, Konrad; Spiegler, Stefanie; Kleimeier, Dana; Sperling, Christian; Kaderali, Lars; Felbor, Ute

    2017-09-01

    Cerebral cavernous malformations (CCM) are vascular lesions of the central nervous system that can cause headaches, seizures and hemorrhagic stroke. Disease-associated mutations have been identified in three genes: CCM1/KRIT1, CCM2 and CCM3/PDCD10. The precise proportion of deep-intronic variants in these genes and their clinical relevance is yet unknown. Here, a long-range PCR (LR-PCR) approach for target enrichment of the entire genomic regions of the three genes was combined with next generation sequencing (NGS) to screen for coding and non-coding variants. NGS detected all six CCM1/KRIT1, two CCM2 and four CCM3/PDCD10 mutations that had previously been identified by Sanger sequencing. Two of the pathogenic variants presented here are novel. Additionally, 20 stringently selected CCM index cases that had remained mutation-negative after conventional sequencing and exclusion of copy number variations were screened for deep-intronic mutations. The combination of bioinformatics filtering and transcript analyses did not reveal any deep-intronic splice mutations in these cases. Our results demonstrate that target enrichment by LR-PCR combined with NGS can be used for a comprehensive analysis of the entire genomic regions of the CCM genes in a research context. However, its clinical utility is limited as deep-intronic splice mutations in CCM1/KRIT1, CCM2 and CCM3/PDCD10 seem to be rather rare. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  6. Transcriptome Sequencing Revealed Significant Alteration of Cortical Promoter Usage and Splicing in Schizophrenia

    PubMed Central

    Wu, Jing Qin; Wang, Xi; Beveridge, Natalie J.; Tooney, Paul A.; Scott, Rodney J.; Carr, Vaughan J.; Cairns, Murray J.

    2012-01-01

    Background While hybridization based analysis of the cortical transcriptome has provided important insight into the neuropathology of schizophrenia, it represents a restricted view of disease-associated gene activity based on predetermined probes. By contrast, sequencing technology can provide un-biased analysis of transcription at nucleotide resolution. Here we use this approach to investigate schizophrenia-associated cortical gene expression. Methodology/Principal Findings The data was generated from 76 bp reads of RNA-Seq, aligned to the reference genome and assembled into transcripts for quantification of exons, splice variants and alternative promoters in postmortem superior temporal gyrus (STG/BA22) from 9 male subjects with schizophrenia and 9 matched non-psychiatric controls. Differentially expressed genes were then subjected to further sequence and functional group analysis. The output, amounting to more than 38 Gb of sequence, revealed significant alteration of gene expression including many previously shown to be associated with schizophrenia. Gene ontology enrichment analysis followed by functional map construction identified three functional clusters highly relevant to schizophrenia including neurotransmission related functions, synaptic vesicle trafficking, and neural development. Significantly, more than 2000 genes displayed schizophrenia-associated alternative promoter usage and more than 1000 genes showed differential splicing (FDR<0.05). Both types of transcriptional isoforms were exemplified by reads aligned to the neurodevelopmentally significant doublecortin-like kinase 1 (DCLK1) gene. Conclusions This study provided the first deep and un-biased analysis of schizophrenia-associated transcriptional diversity within the STG, and revealed variants with important implications for the complex pathophysiology of schizophrenia. PMID:22558445

  7. Validation of Splicing Events in Transcriptome Sequencing Data

    PubMed Central

    Kaisers, Wolfgang; Ptok, Johannes; Schwender, Holger; Schaal, Heiner

    2017-01-01

    Genomic alignments of sequenced cellular messenger RNA contain gapped alignments which are interpreted as consequence of intron removal. The resulting gap-sites, genomic locations of alignment gaps, are landmarks representing potential splice-sites. As alignment algorithms report gap-sites with a considerable false discovery rate, validations are required. We describe two quality scores, gap quality score (gqs) and weighted gap information score (wgis), developed for validation of putative splicing events: While gqs solely relies on alignment data wgis additionally considers information from the genomic sequence. FASTQ files obtained from 54 human dermal fibroblast samples were aligned against the human genome (GRCh38) using TopHat and STAR aligner. Statistical properties of gap-sites validated by gqs and wgis were evaluated by their sequence similarity to known exon-intron borders. Within the 54 samples, TopHat identifies 1,000,380 and STAR reports 6,487,577 gap-sites. Due to the lack of strand information, however, the percentage of identified GT-AG gap-sites is rather low. While gap-sites from TopHat contain ≈89% GT-AG, gap-sites from STAR only contain ≈42% GT-AG dinucleotide pairs in merged data from 54 fibroblast samples. Validation with gqs yields 156,251 gap-sites from TopHat alignments and 166,294 from STAR alignments. Validation with wgis yields 770,327 gap-sites from TopHat alignments and 1,065,596 from STAR alignments. Both alignment algorithms, TopHat and STAR, report gap-sites with considerable false discovery rate, which can drastically be reduced by validation with gqs and wgis. PMID:28545234

  8. Calcium Activated K+ Channels in The Electroreceptor of the Skate Confirmed by Cloning. Details of Subunits and Splicing

    PubMed Central

    King, Benjamin L.; Shi, Ling Fang; Kao, Peter; Clusin, William T.

    2015-01-01

    Elasmobranchs detect small potentials using excitable cells of the ampulla of Lorenzini which have calcium-activated K+ channels, first described in l974. A distinctive feature of the outward current in voltage clamped ampullae is its apparent insensitivity to voltage. The sequence of a BK channel α isoform expressed in the ampulla of the skate was characterized. A signal peptide is present at the beginning of the gene. When compared to human isoform 1 (the canonical sequence), the largest difference was absence of a 59 amino acid region from the S8-S9 intracellular linker that contains the strex regulatory domain. The ampulla isoform was also compared with the isoform predicted˜ in late skate embryos where strex was also absent. The BK voltage sensors were conserved in both skate isoforms. Differences between the skate and human BK channel included alternative splicing. Alternative splicing occurs at seven previously defined sites that are characteristic for BK channels in general and hair cells in particular. Skate BK sequences were highly similar to the Australian ghost shark and several other vertebrate species. Based on alignment of known BK sequences with the skate genome and transcriptome, there are at least two isoforms of Kcnma1α expressed in the skate. One of the β subunits (β4), which is known to decrease voltage sensitivity, was also identified in the skate genome and transcriptome and in the ampulla. These studies advance our knowledge of BK channels and suggest further studies in the ampulla and other excitable tissues. PMID:26687710

  9. Genome-wide CRISPR screen identifies HNRNPL as a prostate cancer dependency regulating RNA splicing.

    PubMed

    Fei, Teng; Chen, Yiwen; Xiao, Tengfei; Li, Wei; Cato, Laura; Zhang, Peng; Cotter, Maura B; Bowden, Michaela; Lis, Rosina T; Zhao, Shuang G; Wu, Qiu; Feng, Felix Y; Loda, Massimo; He, Housheng Hansen; Liu, X Shirley; Brown, Myles

    2017-06-27

    Alternative RNA splicing plays an important role in cancer. To determine which factors involved in RNA processing are essential in prostate cancer, we performed a genome-wide CRISPR/Cas9 knockout screen to identify the genes that are required for prostate cancer growth. Functional annotation defined a set of essential spliceosome and RNA binding protein (RBP) genes, including most notably heterogeneous nuclear ribonucleoprotein L (HNRNPL). We defined the HNRNPL-bound RNA landscape by RNA immunoprecipitation coupled with next-generation sequencing and linked these RBP-RNA interactions to changes in RNA processing. HNRNPL directly regulates the alternative splicing of a set of RNAs, including those encoding the androgen receptor, the key lineage-specific prostate cancer oncogene. HNRNPL also regulates circular RNA formation via back splicing. Importantly, both HNRNPL and its RNA targets are aberrantly expressed in human prostate tumors, supporting their clinical relevance. Collectively, our data reveal HNRNPL and its RNA clients as players in prostate cancer growth and potential therapeutic targets.

  10. DBATE: database of alternative transcripts expression.

    PubMed

    Bianchi, Valerio; Colantoni, Alessio; Calderone, Alberto; Ausiello, Gabriele; Ferrè, Fabrizio; Helmer-Citterich, Manuela

    2013-01-01

    The use of high-throughput RNA sequencing technology (RNA-seq) allows whole transcriptome analysis, providing an unbiased and unabridged view of alternative transcript expression. Coupling splicing variant-specific expression with its functional inference is still an open and difficult issue for which we created the DataBase of Alternative Transcripts Expression (DBATE), a web-based repository storing expression values and functional annotation of alternative splicing variants. We processed 13 large RNA-seq panels from human healthy tissues and in disease conditions, reporting expression levels and functional annotations gathered and integrated from different sources for each splicing variant, using a variant-specific annotation transfer pipeline. The possibility to perform complex queries by cross-referencing different functional annotations permits the retrieval of desired subsets of splicing variant expression values that can be visualized in several ways, from simple to more informative. DBATE is intended as a novel tool to help appreciate how, and possibly why, the transcriptome expression is shaped. DATABASE URL: http://bioinformatica.uniroma2.it/DBATE/.

  11. APPRIS: annotation of principal and alternative splice isoforms

    PubMed Central

    Rodriguez, Jose Manuel; Maietta, Paolo; Ezkurdia, Iakes; Pietrelli, Alessandro; Wesselink, Jan-Jaap; Lopez, Gonzalo; Valencia, Alfonso; Tress, Michael L.

    2013-01-01

    Here, we present APPRIS (http://appris.bioinfo.cnio.es), a database that houses annotations of human splice isoforms. APPRIS has been designed to provide value to manual annotations of the human genome by adding reliable protein structural and functional data and information from cross-species conservation. The visual representation of the annotations provided by APPRIS for each gene allows annotators and researchers alike to easily identify functional changes brought about by splicing events. In addition to collecting, integrating and analyzing reliable predictions of the effect of splicing events, APPRIS also selects a single reference sequence for each gene, here termed the principal isoform, based on the annotations of structure, function and conservation for each transcript. APPRIS identifies a principal isoform for 85% of the protein-coding genes in the GENCODE 7 release for ENSEMBL. Analysis of the APPRIS data shows that at least 70% of the alternative (non-principal) variants would lose important functional or structural information relative to the principal isoform. PMID:23161672

  12. Whole Exome Sequencing identifies a splicing mutation in NSUN2 as a cause of a Dubowitz-like syndrome

    PubMed Central

    Martinez, Fernando; Lee, Jeong Ho; Lee, Ji Eun; Blanco, Sandra; Nickerson, Elizabeth; Gabriel, Stacey; Frye, Michaela; Al-Gazali, Lihadh; Gleeson, Joseph G.

    2016-01-01

    Dubowitz Syndrome is an autosomal recessive disorder characterized by the constellation of mild microcephaly, growth and mental retardation, eczema and peculiar facies, but causes are still unknown. We studied a multiplex consanguineous family with many features of Dubowitz syndrome using whole exome sequencing and identified a splice mutation in NSUN2, encoding a conserved RNA methyltransferase. NSUN2 has been implicated in Myc-induced cell proliferation and mitotic spindle stability, which might help explain the varied clinical presentations that can include chromosomal instability and immunological defects. Patient cells displayed loss of NSUN2-specific methylation at two residues of the aspartate tRNA. Our findings establish NSUN2 as the first causal gene with relationship to the Dubowitz syndrome spectrum phenotype. PMID:22577224

  13. A universal genomic coordinate translator for comparative genomics

    PubMed Central

    2014-01-01

    Background Genomic duplications constitute major events in the evolution of species, allowing paralogous copies of genes to take on fine-tuned biological roles. Unambiguously identifying the orthology relationship between copies across multiple genomes can be resolved by synteny, i.e. the conserved order of genomic sequences. However, a comprehensive analysis of duplication events and their contributions to evolution would require all-to-all genome alignments, which increases at N2 with the number of available genomes, N. Results Here, we introduce Kraken, software that omits the all-to-all requirement by recursively traversing a graph of pairwise alignments and dynamically re-computing orthology. Kraken scales linearly with the number of targeted genomes, N, which allows for including large numbers of genomes in analyses. We first evaluated the method on the set of 12 Drosophila genomes, finding that orthologous correspondence computed indirectly through a graph of multiple synteny maps comes at minimal cost in terms of sensitivity, but reduces overall computational runtime by an order of magnitude. We then used the method on three well-annotated mammalian genomes, human, mouse, and rat, and show that up to 93% of protein coding transcripts have unambiguous pairwise orthologous relationships across the genomes. On a nucleotide level, 70 to 83% of exons match exactly at both splice junctions, and up to 97% on at least one junction. We last applied Kraken to an RNA-sequencing dataset from multiple vertebrates and diverse tissues, where we confirmed that brain-specific gene family members, i.e. one-to-many or many-to-many homologs, are more highly correlated across species than single-copy (i.e. one-to-one homologous) genes. Not limited to protein coding genes, Kraken also identifies thousands of newly identified transcribed loci, likely non-coding RNAs that are consistently transcribed in human, chimpanzee and gorilla, and maintain significant correlation of expression levels across species. Conclusions Kraken is a computational genome coordinate translator that facilitates cross-species comparisons, distinguishes orthologs from paralogs, and does not require costly all-to-all whole genome mappings. Kraken is freely available under LPGL from http://github.com/nedaz/kraken. PMID:24976580

  14. A universal genomic coordinate translator for comparative genomics.

    PubMed

    Zamani, Neda; Sundström, Görel; Meadows, Jennifer R S; Höppner, Marc P; Dainat, Jacques; Lantz, Henrik; Haas, Brian J; Grabherr, Manfred G

    2014-06-30

    Genomic duplications constitute major events in the evolution of species, allowing paralogous copies of genes to take on fine-tuned biological roles. Unambiguously identifying the orthology relationship between copies across multiple genomes can be resolved by synteny, i.e. the conserved order of genomic sequences. However, a comprehensive analysis of duplication events and their contributions to evolution would require all-to-all genome alignments, which increases at N2 with the number of available genomes, N. Here, we introduce Kraken, software that omits the all-to-all requirement by recursively traversing a graph of pairwise alignments and dynamically re-computing orthology. Kraken scales linearly with the number of targeted genomes, N, which allows for including large numbers of genomes in analyses. We first evaluated the method on the set of 12 Drosophila genomes, finding that orthologous correspondence computed indirectly through a graph of multiple synteny maps comes at minimal cost in terms of sensitivity, but reduces overall computational runtime by an order of magnitude. We then used the method on three well-annotated mammalian genomes, human, mouse, and rat, and show that up to 93% of protein coding transcripts have unambiguous pairwise orthologous relationships across the genomes. On a nucleotide level, 70 to 83% of exons match exactly at both splice junctions, and up to 97% on at least one junction. We last applied Kraken to an RNA-sequencing dataset from multiple vertebrates and diverse tissues, where we confirmed that brain-specific gene family members, i.e. one-to-many or many-to-many homologs, are more highly correlated across species than single-copy (i.e. one-to-one homologous) genes. Not limited to protein coding genes, Kraken also identifies thousands of newly identified transcribed loci, likely non-coding RNAs that are consistently transcribed in human, chimpanzee and gorilla, and maintain significant correlation of expression levels across species. Kraken is a computational genome coordinate translator that facilitates cross-species comparisons, distinguishes orthologs from paralogs, and does not require costly all-to-all whole genome mappings. Kraken is freely available under LPGL from http://github.com/nedaz/kraken.

  15. Impaired Spermatogenesis, Muscle, and Erythrocyte Function in U12 Intron Splicing-Defective Zrsr1 Mutant Mice.

    PubMed

    Horiuchi, Keiko; Perez-Cerezales, Serafín; Papasaikas, Panagiotis; Ramos-Ibeas, Priscila; López-Cardona, Angela Patricia; Laguna-Barraza, Ricardo; Fonseca Balvís, Noelia; Pericuesta, Eva; Fernández-González, Raul; Planells, Benjamín; Viera, Alberto; Suja, Jose Angel; Ross, Pablo Juan; Alén, Francisco; Orio, Laura; Rodriguez de Fonseca, Fernando; Pintado, Belén; Valcárcel, Juan; Gutiérrez-Adán, Alfonso

    2018-04-03

    The U2AF35-like ZRSR1 has been implicated in the recognition of 3' splice site during spliceosome assembly, but ZRSR1 knockout mice do not show abnormal phenotypes. To analyze ZRSR1 function and its precise role in RNA splicing, we generated ZRSR1 mutant mice containing truncating mutations within its RNA-recognition motif. Homozygous mutant mice exhibited severe defects in erythrocytes, muscle stretch, and spermatogenesis, along with germ cell sloughing and apoptosis, ultimately leading to azoospermia and male sterility. Testis RNA sequencing (RNA-seq) analyses revealed increased intron retention of both U2- and U12-type introns, including U12-type intron events in genes with key functions in spermatogenesis and spermatid development. Affected U2 introns were commonly found flanking U12 introns, suggesting functional cross-talk between the two spliceosomes. The splicing and tissue defects observed in mutant mice attributed to ZRSR1 loss of function suggest a physiological role for this factor in U12 intron splicing. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.

  16. A mechanism underlying position-specific regulation of alternative splicing

    PubMed Central

    Hamid, Fursham M.

    2017-01-01

    Abstract Many RNA-binding proteins including a master regulator of splicing in developing brain and muscle, polypyrimidine tract-binding protein 1 (PTBP1), can either activate or repress alternative exons depending on the pre-mRNA recruitment position. When bound upstream or within regulated exons PTBP1 tends to promote their skipping, whereas binding to downstream sites often stimulates inclusion. How this switch is orchestrated at the molecular level is poorly understood. Using bioinformatics and biochemical approaches we show that interaction of PTBP1 with downstream intronic sequences can activate natural cassette exons by promoting productive docking of the spliceosomal U1 snRNP to a suboptimal 5′ splice site. Strikingly, introducing upstream PTBP1 sites to this circuitry leads to a potent splicing repression accompanied by the assembly of an exonic ribonucleoprotein complex with a tightly bound U1 but not U2 snRNP. Our data suggest a molecular mechanism underlying the transition between a better-known repressive function of PTBP1 and its role as a bona fide splicing activator. More generally, we argue that the functional outcome of individual RNA contacts made by an RNA-binding protein is subject to extensive context-specific modulation.

  17. NCLscan: accurate identification of non-co-linear transcripts (fusion, trans-splicing and circular RNA) with a good balance between sensitivity and precision.

    PubMed

    Chuang, Trees-Juen; Wu, Chan-Shuo; Chen, Chia-Ying; Hung, Li-Yuan; Chiang, Tai-Wei; Yang, Min-Yu

    2016-02-18

    Analysis of RNA-seq data often detects numerous 'non-co-linear' (NCL) transcripts, which comprised sequence segments that are topologically inconsistent with their corresponding DNA sequences in the reference genome. However, detection of NCL transcripts involves two major challenges: removal of false positives arising from alignment artifacts and discrimination between different types of NCL transcripts (trans-spliced, circular or fusion transcripts). Here, we developed a new NCL-transcript-detecting method ('NCLscan'), which utilized a stepwise alignment strategy to almost completely eliminate false calls (>98% precision) without sacrificing true positives, enabling NCLscan outperform 18 other publicly-available tools (including fusion- and circular-RNA-detecting tools) in terms of sensitivity and precision, regardless of the generation strategy of simulated dataset, type of intragenic or intergenic NCL event, read depth of coverage, read length or expression level of NCL transcript. With the high accuracy, NCLscan was applied to distinguishing between trans-spliced, circular and fusion transcripts on the basis of poly(A)- and nonpoly(A)-selected RNA-seq data. We showed that circular RNAs were expressed more ubiquitously, more abundantly and less cell type-specifically than trans-spliced and fusion transcripts. Our study thus describes a robust pipeline for the discovery of NCL transcripts, and sheds light on the fundamental biology of these non-canonical RNA events in human transcriptome. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. Alteration of the SETBP1 gene and splicing pathway genes SF3B1, U2AF1, and SRSF2 in childhood acute myeloid leukemia.

    PubMed

    Choi, Hyun-Woo; Kim, Hye-Ran; Baek, Hee-Jo; Kook, Hoon; Cho, Duck; Shin, Jong-Hee; Suh, Soon-Pal; Ryang, Dong-Wook; Shin, Myung-Geun

    2015-01-01

    Recurrent somatic SET-binding protein 1 (SETBP1) and splicing pathway gene mutations have recently been found in atypical chronic myeloid leukemia and other hematologic malignancies. These mutations have been comprehensively analyzed in adult AML, but not in childhood AML. We investigated possible alteration of the SETBP1, splicing factor 3B subunit 1 (SF3B1), U2 small nuclear RNA auxiliary factor 1 (U2AF1), and serine/arginine-rich splicing factor 2 (SRSF2) genes in childhood AML. Cytogenetic and molecular analyses were performed to reveal chromosomal and genetic alterations. Sequence alterations in the SETBP1, SF3B1, U2AF1, and SRSF2 genes were examined by using direct sequencing in a cohort of 53 childhood AML patients. Childhood AML patients did not harbor any recurrent SETBP1 gene mutations, although our study did identify a synonymous mutation in one patient. None of the previously reported aberrations in the mutational hotspot of SF3B1, U2AF1, and SRSF2 were identified in any of the 53 patients. Alterations of the SETBP1 gene or SF3B1, U2AF1, and SRSF2 genes are not common genetic events in childhood AML, implying that the mutations are unlikely to exert a driver effect in myeloid leukemogenesis during childhood.

  19. rMATS: robust and flexible detection of differential alternative splicing from replicate RNA-Seq data.

    PubMed

    Shen, Shihao; Park, Juw Won; Lu, Zhi-xiang; Lin, Lan; Henry, Michael D; Wu, Ying Nian; Zhou, Qing; Xing, Yi

    2014-12-23

    Ultra-deep RNA sequencing (RNA-Seq) has become a powerful approach for genome-wide analysis of pre-mRNA alternative splicing. We previously developed multivariate analysis of transcript splicing (MATS), a statistical method for detecting differential alternative splicing between two RNA-Seq samples. Here we describe a new statistical model and computer program, replicate MATS (rMATS), designed for detection of differential alternative splicing from replicate RNA-Seq data. rMATS uses a hierarchical model to simultaneously account for sampling uncertainty in individual replicates and variability among replicates. In addition to the analysis of unpaired replicates, rMATS also includes a model specifically designed for paired replicates between sample groups. The hypothesis-testing framework of rMATS is flexible and can assess the statistical significance over any user-defined magnitude of splicing change. The performance of rMATS is evaluated by the analysis of simulated and real RNA-Seq data. rMATS outperformed two existing methods for replicate RNA-Seq data in all simulation settings, and RT-PCR yielded a high validation rate (94%) in an RNA-Seq dataset of prostate cancer cell lines. Our data also provide guiding principles for designing RNA-Seq studies of alternative splicing. We demonstrate that it is essential to incorporate biological replicates in the study design. Of note, pooling RNAs or merging RNA-Seq data from multiple replicates is not an effective approach to account for variability, and the result is particularly sensitive to outliers. The rMATS source code is freely available at rnaseq-mats.sourceforge.net/. As the popularity of RNA-Seq continues to grow, we expect rMATS will be useful for studies of alternative splicing in diverse RNA-Seq projects.

  20. RNA-Seq profiling reveals aberrant RNA splicing in patient with adult acute myeloid leukemia during treatment.

    PubMed

    Li, X-y; Yao, X; Li, S-n; Suo, A-l; Ruan, Z-p; Liang, X; Kong, Y; Zhang, W-g; Yao, Y

    2014-01-01

    Multiple genetic alterations that affect the process of acute myeloid leukemia (AML) have been discovered, and more evidence also indicates that aberrant splicing plays an important role in cancer. We present a RNA-Seq profiling of an AML patient with complete remission after treatment, to analyze the aberrant splicing of genes during treatment. We sequenced 3.97 and 3.32 Gbp clean data of the AML and remission sample, respectively. Firstly, by analyzing biomarkers associated with AML, to assist normal clinical tests, we confirmed that the patient was anormal karyo type, with NPM1 and IDH2 mutations and deregulation patterns of related genes, such as BAALC, ERG, MN1 and HOX family. Then, we performed alternative splicing detection of the AML and remission sample. We detected 91 differentially splicing events in 68 differentially splicing genes (DSGs) by mixture of isoforms (MISO). Considering Psi values (Ψ) and confidence intervals, 25 differentially expressed isoforms were identified as more confident isoforms, which were associated with RNA processing, cellular macromolecule catabolic process and DNA binding according to GO enrichment analysis. An exon2-skipping event in oncogene FOS (FBJ murine osteosarcoma viral oncogene homolog) were detected and validated in this study. FOS has a critical function in regulating cell proliferation, differentiation and transformation. The exon2-skipping isoform of FOS was increased significantly after treatment. All the data and information of RNA-Seq provides highly accurate and comprehensive supplements to conventional clinical tests of AML. Moreover, the splicing aberrations would be another source for biomarker and even therapeutic target discovery. More information of splicing may also assist the better understanding of leukemogenesis.

Top