Science.gov

Sample records for rna instability sequences

  1. The role of topoisomerase I in suppressing genome instability associated with a highly transcribed guanine-rich sequence is not restricted to preventing RNA:DNA hybrid accumulation

    PubMed Central

    Yadav, Puja; Owiti, Norah; Kim, Nayun

    2016-01-01

    Highly transcribed guanine-run containing sequences, in Saccharomyces cerevisiae, become unstable when topoisomerase I (Top1) is disrupted. Topological changes, such as the formation of extended RNA:DNA hybrids or R-loops or non-canonical DNA structures including G-quadruplexes has been proposed as the major underlying cause of the transcription-linked genome instability. Here, we report that R-loop accumulation at a guanine-rich sequence, which is capable of assembling into the four-stranded G4 DNA structure, is dependent on the level and the orientation of transcription. In the absence of Top1 or RNase Hs, R-loops accumulated to substantially higher extent when guanine-runs were located on the non-transcribed strand. This coincides with the orientation where higher genome instability was observed. However, we further report that there are significant differences between the disruption of RNase Hs and Top1 in regards to the orientation-specific elevation in genome instability at the guanine-rich sequence. Additionally, genome instability in Top1-deficient yeasts is not completely suppressed by removal of negative supercoils and further aggravated by expression of mutant Top1. Together, our data provide a strong support for a function of Top1 in suppressing genome instability at the guanine-run containing sequence that goes beyond preventing the transcription-associated RNA:DNA hybrid formation. PMID:26527723

  2. The role of topoisomerase I in suppressing genome instability associated with a highly transcribed guanine-rich sequence is not restricted to preventing RNA:DNA hybrid accumulation.

    PubMed

    Yadav, Puja; Owiti, Norah; Kim, Nayun

    2016-01-29

    Highly transcribed guanine-run containing sequences, in Saccharomyces cerevisiae, become unstable when topoisomerase I (Top1) is disrupted. Topological changes, such as the formation of extended RNA:DNA hybrids or R-loops or non-canonical DNA structures including G-quadruplexes has been proposed as the major underlying cause of the transcription-linked genome instability. Here, we report that R-loop accumulation at a guanine-rich sequence, which is capable of assembling into the four-stranded G4 DNA structure, is dependent on the level and the orientation of transcription. In the absence of Top1 or RNase Hs, R-loops accumulated to substantially higher extent when guanine-runs were located on the non-transcribed strand. This coincides with the orientation where higher genome instability was observed. However, we further report that there are significant differences between the disruption of RNase Hs and Top1 in regards to the orientation-specific elevation in genome instability at the guanine-rich sequence. Additionally, genome instability in Top1-deficient yeasts is not completely suppressed by removal of negative supercoils and further aggravated by expression of mutant Top1. Together, our data provide a strong support for a function of Top1 in suppressing genome instability at the guanine-run containing sequence that goes beyond preventing the transcription-associated RNA:DNA hybrid formation. PMID:26527723

  3. RNA Sequencing in Schizophrenia

    PubMed Central

    Li, Xin; Teng, Shaolei

    2015-01-01

    Schizophrenia (SCZ) is a serious psychiatric disorder that affects 1% of general population and places a heavy burden worldwide. The underlying genetic mechanism of SCZ remains unknown, but studies indicate that the disease is associated with a global gene expression disturbance across many genes. Next-generation sequencing, particularly of RNA sequencing (RNA-Seq), provides a powerful genome-scale technology to investigate the pathological processes of SCZ. RNA-Seq has been used to analyze the gene expressions and identify the novel splice isoforms and rare transcripts associated with SCZ. This paper provides an overview on the genetics of SCZ, the advantages of RNA-Seq for transcriptome analysis, the accomplishments of RNA-Seq in SCZ cohorts, and the applications of induced pluripotent stem cells and RNA-Seq in SCZ research. PMID:27053919

  4. AMPLIFICATION OF RIBOSOMAL RNA SEQUENCES

    EPA Science Inventory

    This book chapter offers an overview of the use of ribosomal RNA sequences. A history of the technology traces the evolution of techniques to measure bacterial phylogenetic relationships and recent advances in obtaining rRNA sequence information. The manual also describes procedu...

  5. Deciphering the RNA landscape by RNAome sequencing

    PubMed Central

    Derks, Kasper WJ; Misovic, Branislav; van den Hout, Mirjam CGN; Kockx, Christel EM; Payan Gomez, Cesar; Brouwer, Rutger WW; Vrieling, Harry; Hoeijmakers, Jan HJ; van IJcken, Wilfred FJ; Pothof, Joris

    2015-01-01

    Current RNA expression profiling methods rely on enrichment steps for specific RNA classes, thereby not detecting all RNA species in an unperturbed manner. We report strand-specific RNAome sequencing that determines expression of small and large RNAs from rRNA-depleted total RNA in a single sequence run. Since current analysis pipelines cannot reliably analyze small and large RNAs simultaneously, we developed TRAP, Total Rna Analysis Pipeline, a robust interface that is also compatible with existing RNA sequencing protocols. RNAome sequencing quantitatively preserved all RNA classes, allowing cross-class comparisons that facilitates the identification of relationships between different RNA classes. We demonstrate the strength of RNAome sequencing in mouse embryonic stem cells treated with cisplatin. MicroRNA and mRNA expression in RNAome sequencing significantly correlated between replicates and was in concordance with both existing RNA sequencing methods and gene expression arrays generated from the same samples. Moreover, RNAome sequencing also detected additional RNA classes such as enhancer RNAs, anti-sense RNAs, novel RNA species and numerous differentially expressed RNAs undetectable by other methods. At the level of complete RNA classes, RNAome sequencing also identified a specific global repression of the microRNA and microRNA isoform classes after cisplatin treatment whereas all other classes such as mRNAs were unchanged. These characteristics of RNAome sequencing will significantly improve expression analysis as well as studies on RNA biology not covered by existing methods. PMID:25826412

  6. Deciphering the RNA landscape by RNAome sequencing.

    PubMed

    Derks, Kasper W J; Misovic, Branislav; van den Hout, Mirjam C G N; Kockx, Christel E M; Gomez, Cesar Payan; Brouwer, Rutger W W; Vrieling, Harry; Hoeijmakers, Jan H J; van IJcken, Wilfred F J; Pothof, Joris

    2015-01-01

    Current RNA expression profiling methods rely on enrichment steps for specific RNA classes, thereby not detecting all RNA species in an unperturbed manner. We report strand-specific RNAome sequencing that determines expression of small and large RNAs from rRNA-depleted total RNA in a single sequence run. Since current analysis pipelines cannot reliably analyze small and large RNAs simultaneously, we developed TRAP, Total Rna Analysis Pipeline, a robust interface that is also compatible with existing RNA sequencing protocols. RNAome sequencing quantitatively preserved all RNA classes, allowing cross-class comparisons that facilitates the identification of relationships between different RNA classes. We demonstrate the strength of RNAome sequencing in mouse embryonic stem cells treated with cisplatin. MicroRNA and mRNA expression in RNAome sequencing significantly correlated between replicates and was in concordance with both existing RNA sequencing methods and gene expression arrays generated from the same samples. Moreover, RNAome sequencing also detected additional RNA classes such as enhancer RNAs, anti-sense RNAs, novel RNA species and numerous differentially expressed RNAs undetectable by other methods. At the level of complete RNA classes, RNAome sequencing also identified a specific global repression of the microRNA and microRNA isoform classes after cisplatin treatment whereas all other classes such as mRNAs were unchanged. These characteristics of RNAome sequencing will significantly improve expression analysis as well as studies on RNA biology not covered by existing methods. PMID:25826412

  7. RNAome sequencing delineates the complete RNA landscape.

    PubMed

    Derks, Kasper W J; Pothof, Joris

    2015-09-01

    Standard RNA expression profiling methods rely on enrichment steps for specific RNA classes, thereby not detecting all RNA species. For example, small and large RNAs from the same sample cannot be sequenced in a single sequence run. We designed RNAome sequencing, which is a strand-specific method to determine the expression of small and large RNAs from ribosomal RNA-depleted total RNA in a single sequence run. RNAome sequencing quantitatively preserves all RNA classes. This characteristic allows comparisons between RNA classes, thereby facilitating relationships between different RNA classes. Here, we describe in detail the experimental procedure associated with RNAome sequencing published by Derks and colleagues in RNA Biology (2015) [1]. We also provide the R code for the developed Total Rna Analysis Pipeline (TRAP), an algorithm to analyze RNAome sequencing datasets (deposited at the Gene Expression Omnibus data repository, accession number GSE48084). PMID:26484291

  8. RNA sequence analysis using covariance models.

    PubMed Central

    Eddy, S R; Durbin, R

    1994-01-01

    We describe a general approach to several RNA sequence analysis problems using probabilistic models that flexibly describe the secondary structure and primary sequence consensus of an RNA sequence family. We call these models 'covariance models'. A covariance model of tRNA sequences is an extremely sensitive and discriminative tool for searching for additional tRNAs and tRNA-related sequences in sequence databases. A model can be built automatically from an existing sequence alignment. We also describe an algorithm for learning a model and hence a consensus secondary structure from initially unaligned example sequences and no prior structural information. Models trained on unaligned tRNA examples correctly predict tRNA secondary structure and produce high-quality multiple alignments. The approach may be applied to any family of small RNA sequences. Images PMID:8029015

  9. Next generation sequencing of viral RNA genomes

    PubMed Central

    2013-01-01

    Background With the advent of Next Generation Sequencing (NGS) technologies, the ability to generate large amounts of sequence data has revolutionized the genomics field. Most RNA viruses have relatively small genomes in comparison to other organisms and as such, would appear to be an obvious success story for the use of NGS technologies. However, due to the relatively low abundance of viral RNA in relation to host RNA, RNA viruses have proved relatively difficult to sequence using NGS technologies. Here we detail a simple, robust methodology, without the use of ultra-centrifugation, filtration or viral enrichment protocols, to prepare RNA from diagnostic clinical tissue samples, cell monolayers and tissue culture supernatant, for subsequent sequencing on the Roche 454 platform. Results As representative RNA viruses, full genome sequence was successfully obtained from known lyssaviruses belonging to recognized species and a novel lyssavirus species using these protocols and assembling the reads using de novo algorithms. Furthermore, genome sequences were generated from considerably less than 200 ng RNA, indicating that manufacturers’ minimum template guidance is conservative. In addition to obtaining genome consensus sequence, a high proportion of SNPs (Single Nucleotide Polymorphisms) were identified in the majority of samples analyzed. Conclusions The approaches reported clearly facilitate successful full genome lyssavirus sequencing and can be universally applied to discovering and obtaining consensus genome sequences of RNA viruses from a variety of sources. PMID:23822119

  10. antaRNA: ant colony-based RNA sequence design

    PubMed Central

    Kleinkauf, Robert; Mann, Martin; Backofen, Rolf

    2015-01-01

    Motivation: RNA sequence design is studied at least as long as the classical folding problem. Although for the latter the functional fold of an RNA molecule is to be found, inverse folding tries to identify RNA sequences that fold into a function-specific target structure. In combination with RNA-based biotechnology and synthetic biology, reliable RNA sequence design becomes a crucial step to generate novel biochemical components. Results: In this article, the computational tool antaRNA is presented. It is capable of compiling RNA sequences for a given structure that comply in addition with an adjustable full range objective GC-content distribution, specific sequence constraints and additional fuzzy structure constraints. antaRNA applies ant colony optimization meta-heuristics and its superior performance is shown on a biological datasets. Availability and implementation: http://www.bioinf.uni-freiburg.de/Software/antaRNA Contact: backofen@informatik.uni-freiburg.de Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26023105

  11. Mechanisms of genome instability induced by RNA processing defects

    PubMed Central

    Chan, Yujia A.; Hieter, Philip

    2014-01-01

    The role of normal transcription and RNA processing in maintaining genome integrity is becoming increasingly appreciated in organisms ranging from bacteria to humans. Several mutations in RNA biogenesis factors have been implicated in human cancers, but the mechanisms and potential connections to tumor genome instability are not clear. Here we discuss how RNA processing defects could destabilize genomes through mutagenic R-loop structures and by altering expression of genes required for genome stability. A compelling body of evidence now suggests that researchers should be directly testing these mechanisms in models of human cancer. PMID:24794811

  12. Mechanisms of genome instability induced by RNA-processing defects.

    PubMed

    Chan, Yujia A; Hieter, Philip; Stirling, Peter C

    2014-06-01

    The role of normal transcription and RNA processing in maintaining genome integrity is becoming increasingly appreciated in organisms ranging from bacteria to humans. Several mutations in RNA biogenesis factors have been implicated in human cancers, but the mechanisms and potential connections to tumor genome instability are not clear. Here, we discuss how RNA-processing defects could destabilize genomes through mutagenic R-loop structures and by altering expression of genes required for genome stability. A compelling body of evidence now suggests that researchers should be directly testing these mechanisms in models of human cancer. PMID:24794811

  13. Sequence Dependence of Viral RNA Encapsidation.

    PubMed

    Kelly, Joshua; Grosberg, Alexander Y; Bruinsma, Robijn

    2016-07-01

    We develop a Flory mean-field theory for viral RNA (vRNA) molecules that extends the current RNA folding algorithms to include interactions between different sections of the secondary structure. The theory is applied to sequence-selective vRNA encapsidation. The dependence on sequence enters through a single parameter: the largest eigenvalue of the Kramers matrix of the branched polymer obtained by coarse graining the secondary structure. Differences between the work of encapsidation of vRNA molecules and of randomized isomers are found to be in the range of 20 kBT, more than sufficient to provide a strong bias in favor of vRNA encapsidation. The method is applied to a packaging competition experiment where large vRNA molecules compete for encapsidation with two smaller RNA species that together have the same nucleotide sequence as the large molecule. We encounter a substantial, generic free energy bias, that also is of the order of 20 kBT, in favor of encapsidating the single large RNA molecule. The bias is mainly the consequence of the fact that dividing up a large vRNA molecule involves the release of stored elastic energy. This provides an important, nonspecific mechanism for preferential encapsidation of single larger vRNA molecules over multiple smaller mRNA molecules with the same total number of nucleotides. The result is also consistent with recent RNA packaging competition experiments by Comas-Garcia et al.1 Finally, the Flory method leads to the result that when two RNA molecules are copackaged, they are expected to remain segregated inside the capsid. PMID:27116641

  14. Experimental investigation of an RNA sequence space

    NASA Technical Reports Server (NTRS)

    Lee, Youn-Hyung; Dsouza, Lisa; Fox, George E.

    1993-01-01

    Modern rRNAs are the historic consequence of an ongoing evolutionary exploration of a sequence space. These extant sequences belong to a special subset of the sequence space that is comprised only of those primary sequences that can validly perform the biological function(s) required of the particular RNA. If it were possible to readily identify all such valid sequences, stochastic predictions could be made about the relative likelihood of various evolutionary pathways available to an RNA. Herein an experimental system which can assess whether a particular sequence is likely to have validity as a eubacterial 5S rRNA is described. A total of ten naturally occurring, and hence known to be valid, sequences and two point mutants of unknown validity were used to test the usefulness of the approach. Nine of the ten valid sequences tested positive whereas both mutants tested as clearly defective. The tenth valid sequence gave results that would be interpreted as reflecting a borderline status were the answer not known. These results demonstrate that it is possible to experimentally determine which sequences in local regions of the sequence space are potentially valid 5S rRNAs.

  15. RCARE: RNA Sequence Comparison and Annotation for RNA Editing

    PubMed Central

    2015-01-01

    The post-transcriptional sequence modification of transcripts through RNA editing is an important mechanism for regulating protein function and is associated with human disease phenotypes. The identification of RNA editing or RNA-DNA difference (RDD) sites is a fundamental step in the study of RNA editing. However, a substantial number of false-positive RDD sites have been identified recently. A major challenge in identifying RDD sites is to distinguish between the true RNA editing sites and the false positives. Furthermore, determining the location of condition-specific RDD sites and elucidating their functional roles will help toward understanding various biological phenomena that are mediated by RNA editing. The present study developed RNA-sequence comparison and annotation for RNA editing (RCARE) for searching, annotating, and visualizing RDD sites using thousands of previously known editing sites, which can be used for comparative analyses between multiple samples. RCARE also provides evidence for improving the reliability of identified RDD sites. RCARE is a web-based comparison, annotation, and visualization tool, which provides rich biological annotations and useful summary plots. The developers of previous tools that identify or annotate RNA-editing sites seldom mention the reliability of their respective tools. In order to address the issue, RCARE utilizes a number of scientific publications and databases to find specific documentations respective to a particular RNA-editing site, which generates evidence levels to convey the reliability of RCARE. Sequence-based alignment files can be converted into VCF files using a Python script and uploaded to the RCARE server for further analysis. RCARE is available for free at http://www.snubi.org/software/rcare/. PMID:26043858

  16. Alternative applications for distinct RNA sequencing strategies.

    PubMed

    Han, Leng; Vickers, Kasey C; Samuels, David C; Guo, Yan

    2015-07-01

    Recent advances in RNA library preparation methods, platform accessibility and cost efficiency have allowed high-throughput RNA sequencing (RNAseq) to replace conventional hybridization microarray platforms as the method of choice for mRNA profiling and transcriptome analyses. RNAseq is a powerful technique to profile both long and short RNA expression, and the depth of information gained from distinct RNAseq methods is striking and facilitates discovery. In addition to expression analysis, distinct RNAseq approaches also allow investigators the ability to assess transcriptional elongation, DNA variance and exogenous RNA content. Here we review the current state of the art in transcriptome sequencing and address epigenetic regulation, quantification of transcription activation, RNAseq output and a diverse set of applications for RNAseq data. We detail how RNAseq can be used to identify allele-specific expression, single-nucleotide polymorphisms and somatic mutations and discuss the benefits and limitations of using RNAseq to monitor DNA characteristics. Moreover, we highlight the power of combining RNA- and DNAseq methods for genomic analysis. In summary, RNAseq provides the opportunity to gain greater insight into transcriptional regulation and output than simply miRNA and mRNA profiling. PMID:25246237

  17. Alternative applications for distinct RNA sequencing strategies

    PubMed Central

    Han, Leng; Vickers, Kasey C.; Samuels, David C.

    2015-01-01

    Recent advances in RNA library preparation methods, platform accessibility and cost efficiency have allowed high-throughput RNA sequencing (RNAseq) to replace conventional hybridization microarray platforms as the method of choice for mRNA profiling and transcriptome analyses. RNAseq is a powerful technique to profile both long and short RNA expression, and the depth of information gained from distinct RNAseq methods is striking and facilitates discovery. In addition to expression analysis, distinct RNAseq approaches also allow investigators the ability to assess transcriptional elongation, DNA variance and exogenous RNA content. Here we review the current state of the art in transcriptome sequencing and address epigenetic regulation, quantification of transcription activation, RNAseq output and a diverse set of applications for RNAseq data. We detail how RNAseq can be used to identify allele-specific expression, single-nucleotide polymorphisms and somatic mutations and discuss the benefits and limitations of using RNAseq to monitor DNA characteristics. Moreover, we highlight the power of combining RNA- and DNAseq methods for genomic analysis. In summary, RNAseq provides the opportunity to gain greater insight into transcriptional regulation and output than simply miRNA and mRNA profiling. PMID:25246237

  18. Transcriptional profiling of Dictyostelium with RNA sequencing

    PubMed Central

    Miranda, Edward Roshan; Rot, Gregor; Toplak, Marko; Santhanam, Balaji; Curk, Tomaz; Shaulsky, Gad; Zupan, Blaz

    2014-01-01

    Summary Transcriptional profiling methods have been utilized in the analysis of various biological processes in Dictyostelium. Recent advances in high-throughput sequencing have increased the resolution and the dynamic range of transcriptional profiling. Here we describe the utility of RNA-sequencing with the Illumina technology for production of transcriptional profiles. We also describe methods for data mapping and storage as well as common and specialized tools for data analysis, both online and offline. PMID:23494306

  19. Ribosomal RNA sequence suggest microsporidia are extremely ancient eukaryotes

    NASA Technical Reports Server (NTRS)

    Vossbrinck, C. R.; Maddox, J. V.; Friedman, S.; Debrunner-Vossbrinck, B. A.; Woese, C. R.

    1987-01-01

    A comparative sequence analysis of the 18S small subunit ribosomal RNA (rRNA) of the microsporidium Vairimorpha necatrix is presented. The results show that this rRNA sequence is more unlike those of other eukaryotes than any known eukaryote rRNA sequence. It is concluded that the lineage leading to microsporidia branched very early from that leading to other eukaryotes.

  20. Advanced Applications of RNA Sequencing and Challenges

    PubMed Central

    Han, Yixing; Gao, Shouguo; Muegge, Kathrin; Zhang, Wei; Zhou, Bing

    2015-01-01

    Next-generation sequencing technologies have revolutionarily advanced sequence-based research with the advantages of high-throughput, high-sensitivity, and high-speed. RNA-seq is now being used widely for uncovering multiple facets of transcriptome to facilitate the biological applications. However, the large-scale data analyses associated with RNA-seq harbors challenges. In this study, we present a detailed overview of the applications of this technology and the challenges that need to be addressed, including data preprocessing, differential gene expression analysis, alternative splicing analysis, variants detection and allele-specific expression, pathway analysis, co-expression network analysis, and applications combining various experimental procedures beyond the achievements that have been made. Specifically, we discuss essential principles of computational methods that are required to meet the key challenges of the RNA-seq data analyses, development of various bioinformatics tools, challenges associated with the RNA-seq applications, and examples that represent the advances made so far in the characterization of the transcriptome. PMID:26609224

  1. Dis3- and exosome subunit-responsive 3 Prime mRNA instability elements

    SciTech Connect

    Kiss, Daniel L.; Hou, Dezhi; Gross, Robert H.; Andrulis, Erik D.

    2012-07-06

    Highlights: Black-Right-Pointing-Pointer Successful use of a novel RNA-specific bioinformatic tool, RNA SCOPE. Black-Right-Pointing-Pointer Identified novel 3 Prime UTR cis-acting element that destabilizes a reporter mRNA. Black-Right-Pointing-Pointer Show exosome subunits are required for cis-acting element-mediated mRNA instability. Black-Right-Pointing-Pointer Define precise sequence requirements of novel cis-acting element. Black-Right-Pointing-Pointer Show that microarray-defined exosome subunit-regulated mRNAs have novel element. -- Abstract: Eukaryotic RNA turnover is regulated in part by the exosome, a nuclear and cytoplasmic complex of ribonucleases (RNases) and RNA-binding proteins. The major RNase of the complex is thought to be Dis3, a multi-functional 3 Prime -5 Prime exoribonuclease and endoribonuclease. Although it is known that Dis3 and core exosome subunits are recruited to transcriptionally active genes and to messenger RNA (mRNA) substrates, this recruitment is thought to occur indirectly. We sought to discover cis-acting elements that recruit Dis3 or other exosome subunits. Using a bioinformatic tool called RNA SCOPE to screen the 3 Prime untranslated regions of up-regulated transcripts from our published Dis3 depletion-derived transcriptomic data set, we identified several motifs as candidate instability elements. Secondary screening using a luciferase reporter system revealed that one cassette-harboring four elements-destabilized the reporter transcript. RNAi-based depletion of Dis3, Rrp6, Rrp4, Rrp40, or Rrp46 diminished the efficacy of cassette-mediated destabilization. Truncation analysis of the cassette showed that two exosome subunit-sensitive elements (ESSEs) destabilized the reporter. Point-directed mutagenesis of ESSE abrogated the destabilization effect. An examination of the transcriptomic data from exosome subunit depletion-based microarrays revealed that mRNAs with ESSEs are found in every up-regulated mRNA data set but are

  2. De novo assembly of a bell pepper endornavirus genome sequence using RNA sequencing data.

    PubMed

    Jo, Yeonhwa; Choi, Hoseng; Cho, Won Kyong

    2015-01-01

    The genus Endornavirus is a double-stranded RNA virus that infects a wide range of hosts. In this study, we report on the de novo assembly of a bell pepper endornavirus genome sequence by RNA sequencing (RNA-Seq). Our result demonstrates the successful application of RNA-Seq to obtain a complete viral genome sequence from the transcriptome data. PMID:25792042

  3. Nucleotide sequence of Neurospora crassa cytoplasmic initiator tRNA.

    PubMed Central

    Gillum, A M; Hecker, L I; Silberklang, M; Schwartzbach, S D; RajBhandary, U L; Barnett, W E

    1977-01-01

    Initiator methionine tRNA from the cytoplasm of Neurospora crassa has been purified and sequenced. The sequence is: pAGCUGCAUm1GGCGCAGCGGAAGCGCM22GCY*GGGCUCAUt6AACCCGGAGm7GU (or D) - CACUCGAUCGm1AAACGAG*UUGCAGCUACCAOH. Similar to initiator tRNAs from the cytoplasm of other eukaryotes, this tRNA also contains the sequence -AUCG- instead of the usual -TphiCG (or A)- found in loop IV of other tRNAs. The sequence of the N. crassa cytoplasmic initiator tRNA is quite different from that of the corresponding mitochondrial initiator tRNA. Comparison of the sequence of N. crassa cytoplasmic initiator tRNA to those of yeast, wheat germ and vertebrate cytoplasmic initiator tRNA indicates that the sequences of the two fungal tRNAs are no more similar to each other than they are to those of other initiator tRNAs. Images PMID:146192

  4. Empirical insights into the stochasticity of small RNA sequencing

    PubMed Central

    Qin, Li-Xuan; Tuschl, Thomas; Singer, Samuel

    2016-01-01

    The choice of stochasticity distribution for modeling the noise distribution is a fundamental assumption for the analysis of sequencing data and consequently is critical for the accurate assessment of biological heterogeneity and differential expression. The stochasticity of RNA sequencing has been assumed to follow Poisson distributions. We collected microRNA sequencing data and observed that its stochasticity is better approximated by gamma distributions, likely because of the stochastic nature of exponential PCR amplification. We validated our findings with two independent datasets, one for microRNA sequencing and another for RNA sequencing. Motivated by the gamma distributed stochasticity, we provided a simple method for the analysis of RNA sequencing data and showed its superiority to three existing methods for differential expression analysis using three data examples of technical replicate data and biological replicate data. PMID:27052356

  5. Empirical insights into the stochasticity of small RNA sequencing

    NASA Astrophysics Data System (ADS)

    Qin, Li-Xuan; Tuschl, Thomas; Singer, Samuel

    2016-04-01

    The choice of stochasticity distribution for modeling the noise distribution is a fundamental assumption for the analysis of sequencing data and consequently is critical for the accurate assessment of biological heterogeneity and differential expression. The stochasticity of RNA sequencing has been assumed to follow Poisson distributions. We collected microRNA sequencing data and observed that its stochasticity is better approximated by gamma distributions, likely because of the stochastic nature of exponential PCR amplification. We validated our findings with two independent datasets, one for microRNA sequencing and another for RNA sequencing. Motivated by the gamma distributed stochasticity, we provided a simple method for the analysis of RNA sequencing data and showed its superiority to three existing methods for differential expression analysis using three data examples of technical replicate data and biological replicate data.

  6. DSAP: deep-sequencing small RNA analysis pipeline.

    PubMed

    Huang, Po-Jung; Liu, Yi-Chung; Lee, Chi-Ching; Lin, Wei-Chen; Gan, Richie Ruei-Chi; Lyu, Ping-Chiang; Tang, Petrus

    2010-07-01

    DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw. PMID:20478825

  7. Unbiased Deep Sequencing of RNA Viruses from Clinical Samples.

    PubMed

    Matranga, Christian B; Gladden-Young, Adrianne; Qu, James; Winnicki, Sarah; Nosamiefan, Dolo; Levin, Joshua Z; Sabeti, Pardis C

    2016-01-01

    Here we outline a next-generation RNA sequencing protocol that enables de novo assemblies and intra-host variant calls of viral genomes collected from clinical and biological sources. The method is unbiased and universal; it uses random primers for cDNA synthesis and requires no prior knowledge of the viral sequence content. Before library construction, selective RNase H-based digestion is used to deplete unwanted RNA - including poly(rA) carrier and ribosomal RNA - from the viral RNA sample. Selective depletion improves both the data quality and the number of unique reads in viral RNA sequencing libraries. Moreover, a transposase-based 'tagmentation' step is used in the protocol as it reduces overall library construction time. The protocol has enabled rapid deep sequencing of over 600 Lassa and Ebola virus samples-including collections from both blood and tissue isolates-and is broadly applicable to other microbial genomics studies. PMID:27403729

  8. RNase P-Mediated Sequence-Specific Cleavage of RNA by Engineered External Guide Sequences

    PubMed Central

    Derksen, Merel; Mertens, Vicky; Pruijn, Ger J.M.

    2015-01-01

    The RNA cleavage activity of RNase P can be employed to decrease the levels of specific RNAs and to study their function or even to eradicate pathogens. Two different technologies have been developed to use RNase P as a tool for RNA knockdown. In one of these, an external guide sequence, which mimics a tRNA precursor, a well-known natural RNase P substrate, is used to target an RNA molecule for cleavage by endogenous RNase P. Alternatively, a guide sequence can be attached to M1 RNA, the (catalytic) RNase P RNA subunit of Escherichia coli. The guide sequence is specific for an RNA target, which is subsequently cleaved by the bacterial M1 RNA moiety. These approaches are applicable in both bacteria and eukaryotes. In this review, we will discuss the two technologies in which RNase P is used to reduce RNA expression levels. PMID:26569326

  9. RNase P-Mediated Sequence-Specific Cleavage of RNA by Engineered External Guide Sequences.

    PubMed

    Derksen, Merel; Mertens, Vicky; Pruijn, Ger J M

    2015-01-01

    The RNA cleavage activity of RNase P can be employed to decrease the levels of specific RNAs and to study their function or even to eradicate pathogens. Two different technologies have been developed to use RNase P as a tool for RNA knockdown. In one of these, an external guide sequence, which mimics a tRNA precursor, a well-known natural RNase P substrate, is used to target an RNA molecule for cleavage by endogenous RNase P. Alternatively, a guide sequence can be attached to M1 RNA, the (catalytic) RNase P RNA subunit of Escherichia coli. The guide sequence is specific for an RNA target, which is subsequently cleaved by the bacterial M1 RNA moiety. These approaches are applicable in both bacteria and eukaryotes. In this review, we will discuss the two technologies in which RNase P is used to reduce RNA expression levels. PMID:26569326

  10. Efficient prediction methods for selecting effective siRNA sequences.

    PubMed

    Takasaki, Shigeru

    2010-02-01

    Although short interfering RNA (siRNA) has been widely used for studying gene functions in mammalian cells, its gene silencing efficacy varies markedly and there are only a few consistencies among the recently reported design rules/guidelines for selecting siRNA sequences effective for mammalian genes. Another shortcoming of the previously reported methods is that they cannot estimate the probability that a candidate sequence will silence the target gene. This paper first reviewed the recently reported siRNA design guidelines and clarified the problems concerning the guidelines. It then proposed two prediction methods-Radial Basis Function (RBF) network and decision tree learning-and their combined method for selecting effective siRNA target sequences from many possible candidate sequences. They are quite different from the previous score-based siRNA design techniques and can predict the probability that a candidate siRNA sequence will be effective. The methods imply high estimation accuracy for selecting candidate siRNA sequences. PMID:20022002

  11. The primary nucleotide sequence of U4 RNA.

    PubMed

    Reddy, R; Henning, D; Busch, H

    1981-04-10

    U4 RNA is one of the "capped" nuclear snRNAs recently found to be precipitable by anti-Sm antibodies as ribonucleoprotein particles. U4 RNA, along with other snRNAs, has been implicated in hnRNA processing, mRNA transport, or both (Lerner, M. R., Boyle, J., Mount, S., Wolin, S., and Steitz, J. A. (1980) Nature 283, 220-224). Since the proteins bound to different snRNAs appear to be the same, the functions of different snRNPs might be dependent on the RNA components. To help understand the function of U4 RNP, the nucleotide sequence of U4 RNA was determined. The sequence is (formula see text) In addition to the modified nucleotides in the "cap," U4 RNA contains Am at position 63 and m6A at position 98. It also exhibited A-C microheterogeneity at position 97. PMID:6162848

  12. DNA Instability Maintains the Repeat Length of the Yeast RNA Polymerase II C-terminal Domain.

    PubMed

    Morrill, Summer A; Exner, Alexandra E; Babokhov, Michael; Reinfeld, Bradley I; Fuchs, Stephen M

    2016-05-27

    The C-terminal domain (CTD) of RNA polymerase II in eukaryotes is comprised of tandemly repeating units of a conserved seven-amino acid sequence. The number of repeats is, however, quite variable across different organisms. Furthermore, previous studies have identified evidence of rearrangements within the CTD coding region, suggesting that DNA instability may play a role in regulating or maintaining CTD repeat number. The work described here establishes a clear connection between DNA instability and CTD repeat number in Saccharomyces cerevisiae First, analysis of 36 diverse S. cerevisiae isolates revealed evidence of numerous past rearrangements within the DNA sequence that encodes the CTD. Interestingly, the total number of CTD repeats was relatively static (24-26 repeats in all strains), suggesting a balancing act between repeat expansion and contraction. In an effort to explore the genetic plasticity within this region, we measured the rates of repeat expansion and contraction using novel reporters and a doxycycline-regulated expression system for RPB1 In efforts to determine the mechanisms leading to CTD repeat variability, we identified the presence of DNA secondary structures, specifically G-quadruplex-like DNA, within the CTD coding region. Furthermore, we demonstrated that mutating PIF1, a G-quadruplex-specific helicase, results in increased CTD repeat length polymorphisms. We also determined that RAD52 is necessary for CTD repeat expansion but not contraction, identifying a role for recombination in repeat expansion. Results from these DNA rearrangements may help explain the CTD copy number variation seen across eukaryotes, as well as support a model of CTD expansion and contraction to maintain CTD integrity and overall length. PMID:27026700

  13. DNA Instability Maintains the Repeat Length of the Yeast RNA Polymerase II C-terminal Domain*

    PubMed Central

    Morrill, Summer A.; Exner, Alexandra E.; Babokhov, Michael; Reinfeld, Bradley I.

    2016-01-01

    The C-terminal domain (CTD) of RNA polymerase II in eukaryotes is comprised of tandemly repeating units of a conserved seven-amino acid sequence. The number of repeats is, however, quite variable across different organisms. Furthermore, previous studies have identified evidence of rearrangements within the CTD coding region, suggesting that DNA instability may play a role in regulating or maintaining CTD repeat number. The work described here establishes a clear connection between DNA instability and CTD repeat number in Saccharomyces cerevisiae. First, analysis of 36 diverse S. cerevisiae isolates revealed evidence of numerous past rearrangements within the DNA sequence that encodes the CTD. Interestingly, the total number of CTD repeats was relatively static (24–26 repeats in all strains), suggesting a balancing act between repeat expansion and contraction. In an effort to explore the genetic plasticity within this region, we measured the rates of repeat expansion and contraction using novel reporters and a doxycycline-regulated expression system for RPB1. In efforts to determine the mechanisms leading to CTD repeat variability, we identified the presence of DNA secondary structures, specifically G-quadruplex-like DNA, within the CTD coding region. Furthermore, we demonstrated that mutating PIF1, a G-quadruplex-specific helicase, results in increased CTD repeat length polymorphisms. We also determined that RAD52 is necessary for CTD repeat expansion but not contraction, identifying a role for recombination in repeat expansion. Results from these DNA rearrangements may help explain the CTD copy number variation seen across eukaryotes, as well as support a model of CTD expansion and contraction to maintain CTD integrity and overall length. PMID:27026700

  14. RNAcentral: an international database of ncRNA sequences

    PubMed Central

    2015-01-01

    The field of non-coding RNA biology has been hampered by the lack of availability of a comprehensive, up-to-date collection of accessioned RNA sequences. Here we present the first release of RNAcentral, a database that collates and integrates information from an international consortium of established RNA sequence databases. The initial release contains over 8.1 million sequences, including representatives of all major functional classes. A web portal (http://rnacentral.org) provides free access to data, search functionality, cross-references, source code and an integrated genome browser for selected species. PMID:25352543

  15. RNAcentral: an international database of ncRNA sequences.

    PubMed

    2015-01-01

    The field of non-coding RNA biology has been hampered by the lack of availability of a comprehensive, up-to-date collection of accessioned RNA sequences. Here we present the first release of RNAcentral, a database that collates and integrates information from an international consortium of established RNA sequence databases. The initial release contains over 8.1 million sequences, including representatives of all major functional classes. A web portal (http://rnacentral.org) provides free access to data, search functionality, cross-references, source code and an integrated genome browser for selected species. PMID:25352543

  16. RNAcentral: an international database of ncRNA sequences

    DOE PAGESBeta

    Williams, Kelly Porter

    2014-10-28

    The field of non-coding RNA biology has been hampered by the lack of availability of a comprehensive, up-to-date collection of accessioned RNA sequences. Here we present the first release of RNAcentral, a database that collates and integrates information from an international consortium of established RNA sequence databases. The initial release contains over 8.1 million sequences, including representatives of all major functional classes. A web portal (http://rnacentral.org) provides free access to data, search functionality, cross-references, source code and an integrated genome browser for selected species.

  17. Nucleotide sequence of a human tRNA gene heterocluster

    SciTech Connect

    Chang, Y.N.; Pirtle, I.L.; Pirtle, R.M.

    1986-05-01

    Leucine tRNA from bovine liver was used as a hybridization probe to screen a human gene library harbored in Charon-4A of bacteriophage lambda. The human DNA inserts from plaque-pure clones were characterized by restriction endonuclease mapping and Southern hybridization techniques, using both (3'-/sup 32/P)-labeled bovine liver leucine tRNA and total tRNA as hybridization probes. An 8-kb Hind III fragment of one of these ..gamma..-clones was subcloned into the Hind III site of pBR322. Subsequent fine restriction mapping and DNA sequence analysis of this plasmid DNA indicated the presence of four tRNA genes within the 8-kb DNA fragment. A leucine tRNA gene with an anticodon of AAG and a proline tRNA gene with an anticodon of AGG are in a 1.6-kb subfragment. A threonine tRNA gene with an anticodon of UGU and an as yet unidentified tRNA gene are located in a 1.1-kb subfragment. These two different subfragments are separated by 2.8 kb. The coding regions of the three sequenced genes contain characteristic internal split promoter sequences and do not have intervening sequences. The 3'-flanking region of these three genes have typical RNA polymerase III termination sites of at least four consecutive T residues.

  18. Compilation of 5S rRNA and 5S rRNA gene sequences

    PubMed Central

    Specht, Thomas; Wolters, Jörn; Erdmann, Volker A.

    1990-01-01

    The BERLIN RNA DATABANK as of Dezember 31, 1989, contains a total of 667 sequences of 5S rRNAs or their genes, which is an increase of 114 new sequence entries over the last compilation (1). It covers sequences from 44 archaebacteria, 267 eubacteria, 20 plastids, 6 mitochondria, 319 eukaryotes and 11 eukaryotic pseudogenes. The hardcopy shows only the list (Table 1) of those organisms whose sequences have been determined. The BERLIN RNA DATABANK uses the format of the EMBL Nucleotide Sequence Data Library complemented by a Sequence Alignment (SA) field including secondary structure information. PMID:1692116

  19. Complete sequence of RNA1 and subgenomic RNA3 of Atlantic halibut nodavirus (AHNV).

    PubMed

    Sommerset, Ingunn; Nerland, Audun H

    2004-03-10

    The Nodaviridae are divided into the alphanodavirus genus, which infects insects, and the betanodavirus genus, which infects fishes. Betanodaviruses are the causative agent of viral encephalopathy and retinopathy (VER) in a number of cultivated marine fish species. The Nodaviridae are small non-enveloped RNA viruses that contain a genome consisting of 2 single-stranded positivesense RNA segments: RNA1 (3.1 kb), which encodes the viral part of the RNA-dependent RNA polymerase (RdRp); and RNA2 (1.4 kb), which encodes the capsid protein. In addition to RNA1 and RNA2, a subgenomic transcript of RNA1, RNA3, is present in infected cells. We have cloned and sequenced RNA1 from the Atlantic halibut Hippoglossus hippoglossus nodavirus (AHNV), and for the first time, the sequence of a betanodaviral subgenomic RNA3 has been determined. AHNV RNA1 was 3100 nucleotides in length and contained a main open reading frame encoding a polypeptide of 981 amino acids. Conservative motifs for RdRp were found in the deduced amino acid sequence. RNA3 was 371 nucleotides in length, and contained an open reading frame encoding a peptide of 75 amino acids corresponding to a hypothetical B2 protein, although sequence alignments with the alphanodavirus B2 proteins showed only marginal similarities. AHNV RNA replication in the fish cell-line SSN-1 (derived from striped snakehead) was analysed by Northern blot analysis, which indicated that RNA3 was synthesised in large amounts (compared to RNA1) at an early point in time post-infection. PMID:15109133

  20. Small RNA Deep Sequencing Reveals Role for Arabidopsis thaliana RNA-Dependent RNA Polymerases in Viral siRNA Biogenesis

    PubMed Central

    Qi, Xiaopeng; Bao, Forrest Sheng; Xie, Zhixin

    2009-01-01

    RNA silencing functions as an important antiviral defense mechanism in a broad range of eukaryotes. In plants, biogenesis of several classes of endogenous small interfering RNAs (siRNAs) requires RNA-dependent RNA Polymerase (RDR) activities. Members of the RDR family proteins, including RDR1and RDR6, have also been implicated in antiviral defense, although a direct role for RDRs in viral siRNA biogenesis has yet to be demonstrated. Using a crucifer-infecting strain of Tobacco Mosaic Virus (TMV-Cg) and Arabidopsis thaliana as a model system, we analyzed the viral small RNA profile in wild-type plants as well as rdr mutants by applying small RNA deep sequencing technology. Over 100,000 TMV-Cg-specific small RNA reads, mostly of 21- (78.4%) and 22-nucleotide (12.9%) in size and originating predominately (79.9%) from the genomic sense RNA strand, were captured at an early infection stage, yielding the first high-resolution small RNA map for a plant virus. The TMV-Cg genome harbored multiple, highly reproducible small RNA-generating hot spots that corresponded to regions with no apparent local hairpin-forming capacity. Significantly, both the rdr1 and rdr6 mutants exhibited globally reduced levels of viral small RNA production as well as reduced strand bias in viral small RNA population, revealing an important role for these host RDRs in viral siRNA biogenesis. In addition, an informatics analysis showed that a large set of host genes could be potentially targeted by TMV-Cg-derived siRNAs for posttranscriptional silencing. Two of such predicted host targets, which encode a cleavage and polyadenylation specificity factor (CPSF30) and an unknown protein similar to translocon-associated protein alpha (TRAP α), respectively, yielded a positive result in cleavage validation by 5′RACE assays. Our data raised the interesting possibility for viral siRNA-mediated virus-host interactions that may contribute to viral pathogenicity and host specificity. PMID:19308254

  1. FLDS: A Comprehensive dsRNA Sequencing Method for Intracellular RNA Virus Surveillance

    PubMed Central

    Urayama, Syun-ichi; Takaki, Yoshihiro; Nunoura, Takuro

    2016-01-01

    Knowledge of the distribution and diversity of RNA viruses is still limited in spite of their possible environmental and epidemiological impacts because RNA virus-specific metagenomic methods have not yet been developed. We herein constructed an effective metagenomic method for RNA viruses by targeting long double-stranded (ds)RNA in cellular organisms, which is a hallmark of infection, or the replication of dsRNA and single-stranded (ss)RNA viruses, except for retroviruses. This novel dsRNA targeting metagenomic method is characterized by an extremely high recovery rate of viral RNA sequences, the retrieval of terminal sequences, and uniform read coverage, which has not previously been reported in other metagenomic methods targeting RNA viruses. This method revealed a previously unidentified viral RNA diversity of more than 20 complete RNA viral genomes including dsRNA and ssRNA viruses associated with an environmental diatom colony. Our approach will be a powerful tool for cataloging RNA viruses associated with organisms of interest. PMID:26877136

  2. Deep Sequencing Insights in Therapeutic shRNA Processing and siRNA Target Cleavage Precision

    PubMed Central

    Denise, Hubert; Moschos, Sterghios A.; Sidders, Benjamin; Burden, Frances; Perkins, Hannah; Carter, Nikki; Stroud, Tim; Kennedy, Michael; Fancy, Sally-Ann; Lapthorn, Cris; Lavender, Helen; Kinloch, Ross; Suhy, David; Corbau, Romu

    2014-01-01

    TT-034 (PF-05095808) is a recombinant adeno-associated virus serotype 8 (AAV8) agent expressing three short hairpin RNA (shRNA) pro-drugs that target the hepatitis C virus (HCV) RNA genome. The cytosolic enzyme Dicer cleaves each shRNA into multiple, potentially active small interfering RNA (siRNA) drugs. Using next-generation sequencing (NGS) to identify and characterize active shRNAs maturation products, we observed that each TT-034–encoded shRNA could be processed into as many as 95 separate siRNA strands. Few of these appeared active as determined by Sanger 5′ RNA Ligase-Mediated Rapid Amplification of cDNA Ends (5-RACE) and through synthetic shRNA and siRNA analogue studies. Moreover, NGS scrutiny applied on 5-RACE products (RACE-seq) suggested that synthetic siRNAs could direct cleavage in not one, but up to five separate positions on targeted RNA, in a sequence-dependent manner. These data support an on-target mechanism of action for TT-034 without cytotoxicity and question the accepted precision of substrate processing by the key RNA interference (RNAi) enzymes Dicer and siRNA-induced silencing complex (siRISC). PMID:24496437

  3. FLDS: A Comprehensive dsRNA Sequencing Method for Intracellular RNA Virus Surveillance.

    PubMed

    Urayama, Syun-Ichi; Takaki, Yoshihiro; Nunoura, Takuro

    2016-03-26

    Knowledge of the distribution and diversity of RNA viruses is still limited in spite of their possible environmental and epidemiological impacts because RNA virus-specific metagenomic methods have not yet been developed. We herein constructed an effective metagenomic method for RNA viruses by targeting long double-stranded (ds)RNA in cellular organisms, which is a hallmark of infection, or the replication of dsRNA and single-stranded (ss)RNA viruses, except for retroviruses. This novel dsRNA targeting metagenomic method is characterized by an extremely high recovery rate of viral RNA sequences, the retrieval of terminal sequences, and uniform read coverage, which has not previously been reported in other metagenomic methods targeting RNA viruses. This method revealed a previously unidentified viral RNA diversity of more than 20 complete RNA viral genomes including dsRNA and ssRNA viruses associated with an environmental diatom colony. Our approach will be a powerful tool for cataloging RNA viruses associated with organisms of interest. PMID:26877136

  4. Complete Genome Sequence of the WHO International Standard for HIV-2 RNA Determined by Deep Sequencing

    PubMed Central

    Ham, Claire; Morris, Clare

    2016-01-01

    The World Health Organization (WHO) International Standard for HIV-2 RNA nucleic acid assays was characterized by complete genome deep sequencing. The entire coding sequence and flanking long terminal repeats (LTRs), including minority species, were assigned subtype A. This information will aid design, development, and evaluation of HIV-2 RNA amplification assays. PMID:26847885

  5. Sequence-non-specific effects of RNA interference triggers and microRNA regulators

    PubMed Central

    Olejniczak, Marta; Galka, Paulina; Krzyzosiak, Wlodzimierz J.

    2010-01-01

    RNA reagents of diverse lengths and structures, unmodified or containing various chemical modifications are powerful tools of RNA interference and microRNA technologies. These reagents which are either delivered to cells using appropriate carriers or are expressed in cells from suitable vectors often cause unintended sequence-non-specific immune responses besides triggering intended sequence-specific silencing effects. This article reviews the present state of knowledge regarding the cellular sensors of foreign RNA, the signaling pathways these sensors mobilize and shows which specific features of the RNA reagents set the responsive systems on alert. The representative examples of toxic effects caused in the investigated cell lines and tissues by the RNAs of specific types and structures are collected and may be instructive for further studies of sequence-non-specific responses to foreign RNA in human cells. PMID:19843612

  6. The chemical structure of DNA sequence signals for RNA transcription

    NASA Technical Reports Server (NTRS)

    George, D. G.; Dayhoff, M. O.

    1982-01-01

    The proposed recognition sites for RNA transcription for E. coli NRA polymerase, bacteriophage T7 RNA polymerase, and eukaryotic RNA polymerase Pol II are evaluated in the light of the requirements for efficient recognition. It is shown that although there is good experimental evidence that specific nucleic acid sequence patterns are involved in transcriptional regulation in bacteria and bacterial viruses, among the sequences now available, only in the case of the promoters recognized by bacteriophage T7 polymerase does it seem likely that the pattern is sufficient. It is concluded that the eukaryotic pattern that is investigated is not restrictive enough to serve as a recognition site.

  7. Comparison of ribosomal RNA removal methods for transcriptome sequencing workflows in teleost fish

    Technology Transfer Automated Retrieval System (TEKTRAN)

    RNA sequencing (RNA-Seq) is becoming the standard for transcriptome analysis. Removal of contaminating ribosomal RNA (rRNA) is a priority in the preparation of libraries suitable for sequencing. rRNAs are commonly removed from total RNA via either mRNA selection or rRNA depletion. These methods have...

  8. Spliced synthetic genes as internal controls in RNA sequencing experiments.

    PubMed

    Hardwick, Simon A; Chen, Wendy Y; Wong, Ted; Deveson, Ira W; Blackburn, James; Andersen, Stacey B; Nielsen, Lars K; Mattick, John S; Mercer, Tim R

    2016-09-01

    RNA sequencing (RNA-seq) can be used to assemble spliced isoforms, quantify expressed genes and provide a global profile of the transcriptome. However, the size and diversity of the transcriptome, the wide dynamic range in gene expression and inherent technical biases confound RNA-seq analysis. We have developed a set of spike-in RNA standards, termed 'sequins' (sequencing spike-ins), that represent full-length spliced mRNA isoforms. Sequins have an entirely artificial sequence with no homology to natural reference genomes, but they align to gene loci encoded on an artificial in silico chromosome. The combination of multiple sequins across a range of concentrations emulates alternative splicing and differential gene expression, and it provides scaling factors for normalization between samples. We demonstrate the use of sequins in RNA-seq experiments to measure sample-specific biases and determine the limits of reliable transcript assembly and quantification in accompanying human RNA samples. In addition, we have designed a complementary set of sequins that represent fusion genes arising from rearrangements of the in silico chromosome to aid in cancer diagnosis. RNA sequins provide a qualitative and quantitative reference with which to navigate the complexity of the human transcriptome. PMID:27502218

  9. The nucleotide sequence of cowpea mosaic virus B RNA

    PubMed Central

    Lomonossoff, G.P.; Shanks, M.

    1983-01-01

    The complete sequence of the bottom component RNA (B RNA) of cowpea mosaic virus (CPMV) has been determined. Restriction enzyme fragments of double-stranded cDNA were cloned in M13 and the sequence of the inserts was determined by a combination of enzymatic and chemical sequencing techniques. Additional sequence information was obtained by primed synthesis on first strand cDNA. The complete sequence deduced is 5889 nucleotides long excluding the 3' poly(A), and contains an open reading frame sufficient to code for a polypeptide of mol. wt. 207 760. The coding region is flanked by a 5' leader sequence of 206 nucleotides and a 3' non-coding region of 82 residues which does not contain a polyadenylation signal. PMID:16453487

  10. Identifying novel sequence variants of RNA 3D motifs

    PubMed Central

    Zirbel, Craig L.; Roll, James; Sweeney, Blake A.; Petrov, Anton I.; Pirrung, Meg; Leontis, Neocles B.

    2015-01-01

    Predicting RNA 3D structure from sequence is a major challenge in biophysics. An important sub-goal is accurately identifying recurrent 3D motifs from RNA internal and hairpin loop sequences extracted from secondary structure (2D) diagrams. We have developed and validated new probabilistic models for 3D motif sequences based on hybrid Stochastic Context-Free Grammars and Markov Random Fields (SCFG/MRF). The SCFG/MRF models are constructed using atomic-resolution RNA 3D structures. To parameterize each model, we use all instances of each motif found in the RNA 3D Motif Atlas and annotations of pairwise nucleotide interactions generated by the FR3D software. Isostericity relations between non-Watson–Crick basepairs are used in scoring sequence variants. SCFG techniques model nested pairs and insertions, while MRF ideas handle crossing interactions and base triples. We use test sets of randomly-generated sequences to set acceptance and rejection thresholds for each motif group and thus control the false positive rate. Validation was carried out by comparing results for four motif groups to RMDetect. The software developed for sequence scoring (JAR3D) is structured to automatically incorporate new motifs as they accumulate in the RNA 3D Motif Atlas when new structures are solved and is available free for download. PMID:26130723

  11. Identifying novel sequence variants of RNA 3D motifs.

    PubMed

    Zirbel, Craig L; Roll, James; Sweeney, Blake A; Petrov, Anton I; Pirrung, Meg; Leontis, Neocles B

    2015-09-01

    Predicting RNA 3D structure from sequence is a major challenge in biophysics. An important sub-goal is accurately identifying recurrent 3D motifs from RNA internal and hairpin loop sequences extracted from secondary structure (2D) diagrams. We have developed and validated new probabilistic models for 3D motif sequences based on hybrid Stochastic Context-Free Grammars and Markov Random Fields (SCFG/MRF). The SCFG/MRF models are constructed using atomic-resolution RNA 3D structures. To parameterize each model, we use all instances of each motif found in the RNA 3D Motif Atlas and annotations of pairwise nucleotide interactions generated by the FR3D software. Isostericity relations between non-Watson-Crick basepairs are used in scoring sequence variants. SCFG techniques model nested pairs and insertions, while MRF ideas handle crossing interactions and base triples. We use test sets of randomly-generated sequences to set acceptance and rejection thresholds for each motif group and thus control the false positive rate. Validation was carried out by comparing results for four motif groups to RMDetect. The software developed for sequence scoring (JAR3D) is structured to automatically incorporate new motifs as they accumulate in the RNA 3D Motif Atlas when new structures are solved and is available free for download. PMID:26130723

  12. Discovering common stem–loop motifs in unaligned RNA sequences

    PubMed Central

    Gorodkin, Jan; Stricklin, Shawn L.; Stormo, Gary D.

    2001-01-01

    Post-transcriptional regulation of gene expression is often accomplished by proteins binding to specific sequence motifs in mRNA molecules, to affect their translation or stability. The motifs are often composed of a combination of sequence and structural constraints such that the overall structure is preserved even though much of the primary sequence is variable. While several methods exist to discover transcriptional regulatory sites in the DNA sequences of coregulated genes, the RNA motif discovery problem is much more difficult because of covariation in the positions. We describe the combined use of two approaches for RNA structure prediction, FOLDALIGN and COVE, that together can discover and model stem–loop RNA motifs in unaligned sequences, such as UTRs from post-transcriptionally coregulated genes. We evaluate the method on two datasets, one a section of rRNA genes with randomly truncated ends so that a global alignment is not possible, and the other a hyper-variable collection of IRE-like elements that were inserted into randomized UTR sequences. In both cases the combined method identified the motifs correctly, and in the rRNA example we show that it is capable of determining the structure, which includes bulge and internal loops as well as a variable length hairpin loop. Those automated results are quantitatively evaluated and found to agree closely with structures contained in curated databases, with correlation coefficients up to 0.9. A basic server, Stem–Loop Align SearcH (SLASH), which will perform stem–loop searches in unaligned RNA sequences, is available at http://www.bioinf.au.dk/slash/. PMID:11353083

  13. Dinoflagellate 17S rRNA sequence inferred from the gene sequence: Evolutionary implications.

    PubMed

    Herzog, M; Maroteaux, L

    1986-11-01

    We present the complete sequence of the nuclear-encoded small-ribosomal-subunit RNA inferred from the cloned gene sequence of the dinoflagellate Prorocentrum micans. The dinoflagellate 17S rRNA sequence of 1798 nucleotides is contained in a family of 200 tandemly repeated genes per haploid genome. A tentative model of the secondary structure of P. micans 17S rRNA is presented. This sequence is compared with the small-ribosomal-subunit rRNA of Xenopus laevis (Animalia), Saccharomyces cerevisiae (Fungi), Zea mays (Planta), Dictyostelium discoideum (Protoctista), and Halobacterium volcanii (Monera). Although the secondary structure of the dinoflagellate 17S rRNA presents most of the eukaryotic characteristics, it contains sufficient archaeobacterial-like structural features to reinforce the view that dinoflagellates branch off very early from the eukaryotic lineage. PMID:16578795

  14. Dinoflagellate 17S rRNA sequence inferred from the gene sequence: Evolutionary implications

    PubMed Central

    Herzog, Michel; Maroteaux, Luc

    1986-01-01

    We present the complete sequence of the nuclear-encoded small-ribosomal-subunit RNA inferred from the cloned gene sequence of the dinoflagellate Prorocentrum micans. The dinoflagellate 17S rRNA sequence of 1798 nucleotides is contained in a family of 200 tandemly repeated genes per haploid genome. A tentative model of the secondary structure of P. micans 17S rRNA is presented. This sequence is compared with the small-ribosomal-subunit rRNA of Xenopus laevis (Animalia), Saccharomyces cerevisiae (Fungi), Zea mays (Planta), Dictyostelium discoideum (Protoctista), and Halobacterium volcanii (Monera). Although the secondary structure of the dinoflagellate 17S rRNA presents most of the eukaryotic characteristics, it contains sufficient archaeobacterial-like structural features to reinforce the view that dinoflagellates branch off very early from the eukaryotic lineage. PMID:16578795

  15. Evaluation of Commercially Available RNA Amplification Kits for RNA Sequencing Using Very Low Input Amounts of Total RNA

    PubMed Central

    Shanker, Savita; Paulson, Ariel; Edenberg, Howard J.; Peak, Allison; Perera, Anoja; Alekseyev, Yuriy O.; Beckloff, Nicholas; Bivens, Nathan J.; Donnelly, Robert; Gillaspy, Allison F.; Grove, Deborah; Gu, Weikuan; Jafari, Nadereh; Kerley-Hamilton, Joanna S.; Lyons, Robert H.; Tepper, Clifford

    2015-01-01

    This article includes supplemental data. Please visit http://www.fasebj.org to obtain this information.Multiple recent publications on RNA sequencing (RNA-seq) have demonstrated the power of next-generation sequencing technologies in whole-transcriptome analysis. Vendor-specific protocols used for RNA library construction often require at least 100 ng total RNA. However, under certain conditions, much less RNA is available for library construction. In these cases, effective transcriptome profiling requires amplification of subnanogram amounts of RNA. Several commercial RNA amplification kits are available for amplification prior to library construction for next-generation sequencing, but these kits have not been comprehensively field evaluated for accuracy and performance of RNA-seq for picogram amounts of RNA. To address this, 4 types of amplification kits were tested with 3 different concentrations, from 5 ng to 50 pg, of a commercially available RNA. Kits were tested at multiple sites to assess reproducibility and ease of use. The human total reference RNA used was spiked with a control pool of RNA molecules in order to further evaluate quantitative recovery of input material. Additional control data sets were generated from libraries constructed following polyA selection or ribosomal depletion using established kits and protocols. cDNA was collected from the different sites, and libraries were synthesized at a single site using established protocols. Sequencing runs were carried out on the Illumina platform. Numerous metrics were compared among the kits and dilutions used. Overall, no single kit appeared to meet all the challenges of small input material. However, it is encouraging that excellent data can be recovered with even the 50 pg input total RNA. PMID:25649271

  16. repRNA: a web server for generating various feature vectors of RNA sequences.

    PubMed

    Liu, Bin; Liu, Fule; Fang, Longyun; Wang, Xiaolong; Chou, Kuo-Chen

    2016-02-01

    With the rapid growth of RNA sequences generated in the postgenomic age, it is highly desired to develop a flexible method that can generate various kinds of vectors to represent these sequences by focusing on their different features. This is because nearly all the existing machine-learning methods, such as SVM (support vector machine) and KNN (k-nearest neighbor), can only handle vectors but not sequences. To meet the increasing demands and speed up the genome analyses, we have developed a new web server, called "representations of RNA sequences" (repRNA). Compared with the existing methods, repRNA is much more comprehensive, flexible and powerful, as reflected by the following facts: (1) it can generate 11 different modes of feature vectors for users to choose according to their investigation purposes; (2) it allows users to select the features from 22 built-in physicochemical properties and even those defined by users' own; (3) the resultant feature vectors and the secondary structures of the corresponding RNA sequences can be visualized. The repRNA web server is freely accessible to the public at http://bioinformatics.hitsz.edu.cn/repRNA/ . PMID:26085220

  17. RNAcentral: A vision for an international database of RNA sequences

    PubMed Central

    Bateman, Alex; Agrawal, Shipra; Birney, Ewan; Bruford, Elspeth A.; Bujnicki, Janusz M.; Cochrane, Guy; Cole, James R.; Dinger, Marcel E.; Enright, Anton J.; Gardner, Paul P.; Gautheret, Daniel; Griffiths-Jones, Sam; Harrow, Jen; Herrero, Javier; Holmes, Ian H.; Huang, Hsien-Da; Kelly, Krystyna A.; Kersey, Paul; Kozomara, Ana; Lowe, Todd M.; Marz, Manja; Moxon, Simon; Pruitt, Kim D.; Samuelsson, Tore; Stadler, Peter F.; Vilella, Albert J.; Vogel, Jan-Hinnerk; Williams, Kelly P.; Wright, Mathew W.; Zwieb, Christian

    2011-01-01

    During the last decade there has been a great increase in the number of noncoding RNA genes identified, including new classes such as microRNAs and piRNAs. There is also a large growth in the amount of experimental characterization of these RNA components. Despite this growth in information, it is still difficult for researchers to access RNA data, because key data resources for noncoding RNAs have not yet been created. The most pressing omission is the lack of a comprehensive RNA sequence database, much like UniProt, which provides a comprehensive set of protein knowledge. In this article we propose the creation of a new open public resource that we term RNAcentral, which will contain a comprehensive collection of RNA sequences and fill an important gap in the provision of biomedical databases. We envision RNA researchers from all over the world joining a federated RNAcentral network, contributing specialized knowledge and databases. RNAcentral would centralize key data that are currently held across a variety of databases, allowing researchers instant access to a single, unified resource. This resource would facilitate the next generation of RNA research and help drive further discoveries, including those that improve food production and human and animal health. We encourage additional RNA database resources and research groups to join this effort. We aim to obtain international network funding to further this endeavor. PMID:21940779

  18. RNAcentral: A vision for an international database of RNA sequences.

    PubMed

    Bateman, Alex; Agrawal, Shipra; Birney, Ewan; Bruford, Elspeth A; Bujnicki, Janusz M; Cochrane, Guy; Cole, James R; Dinger, Marcel E; Enright, Anton J; Gardner, Paul P; Gautheret, Daniel; Griffiths-Jones, Sam; Harrow, Jen; Herrero, Javier; Holmes, Ian H; Huang, Hsien-Da; Kelly, Krystyna A; Kersey, Paul; Kozomara, Ana; Lowe, Todd M; Marz, Manja; Moxon, Simon; Pruitt, Kim D; Samuelsson, Tore; Stadler, Peter F; Vilella, Albert J; Vogel, Jan-Hinnerk; Williams, Kelly P; Wright, Mathew W; Zwieb, Christian

    2011-11-01

    During the last decade there has been a great increase in the number of noncoding RNA genes identified, including new classes such as microRNAs and piRNAs. There is also a large growth in the amount of experimental characterization of these RNA components. Despite this growth in information, it is still difficult for researchers to access RNA data, because key data resources for noncoding RNAs have not yet been created. The most pressing omission is the lack of a comprehensive RNA sequence database, much like UniProt, which provides a comprehensive set of protein knowledge. In this article we propose the creation of a new open public resource that we term RNAcentral, which will contain a comprehensive collection of RNA sequences and fill an important gap in the provision of biomedical databases. We envision RNA researchers from all over the world joining a federated RNAcentral network, contributing specialized knowledge and databases. RNAcentral would centralize key data that are currently held across a variety of databases, allowing researchers instant access to a single, unified resource. This resource would facilitate the next generation of RNA research and help drive further discoveries, including those that improve food production and human and animal health. We encourage additional RNA database resources and research groups to join this effort. We aim to obtain international network funding to further this endeavor. PMID:21940779

  19. Method for rapid base sequencing in DNA and RNA

    DOEpatents

    Jett, J.H.; Keller, R.A.; Martin, J.C.; Moyzis, R.K.; Ratliff, R.L.; Shera, E.B.; Stewart, C.C.

    1987-10-07

    A method is provided for the rapid base sequencing of DNA or RNA fragments wherein a single fragment of DNA or RNA is provided with identifiable bases and suspended in a moving flow stream. An exonuclease sequentially cleaves individual bases from the end of the suspended fragment. The moving flow stream maintains the cleaved bases in an orderly train for subsequent detection and identification. In a particular embodiment, individual bases forming the DNA or RNA fragments are individually tagged with a characteristic fluorescent dye. The train of bases is then excited to fluorescence with an output spectrum characteristic of the individual bases. Accordingly, the base sequence of the original DNA or RNA fragment can be reconstructed. 2 figs.

  20. Method for rapid base sequencing in DNA and RNA

    DOEpatents

    Jett, James H.; Keller, Richard A.; Martin, John C.; Moyzis, Robert K.; Ratliff, Robert L.; Shera, E. Brooks; Stewart, Carleton C.

    1990-01-01

    A method is provided for the rapid base sequencing of DNA or RNA fragments wherein a single fragment of DNA or RNA is provided with identifiable bases and suspended in a moving flow stream. An exonuclease sequentially cleaves individual bases from the end of the suspended fragment. The moving flow stream maintains the cleaved bases in an orderly train for subsequent detection and identification. In a particular embodiment, individual bases forming the DNA or RNA fragments are individually tagged with a characteristic fluorescent dye. The train of bases is then excited to fluorescence with an output spectrum characteristic of the individual bases. Accordingly, the base sequence of the original DNA or RNA fragment can be reconstructed.

  1. Method for rapid base sequencing in DNA and RNA

    DOEpatents

    Jett, J.H.; Keller, R.A.; Martin, J.C.; Moyzis, R.K.; Ratliff, R.L.; Shera, E.B.; Stewart, C.C.

    1990-10-09

    A method is provided for the rapid base sequencing of DNA or RNA fragments wherein a single fragment of DNA or RNA is provided with identifiable bases and suspended in a moving flow stream. An exonuclease sequentially cleaves individual bases from the end of the suspended fragment. The moving flow stream maintains the cleaved bases in an orderly train for subsequent detection and identification. In a particular embodiment, individual bases forming the DNA or RNA fragments are individually tagged with a characteristic fluorescent dye. The train of bases is then excited to fluorescence with an output spectrum characteristic of the individual bases. Accordingly, the base sequence of the original DNA or RNA fragment can be reconstructed. 2 figs.

  2. Reentrant Melting of RNA with Quenched Sequence Randomness

    NASA Astrophysics Data System (ADS)

    Hayrapetyan, G. N.; Iannelli, F.; Lekscha, J.; Morozov, V. F.; Netz, R. R.; Mamasakhlisov, Y. Sh.

    2014-08-01

    The effect of quenched sequence disorder on the thermodynamics of RNA secondary structure formation is investigated for two- and four-letter alphabet models using the constrained annealing approach, from which the temperature behavior of the free energy, specific heat, and helicity is analytically obtained. For competing base pairing energies, the calculations reveal reentrant melting at low temperatures, in excellent agreement with numerical results. Our results suggest an additional mechanism for the experimental phenomenon of RNA cold denaturation.

  3. Sequence determinants of improved CRISPR sgRNA design

    PubMed Central

    Xu, Han; Xiao, Tengfei; Chen, Chen-Hao; Li, Wei; Meyer, Clifford A.; Wu, Qiu; Wu, Di; Cong, Le; Zhang, Feng; Liu, Jun S.; Brown, Myles; Liu, X. Shirley

    2015-01-01

    The CRISPR/Cas9 system has revolutionized mammalian somatic cell genetics. Genome-wide functional screens using CRISPR/Cas9-mediated knockout or dCas9 fusion-mediated inhibition/activation (CRISPRi/a) are powerful techniques for discovering phenotype-associated gene function. We systematically assessed the DNA sequence features that contribute to single guide RNA (sgRNA) efficiency in CRISPR-based screens. Leveraging the information from multiple designs, we derived a new sequence model for predicting sgRNA efficiency in CRISPR/Cas9 knockout experiments. Our model confirmed known features and suggested new features including a preference for cytosine at the cleavage site. The model was experimentally validated for sgRNA-mediated mutation rate and protein knockout efficiency. Tested on independent data sets, the model achieved significant results in both positive and negative selection conditions and outperformed existing models. We also found that the sequence preference for CRISPRi/a is substantially different from that for CRISPR/Cas9 knockout and propose a new model for predicting sgRNA efficiency in CRISPRi/a experiments. These results facilitate the genome-wide design of improved sgRNA for both knockout and CRISPRi/a studies. PMID:26063738

  4. Quantifying sequence and structural features of protein-RNA interactions.

    PubMed

    Li, Songling; Yamashita, Kazuo; Amada, Karlou Mar; Standley, Daron M

    2014-09-01

    Increasing awareness of the importance of protein-RNA interactions has motivated many approaches to predict residue-level RNA binding sites in proteins based on sequence or structural characteristics. Sequence-based predictors are usually high in sensitivity but low in specificity; conversely structure-based predictors tend to have high specificity, but lower sensitivity. Here we quantified the contribution of both sequence- and structure-based features as indicators of RNA-binding propensity using a machine-learning approach. In order to capture structural information for proteins without a known structure, we used homology modeling to extract the relevant structural features. Several novel and modified features enhanced the accuracy of residue-level RNA-binding propensity beyond what has been reported previously, including by meta-prediction servers. These features include: hidden Markov model-based evolutionary conservation, surface deformations based on the Laplacian norm formalism, and relative solvent accessibility partitioned into backbone and side chain contributions. We constructed a web server called aaRNA that implements the proposed method and demonstrate its use in identifying putative RNA binding sites. PMID:25063293

  5. Phylogenetic relationships of Cryptosporidium determined by ribosomal RNA sequence comparison.

    PubMed

    Johnson, A M; Fielke, R; Lumb, R; Baverstock, P R

    1990-04-01

    Reverse transcription of total cellular RNA was used to obtain a partial sequence of the small subunit ribosomal RNA of Cryptosporidium, a protist currently placed in the phylum Apicomplexa. The semi-conserved regions were aligned with homologous sequences in a range of other eukaryotes, and the evolutionary relationships of Cryptosporidium were determined by two different methods of phylogenetic analysis. The prokaryotes Escherichia coli and Halobacterium cuti were included as outgroups. The results do not show an especially close relationship of Cryptosporidium to other members of the phylum Apicomplexa. PMID:2332273

  6. Probing dimensionality beyond the linear sequence of mRNA.

    PubMed

    Del Campo, Cristian; Ignatova, Zoya

    2016-05-01

    mRNA is a nexus entity between DNA and translating ribosomes. Recent developments in deep sequencing technologies coupled with structural probing have revealed new insights beyond the classic role of mRNA and place it more centrally as a direct effector of a variety of processes, including translation, cellular localization, and mRNA degradation. Here, we highlight emerging approaches to probe mRNA secondary structure on a global transcriptome-wide level and compare their potential and resolution. Combined approaches deliver a richer and more complex picture. While our understanding on the effect of secondary structure for various cellular processes is quite advanced, the next challenge is to unravel more complex mRNA architectures and tertiary interactions. PMID:26650615

  7. Statistical mechanics of secondary structures formed by random RNA sequences

    NASA Astrophysics Data System (ADS)

    Bundschuh, Ralf

    2003-03-01

    In addition to its importance for the biological function of RNA molecules RNA secondary structure formation is an interesting system from the statistical physics point of view. The ensemble of secondary structures of random RNA sequences shows a rich phase diagram with distinct native, denatured, molten, and glassy phases separated by thermodynamical phase transitions. These phase transitions are driven by the competition between thermal fluctuations, the disorder frozen into the specific sequence of a given RNA molecule, and the evolutionary bias towards the formation of some biologically relevant structure. Yet, in contrast to the protein folding problem which is driven by very similar principles and shows a similar phase diagram RNA secondary structure formation can be represented by a simple diagrammatic language which allows the application of various analytical and numerical methods. This makes RNA secondary structure formation an ideal model system for heteropolymer folding. In the talk, I will characterize and explain the complex behaviour of RNA folding using several simple models and discuss possible implications to biological processes.

  8. High-throughput RNA interference screening using pooled shRNA libraries and next generation sequencing

    PubMed Central

    2011-01-01

    RNA interference (RNAi) screening is a state-of-the-art technology that enables the dissection of biological processes and disease-related phenotypes. The commercial availability of genome-wide, short hairpin RNA (shRNA) libraries has fueled interest in this area but the generation and analysis of these complex data remain a challenge. Here, we describe complete experimental protocols and novel open source computational methodologies, shALIGN and shRNAseq, that allow RNAi screens to be rapidly deconvoluted using next generation sequencing. Our computational pipeline offers efficient screen analysis and the flexibility and scalability to quickly incorporate future developments in shRNA library technology. PMID:22018332

  9. Studying RNA Homology and Conservation with Infernal: From Single Sequences to RNA Families.

    PubMed

    Barquist, Lars; Burge, Sarah W; Gardner, Paul P

    2016-01-01

    Emerging high-throughput technologies have led to a deluge of putative non-coding RNA (ncRNA) sequences identified in a wide variety of organisms. Systematic characterization of these transcripts will be a tremendous challenge. Homology detection is critical to making maximal use of functional information gathered about ncRNAs: identifying homologous sequence allows us to transfer information gathered in one organism to another quickly and with a high degree of confidence. ncRNA presents a challenge for homology detection, as the primary sequence is often poorly conserved and de novo secondary structure prediction and search remain difficult. This unit introduces methods developed by the Rfam database for identifying "families" of homologous ncRNAs starting from single "seed" sequences, using manually curated sequence alignments to build powerful statistical models of sequence and structure conservation known as covariance models (CMs), implemented in the Infernal software package. We provide a step-by-step iterative protocol for identifying ncRNA homologs and then constructing an alignment and corresponding CM. We also work through an example for the bacterial small RNA MicA, discovering a previously unreported family of divergent MicA homologs in genus Xenorhabdus in the process. © 2016 by John Wiley & Sons, Inc. PMID:27322404

  10. Transcriptome Profiling of Developing Murine Lens Through RNA Sequencing

    PubMed Central

    Khan, Shahid Y.; Hackett, Sean F.; Lee, Mei-Chong W.; Pourmand, Nader; Talbot, C. Conover; Riazuddin, S. Amer

    2015-01-01

    Purpose Transcriptome is the entire repertoire of transcripts present in a cell at any particular time. We undertook a next-generation whole transcriptome sequencing approach to gain insight into the transcriptional landscape of the developing mouse lens. Methods We ascertained mouse lenses at six developmental time points including two embryonic (E15 and E18) and four postnatal stages (P0, P3, P6, and P9). The ocular tissue at each time point was maintained as two distinct pools serving as biological replicates for each developmental stage. The mRNA and small RNA libraries were paired-end sequenced on Illumina HiSeq 2000 and subsequently analyzed using bioinformatics tools. Results Mapping of mRNA and small RNA libraries generated 187.56 and 154.22 million paired-end reads, respectively. We detected a total of 14,465 genes in the mouse ocular lens at the above-mentioned six developmental stages. Of these, 46 genes exhibited a 40-fold differential (higher or lower) expression at one the five developmental stages (E18, P0, P3, P6, and P9) compared with their expression level at E15. Likewise, small RNA profiling identified 379 microRNAs (miRNAs) expressed in mouse lens at six developmental time points. Of these, 49 miRNAs manifested an 8-fold differential (higher or lower) expression at one the five developmental stages, as mentioned above compared with their expression level at E15. Conclusions We report a comprehensive profile of developing murine lens transcriptome including both mRNA and miRNA through next-generation RNA sequencing. A complete repository of the lens transcriptome of six developmental time points will be monumental in elucidating processes essential for the development of the ocular lens and maintenance of its transparency. PMID:26225632

  11. MicroRNA Expression Profile in Penile Cancer Revealed by Next-Generation Small RNA Sequencing

    PubMed Central

    Zhang, Yuanwei; Xu, Bo; Zhou, Jun; Fan, Song; Hao, Zongyao; Shi, Haoqiang; Zhang, Xiansheng; Kong, Rui; Xu, Lingfan; Gao, Jingjing; Zou, Duohong; Liang, Chaozhao

    2015-01-01

    Penile cancer (PeCa) is a relatively rare tumor entity but possesses higher morbidity and mortality rates especially in developing countries. To date, the concrete pathogenic signaling pathways and core machineries involved in tumorigenesis and progression of PeCa remain to be elucidated. Several studies suggested miRNAs, which modulate gene expression at posttranscriptional level, were frequently mis-regulated and aberrantly expressed in human cancers. However, the miRNA profile in human PeCa has not been reported before. In this present study, the miRNA profile was obtained from 10 fresh penile cancerous tissues and matched adjacent non-cancerous tissues via next-generation sequencing. As a result, a total of 751 and 806 annotated miRNAs were identified in normal and cancerous penile tissues, respectively. Among which, 56 miRNAs with significantly different expression levels between paired tissues were identified. Subsequently, several annotated miRNAs were selected randomly and validated using quantitative real-time PCR. Compared with the previous publications regarding to the altered miRNAs expression in various cancers and especially genitourinary (prostate, bladder, kidney, testis) cancers, the most majority of deregulated miRNAs showed the similar expression pattern in penile cancer. Moreover, the bioinformatics analyses suggested that the putative target genes of differentially expressed miRNAs between cancerous and matched normal penile tissues were tightly associated with cell junction, proliferation, growth as well as genomic instability and so on, by modulating Wnt, MAPK, p53, PI3K-Akt, Notch and TGF-β signaling pathways, which were all well-established to participate in cancer initiation and progression. Our work presents a global view of the differentially expressed miRNAs and potentially regulatory networks of their target genes for clarifying the pathogenic transformation of normal penis to PeCa, which research resource also provides new insights

  12. Using RNA Sequencing to Classify Organisms into Three Primary Kingdoms.

    ERIC Educational Resources Information Center

    Evans, Robert H.

    1983-01-01

    Using the biochemical record to class archaebacteria, eukaryotes, and eubacteria involves abstractions difficult for the concrete learner. Therefore, a method is provided in which students discover some basic tenets of biochemical classification and apply them in a "hands-on" classification problem. The method involves use of RNA sequencing. (JN)

  13. RNA sequencing of the nephron transcriptome: a technical note

    PubMed Central

    Lee, Jae Wook

    2015-01-01

    To understand the functions of the kidney, the transcriptome of each part of the nephron needs to be profiled using a highly sensitive and unbiased tool. RNA sequencing (RNA-seq) has revolutionized transcriptomic research, enabling researchers to define transcription activity and functions of genomic elements with unprecedented sensitivity and precision. Recently, RNA-seq for polyadenylated messenger RNAs [poly(A)′-mRNAs] and classical microdissection were successfully combined to investigate the transcriptome of glomeruli and 14 different renal tubule segments. A rat kidney is perfused with and incubated in collagenase solution, and the digested kidney was manually dissected under a stereomicroscope. Individual glomeruli and renal tubule segments are identified by their anatomical and morphological characteristics and collected in phosphate-buffered saline. Poly(A)′-tailed mRNAs are released from cell lysate, captured by oligo-dT primers, and made into complementary DNAs (cDNAs) using a highly sensitive reverse transcription method. These cDNAs are sheared by sonication and prepared into adapter-ligated cDNA libraries for Illumina sequencing. Nucleotide sequences reported from the sequencing reaction are mapped to the rat reference genome for gene expression analysis. These RNA-seq transcriptomic data were highly consistent with prior knowledge of gene expression along the nephron. The gene expression data obtained in this work are available as a public Web page (https://helixweb.nih.gov/ESBL/Database/NephronRNAseq/) and can be used to explore the transcriptomic landscape of the nephron. PMID:26779425

  14. Sequence specificity of mRNA N6-adenosine methyltransferase.

    PubMed

    Csepany, T; Lin, A; Baldick, C J; Beemon, K

    1990-11-25

    The sequence specificity of chicken mRNA N6-adenosine methyltransferase has been investigated in vivo. Localization of six new N6-methyladenosine sites on Rous sarcoma virus (RSV) virion RNA has confirmed our extended consensus sequence for methylation: RGACU, where R is usually a G (7/12). We have also observed A (2/12) and U (3/12) at the -2 position (relative to m6A at +1) but never a C. At the +3 position, the U was observed 10/12 times; an A and a C were observed once each in weakly methylated sequences. The extent of methylation varied between the different sites up to a maximum of about 90%. To test the significance of this consensus sequence, it was altered by site-specific mutagenesis, and methylation was assayed after transfection of mutated RSV DNA into chicken embryo fibroblasts. We found that changing the G at -1 or the U at +3 to any other residue inhibited methylation. However, inhibition of methylation at all four of the major sites in the RSV src gene did not detectably alter the steady-state levels of the three viral RNA species or viral infectivity. Additional mutants that inactivated the src protein kinase activity produced less virus and exhibited relatively less src mRNA in infected cells. PMID:2173695

  15. SRP-RNA sequence alignment and secondary structure.

    PubMed Central

    Larsen, N; Zwieb, C

    1991-01-01

    The secondary structures of the RNAs from the signal recognition particle, termed SRP-RNA, were derived buy comparative analyses of an alignment of 39 sequences. The models are minimal in that only base pairs are included for which there is comparative evidence. The structures represent refinements of earlier versions and include a new short helix. PMID:1707519

  16. siRNA release from pri-miRNA scaffolds is controlled by the sequence and structure of RNA.

    PubMed

    Galka-Marciniak, Paulina; Olejniczak, Marta; Starega-Roslan, Julia; Szczesniak, Michal W; Makalowska, Izabela; Krzyzosiak, Wlodzimierz J

    2016-04-01

    shmiRs are pri-miRNA-based RNA interference triggers from which exogenous siRNAs are expressed in cells to silence target genes. These reagents are very promising tools in RNAi in vivo applications due to their good activity profile and lower toxicity than observed for other vector-based reagents such as shRNAs. In this study, using high-resolution northern blotting and small RNA sequencing, we investigated the precision with which RNases Drosha and Dicer process shmiRs. The fidelity of siRNA release from the commonly used pri-miRNA shuttles was found to depend on both the siRNA insert and the pri-miR scaffold. Then, we searched for specific factors that may affect the precision of siRNA release and found that both the structural features of shmiR hairpins and the nucleotide sequence at Drosha and Dicer processing sites contribute to cleavage site selection and cleavage precision. An analysis of multiple shRNA intermediates generated from several reagents revealed the complexity of shmiR processing by Drosha and demonstrated that Dicer selects substrates for further processing. Aside from providing new basic knowledge regarding the specificity of nucleases involved in miRNA biogenesis, our results facilitate the rational design of more efficient genetic reagents for RNAi technology. PMID:26921501

  17. Toward Rare Blood Cell Preservation for RNA Sequencing.

    PubMed

    Vickovic, Sanja; Ahmadian, Afshin; Lewensohn, Rolf; Lundeberg, Joakim

    2015-07-01

    Cancer is driven by various events leading to cell differentiation and disease progression. Molecular tools are powerful approaches for describing how and why these events occur. With the growing field of next-generation DNA sequencing, there is an increasing need for high-quality nucleic acids derived from human cells and tissues-a prerequisite for successful cell profiling. Although advances in RNA preservation have been made, some of the largest biobanks still do not employ RNA blood preservation as standard because of limitations in low blood-input volume and RNA stability over the whole gene body. Therefore, we have developed a robust protocol for blood preservation and long-term storage while maintaining RNA integrity. Furthermore, we explored the possibility of using the protocol for preserving rare cell samples, such as circulating tumor cells. The results of our study confirmed that gene expression was not impacted by the preservation procedure (r(2) > 0.88) or by long-term storage (r(2) = 0.95), with RNA integrity number values averaging over 8. Similarly, cell surface antigens were still available for antibody selection (r(2) = 0.95). Lastly, data mining for fusion events showed that it was possible to detect rare tumor cells among a background of other cells present in blood irrespective of fixation. Thus, the developed protocol would be suitable for rare blood cell preservation followed by RNA sequencing analysis. PMID:25989392

  18. Learning to Predict miRNA-mRNA Interactions from AGO CLIP Sequencing and CLASH Data.

    PubMed

    Lu, Yuheng; Leslie, Christina S

    2016-07-01

    Recent technologies like AGO CLIP sequencing and CLASH enable direct transcriptome-wide identification of AGO binding and miRNA target sites, but the most widely used miRNA target prediction algorithms do not exploit these data. Here we use discriminative learning on AGO CLIP and CLASH interactions to train a novel miRNA target prediction model. Our method combines two SVM classifiers, one to predict miRNA-mRNA duplexes and a second to learn a binding model of AGO's local UTR sequence preferences and positional bias in 3'UTR isoforms. The duplex SVM model enables the prediction of non-canonical target sites and more accurately resolves miRNA interactions from AGO CLIP data than previous methods. The binding model is trained using a multi-task strategy to learn context-specific and common AGO sequence preferences. The duplex and common AGO binding models together outperform existing miRNA target prediction algorithms on held-out binding data. Open source code is available at https://bitbucket.org/leslielab/chimiric. PMID:27438777

  19. Learning to Predict miRNA-mRNA Interactions from AGO CLIP Sequencing and CLASH Data

    PubMed Central

    Lu, Yuheng; Leslie, Christina S.

    2016-01-01

    Recent technologies like AGO CLIP sequencing and CLASH enable direct transcriptome-wide identification of AGO binding and miRNA target sites, but the most widely used miRNA target prediction algorithms do not exploit these data. Here we use discriminative learning on AGO CLIP and CLASH interactions to train a novel miRNA target prediction model. Our method combines two SVM classifiers, one to predict miRNA-mRNA duplexes and a second to learn a binding model of AGO’s local UTR sequence preferences and positional bias in 3’UTR isoforms. The duplex SVM model enables the prediction of non-canonical target sites and more accurately resolves miRNA interactions from AGO CLIP data than previous methods. The binding model is trained using a multi-task strategy to learn context-specific and common AGO sequence preferences. The duplex and common AGO binding models together outperform existing miRNA target prediction algorithms on held-out binding data. Open source code is available at https://bitbucket.org/leslielab/chimiric. PMID:27438777

  20. Structurally complex and highly active RNA ligases derived from random RNA sequences

    NASA Technical Reports Server (NTRS)

    Ekland, E. H.; Szostak, J. W.; Bartel, D. P.

    1995-01-01

    Seven families of RNA ligases, previously isolated from random RNA sequences, fall into three classes on the basis of secondary structure and regiospecificity of ligation. Two of the three classes of ribozymes have been engineered to act as true enzymes, catalyzing the multiple-turnover transformation of substrates into products. The most complex of these ribozymes has a minimal catalytic domain of 93 nucleotides. An optimized version of this ribozyme has a kcat exceeding one per second, a value far greater than that of most natural RNA catalysts and approaching that of comparable protein enzymes. The fact that such a large and complex ligase emerged from a very limited sampling of sequence space implies the existence of a large number of distinct RNA structures of equivalent complexity and activity.

  1. Integrated microRNA-mRNA analyses reveal OPLL specific microRNA regulatory network using high-throughput sequencing.

    PubMed

    Xu, Chen; Chen, Yu; Zhang, Hao; Chen, Yuanyuan; Shen, Xiaolong; Shi, Changgui; Liu, Yang; Yuan, Wen

    2016-01-01

    Ossification of the posterior longitudinal ligament (OPLL) is a genetic disorder which involves pathological heterotopic ossification of the spinal ligaments. Although studies have identified several genes that correlated with OPLL, the underlying regulation network is far from clear. Through small RNA sequencing, we compared the microRNA expressions of primary posterior longitudinal ligament cells form OPLL patients with normal patients (PLL) and identified 218 dysregulated miRNAs (FDR < 0.01). Furthermore, assessing the miRNA profiling data of multiple cell types, we found these dysregulated miRNAs were mostly OPLL specific. In order to decipher the regulation network of these OPLL specific miRNAs, we integrated mRNA expression profiling data with miRNA sequencing data. Through computational approaches, we showed the pivotal roles of these OPLL specific miRNAs in heterotopic ossification of longitudinal ligament by discovering highly correlated miRNA/mRNA pairs that associated with skeletal system development, collagen fibril organization, and extracellular matrix organization. The results of which provide strong evidence that the miRNA regulatory networks we established may indeed play vital roles in OPLL onset and progression. To date, this is the first systematic analysis of the micronome in OPLL, and thus may provide valuable resources in finding novel treatment and diagnostic targets of OPLL. PMID:26868491

  2. Using Small RNA Deep Sequencing Data to Detect Human Viruses

    PubMed Central

    Wang, Fang; Sun, Yu; Ruan, Jishou; Chen, Rui; Chen, Xin; Chen, Chengjie; Kreuze, Jan F.; Fei, ZhangJun; Zhu, Xiao

    2016-01-01

    Small RNA sequencing (sRNA-seq) can be used to detect viruses in infected hosts without the necessity to have any prior knowledge or specialized sample preparation. The sRNA-seq method was initially used for viral detection and identification in plants and then in invertebrates and fungi. However, it is still controversial to use sRNA-seq in the detection of mammalian or human viruses. In this study, we used 931 sRNA-seq runs of data from the NCBI SRA database to detect and identify viruses in human cells or tissues, particularly from some clinical samples. Six viruses including HPV-18, HBV, HCV, HIV-1, SMRV, and EBV were detected from 36 runs of data. Four viruses were consistent with the annotations from the previous studies. HIV-1 was found in clinical samples without the HIV-positive reports, and SMRV was found in Diffuse Large B-Cell Lymphoma cells for the first time. In conclusion, these results suggest the sRNA-seq can be used to detect viruses in mammals and humans. PMID:27066498

  3. Using Small RNA Deep Sequencing Data to Detect Human Viruses.

    PubMed

    Wang, Fang; Sun, Yu; Ruan, Jishou; Chen, Rui; Chen, Xin; Chen, Chengjie; Kreuze, Jan F; Fei, ZhangJun; Zhu, Xiao; Gao, Shan

    2016-01-01

    Small RNA sequencing (sRNA-seq) can be used to detect viruses in infected hosts without the necessity to have any prior knowledge or specialized sample preparation. The sRNA-seq method was initially used for viral detection and identification in plants and then in invertebrates and fungi. However, it is still controversial to use sRNA-seq in the detection of mammalian or human viruses. In this study, we used 931 sRNA-seq runs of data from the NCBI SRA database to detect and identify viruses in human cells or tissues, particularly from some clinical samples. Six viruses including HPV-18, HBV, HCV, HIV-1, SMRV, and EBV were detected from 36 runs of data. Four viruses were consistent with the annotations from the previous studies. HIV-1 was found in clinical samples without the HIV-positive reports, and SMRV was found in Diffuse Large B-Cell Lymphoma cells for the first time. In conclusion, these results suggest the sRNA-seq can be used to detect viruses in mammals and humans. PMID:27066498

  4. Experimental design, preprocessing, normalization and differential expression analysis of small RNA sequencing experiments

    PubMed Central

    2011-01-01

    Prior to the advent of new, deep sequencing methods, small RNA (sRNA) discovery was dependent on Sanger sequencing, which was time-consuming and limited knowledge to only the most abundant sRNA. The innovation of large-scale, next-generation sequencing has exponentially increased knowledge of the biology, diversity and abundance of sRNA populations. In this review, we discuss issues involved in the design of sRNA sequencing experiments, including choosing a sequencing platform, inherent biases that affect sRNA measurements and replication. We outline the steps involved in preprocessing sRNA sequencing data and review both the principles behind and the current options for normalization. Finally, we discuss differential expression analysis in the absence and presence of biological replicates. While our focus is on sRNA sequencing experiments, many of the principles discussed are applicable to the sequencing of other RNA populations. PMID:21356093

  5. HLA typing from RNA-Seq sequence reads.

    PubMed

    Boegel, Sebastian; Löwer, Martin; Schäfer, Michael; Bukur, Thomas; de Graaf, Jos; Boisguérin, Valesca; Türeci, Ozlem; Diken, Mustafa; Castle, John C; Sahin, Ugur

    2012-01-01

    We present a method, seq2HLA, for obtaining an individual's human leukocyte antigen (HLA) class I and II type and expression using standard next generation sequencing RNA-Seq data. RNA-Seq reads are mapped against a reference database of HLA alleles, and HLA type, confidence score and locus-specific expression level are determined. We successfully applied seq2HLA to 50 individuals included in the HapMap project, yielding 100% specificity and 94% sensitivity at a P-value of 0.1 for two-digit HLA types. We determined HLA type and expression for previously un-typed Illumina Body Map tissues and a cohort of Korean patients with lung cancer. Because the algorithm uses standard RNA-Seq reads and requires no change to laboratory protocols, it can be used for both existing datasets and future studies, thus adding a new dimension for HLA typing and biomarker studies. PMID:23259685

  6. Using small RNA (sRNA) deep sequencing to understand global virus distribution in plants

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Small RNAs (sRNAs), a class of regulatory RNAs, have been used to serve as the specificity determinants of suppressing gene expression in plants and animals. Next generation sequencing (NGS) uncovered the sRNA landscape in most organisms including their associated microbes. In the current study, w...

  7. Dis3- and exosome subunit-responsive 3' mRNA instability elements.

    PubMed

    Kiss, Daniel L; Hou, Dezhi; Gross, Robert H; Andrulis, Erik D

    2012-07-01

    Eukaryotic RNA turnover is regulated in part by the exosome, a nuclear and cytoplasmic complex of ribonucleases (RNases) and RNA-binding proteins. The major RNase of the complex is thought to be Dis3, a multi-functional 3'-5' exoribonuclease and endoribonuclease. Although it is known that Dis3 and core exosome subunits are recruited to transcriptionally active genes and to messenger RNA (mRNA) substrates, this recruitment is thought to occur indirectly. We sought to discover cis-acting elements that recruit Dis3 or other exosome subunits. Using a bioinformatic tool called RNA SCOPE to screen the 3' untranslated regions of up-regulated transcripts from our published Dis3 depletion-derived transcriptomic data set, we identified several motifs as candidate instability elements. Secondary screening using a luciferase reporter system revealed that one cassette-harboring four elements-destabilized the reporter transcript. RNAi-based depletion of Dis3, Rrp6, Rrp4, Rrp40, or Rrp46 diminished the efficacy of cassette-mediated destabilization. Truncation analysis of the cassette showed that two exosome subunit-sensitive elements (ESSEs) destabilized the reporter. Point-directed mutagenesis of ESSE abrogated the destabilization effect. An examination of the transcriptomic data from exosome subunit depletion-based microarrays revealed that mRNAs with ESSEs are found in every up-regulated mRNA data set but are underrepresented or missing from the down-regulated data sets. Taken together, our findings imply a potentially novel mechanism of mRNA turnover that involves direct Dis3 and other exosome subunit recruitment to and/or regulation on mRNA substrates. PMID:22668878

  8. tRNA-Related Sequences Trigger Systemic mRNA Transport in Plants.

    PubMed

    Zhang, Wenna; Thieme, Christoph J; Kollwig, Gregor; Apelt, Federico; Yang, Lei; Winter, Nikola; Andresen, Nadine; Walther, Dirk; Kragler, Friedrich

    2016-06-01

    In plants, protein-coding mRNAs can move via the phloem vasculature to distant tissues, where they may act as non-cell-autonomous signals. Emerging work has identified many phloem-mobile mRNAs, but little is known regarding RNA motifs triggering mobility, the extent of mRNA transport, and the potential of transported mRNAs to be translated into functional proteins after transport. To address these aspects, we produced reporter transcripts harboring tRNA-like structures (TLSs) that were found to be enriched in the phloem stream and in mRNAs moving over chimeric graft junctions. Phenotypic and enzymatic assays on grafted plants indicated that mRNAs harboring a distinctive TLS can move from transgenic roots into wild-type leaves and from transgenic leaves into wild-type flowers or roots; these mRNAs can also be translated into proteins after transport. In addition, we provide evidence that dicistronic mRNA:tRNA transcripts are frequently produced in Arabidopsis thaliana and are enriched in the population of graft-mobile mRNAs. Our results suggest that tRNA-derived sequences with predicted stem-bulge-stem-loop structures are sufficient to mediate mRNA transport and seem to be necessary for the mobility of a large number of endogenous transcripts that can move through graft junctions. PMID:27268430

  9. tRNA-Related Sequences Trigger Systemic mRNA Transport in Plants[OPEN

    PubMed Central

    Zhang, Wenna; Kollwig, Gregor; Apelt, Federico; Walther, Dirk

    2016-01-01

    In plants, protein-coding mRNAs can move via the phloem vasculature to distant tissues, where they may act as non-cell-autonomous signals. Emerging work has identified many phloem-mobile mRNAs, but little is known regarding RNA motifs triggering mobility, the extent of mRNA transport, and the potential of transported mRNAs to be translated into functional proteins after transport. To address these aspects, we produced reporter transcripts harboring tRNA-like structures (TLSs) that were found to be enriched in the phloem stream and in mRNAs moving over chimeric graft junctions. Phenotypic and enzymatic assays on grafted plants indicated that mRNAs harboring a distinctive TLS can move from transgenic roots into wild-type leaves and from transgenic leaves into wild-type flowers or roots; these mRNAs can also be translated into proteins after transport. In addition, we provide evidence that dicistronic mRNA:tRNA transcripts are frequently produced in Arabidopsis thaliana and are enriched in the population of graft-mobile mRNAs. Our results suggest that tRNA-derived sequences with predicted stem-bulge-stem-loop structures are sufficient to mediate mRNA transport and seem to be necessary for the mobility of a large number of endogenous transcripts that can move through graft junctions. PMID:27268430

  10. Divergent RNA editing frequencies in hornwort mitochondrial nad5 sequences.

    PubMed

    Duff, R Joel

    2006-02-01

    Hornwort mitochondrial genomes have some of the highest rates of RNA editing among plants. Comparison of eleven partial mitochondrial nad5 genomic and cDNA sequences from diverse taxa of hornworts reveal 125 edited sites in only 1107 nt. No single sample exhibits more than half of these sites. Ten of the 11 hornwort taxa have between 35 and 54 edited sties each; whereas, the eleventh taxon, Leiosporoceros, which represents a potential sister taxa to all other hornworts, has only eight sites. Comparison of multiple cDNA sequences from several individuals reveals the presence of many immature transcripts showing the heterogonous nature of the progression of editing. Phylogenetic analyses of hornwort genomic and cDNAs sequences reveal that 65 of the 94 phylogenetically informative sites within the hornwort clade are edited positions. PMID:16376027

  11. SimFuse: A Novel Fusion Simulator for RNA Sequencing (RNA-Seq) Data.

    PubMed

    Tan, Yuxiang; Tambouret, Yann; Monti, Stefano

    2015-01-01

    The performance evaluation of fusion detection algorithms from high-throughput sequencing data crucially relies on the availability of data with known positive and negative cases of gene rearrangements. The use of simulated data circumvents some shortcomings of real data by generation of an unlimited number of true and false positive events, and the consequent robust estimation of accuracy measures, such as precision and recall. Although a few simulated fusion datasets from RNA Sequencing (RNA-Seq) are available, they are of limited sample size. This makes it difficult to systematically evaluate the performance of RNA-Seq based fusion-detection algorithms. Here, we present SimFuse to address this problem. SimFuse utilizes real sequencing data as the fusions' background to closely approximate the distribution of reads from a real sequencing library and uses a reference genome as the template from which to simulate fusions' supporting reads. To assess the supporting read-specific performance, SimFuse generates multiple datasets with various numbers of fusion supporting reads. Compared to an extant simulated dataset, SimFuse gives users control over the supporting read features and the sample size of the simulated library, based on which the performance metrics needed for the validation and comparison of alternative fusion-detection algorithms can be rigorously estimated. PMID:26839886

  12. Sequence and expression of ferredoxin mRNA in barley

    SciTech Connect

    Zielinski, R.; Funder, P.M.; Ling, V. )

    1990-05-01

    We have isolated and structurally characterized a full-length cDNA clone encoding ferredoxin from a {lambda}gt10 cDNA library prepared from barley leaf mRNA. The ferredoxin clone (pBFD-1) was fused head-to-head with a partial-length cDNA clone encoding calmodulin, and was fortuitously isolated by screening the library with a calmodulin-specific oligonucleotide probe. The mRNA sequence from which pBFD-1 was derived is expressed exclusively in the leaf tissues of 7-d old barley seedlings. Barley pre-ferredoxin has a predicted size of 15.3 kDal, of which 4.6 kDal are accounted for by the transit peptide. The polypeptide encoded by pBFD-1 is identical to wheat ferredoxin, and shares slightly more amino acid sequence similarity with spinach ferredoxin I than with ferredoxin II. Ferredoxin mRNA levels are rapidly increased 10-fold by white light in etiolated barley leaves.

  13. Chaining sequence/structure seeds for computing RNA similarity.

    PubMed

    Bourgeade, Laetitia; Chauve, Cédric; Allali, Julien

    2015-03-01

    We describe a new method to compare a query RNA with a static set of target RNAs. Our method is based on (i) a static indexing of the sequence/structure seeds of the target RNAs; (ii) searching the target RNAs by detecting seeds of the query present in the target, chaining these seeds in promising candidate homologs; and then (iii) completing the alignment using an anchor-based exact alignment algorithm. We apply our method on the benchmark Bralibase2.1 and compare its accuracy and efficiency with the exact method LocARNA and its recent seeds-based speed-up ExpLoc-P. Our pipeline RNA-unchained greatly improves computation time of LocARNA and is comparable to the one of ExpLoc-P, while improving the overall accuracy of the final alignments. PMID:25768236

  14. [Nucleotide sequence determination of yeast mitochondrial phenylalanine-tRNA].

    PubMed

    Martin, R; Sibler, A P; Schneller, J M; Keith, G; Stahl, A J; Dirheimer, G

    1978-10-01

    The primary structure of mitochondrial tRNAPhe from Saccharomyces cerevisiae, purified by two-dimensional polyacrylamide gel electrophoresis, was determined using, standard procedures on in vivo 32P-labeled tRNA, as well as the new 5'-end postlabeling techniques. We propose a cloverleaf model which allows for tertiary interaction between cytosine in position 46 and guanine in position 15 and maximizes base pairing in the psi C stem, thus excluding the uracile in position 50 from base pairing in the psi C stem. Comparison of the primary structure of this tRNA with all other known procaryotic, chloroplastic or cytoplasmic tRNAsPhe sequences does not lead to any conclusion about the endosymbiotic theory of mitochondria evolution. PMID:103657

  15. DNA slip-outs cause RNA polymerase II arrest in vitro: potential implications for genetic instability

    PubMed Central

    Salinas-Rios, Viviana; Belotserkovskii, Boris P.; Hanawalt, Philip C.

    2011-01-01

    The abnormal number of repeats found in triplet repeat diseases arises from ‘repeat instability’, in which the repetitive section of DNA is subject to a change in copy number. Recent studies implicate transcription in a mechanism for repeat instability proposed to involve RNA polymerase II (RNAPII) arrest caused by a CTG slip-out, triggering transcription-coupled repair (TCR), futile cycles of which may lead to repeat expansion or contraction. In the present study, we use defined DNA constructs to directly test whether the structures formed by CAG and CTG repeat slip-outs can cause transcription arrest in vitro. We found that a slip-out of (CAG)20 or (CTG)20 repeats on either strand causes RNAPII arrest in HeLa cell nuclear extracts. Perfect hairpins and loops on either strand also cause RNAPII arrest. These findings are consistent with a transcription-induced repeat instability model in which transcription arrest in mammalian cells may initiate a ‘gratuitous’ TCR event leading to a change in repeat copy number. An understanding of the underlying mechanism of repeat instability could lead to intervention to slow down expansion and delay the onset of many neurodegenerative diseases in which triplet repeat expansion is implicated. PMID:21666257

  16. Assessing long-distance RNA sequence connectivity via RNA-templated DNA–DNA ligation

    PubMed Central

    Roy, Christian K; Olson, Sara; Graveley, Brenton R; Zamore, Phillip D; Moore, Melissa J

    2015-01-01

    Many RNAs, including pre-mRNAs and long non-coding RNAs, can be thousands of nucleotides long and undergo complex post-transcriptional processing. Multiple sites of alternative splicing within a single gene exponentially increase the number of possible spliced isoforms, with most human genes currently estimated to express at least ten. To understand the mechanisms underlying these complex isoform expression patterns, methods are needed that faithfully maintain long-range exon connectivity information in individual RNA molecules. In this study, we describe SeqZip, a methodology that uses RNA-templated DNA–DNA ligation to retain and compress connectivity between distant sequences within single RNA molecules. Using this assay, we test proposed coordination between distant sites of alternative exon utilization in mouse Fn1, and we characterize the extraordinary exon diversity of Drosophila melanogaster Dscam1. DOI: http://dx.doi.org/10.7554/eLife.03700.001 PMID:25866926

  17. Legume genomics: understanding biology through DNA and RNA sequencing

    PubMed Central

    O'Rourke, Jamie A.; Bolon, Yung-Tsi; Bucciarelli, Bruna; Vance, Carroll P.

    2014-01-01

    Background The legume family (Leguminosae) consists of approx. 17 000 species. A few of these species, including, but not limited to, Phaseolus vulgaris, Cicer arietinum and Cajanus cajan, are important dietary components, providing protein for approx. 300 million people worldwide. Additional species, including soybean (Glycine max) and alfalfa (Medicago sativa), are important crops utilized mainly in animal feed. In addition, legumes are important contributors to biological nitrogen, forming symbiotic relationships with rhizobia to fix atmospheric N2 and providing up to 30 % of available nitrogen for the next season of crops. The application of high-throughput genomic technologies including genome sequencing projects, genome re-sequencing (DNA-seq) and transcriptome sequencing (RNA-seq) by the legume research community has provided major insights into genome evolution, genomic architecture and domestication. Scope and Conclusions This review presents an overview of the current state of legume genomics and explores the role that next-generation sequencing technologies play in advancing legume genomics. The adoption of next-generation sequencing and implementation of associated bioinformatic tools has allowed researchers to turn each species of interest into their own model organism. To illustrate the power of next-generation sequencing, an in-depth overview of the transcriptomes of both soybean and white lupin (Lupinus albus) is provided. The soybean transcriptome focuses on analysing seed development in two near-isogenic lines, examining the role of transporters, oil biosynthesis and nitrogen utilization. The white lupin transcriptome analysis examines how phosphate deficiency alters gene expression patterns, inducing the formation of cluster roots. Such studies illustrate the power of next-generation sequencing and bioinformatic analyses in elucidating the gene networks underlying biological processes. PMID:24769535

  18. Long Non-Coding RNA and Alternative Splicing Modulations in Parkinson's Leukocytes Identified by RNA Sequencing

    PubMed Central

    Soreq, Lilach; Guffanti, Alessandro; Salomonis, Nathan; Simchovitz, Alon; Israel, Zvi; Bergman, Hagai; Soreq, Hermona

    2014-01-01

    The continuously prolonged human lifespan is accompanied by increase in neurodegenerative diseases incidence, calling for the development of inexpensive blood-based diagnostics. Analyzing blood cell transcripts by RNA-Seq is a robust means to identify novel biomarkers that rapidly becomes a commonplace. However, there is lack of tools to discover novel exons, junctions and splicing events and to precisely and sensitively assess differential splicing through RNA-Seq data analysis and across RNA-Seq platforms. Here, we present a new and comprehensive computational workflow for whole-transcriptome RNA-Seq analysis, using an updated version of the software AltAnalyze, to identify both known and novel high-confidence alternative splicing events, and to integrate them with both protein-domains and microRNA binding annotations. We applied the novel workflow on RNA-Seq data from Parkinson's disease (PD) patients' leukocytes pre- and post- Deep Brain Stimulation (DBS) treatment and compared to healthy controls. Disease-mediated changes included decreased usage of alternative promoters and N-termini, 5′-end variations and mutually-exclusive exons. The PD regulated FUS and HNRNP A/B included prion-like domains regulated regions. We also present here a workflow to identify and analyze long non-coding RNAs (lncRNAs) via RNA-Seq data. We identified reduced lncRNA expression and selective PD-induced changes in 13 of over 6,000 detected leukocyte lncRNAs, four of which were inversely altered post-DBS. These included the U1 spliceosomal lncRNA and RP11-462G22.1, each entailing sequence complementarity to numerous microRNAs. Analysis of RNA-Seq from PD and unaffected controls brains revealed over 7,000 brain-expressed lncRNAs, of which 3,495 were co-expressed in the leukocytes including U1, which showed both leukocyte and brain increases. Furthermore, qRT-PCR validations confirmed these co-increases in PD leukocytes and two brain regions, the amygdala and substantia

  19. Use of Unamplified RNA/cDNA–Hybrid Nanopore Sequencing for Rapid Detection and Characterization of RNA Viruses

    PubMed Central

    Kilianski, Andy; Roth, Pierce A.; Liem, Alvin T.; Hill, Jessica M.; Willis, Kristen L.; Rossmaier, Rebecca D.; Marinich, Andrew V.; Maughan, Michele N.; Karavis, Mark A.; Kuhn, Jens H.; Honko, Anna N.

    2016-01-01

    Nanopore sequencing, a novel genomics technology, has potential applications for routine biosurveillance, clinical diagnosis, and outbreak investigation of virus infections. Using rapid sequencing of unamplified RNA/cDNA hybrids, we identified Venezuelan equine encephalitis virus and Ebola virus in 3 hours from sample receipt to data acquisition, demonstrating a fieldable technique for RNA virus characterization. PMID:27191483

  20. Use of Unamplified RNA/cDNA-Hybrid Nanopore Sequencing for Rapid Detection and Characterization of RNA Viruses.

    PubMed

    Kilianski, Andy; Roth, Pierce A; Liem, Alvin T; Hill, Jessica M; Willis, Kristen L; Rossmaier, Rebecca D; Marinich, Andrew V; Maughan, Michele N; Karavis, Mark A; Kuhn, Jens H; Honko, Anna N; Rosenzweig, C Nicole

    2016-08-01

    Nanopore sequencing, a novel genomics technology, has potential applications for routine biosurveillance, clinical diagnosis, and outbreak investigation of virus infections. Using rapid sequencing of unamplified RNA/cDNA hybrids, we identified Venezuelan equine encephalitis virus and Ebola virus in 3 hours from sample receipt to data acquisition, demonstrating a fieldable technique for RNA virus characterization. PMID:27191483

  1. PlantMirnaT: miRNA and mRNA integrated analysis fully utilizing characteristics of plant sequencing data.

    PubMed

    Rhee, S; Chae, H; Kim, S

    2015-07-15

    miRNA is known to regulate up to several hundreds coding genes, thus the integrated analysis of miRNA and mRNA expression data is an important problem. Unfortunately, the integrated analysis is challenging since it needs to consider expression data of two different types, miRNA and mRNA, and target relationship between miRNA and mRNA is not clear, especially when microarray data is used. Fortunately, due to the low sequencing cost, small RNA and RNA sequencing are routinely processed and we may be able to infer regulation relationships between miRNAs and mRNAs more accurately by using sequencing data. However, no method is developed specifically for sequencing data. Thus we developed PlantMirnaT, a new miRNA-mRNA integrated analysis system. To fully leverage the power of sequencing data, three major features are developed and implemented in PlantMirnaT. First, we implemented a plant-specific short read mapping tool based on recent discoveries on miRNA target relationship in plant. Second, we designed and implemented an algorithm considering miRNA targets in the full intragenic region, not just 3' UTR. Lastly but most importantly, our algorithm is designed to consider quantity of miRNA expression and its distribution on target mRNAs. The new algorithm was used to characterize rice under drought condition using our proprietary data. Our algorithm successfully discovered that two miRNAs, miRNA1425-5p, miRNA 398b, that are involved in suppression of glucose pathway in a naturally drought resistant rice, Vandana. The system can be downloaded at https://sites.google.com/site/biohealthinformaticslab/resources. PMID:25863133

  2. Globin mRNA contains a sequence complementary to double-stranded region of nuclear pre-mRNA.

    PubMed Central

    Ryskov, A P; Tokarskaya, O V; Georgiev, G P; Coutelle, C; Thiele, B

    1976-01-01

    Melted ds RNA isolated from rabbit bone marrow pre-mRNA was hybridized with excess of globin mRNA which was prepared from rabbit reticulocytes. 7-9% of ds sequences became RNAase-stable and about 30% of the sequences could be bound to poly(U)-Sepharose through poly (A) of mRNA. The size of RNAase-stable hybrid is about 30 nucleotides, that is one fourth of the length of one strand of the ds RNA. PMID:986644

  3. A method for clustering of miRNA sequences using fragmented programming.

    PubMed

    Ivashchenko, Anatoly; Pyrkova, Anna; Niyazova, Raigul

    2016-01-01

    Clustering of miRNA sequences is an important problem in molecular genetics associated cellular biology. Thousands of such sequences are known today through advancement in sophisticated molecular tools, sequencing techniques, computational resources and rule based mathematical models. Analysis of such large-scale miRNA sequences for inferring patterns towards deducing cellular function is a great challenge in modern molecular biology. Therefore, it is of interest to develop mathematical models specific for miRNA sequences. The process is to group (cluster) such miRNA sequences using well-defined known features. We describe a method for clustering of miRNA sequences using fragmented programming. Subsequently, we illustrated the utility of the model using a dendrogram (a tree diagram) for publically known A.thaliana miRNA nucleotide sequences towards the inference of observed conserved patterns. PMID:27212839

  4. Improved definition of the mouse transcriptome via targeted RNA sequencing

    PubMed Central

    Clark, Michael B.; Mercer, Tim R.; Crawford, Joanna; Malquori, Lorenzo; Notredame, Cedric; Dinger, Marcel E.; Mattick, John S.

    2016-01-01

    Targeted RNA sequencing (CaptureSeq) uses oligonucleotide probes to capture RNAs for sequencing, providing enriched read coverage, accurate measurement of gene expression, and quantitative expression data. We applied CaptureSeq to refine transcript annotations in the current murine GRCm38 assembly. More than 23,000 regions corresponding to putative or annotated long noncoding RNAs (lncRNAs) and 154,281 known splicing junction sites were selected for targeted sequencing across five mouse tissues and three brain subregions. The results illustrate that the mouse transcriptome is considerably more complex than previously thought. We assemble more complete transcript isoforms than GENCODE, expand transcript boundaries, and connect interspersed islands of mapped reads. We describe a novel filtering pipeline that identifies previously unannotated but high-quality transcript isoforms. In this set, 911 GENCODE neighboring genes are condensed into 400 expanded gene models. Additionally, 594 GENCODE lncRNAs acquire an open reading frame (ORF) when their structure is extended with CaptureSeq. Finally, we validate our observations using current FANTOM and Mouse ENCODE resources. PMID:27197243

  5. Sequence of the 16S ribosomal RNA from Halobacterium volcanii, an archaebacterium

    NASA Technical Reports Server (NTRS)

    Gupta, R.; Lanter, J. M.; Woese, C. R.

    1983-01-01

    The sequence of the 16S ribosomal RNA (rRNA) from the archaebacterium Halobacterium volcanii has been determined by DNA sequencing methods. The archaebacterial rRNA is similar to its eubacterial counterpart in secondary structure. Although it is closer in sequence to the eubacterial 16S rRNA than to the eukaryotic 16S-like rRNA, the H. volcanii sequence also shows certain points of specific similarity to its eukaryotic counterpart. Since the H. volcanii sequence is closer to both the eubacterial and the eukaryotic sequences than these two are to one another, it follows that the archaebacterial sequence resembles their common ancestral sequence more closely than does either of the other two versions.

  6. High-throughput illumina strand-specific RNA sequencing library preparation

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Conventional Illumina RNA-Seq does not have the resolution to decode the complex eukaryote transcriptome due to the lack of RNA polarity information. Strand-specific RNA sequencing (ssRNA-Seq) can overcome these limitations and as such is better suited for genome annotation, de novo transcriptome as...

  7. Comparative RNA sequencing reveals substantial genetic variation in endangered primates

    PubMed Central

    Perry, George H.; Melsted, Páll; Marioni, John C.; Wang, Ying; Bainer, Russell; Pickrell, Joseph K.; Michelini, Katelyn; Zehr, Sarah; Yoder, Anne D.; Stephens, Matthew; Pritchard, Jonathan K.; Gilad, Yoav

    2012-01-01

    Comparative genomic studies in primates have yielded important insights into the evolutionary forces that shape genetic diversity and revealed the likely genetic basis for certain species-specific adaptations. To date, however, these studies have focused on only a small number of species. For the majority of nonhuman primates, including some of the most critically endangered, genome-level data are not yet available. In this study, we have taken the first steps toward addressing this gap by sequencing RNA from the livers of multiple individuals from each of 16 mammalian species, including humans and 11 nonhuman primates. Of the nonhuman primate species, five are lemurs and two are lorisoids, for which little or no genomic data were previously available. To analyze these data, we developed a method for de novo assembly and alignment of orthologous gene sequences across species. We assembled an average of 5721 gene sequences per species and characterized diversity and divergence of both gene sequences and gene expression levels. We identified patterns of variation that are consistent with the action of positive or directional selection, including an 18-fold enrichment of peroxisomal genes among genes whose regulation likely evolved under directional selection in the ancestral primate lineage. Importantly, we found no relationship between genetic diversity and endangered status, with the two most endangered species in our study, the black and white ruffed lemur and the Coquerel's sifaka, having the highest genetic diversity among all primates. Our observations imply that many endangered lemur populations still harbor considerable genetic variation. Timely efforts to conserve these species alongside their habitats have, therefore, strong potential to achieve long-term success. PMID:22207615

  8. Research Techniques Made Simple: Methodology and Clinical Applications of RNA Sequencing.

    PubMed

    Whitley, Sarah K; Horne, William T; Kolls, Jay K

    2016-08-01

    RNA sequencing is a method of transcriptome profiling that utilizes next-generation sequencing technology. It offers several distinct advantages over hybridization-based approaches, most notably superior sensitivity and the capacity for de novo transcript discovery. This article describes RNA sequencing methodology, summarizes important technological advances and challenges, and discusses applications for this technique in the field of dermatology. PMID:27450500

  9. Modulations of RNA sequences by cytokinin in pumpkin cotyledons

    SciTech Connect

    Chang, C.; Ertl, J.; Chen, C.

    1987-04-01

    Polyadenylated mRNAs from excised pumpkin cotyledons treated with or without 10/sup -4/ M benzyladenine (BA) for various time periods in suspension culture were assayed by in vitro translation in the presence of (/sup 35/S) methionine. The radioactive polypeptides were analyzed by one- and two-dimensional polyacrylamide gel electrophoresis. Specific sequences of mRNAs were enhanced, reduced, induced, or suppressed by the hormone within 60 min of the application of BA to the cotyledons. Four independent cDNA clones of cytokinin-modulated mRNAs have been selected and characterized. RNA blot hybridization using the four cDNA probes also indicates that the levels of specific mRNAs are modulated upward or downward by the hormone.

  10. The Microglial Sensome Revealed by Direct RNA Sequencing

    PubMed Central

    Hickman, Suzanne E.; Kingery, Nathan D.; Ohsumi, Toshiro; Borowsky, Mark; Wang, Li-chong; Means, Terry K.; Khoury, Joseph El

    2013-01-01

    Microglia, the principal neuroimmune sentinels of the brain, continuously sense changes in their environment and respond to invading pathogens, toxins and cellular debris. Microglia exhibit plasticity and can assume neurotoxic or neuroprotective priming states that determine their responses to danger. We used direct RNA sequencing, without amplification or cDNA synthesis, to determine the quantitative transcriptomes of microglia of healthy adult and aged mice. We validated our findings by fluorescent dual in-situ hybridization, unbiased proteomic analysis and quantitative PCR. We report here that microglia have a distinct transcriptomic signature and express a unique cluster of transcripts encoding proteins for sensing endogenous ligands and microbes that we term the “sensome”. With aging, sensome transcripts for endogenous ligand recognition are downregulated, whereas those involved in microbe recognition and host defense are upregulated. In addition, aging is associated with an overall increase in expression of microglial genes involved in neuroprotection. PMID:24162652

  11. Copying of RNA Sequences without Pre-Activation

    PubMed Central

    Jauker, Mario; Griesser, Helmut; Richert, Clemens

    2015-01-01

    Template-directed incorporation of nucleotides at the terminus of a growing complementary strand is the basis of replication. For RNA, this process can occur in the absence of enzymes, if the ribonucleotides are first converted to an active species with a leaving group. Thus far, the activation required a separate chemical step, complicating prebiotically plausible scenarios. Here we show that a combination of a carbodiimide and an organocatalyst induces near-quantitative incorporation of any of the four ribonucleotides. Upon in situ activation, adenosine monophosphate was found to also form oligomers in aqueous solution. So, both de novo strand formation and sequence-specific copying can occur without an artificial synthetic step. PMID:26435291

  12. FASTR: A novel data format for concomitant representation of RNA sequence and secondary structure information.

    PubMed

    Bose, Tungadri; Dutta, Anirban; Mh, Mohammed; Gandhi, Hemang; Mande, Sharmila S

    2015-09-01

    Given the importance of RNA secondary structures in defining their biological role, it would be convenient for researchers seeking RNA data if both sequence and structural information pertaining to RNA molecules are made available together. Current nucleotide data repositories archive only RNA sequence data. Furthermore, storage formats which can frugally represent RNA sequence as well as structure data in a single file, are currently unavailable. This article proposes a novel storage format, 'FASTR', for concomitant representation of RNA sequence and structure. The storage efficiency of the proposed FASTR format has been evaluated using RNA data from various microorganisms. Results indicate that the size of FASTR formatted files (containing both RNA sequence as well as structure information) are equivalent to that of FASTA-format files, which contain only RNA sequence information. RNA secondary structure is typically represented using a combination of a string of nucleotide characters along with the corresponding dot-bracket notation indicating structural attributes. 'FASTR' - the novel storage format proposed in the present study enables a frugal representation of both RNA sequence and structural information in the form of a single string. In spite of having a relatively smaller storage footprint, the resultant 'fastr' string(s) retain all sequence as well as secondary structural information that could be stored using a dot-bracket notation. An implementation of the 'FASTR' methodology is available for download at http://metagenomics.atc.tcs.com/compression/fastr. PMID:26333403

  13. New wheat microRNA using whole-genome sequence.

    PubMed

    Kurtoglu, Kuaybe Yucebilgili; Kantar, Melda; Budak, Hikmet

    2014-06-01

    MicroRNAs are post-transcriptional regulators of gene expression, taking roles in a variety of fundamental biological processes. Hence, their identification, annotation and characterization are of great significance, especially in bread wheat, one of the main food sources for humans. The recent availability of 5× coverage Triticum aestivum L. whole-genome sequence provided us with the opportunity to perform a systematic prediction of a complete catalogue of wheat microRNAs. Using an in silico homology-based approach, stem-loop coding regions were derived from two assemblies, constructed from wheat 454 reads. To avoid the presence of pseudo-microRNAs in the final data set, transposable element related stem-loops were eliminated by repeat analysis. Overall, 52 putative wheat microRNAs were predicted, including seven, which have not been previously published. Moreover, with distinct analysis of the two different assemblies, both variety and representation of putative microRNA-coding stem-loops were found to be predominant in the intergenic regions. By searching available expressed sequences and small RNA library databases, expression evidence for 39 (out of 52) putative wheat microRNAs was provided. Expression of three of the predicted microRNAs (miR166, miR396 and miR528) was also comparatively quantified with real-time quantitative reverse transcription PCR. This is the first report on in silico prediction of a whole repertoire of bread wheat microRNAs, supported by the wet-lab validation. PMID:24395439

  14. RNA Sequencing Identifies Novel Translational Biomarkers of Kidney Fibrosis.

    PubMed

    Craciun, Florin L; Bijol, Vanesa; Ajay, Amrendra K; Rao, Poornima; Kumar, Ramya K; Hutchinson, John; Hofmann, Oliver; Joshi, Nikita; Luyendyk, James P; Kusebauch, Ulrike; Moss, Christopher L; Srivastava, Anand; Himmelfarb, Jonathan; Waikar, Sushrut S; Moritz, Robert L; Vaidya, Vishal S

    2016-06-01

    CKD is the gradual, asymptomatic loss of kidney function, but current tests only identify CKD when significant loss has already happened. Several potential biomarkers of CKD have been reported, but none have been approved for preclinical or clinical use. Using RNA sequencing in a mouse model of folic acid-induced nephropathy, we identified ten genes that track kidney fibrosis development, the common pathologic finding in patients with CKD. The gene expression of all ten candidates was confirmed to be significantly higher (approximately ten- to 150-fold) in three well established, mechanistically distinct mouse models of kidney fibrosis than in models of nonfibrotic AKI. Protein expression of these genes was also high in the folic acid model and in patients with biopsy-proven kidney fibrosis. mRNA expression of the ten genes increased with increasing severity of kidney fibrosis, decreased in response to therapeutic intervention, and increased only modestly (approximately two- to five-fold) with liver fibrosis in mice and humans, demonstrating specificity for kidney fibrosis. Using targeted selected reaction monitoring mass spectrometry, we detected three of the ten candidates in human urine: cadherin 11 (CDH11), macrophage mannose receptor C1 (MRC1), and phospholipid transfer protein (PLTP). Furthermore, urinary levels of each of these three proteins distinguished patients with CKD (n=53) from healthy individuals (n=53; P<0.05). In summary, we report the identification of urinary CDH11, MRC1, and PLTP as novel noninvasive biomarkers of CKD. PMID:26449608

  15. Targeted RNA Sequencing Assay to Characterize Gene Expression and Genomic Alterations.

    PubMed

    Martin, Dorrelyn P; Miya, Jharna; Reeser, Julie W; Roychowdhury, Sameek

    2016-01-01

    RNA sequencing (RNAseq) is a versatile method that can be utilized to detect and characterize gene expression, mutations, gene fusions, and noncoding RNAs. Standard RNAseq requires 30 - 100 million sequencing reads and can include multiple RNA products such as mRNA and noncoding RNAs. We demonstrate how targeted RNAseq (capture) permits a focused study on selected RNA products using a desktop sequencer. RNAseq capture can characterize unannotated, low, or transiently expressed transcripts that may otherwise be missed using traditional RNAseq methods. Here we describe the extraction of RNA from cell lines, ribosomal RNA depletion, cDNA synthesis, preparation of barcoded libraries, hybridization and capture of targeted transcripts and multiplex sequencing on a desktop sequencer. We also outline the computational analysis pipeline, which includes quality control assessment, alignment, fusion detection, gene expression quantification and identification of single nucleotide variants. This assay allows for targeted transcript sequencing to characterize gene expression, gene fusions, and mutations. PMID:27585245

  16. Complete sequence and gene organization of the Nosema spodopterae rRNA gene.

    PubMed

    Tsai, Shu-Jen; Huang, Wei-Fone; Wang, Chung-Hsiung

    2005-01-01

    By sequencing the entire ribosomal RNA (rRNA) gene of Nosema spodopterae, we show here that its gene organization follows a pattern similar to the Nosema type species, Nosema bombycis, i.e. 5'-large subunit rRNA (2,497 bp)-internal transcribed spacer (185 bp)-small subunit rRNA (1,232 bp)-intergenic spacer (277 bp)-5S rRNA (114 bp)-3'. Gene sequences and the secondary structures of large subunit rRNA, small subunit rRNA, and 5S rRNA are compared with the known corresponding sequences and structures of closely related microsporidia. The results suggest that the Nosema genus may be heterogeneous and that the rRNA gene organization may be a useful characteristic for determining which species are closely related to the type species. PMID:15702980

  17. International interlaboratory study comparing single organism 16S rRNA gene sequencing data: Beyond consensus sequence comparisons.

    PubMed

    Olson, Nathan D; Lund, Steven P; Zook, Justin M; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B

    2015-03-01

    This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing(®), or Ion Torrent PGM(®). The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies. PMID:27077030

  18. Sequence analysis of a cluster of twenty-one tRNA genes in Bacillus subtilis.

    PubMed Central

    Green, C J; Vold, B S

    1983-01-01

    The DNA sequence of a cluster of twenty-one tRNA genes distal to a rRNA gene set in B. subtilis was determined. None of the tRNA genes are repeated in the sequence. The only classes of tRNAs that are not represented are those for cysteine, glutamine, tryptophan, and tyrosine. Three of the tRNA genes in this cluster do not have the 3'-CCA sequence encoded in the gene. There is no RNA polymerase terminator sequence in the region between the 5S gene and the first tRNA gene or within the tRNA gene cluster. A terminator sequence was found directly after the last tRNA gene. This rRNA and tRNA gene cluster probably represents one transcriptional unit. However, there may be an RNA polymerase promoter site within this sequence, which raises some interesting questions concerning the regulation of transcription for these tRNA genes. PMID:6310512

  19. Sequences far downstream from the classical tRNA promoter elements bind RNA polymerase III transcription factors.

    PubMed Central

    Young, L S; Rivier, D H; Sprague, K U

    1991-01-01

    We have examined the interaction of transcription factors TFIIIC and TFIIID with a silkworm alanine tRNA gene. Previous functional analysis showed that the promoter for this gene is unusually large compared with the classical tRNA promoter elements (the A and B boxes) and includes sequences downstream from the transcription termination site. The goal of the experiments reported here was to determine which sequences within the full promoter make stable contacts with transcription factors. We show that when TFIIIC and TFIIID are combined, a complex is formed with the tRNA(Ala)C gene. Neither factor alone can form this complex. DNase I digestion of gene-factor complexes reveals that most of the tRNA(Ala)C promoter is in contact with factors. The protected region extends from -1 to at least +136 and includes both the A and B boxes and the previously identified downstream promoter sequences. Analysis of mutant promoters shows that sequence-specific contacts throughout the protected region are required for binding. The role of 3'-flanking sequences in transcription factor binding explains the contribution of these sequences to the tRNA(Ala)C promoter. We discuss the possibility that such sequences affect promoter strength in other tRNA genes. Images PMID:1996100

  20. The contribution of co-transcriptional RNA:DNA hybrid structures to DNA damage and genome instability

    PubMed Central

    Hamperl, Stephan; Cimprich, Karlene A.

    2014-01-01

    Accurate DNA replication and DNA repair are crucial for the maintenance of genome stability, and it is generally accepted that failure of these processes is a major source of DNA damage in cells. Intriguingly, recent evidence suggests that DNA damage is more likely to occur at genomic loci with high transcriptional activity. Furthermore, loss of certain RNA processing factors in eukaryotic cells is associated with increased formation of co-transcriptional RNA:DNA hybrid structures known as R-loops, resulting in double-strand breaks (DSBs) and DNA damage. However, the molecular mechanisms by which R-loop structures ultimately lead to DNA breaks and genome instability is not well understood. In this review, we summarize the current knowledge about the formation, recognition and processing of RNA:DNA hybrids, and discuss possible mechanisms by which these structures contribute to DNA damage and genome instability in the cell. PMID:24746923

  1. Optimal terminal sequences for continuous or serial isothermal amplification of dsRNA with norovirus RNA replicase.

    PubMed

    Arai, Hidenao; Nishigaki, Koichi; Nemoto, Naoto; Suzuki, Miho; Husimi, Yuzuru

    2014-01-01

    The norovirus RNA replicase (NV3D(pol), 56 kDa, single chain monomeric protein) can amplify double-stranded (ds) RNA isothermally. It will play an alternative role in the in vitro evolution against traditional Qβ RNA replicase, which cannot amplify dsRNA and consists of four subunits, three of which are borrowed from host E.coli. In order to identify the optimal 3'-terminal sequence of the RNA template for NV3D(pol), an in vitro selection using the serial transfer was performed for a random library having the 3'-terminal sequence of ---UUUUUUNNNN-3'. The population landscape on the 4-dimensional sequence space of the 17(th) round of transfer gave a main peak around ---CAAC-3'. In the preceding studies on the batch amplification reaction starting from a single-stranded RNA, a template with 3'-terminal C-stretch was amplified effectively. It was confirmed that in the batch amplification the ---CCC-3' was much more effective than the ---CAAC-3', but in the serial transfer condition in which the ----CAAC-3' was sustained stably, the ---CCC-3' was washed out. Based on these results we proposed the existence of the "shuttle mode" replication of dsRNA. We also proposed the optimal terminal sequences of RNA for in vitro evolution with NV3D(pol). PMID:27493494

  2. Optimal terminal sequences for continuous or serial isothermal amplification of dsRNA with norovirus RNA replicase

    PubMed Central

    Arai, Hidenao; Nishigaki, Koichi; Nemoto, Naoto; Suzuki, Miho; Husimi, Yuzuru

    2014-01-01

    The norovirus RNA replicase (NV3Dpol, 56 kDa, single chain monomeric protein) can amplify double-stranded (ds) RNA isothermally. It will play an alternative role in the in vitro evolution against traditional Qβ RNA replicase, which cannot amplify dsRNA and consists of four subunits, three of which are borrowed from host E.coli. In order to identify the optimal 3′-terminal sequence of the RNA template for NV3Dpol, an in vitro selection using the serial transfer was performed for a random library having the 3′-terminal sequence of ---UUUUUUNNNN-3′. The population landscape on the 4-dimensional sequence space of the 17th round of transfer gave a main peak around ---CAAC-3′. In the preceding studies on the batch amplification reaction starting from a single-stranded RNA, a template with 3′-terminal C-stretch was amplified effectively. It was confirmed that in the batch amplification the ---CCC-3′ was much more effective than the ---CAAC-3′, but in the serial transfer condition in which the ----CAAC-3′ was sustained stably, the ---CCC-3′ was washed out. Based on these results we proposed the existence of the “shuttle mode” replication of dsRNA. We also proposed the optimal terminal sequences of RNA for in vitro evolution with NV3Dpol. PMID:27493494

  3. Equally parsimonious pathways through an RNA sequence space are not equally likely

    NASA Technical Reports Server (NTRS)

    Lee, Y. H.; DSouza, L. M.; Fox, G. E.

    1997-01-01

    An experimental system for determining the potential ability of sequences resembling 5S ribosomal RNA (rRNA) to perform as functional 5S rRNAs in vivo in the Escherichia coli cellular environment was devised previously. Presumably, the only 5S rRNA sequences that would have been fixed by ancestral populations are ones that were functionally valid, and hence the actual historical paths taken through RNA sequence space during 5S rRNA evolution would have most likely utilized valid sequences. Herein, we examine the potential validity of all sequence intermediates along alternative equally parsimonious trajectories through RNA sequence space which connect two pairs of sequences that had previously been shown to behave as valid 5S rRNAs in E. coli. The first trajectory requires a total of four changes. The 14 sequence intermediates provide 24 apparently equally parsimonious paths by which the transition could occur. The second trajectory involves three changes, six intermediate sequences, and six potentially equally parsimonious paths. In total, only eight of the 20 sequence intermediates were found to be clearly invalid. As a consequence of the position of these invalid intermediates in the sequence space, seven of the 30 possible paths consisted of exclusively valid sequences. In several cases, the apparent validity/invalidity of the intermediate sequences could not be anticipated on the basis of current knowledge of the 5S rRNA structure. This suggests that the interdependencies in RNA sequence space may be more complex than currently appreciated. If ancestral sequences predicted by parsimony are to be regarded as actual historical sequences, then the present results would suggest that they should also satisfy a validity requirement and that, in at least limited cases, this conjecture can be tested experimentally.

  4. High-throughput-sequencing-based identification of a grapevine fanleaf virus satellite RNA in Vitis vinifera.

    PubMed

    Chiumenti, Michela; Mohorianu, Irina; Roseti, Vincenzo; Saldarelli, Pasquale; Dalmay, Tamas; Minafra, Angelantonio

    2016-05-01

    A new satellite RNA (satRNA) of grapevine fanleaf virus (GFLV) was identified by high-throughput sequencing of high-definition (HD) adapter libraries from grapevine plants of the cultivar Panse precoce (PPE) affected by enation disease. The complete nucleotide sequence was obtained by automatic sequencing using primers designed based on next-generation sequencing (NGS) data. The full-length sequence, named satGFLV-PPE, consisted of 1119 nucleotides with a single open reading frame from position 15 to 1034. This satRNA showed maximum nucleotide sequence identity of 87 % to satArMV-86 and satGFLV-R6. Symptomatic grapevines were surveyed for the presence of the satRNA, and no correlation was found between detection of the satRNA and enation symptom expression. PMID:26873812

  5. Nucleotide sequence of 5S ribosomal RNA from Aspergillus nidulans and Neurospora crassa.

    PubMed Central

    Piechulla, B; Hahn, U; McLaughlin, L W; Küntzel, H

    1981-01-01

    The nucleotide sequences of 5S rRNA molecules isolated from the cytosol and the mitochondria of the ascomycetes A. nidulans and N. crassa were determined by partial chemical cleavage of 3'-terminally labelled RNA. The sequence identity of the cytosolic and mitochondrial RNA preparations confirms the absence of mitochondrion-specific 5S rRNA in these fungi. The sequences of the two organisms differ in 35 positions, and each sequence differs from yeast 5S rRNA in 44 positions. Both molecules contain the sequence GCUC in place of GAAC or GAUY found in all other 5S rRNAs, indicating that this region is not universally involved in base-pairing to the invariant GTpsiC sequence of tRNAs. Images PMID:6453331

  6. Detection of mRNA sequences in nuclear 30S ribonucleoprotein subcomplexes.

    PubMed Central

    Kinniburgh, A J; Martin, T E

    1976-01-01

    RNA from nuclear 30S ribonucleoprotein (RNP) complexes of mouse ascites cells has been shows to contain sequences homologous to poly(A) + mRNA by its ability to hybridize with complementary DNA prepared from poly(A) + mRNA template. Analysis of the hybridization kinetics of poly(A) + mRNA with its own complementary DNA revealed several abundancy classes. The total complexity of poly(A) + mRNA from ascites cells was estimated to be approximately 30,000 sequences of average molecular weight (6 X 10(5)). When the hybridization reaction of 30S RNP-RNA with mRNA-specific cDNA was compared to the homologous reaction the majority, and most probably all, of the poly(A) + mRNA sequences were found to be present in the RNA. The kinetics of hybridization suggest that 10-15% of the RNA in this RNP complex is homologous to poly(A) + mRNA. The 30S RNP subcomplexes therefore contain nuclear poly(A) + mRNA sequences as well as the bulk of heterogeneous RNA. PMID:1066686

  7. 5S rRNA sequences from four marine invertebrates and implications for base pairing models of metazoan sequences.

    PubMed

    Walker, W F; Doolittle, W F

    1983-08-11

    The nucleotide sequences of 5S rRNAs from the starfish Asterias vulgaris, the squid Illex illecebrosus, the sipunculid Phascolopsis gouldii and the jellyfish Aurelia aurita were determined. The sequence from Asterias lends support for one of two previous base pairing models for helix E in metazoan sequences. The Aurelia sequence differs by five nucleotides from that previously reported and does not violate the consensus secondary structure model for eukaryotic 5S rRNA. PMID:6136024

  8. Maternal Plasma DNA and RNA Sequencing for Prenatal Testing.

    PubMed

    Tamminga, Saskia; van Maarle, Merel; Henneman, Lidewij; Oudejans, Cees B M; Cornel, Martina C; Sistermans, Erik A

    2016-01-01

    Cell-free DNA (cfDNA) testing has recently become indispensable in diagnostic testing and screening. In the prenatal setting, this type of testing is often called noninvasive prenatal testing (NIPT). With a number of techniques, using either next-generation sequencing or single nucleotide polymorphism-based approaches, fetal cfDNA in maternal plasma can be analyzed to screen for rhesus D genotype, common chromosomal aneuploidies, and increasingly for testing other conditions, including monogenic disorders. With regard to screening for common aneuploidies, challenges arise when implementing NIPT in current prenatal settings. Depending on the method used (targeted or nontargeted), chromosomal anomalies other than trisomy 21, 18, or 13 can be detected, either of fetal or maternal origin, also referred to as unsolicited or incidental findings. For various biological reasons, there is a small chance of having either a false-positive or false-negative NIPT result, or no result, also referred to as a "no-call." Both pre- and posttest counseling for NIPT should include discussing potential discrepancies. Since NIPT remains a screening test, a positive NIPT result should be confirmed by invasive diagnostic testing (either by chorionic villus biopsy or by amniocentesis). As the scope of NIPT is widening, professional guidelines need to discuss the ethics of what to offer and how to offer. In this review, we discuss the current biochemical, clinical, and ethical challenges of cfDNA testing in the prenatal setting and its future perspectives including novel applications that target RNA instead of DNA. PMID:27117661

  9. Distinct tmRNA sequence elements facilitate RNase R engagement on rescued ribosomes for selective nonstop mRNA decay

    PubMed Central

    Venkataraman, Krithika; Zafar, Hina; Karzai, A. Wali

    2014-01-01

    trans-Translation, orchestrated by SmpB and tmRNA, is the principal eubacterial pathway for resolving stalled translation complexes. RNase R, the leading nonstop mRNA surveillance factor, is recruited to stalled ribosomes in a trans-translation dependent process. To elucidate the contributions of SmpB and tmRNA to RNase R recruitment, we evaluated Escherichia coli–Francisella tularensis chimeric variants of tmRNA and SmpB. This evaluation showed that while the hybrid tmRNA supported nascent polypeptide tagging and ribosome rescue, it suffered defects in facilitating RNase R recruitment to stalled ribosomes. To gain further insights, we used established tmRNA and SmpB variants that impact distinct stages of the trans-translation process. Analysis of select tmRNA variants revealed that the sequence composition and positioning of the ultimate and penultimate codons of the tmRNA ORF play a crucial role in recruiting RNase R to rescued ribosomes. Evaluation of defined SmpB C-terminal tail variants highlighted the importance of establishing the tmRNA reading frame, and provided valuable clues into the timing of RNase R recruitment to rescued ribosomes. Taken together, these studies demonstrate that productive RNase R-ribosomes engagement requires active trans-translation, and suggest that RNase R captures the emerging nonstop mRNA at an early stage after establishment of the tmRNA ORF as the surrogate mRNA template. PMID:25200086

  10. Characterising the Canine Oral Microbiome by Direct Sequencing of Reverse-Transcribed rRNA Molecules

    PubMed Central

    McDonald, James E.; Larsen, Niels; Pennington, Andrea; Connolly, John; Wallis, Corrin; Rooks, David J.; Hall, Neil; McCarthy, Alan J.; Allison, Heather E.

    2016-01-01

    PCR amplification and sequencing of phylogenetic markers, primarily Small Sub-Unit ribosomal RNA (SSU rRNA) genes, has been the paradigm for defining the taxonomic composition of microbiomes. However, ‘universal’ SSU rRNA gene PCR primer sets are likely to miss much of the diversity therein. We sequenced a library comprising purified and reverse-transcribed SSU rRNA (RT-SSU rRNA) molecules from the canine oral microbiome and compared it to a general bacterial 16S rRNA gene PCR amplicon library generated from the same biological sample. In addition, we have developed BIONmeta, a novel, open-source, computer package for the processing and taxonomic classification of the randomly fragmented RT-SSU rRNA reads produced. Direct RT-SSU rRNA sequencing revealed that 16S rRNA molecules belonging to the bacterial phyla Actinobacteria, Bacteroidetes, Firmicutes, Proteobacteria and Spirochaetes, were most abundant in the canine oral microbiome (92.5% of total bacterial SSU rRNA). The direct rRNA sequencing approach detected greater taxonomic diversity (1 additional phylum, 2 classes, 1 order, 10 families and 61 genera) when compared with general bacterial 16S rRNA amplicons from the same sample, simultaneously provided SSU rRNA gene inventories of Bacteria, Archaea and Eukarya, and detected significant numbers of sequences not recognised by ‘universal’ primer sets. Proteobacteria and Spirochaetes were found to be under-represented by PCR-based analysis of the microbiome, and this was due to primer mismatches and taxon-specific variations in amplification efficiency, validated by qPCR analysis of 16S rRNA amplicons from a mock community. This demonstrated the veracity of direct RT-SSU rRNA sequencing for molecular microbial ecology. PMID:27276347

  11. Characterising the Canine Oral Microbiome by Direct Sequencing of Reverse-Transcribed rRNA Molecules.

    PubMed

    McDonald, James E; Larsen, Niels; Pennington, Andrea; Connolly, John; Wallis, Corrin; Rooks, David J; Hall, Neil; McCarthy, Alan J; Allison, Heather E

    2016-01-01

    PCR amplification and sequencing of phylogenetic markers, primarily Small Sub-Unit ribosomal RNA (SSU rRNA) genes, has been the paradigm for defining the taxonomic composition of microbiomes. However, 'universal' SSU rRNA gene PCR primer sets are likely to miss much of the diversity therein. We sequenced a library comprising purified and reverse-transcribed SSU rRNA (RT-SSU rRNA) molecules from the canine oral microbiome and compared it to a general bacterial 16S rRNA gene PCR amplicon library generated from the same biological sample. In addition, we have developed BIONmeta, a novel, open-source, computer package for the processing and taxonomic classification of the randomly fragmented RT-SSU rRNA reads produced. Direct RT-SSU rRNA sequencing revealed that 16S rRNA molecules belonging to the bacterial phyla Actinobacteria, Bacteroidetes, Firmicutes, Proteobacteria and Spirochaetes, were most abundant in the canine oral microbiome (92.5% of total bacterial SSU rRNA). The direct rRNA sequencing approach detected greater taxonomic diversity (1 additional phylum, 2 classes, 1 order, 10 families and 61 genera) when compared with general bacterial 16S rRNA amplicons from the same sample, simultaneously provided SSU rRNA gene inventories of Bacteria, Archaea and Eukarya, and detected significant numbers of sequences not recognised by 'universal' primer sets. Proteobacteria and Spirochaetes were found to be under-represented by PCR-based analysis of the microbiome, and this was due to primer mismatches and taxon-specific variations in amplification efficiency, validated by qPCR analysis of 16S rRNA amplicons from a mock community. This demonstrated the veracity of direct RT-SSU rRNA sequencing for molecular microbial ecology. PMID:27276347

  12. Tetrathiobacter kashmirensis Strain CA-1 16S rRNA gene complete sequence.

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This study used 1326 base pair 16S rRNA gene sequence methods to confirm the identification of a bacterium as Tetrathiobacter kashmirensis. Morphological, biochemical characteristics, and fatty acid profiles are consistent with the 16S rRNA gene sequence identification of the bacterium. The isolate...

  13. In silico detection of tRNA sequence features characteristic to aminoacyl-tRNA synthetase class membership

    PubMed Central

    Jakó, Éena; Ittzés, Péter; Szenes, Áron; Kun, Ádám; Szathmáry, Eörs; Pál, Gábor

    2007-01-01

    Aminoacyl tRNA synthetases (aaRS) are grouped into Class I and II based on primary and tertiary structure and enzyme properties suggesting two independent phylogenetic lineages. Analogously, tRNA molecules can also form two respective classes, based on the class membership of their corresponding aaRS. Although some aaRS–tRNA interactions are not extremely specific and require editing mechanisms to avoid misaminoacylation, most aaRS–tRNA interactions are rather stereospecific. Thus, class-specific aaRS features could be mirrored by class-specific tRNA features. However, previous investigations failed to detect conserved class-specific nucleotides. Here we introduce a discrete mathematical approach that evaluates not only class-specific ‘strictly present’, but also ‘strictly absent’ nucleotides. The disjoint subsets of these elements compose a unique partition, named extended consensus partition (ECP). By analyzing the ECP for both Class I and II tDNA sets from 50 (13 archaeal, 30 bacterial and 7 eukaryotic) species, we could demonstrate that class-specific tRNA sequence features do exist, although not in terms of strictly conserved nucleotides as it had previously been anticipated. This finding demonstrates that important information was hidden in tRNA sequences inaccessible for traditional statistical methods. The ECP analysis might contribute to the understanding of tRNA evolution and could enrich the sequence analysis tool repertoire. PMID:17704131

  14. Profiling miRNA Expression in Bovine Tissues by Deep Sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    miRNA are short RNA sequences ( ~ 21 nt long) that have been recently identified and were found to play an important role in gene regulation and controlling major cellular processes. Several miRNA are found to be evolutionarily conserved among the mammalian species. Some miRNAs are even conserved be...

  15. Transcription profile of boar spermatozoa as revealed by RNA-sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    High-throughput RNA sequencing (RNA-Seq) overcomes the limitations of the current hybridization-based techniques to detect the actual pool of RNA transcripts in spermatozoa. The application of this technology in livestock can speed the discovery of potential predictors of male fertility. As a first ...

  16. HIGH SEQUENCE DIVERSITY IN THE RNA SYNTHESIZED AT THE LAMPBRUSH STAGE OF OÖGENESIS*

    PubMed Central

    Davidson, Eric H.; Hough, Barbara R.

    1969-01-01

    Many diverse RNA's are synthesized in the lampbrush stage oöcyte of Xenopus, as shown by the presence of different nucleotide sequences in the RNA population. This fact has been established by hybridizing lampbrush stage oöcyte RNA with an isolated nonrepetitive fraction of Xenopus DNA. Images PMID:5257126

  17. High sequence diversity in the RNA synthesized at the lampbrush stage of oögenesis.

    PubMed

    Davidson, E H; Hough, B R

    1969-06-01

    Many diverse RNA's are synthesized in the lampbrush stage oöcyte of Xenopus, as shown by the presence of different nucleotide sequences in the RNA population. This fact has been established by hybridizing lampbrush stage oöcyte RNA with an isolated nonrepetitive fraction of Xenopus DNA. PMID:5257126

  18. Strand-specific libraries for high throughput RNA sequencing (RNA-Seq) prepared without poly(A) selection

    PubMed Central

    2012-01-01

    Background High throughput DNA sequencing technology has enabled quantification of all the RNAs in a cell or tissue, a method widely known as RNA sequencing (RNA-Seq). However, non-coding RNAs such as rRNA are highly abundant and can consume >70% of sequencing reads. A common approach is to extract only polyadenylated mRNA; however, such approaches are blind to RNAs with short or no poly(A) tails, leading to an incomplete view of the transcriptome. Another challenge of preparing RNA-Seq libraries is to preserve the strand information of the RNAs. Design Here, we describe a procedure for preparing RNA-Seq libraries from 1 to 4 μg total RNA without poly(A) selection. Our method combines the deoxyuridine triphosphate (dUTP)/uracil-DNA glycosylase (UDG) strategy to achieve strand specificity with AMPure XP magnetic beads to perform size selection. Together, these steps eliminate gel purification, allowing a library to be made in less than two days. We barcode each library during the final PCR amplification step, allowing several samples to be sequenced in a single lane without sacrificing read length. Libraries prepared using this protocol are compatible with Illumina GAII, GAIIx and HiSeq 2000 platforms. Discussion The RNA-Seq protocol described here yields strand-specific transcriptome libraries without poly(A) selection, which provide approximately 90% mappable sequences. Typically, more than 85% of mapped reads correspond to protein-coding genes and only 6% derive from non-coding RNAs. The protocol has been used to measure RNA transcript identity and abundance in tissues from flies, mice, rats, chickens, and frogs, demonstrating its general applicability. PMID:23273270

  19. Changes in nuclear and polysomal polyadenylated RNA sequences during rat-liver regeneration.

    PubMed Central

    Wilkes, P R; Birnie, G D; Paul, J

    1979-01-01

    Nuclear and polysomal polyadenylated RNA populations of normal and 16 hour regenerating rat liver have been compared by mRNA-cDNA hybridisations and by unique DNA saturation experiments. It was found that nuclear polyadenylated RNA hybridises to 6.8% of unique DNA in both normal and 16 hour regenerating rat liver. However, cross-hybridisation experiments using cDNA have shown that 10-15% by weight of nuclear polyadenylated RNA sequences are specific to 16 hour regenerating rat-liver. Since both unique DNA and cDNA hybridisation have shown that normal and 16 hour regenerating rat-liver polysomal polyadenylated RNA populations are qualitatively very similar sequences specific to 16 hour regenerating rat-liver nuclear polyadenylated RNA are nucleus confined. Polysomal RNA sequences which were abundant in normal rat-liver have become less abundant in regenerating rat liver. PMID:461186

  20. JAR3D Webserver: Scoring and aligning RNA loop sequences to known 3D motifs.

    PubMed

    Roll, James; Zirbel, Craig L; Sweeney, Blake; Petrov, Anton I; Leontis, Neocles

    2016-07-01

    Many non-coding RNAs have been identified and may function by forming 2D and 3D structures. RNA hairpin and internal loops are often represented as unstructured on secondary structure diagrams, but RNA 3D structures show that most such loops are structured by non-Watson-Crick basepairs and base stacking. Moreover, different RNA sequences can form the same RNA 3D motif. JAR3D finds possible 3D geometries for hairpin and internal loops by matching loop sequences to motif groups from the RNA 3D Motif Atlas, by exact sequence match when possible, and by probabilistic scoring and edit distance for novel sequences. The scoring gauges the ability of the sequences to form the same pattern of interactions observed in 3D structures of the motif. The JAR3D webserver at http://rna.bgsu.edu/jar3d/ takes one or many sequences of a single loop as input, or else one or many sequences of longer RNAs with multiple loops. Each sequence is scored against all current motif groups. The output shows the ten best-matching motif groups. Users can align input sequences to each of the motif groups found by JAR3D. JAR3D will be updated with every release of the RNA 3D Motif Atlas, and so its performance is expected to improve over time. PMID:27235417

  1. Replicating satellite RNA induces sequence-specific DNA methylation and truncated transcripts in plants.

    PubMed Central

    Wang, M B; Wesley, S V; Finnegan, E J; Smith, N A; Waterhouse, P M

    2001-01-01

    Tobacco plants were transformed with a chimeric transgene comprising sequences encoding beta-glucuronidase (GUS) and the satellite RNA (satRNA) of cereal yellow dwarf luteovirus. When transgenic plants were infected with potato leafroll luteovirus (PLRV), which replicated the transgene-derived satRNA to a high level, the satellite sequence of the GUS:Sat transgene became densely methylated. Within the satellite region, all 86 cytosines in the upper strand and 73 of the 75 cytosines in the lower strand were either partially or fully methylated. In contrast, very low levels of DNA methylation were detected in the satellite sequence of the transgene in uninfected plants and in the flanking nonsatellite sequences in both infected and uninfected plants. Substantial amounts of truncated GUS:Sat RNA accumulated in the satRNA-replicating plants, and most of the molecules terminated at nucleotides within the first 60 bp of the satellite sequence. Whereas this RNA truncation was associated with high levels of satRNA replication, it appeared to be independent of the levels of DNA methylation in the satellite sequence, suggesting that it is not caused by methylation. All the sequenced GUS:Sat DNA molecules were hypermethylated in plants with replicating satRNA despite the phloem restriction of the helper PLRV. Also, small, sense and antisense approximately 22 nt RNAs, derived from the satRNA, were associated with the replicating satellite. These results suggest that the sequence-specific DNA methylation spread into cells in which no satRNA replication occurred and that this was mediated by the spread of unamplified satRNA and/or its associated 22 nt RNA molecules. PMID:11214177

  2. Integration of Expressed Sequence Tag Data Flanking Predicted RNA Secondary Structures Facilitates Novel Non-Coding RNA Discovery

    PubMed Central

    Krzyzanowski, Paul M.; Price, Feodor D.; Muro, Enrique M.; Rudnicki, Michael A.; Andrade-Navarro, Miguel A.

    2011-01-01

    Many computational methods have been used to predict novel non-coding RNAs (ncRNAs), but none, to our knowledge, have explicitly investigated the impact of integrating existing cDNA-based Expressed Sequence Tag (EST) data that flank structural RNA predictions. To determine whether flanking EST data can assist in microRNA (miRNA) prediction, we identified genomic sites encoding putative miRNAs by combining functional RNA predictions with flanking ESTs data in a model consistent with miRNAs undergoing cleavage during maturation. In both human and mouse genomes, we observed that the inclusion of flanking ESTs adjacent to and not overlapping predicted miRNAs significantly improved the performance of various methods of miRNA prediction, including direct high-throughput sequencing of small RNA libraries. We analyzed the expression of hundreds of miRNAs predicted to be expressed during myogenic differentiation using a customized microarray and identified several known and predicted myogenic miRNA hairpins. Our results indicate that integrating ESTs flanking structural RNA predictions improves the quality of cleaved miRNA predictions and suggest that this strategy can be used to predict other non-coding RNAs undergoing cleavage during maturation. PMID:21698286

  3. Integration of expressed sequence tag data flanking predicted RNA secondary structures facilitates novel non-coding RNA discovery.

    PubMed

    Krzyzanowski, Paul M; Price, Feodor D; Muro, Enrique M; Rudnicki, Michael A; Andrade-Navarro, Miguel A

    2011-01-01

    Many computational methods have been used to predict novel non-coding RNAs (ncRNAs), but none, to our knowledge, have explicitly investigated the impact of integrating existing cDNA-based Expressed Sequence Tag (EST) data that flank structural RNA predictions. To determine whether flanking EST data can assist in microRNA (miRNA) prediction, we identified genomic sites encoding putative miRNAs by combining functional RNA predictions with flanking ESTs data in a model consistent with miRNAs undergoing cleavage during maturation. In both human and mouse genomes, we observed that the inclusion of flanking ESTs adjacent to and not overlapping predicted miRNAs significantly improved the performance of various methods of miRNA prediction, including direct high-throughput sequencing of small RNA libraries. We analyzed the expression of hundreds of miRNAs predicted to be expressed during myogenic differentiation using a customized microarray and identified several known and predicted myogenic miRNA hairpins. Our results indicate that integrating ESTs flanking structural RNA predictions improves the quality of cleaved miRNA predictions and suggest that this strategy can be used to predict other non-coding RNAs undergoing cleavage during maturation. PMID:21698286

  4. The use of exome capture RNA-seq for highly degraded RNA with application to clinical cancer sequencing.

    PubMed

    Cieslik, Marcin; Chugh, Rashmi; Wu, Yi-Mi; Wu, Ming; Brennan, Christine; Lonigro, Robert; Su, Fengyun; Wang, Rui; Siddiqui, Javed; Mehra, Rohit; Cao, Xuhong; Lucas, David; Chinnaiyan, Arul M; Robinson, Dan

    2015-09-01

    RNA-seq by poly(A) selection is currently the most common protocol for whole transcriptome sequencing as it provides a broad, detailed, and accurate view of the RNA landscape. Unfortunately, the utility of poly(A) libraries is greatly limited when the input RNA is degraded, which is the norm for research tissues and clinical samples, especially when specimens are formalin-fixed. To facilitate the use of RNA sequencing beyond cell lines and in the clinical setting, we developed an exome-capture transcriptome protocol with greatly improved performance on degraded RNA. Capture transcriptome libraries enable measuring absolute and differential gene expression, calling genetic variants, and detecting gene fusions. Through validation against gold-standard poly(A) and Ribo-Zero libraries from intact RNA, we show that capture RNA-seq provides accurate and unbiased estimates of RNA abundance, uniform transcript coverage, and broad dynamic range. Unlike poly(A) selection and Ribo-Zero depletion, capture libraries retain these qualities regardless of RNA quality and provide excellent data from clinical specimens including formalin-fixed paraffin-embedded (FFPE) blocks. Systematic improvements across key applications of RNA-seq are shown on a cohort of prostate cancer patients and a set of clinical FFPE samples. Further, we demonstrate the utility of capture RNA-seq libraries in a patient with a highly malignant solitary fibrous tumor (SFT) enrolled in our clinical sequencing program called MI-ONCOSEQ. Capture transcriptome profiling from FFPE revealed two oncogenic fusions: the pathognomonic NAB2-STAT6 inversion and a therapeutically actionable BRAF fusion, which may drive this specific cancer's aggressive phenotype. PMID:26253700

  5. AB053. MicroRNA expression profile in penile cancer revealed by next-generation small RNA sequencing

    PubMed Central

    Zhang, Li; Wei, Pengfei

    2016-01-01

    Objective Penile cancer (PeCa) is a relatively rare tumor entity but possesses higher morbidity and mortality rates especially in developing countries. To date, the concrete pathogenic signaling pathways and core machineries involved in tumorigenesis and progression of PeCa remain to be elucidated. Several studies suggested miRNAs, which modulate gene expression at posttranscriptional level, were frequently mis-regulated and aberrantly expressed in human cancers. However, the miRNA profile in human PeCa has not been reported before. Methods In this present study, the miRNA profile was obtained from 10 fresh penile cancerous tissues and matched adjacent non-cancerous tissues via next-generation sequencing. Results As a result, a total of 751 and 806 annotated miRNAs were identified in normal and cancerous penile tissues, respectively. Among which, 56 miRNAs with significantly different expression levels between paired tissues were identified. Subsequently, several annotated miRNAs were randomly and validated using quantitative real-time PCR. Compared with the previous publications regarding to the altered miRNAs expression in various cancers and especially genitourinary (prostate, bladder, kidney, testis) cancers, the most majority of deregulated miRNAs showed the similar expression pattern in penile cancer. Moreover, the bioinformatics analyses suggested that the putative target genes of differentially expressed miRNAs between cancerous and matched normal penile tissues were tightly associated with cell junction, proliferation, growth as well as genomic instability and so on, by modulating Wnt, MAPK, p53, PI3K-Akt, Notch and TGF-β signaling pathways, which were all well-established to participate in cancer initiation and progression. Conclusions Our work presents a global view of the differentially expressed miRNAs and potentially regulatory networks of their target genes for clarifying the pathogenic transformation of normal penis to PeCa, which research resource

  6. Globin mRNA reduction for whole-blood transcriptome sequencing

    PubMed Central

    Krjutškov, Kaarel; Koel, Mariann; Roost, Anne Mari; Katayama, Shintaro; Einarsdottir, Elisabet; Jouhilahti, Eeva-Mari; Söderhäll, Cilla; Jaakma, Ülle; Plaas, Mario; Vesterlund, Liselotte; Lohi, Hannes; Salumets, Andres; Kere, Juha

    2016-01-01

    The transcriptome analysis of whole-blood RNA by sequencing holds promise for the identification and tracking of biomarkers; however, the high globin mRNA (gmRNA) content of erythrocytes hampers whole-blood and buffy coat analyses. We introduce a novel gmRNA locking assay (GlobinLock, GL) as a robust and simple gmRNA reduction tool to preserve RNA quality, save time and cost. GL consists of a pair of gmRNA-specific oligonucleotides in RNA initial denaturation buffer that is effective immediately after RNA denaturation and adds only ten minutes of incubation to the whole cDNA synthesis procedure when compared to non-blood RNA analysis. We show that GL is fully effective not only for human samples but also for mouse and rat, and so far incompletely studied cow, dog and zebrafish. PMID:27515369

  7. Globin mRNA reduction for whole-blood transcriptome sequencing.

    PubMed

    Krjutškov, Kaarel; Koel, Mariann; Roost, Anne Mari; Katayama, Shintaro; Einarsdottir, Elisabet; Jouhilahti, Eeva-Mari; Söderhäll, Cilla; Jaakma, Ülle; Plaas, Mario; Vesterlund, Liselotte; Lohi, Hannes; Salumets, Andres; Kere, Juha

    2016-01-01

    The transcriptome analysis of whole-blood RNA by sequencing holds promise for the identification and tracking of biomarkers; however, the high globin mRNA (gmRNA) content of erythrocytes hampers whole-blood and buffy coat analyses. We introduce a novel gmRNA locking assay (GlobinLock, GL) as a robust and simple gmRNA reduction tool to preserve RNA quality, save time and cost. GL consists of a pair of gmRNA-specific oligonucleotides in RNA initial denaturation buffer that is effective immediately after RNA denaturation and adds only ten minutes of incubation to the whole cDNA synthesis procedure when compared to non-blood RNA analysis. We show that GL is fully effective not only for human samples but also for mouse and rat, and so far incompletely studied cow, dog and zebrafish. PMID:27515369

  8. Sequence-specific cleavage of dsRNA by Mini-III RNase

    PubMed Central

    Głów, Dawid; Pianka, Dariusz; Sulej, Agata A.; Kozłowski, Łukasz P.; Czarnecka, Justyna; Chojnowski, Grzegorz; Skowronek, Krzysztof J.; Bujnicki, Janusz M.

    2015-01-01

    Ribonucleases (RNases) play a critical role in RNA processing and degradation by hydrolyzing phosphodiester bonds (exo- or endonucleolytically). Many RNases that cut RNA internally exhibit substrate specificity, but their target sites are usually limited to one or a few specific nucleotides in single-stranded RNA and often in a context of a particular three-dimensional structure of the substrate. Thus far, no RNase counterparts of restriction enzymes have been identified which could cleave double-stranded RNA (dsRNA) in a sequence-specific manner. Here, we present evidence for a sequence-dependent cleavage of long dsRNA by RNase Mini-III from Bacillus subtilis (BsMiniIII). Analysis of the sites cleaved by this enzyme in limited digest of bacteriophage Φ6 dsRNA led to the identification of a consensus target sequence. We defined nucleotide residues within the preferred cleavage site that affected the efficiency of the cleavage and were essential for the discrimination of cleavable versus non-cleavable dsRNA sequences. We have also determined that the loop α5b-α6, a distinctive structural element in Mini-III RNases, is crucial for the specific cleavage, but not for dsRNA binding. Our results suggest that BsMiniIII may serve as a prototype of a sequence-specific dsRNase that could possibly be used for targeted cleavage of dsRNA. PMID:25634891

  9. Sequence of instability processes triggered by heavy rainfall in the northern Italy

    NASA Astrophysics Data System (ADS)

    Luino, Fabio

    2005-03-01

    Northern Italy is a geomorphologically heterogeneous region: high mountains, wide valleys, gentle hills and a large plain form a very varied landscape and influence the temperate climate of the area. The Alps region has harsh winters and moderately warm summers with abundant rainfall. The Po Plain has harsh winters with long periods of subfreezing temperatures and warm sultry summers, with rainfall more common in winter. Geomorphic instability processes are very common. Almost every year, landslides, mud flows and debris flows in the Alpine areas and flooding in the Po flood plain cause severe damage to structures and infrastructure and often claim human lives. Analyses of major events that have struck northern Italy over the last 35 years have provided numerous useful data for the recognition of various rainfall-triggering processes and their sequence of development in relation to the intensity and duration of rainfall. Findings acquired during and after these events emphasise that the quantity and typology of instability processes triggered by rainfall are related not only to an area's morphological and geological characteristics but also to intense rainfall distribution during meteorological disturbances. Moreover, critical rainfall thresholds can vary from place to place in relation to the climatic and geomorphological conditions of the area. Once the threshold has been exceeded, which is about 10% of the local mean annual rainfall (MAR), the instability processes on the slopes and along the hydrographic networks follow a sequence that can be reconstructed in three different phases. In the first phase, the initial instability processes that can usually be observed are soil slips on steep slopes, mud-debris flows in small basins of less than 20 km 2 in area, while discharge increases substantially in larger stream basins of up to 500 km 2. In continuous precipitation, in the second phase, first mud-debris flows can be triggered also in basins larger than 20 km 2

  10. In vitro selection of an RNA sequence that interacts with high affinity with thymidylate synthase

    PubMed Central

    Lin, Xiukun; Mizunuma, Nobuyuki; Chen, Tian-men; Copur, Sitki M.; Maley, Gladys F.; Liu, Jun; Maley, Frank; Chu, Edward

    2000-01-01

    Previous studies have shown that the repressive effect of thymidylate synthase (TS) mRNA translation is mediated by direct binding of TS itself to two cis-acting elements on its cognate mRNA. To identify the optimal RNA nucleotides that interact with TS, we in vitro synthesized a completely degenerate, linear RNA pool of 25 nt and employed in vitro selection to isolate high affinity RNA ligands that bind human TS protein. After 10 rounds of selection and amplification, a single RNA molecule was selected that bound TS protein with nearly 20-fold greater affinity than native, wild-type TS RNA sequences. Secondary structure analysis of this RNA sequence predicted it to possess a stem–loop structure. Deletion and/or modification of the UGU loop element within the RNA sequence decreased binding to TS by up to 1000-fold. In vivo transfection experiments revealed that the presence of the selected RNA sequence resulted in a significant increase in the expression of a heterologous luciferase reporter construct in human colon cancer H630 and TS-overexpressing HCT-C:His-TS+ cells, but not in HCT-C18 cells expressing a functionally inactive TS. In addition, the presence of this element in H630 cells leads to induced expression of TS protein. An immunoprecipitation method using RT–PCR confirmed a direct interaction between human TS protein and the selected RNA sequence in transfected human cancer H630 cells. This study identified a novel RNA sequence from a degenerate RNA library that specifically interacts with TS. PMID:11058126

  11. In vitro selection of an RNA sequence that interacts with high affinity with thymidylate synthase.

    PubMed

    Lin, X; Mizunuma, N; Chen, T; Copur, S M; Maley, G F; Liu, J; Maley, F; Chu, E

    2000-11-01

    Previous studies have shown that the repressive effect of thymidylate synthase (TS) mRNA translation is mediated by direct binding of TS itself to two cis-acting elements on its cognate mRNA. To identify the optimal RNA nucleotides that interact with TS, we in vitro synthesized a completely degenerate, linear RNA pool of 25 nt and employed in vitro selection to isolate high affinity RNA ligands that bind human TS protein. After 10 rounds of selection and amplification, a single RNA molecule was selected that bound TS protein with nearly 20-fold greater affinity than native, wild-type TS RNA sequences. Secondary structure analysis of this RNA sequence predicted it to possess a stem-loop structure. Deletion and/or modification of the UGU loop element within the RNA sequence decreased binding to TS by up to 1000-fold. In vivo transfection experiments revealed that the presence of the selected RNA sequence resulted in a significant increase in the expression of a heterologous luciferase reporter construct in human colon cancer H630 and TS-overexpressing HCT-C:His-TS+ cells, but not in HCT-C18 cells expressing a functionally inactive TS. In addition, the presence of this element in H630 cells leads to induced expression of TS protein. An immunoprecipitation method using RT-PCR confirmed a direct interaction between human TS protein and the selected RNA sequence in transfected human cancer H630 cells. This study identified a novel RNA sequence from a degenerate RNA library that specifically interacts with TS. PMID:11058126

  12. Nucleotide sequence of 3' untranslated portion of human alpha globin mRNA.

    PubMed Central

    Wilson, J T; deRiel, J K; Forget, B G; Marotta, C A; Weissman, S M

    1977-01-01

    We have determined the nucleotide sequence of 75 nucleotides of the 3'-untranslated portion of normal human alpha globin mRNA which corresponds to the elongated amino acid sequence of the chain termination mutant Hb Constant Spring. This was accomplished by sequence analysis of cDNA fragments obtained by restriction endonuclease or T4 endonuclease IV cleavage of human globin cDNA synthesized from globin mRNA by use of viral reverse transcriptase. Analysis of cRNA synthesized from cDNA by use of RNA polymerase provided additional confirmatory sequence information. Possible polymorphism has been identified at one site of the sequence. Our sequence overlaps with, and extends the sequence of 43 nucleotides determined by Proudfood and coworkers for the very 3'-terminal portion of human alpha globin mRNA. The complete 3'-untranslated sequence of human alpha globin mRNA (112 nucleotides including termination codon) shows little homology to that of the human or rabbit beta globin mRNAs except for the presence of the hexanucleotide sequence AAUAAA which is found in most eukaryotic mRNAs near the 3'-terminal poly (A). Images PMID:909779

  13. Molecular basis of sequence-specific recognition of pre-ribosomal RNA by nucleolin

    PubMed Central

    Allain, Frédéric H.-T.; Bouvet, Philippe; Dieckmann, Thorsten; Feigon, Juli

    2000-01-01

    The structure of the 28 kDa complex of the first two RNA binding domains (RBDs) of nucleolin (RBD12) with an RNA stem–loop that includes the nucleolin recognition element UCCCGA in the loop was determined by NMR spectroscopy. The structure of nucleolin RBD12 with the nucleolin recognition element (NRE) reveals that the two RBDs bind on opposite sides of the RNA loop, forming a molecular clamp that brings the 5′ and 3′ ends of the recognition sequence close together and stabilizing the stem–loop. The specific interactions observed in the structure explain the sequence specificity for the NRE sequence. Binding studies of mutant proteins and analysis of conserved residues support the proposed interactions. The mode of interaction of the protein with the RNA and the location of the putative NRE sites suggest that nucleolin may function as an RNA chaperone to prevent improper folding of the nascent pre-rRNA. PMID:11118222

  14. Interspersion of sequences in avian myeloblastosis virus rna that rapidly hybridize with leukemic chicken cell DNA.

    PubMed Central

    Drohan, W N; Shoyab, M; Wall, R; Baluda, M A

    1975-01-01

    Liquid hybridization of progressively smaller fragments (35S, 27S, 15.5S, 12.5S, and 8S) of poly(A)-selected avian myeloblastosis virus RNA with excess DNA from leukemic chicken myeloblasts revealed that all sizes of RNA contained sequences complementary to both slowly and rapidly hybridizing cellular DNA sequences. Apparently, the RNA sequences which hybridize rapidly with excesses of cellular DNA are not restricted to any one region of the avian myeloblastosis virus 35S RNA. Instead, they appear to be randomly distributed over the entire 35S avian myeloblastosis virus RNA molecule with some positioned within 200 nucleotides of the poly(A) tract at the 3' end of the RNA. PMID:163372

  15. Sequence-specific RNA Photocleavage by Single-stranded DNA in Presence of Riboflavin.

    PubMed

    Zhao, Yongyun; Chen, Gangyi; Yuan, Yi; Li, Na; Dong, Juan; Huang, Xin; Cui, Xin; Tang, Zhuo

    2015-01-01

    Constant efforts have been made to develop new method to realize sequence-specific RNA degradation, which could cause inhibition of the expression of targeted gene. Herein, by using an unmodified short DNA oligonucleotide for sequence recognition and endogenic small molecule, vitamin B2 (riboflavin) as photosensitizer, we report a simple strategy to realize the sequence-specific photocleavage of targeted RNA. The DNA strand is complimentary to the target sequence to form DNA/RNA duplex containing a G • U wobble in the middle. The cleavage reaction goes through oxidative elimination mechanism at the nucleoside downstream of U of the G • U wobble in duplex to obtain unnatural RNA terminal, and the whole process is under tight control by using light as switch, which means the cleavage could be carried out according to specific spatial and temporal requirements. The biocompatibility of this method makes the DNA strand in combination with riboflavin a promising molecular tool for RNA manipulation. PMID:26461456

  16. Sequence-specific RNA Photocleavage by Single-stranded DNA in Presence of Riboflavin

    NASA Astrophysics Data System (ADS)

    Zhao, Yongyun; Chen, Gangyi; Yuan, Yi; Li, Na; Dong, Juan; Huang, Xin; Cui, Xin; Tang, Zhuo

    2015-10-01

    Constant efforts have been made to develop new method to realize sequence-specific RNA degradation, which could cause inhibition of the expression of targeted gene. Herein, by using an unmodified short DNA oligonucleotide for sequence recognition and endogenic small molecue, vitamin B2 (riboflavin) as photosensitizer, we report a simple strategy to realize the sequence-specific photocleavage of targeted RNA. The DNA strand is complimentary to the target sequence to form DNA/RNA duplex containing a G•U wobble in the middle. The cleavage reaction goes through oxidative elimination mechanism at the nucleoside downstream of U of the G•U wobble in duplex to obtain unnatural RNA terminal, and the whole process is under tight control by using light as switch, which means the cleavage could be carried out according to specific spatial and temporal requirements. The biocompatibility of this method makes the DNA strand in combination with riboflavin a promising molecular tool for RNA manipulation.

  17. Plant RNA virus sequences identified in kimchi by microbial metatranscriptome analysis.

    PubMed

    Kim, Dong Seon; Jung, Ji Young; Wang, Yao; Oh, Hye Ji; Choi, Dongjin; Jeon, Che Ok; Hahn, Yoonsoo

    2014-07-01

    Plant pathogenic RNA viruses are present in a variety of plant-based foods. When ingested by humans, these viruses can survive the passage through the digestive tract, and are frequently detected in human feces. Kimchi is a traditional fermented Korean food made from cabbage or vegetables, with a variety of other plant-based ingredients, including ground red pepper and garlic paste. We analyzed microbial metatranscriptome data from kimchi at five fermentation stages to identify plant RNA virus-derived sequences. We successfully identified a substantial amount of plant RNA virus sequences, especially during the early stages of fermentation: 23.47% and 16.45% of total clean reads on days 7 and 13, respectively. The most abundant plant RNA virus sequences were from pepper mild mottle virus, a major pathogen of red peppers; this constituted 95% of the total RNA virus sequences identified throughout the fermentation period. We observed distinct sequencing read-depth distributions for plant RNA virus genomes, possibly implying intrinsic and/or technical biases during the metatranscriptome generation procedure. We also identified RNA virus sequences in publicly available microbial metatranscriptome data sets. We propose that metatranscriptome data may serve as a valuable resource for RNA virus detection, and a systematic screening of the ingredients may help prevent the use of virus-infected low-quality materials for food production. PMID:24836186

  18. Common 5S rRNA variants are likely to be accepted in many sequence contexts

    NASA Technical Reports Server (NTRS)

    Zhang, Zhengdong; D'Souza, Lisa M.; Lee, Youn-Hyung; Fox, George E.

    2003-01-01

    Over evolutionary time RNA sequences which are successfully fixed in a population are selected from among those that satisfy the structural and chemical requirements imposed by the function of the RNA. These sequences together comprise the structure space of the RNA. In principle, a comprehensive understanding of RNA structure and function would make it possible to enumerate which specific RNA sequences belong to a particular structure space and which do not. We are using bacterial 5S rRNA as a model system to attempt to identify principles that can be used to predict which sequences do or do not belong to the 5S rRNA structure space. One promising idea is the very intuitive notion that frequently seen sequence changes in an aligned data set of naturally occurring 5S rRNAs would be widely accepted in many other 5S rRNA sequence contexts. To test this hypothesis, we first developed well-defined operational definitions for a Vibrio region of the 5S rRNA structure space and what is meant by a highly variable position. Fourteen sequence variants (10 point changes and 4 base-pair changes) were identified in this way, which, by the hypothesis, would be expected to incorporate successfully in any of the known sequences in the Vibrio region. All 14 of these changes were constructed and separately introduced into the Vibrio proteolyticus 5S rRNA sequence where they are not normally found. Each variant was evaluated for its ability to function as a valid 5S rRNA in an E. coli cellular context. It was found that 93% (13/14) of the variants tested are likely valid 5S rRNAs in this context. In addition, seven variants were constructed that, although present in the Vibrio region, did not meet the stringent criteria for a highly variable position. In this case, 86% (6/7) are likely valid. As a control we also examined seven variants that are seldom or never seen in the Vibrio region of 5S rRNA sequence space. In this case only two of seven were found to be potentially valid. The

  19. Preparation of cDNA libraries for high-throughput RNA sequencing analysis of RNA 5′ ends

    PubMed Central

    Vvedenskaya, Irina O.; Goldman, Seth R.; Nickels, Bryce E.

    2015-01-01

    Summary We provide a detailed protocol for preparing cDNA libraries suitable for high throughput sequencing that are derived specifically from the 5′ ends of RNA (5′ specific RNA-seq). The protocol describes how cDNA libraries for 5′ specific RNA-seq can be tailored to analyze specific classes of RNAs based upon the phosphorylation status of the 5′ end. Thus, the analysis of cDNA libraries generated by these methods provides information regarding both the sequence and phosphorylation status of the 5′ ends of RNAs. 5′ specific RNA-seq can be used to analyze transcription initiation and post-transcriptional processing of RNAs with single base pair resolution on a genome-wide level. PMID:25665566

  20. 5S RNA sequence from the Philosamia silkworm: evidence for variable evolutionary rates in insect 5S RNA.

    PubMed Central

    Xian-Rong, G; Nicoghosian, K; Cedergren, R J

    1982-01-01

    The primary structure of 5S RNA isolated from the posterior silkgland of Philosamia cynthia ricini was determined using three in vitro labelling techniques. The derived sequence consists of 119 nucleotides and can be folded into the secondary structure model proposed for eukaryotic 5S RNAs. This 5S RNA differs from the Bombyx mori molecule in 9 positions and from the Drosophila melanogaster sequence in 14 positions. The comparison of evolutionary rates in insect 5S RNA with inferred rates in other eukaryotic phyla leads to the conclusion that 5S RNA evolution is not constant in different eukaryotic branches, a condition which must be taken into account in phylogenetic tree constructions. Images PMID:7145713

  1. Sequencing 16S rRNA gene fragments using the PacBio SMRT DNA sequencing system

    PubMed Central

    Jenior, Matthew L.; Koumpouras, Charles C.; Westcott, Sarah L.; Highlander, Sarah K.

    2016-01-01

    Over the past 10 years, microbial ecologists have largely abandoned sequencing 16S rRNA genes by the Sanger sequencing method and have instead adopted highly parallelized sequencing platforms. These new platforms, such as 454 and Illumina’s MiSeq, have allowed researchers to obtain millions of high quality but short sequences. The result of the added sequencing depth has been significant improvements in experimental design. The tradeoff has been the decline in the number of full-length reference sequences that are deposited into databases. To overcome this problem, we tested the ability of the PacBio Single Molecule, Real-Time (SMRT) DNA sequencing platform to generate sequence reads from the 16S rRNA gene. We generated sequencing data from the V4, V3–V5, V1–V3, V1–V5, V1–V6, and V1–V9 variable regions from within the 16S rRNA gene using DNA from a synthetic mock community and natural samples collected from human feces, mouse feces, and soil. The mock community allowed us to assess the actual sequencing error rate and how that error rate changed when different curation methods were applied. We developed a simple method based on sequence characteristics and quality scores to reduce the observed error rate for the V1–V9 region from 0.69 to 0.027%. This error rate is comparable to what has been observed for the shorter reads generated by 454 and Illumina’s MiSeq sequencing platforms. Although the per base sequencing cost is still significantly more than that of MiSeq, the prospect of supplementing reference databases with full-length sequences from organisms below the limit of detection from the Sanger approach is exciting. PMID:27069806

  2. Sequencing 16S rRNA gene fragments using the PacBio SMRT DNA sequencing system.

    PubMed

    Schloss, Patrick D; Jenior, Matthew L; Koumpouras, Charles C; Westcott, Sarah L; Highlander, Sarah K

    2016-01-01

    Over the past 10 years, microbial ecologists have largely abandoned sequencing 16S rRNA genes by the Sanger sequencing method and have instead adopted highly parallelized sequencing platforms. These new platforms, such as 454 and Illumina's MiSeq, have allowed researchers to obtain millions of high quality but short sequences. The result of the added sequencing depth has been significant improvements in experimental design. The tradeoff has been the decline in the number of full-length reference sequences that are deposited into databases. To overcome this problem, we tested the ability of the PacBio Single Molecule, Real-Time (SMRT) DNA sequencing platform to generate sequence reads from the 16S rRNA gene. We generated sequencing data from the V4, V3-V5, V1-V3, V1-V5, V1-V6, and V1-V9 variable regions from within the 16S rRNA gene using DNA from a synthetic mock community and natural samples collected from human feces, mouse feces, and soil. The mock community allowed us to assess the actual sequencing error rate and how that error rate changed when different curation methods were applied. We developed a simple method based on sequence characteristics and quality scores to reduce the observed error rate for the V1-V9 region from 0.69 to 0.027%. This error rate is comparable to what has been observed for the shorter reads generated by 454 and Illumina's MiSeq sequencing platforms. Although the per base sequencing cost is still significantly more than that of MiSeq, the prospect of supplementing reference databases with full-length sequences from organisms below the limit of detection from the Sanger approach is exciting. PMID:27069806

  3. a Simple Symmetric Algorithm Using a Likeness with Introns Behavior in RNA Sequences

    NASA Astrophysics Data System (ADS)

    Regoli, Massimo

    2009-02-01

    The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. The RNA sequences has some sections called Introns. Introns, derived from the term "intragenic regions", are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by Biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behaviour in the access to the secret key to code the messages. In the RNA-Crypto System algoritnm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.

  4. Deep Sequencing of RNA from Ancient Maize Kernels

    PubMed Central

    Rasmussen, Morten; Cappellini, Enrico; Romero-Navarro, J. Alberto; Wales, Nathan; Alquezar-Planas, David E.; Penfield, Steven; Brown, Terence A.; Vielle-Calzada, Jean-Philippe; Montiel, Rafael; Jørgensen, Tina; Odegaard, Nancy; Jacobs, Michael; Arriaza, Bernardo; Higham, Thomas F. G.; Ramsey, Christopher Bronk; Willerslev, Eske; Gilbert, M. Thomas P.

    2013-01-01

    The characterization of biomolecules from ancient samples can shed otherwise unobtainable insights into the past. Despite the fundamental role of transcriptomal change in evolution, the potential of ancient RNA remains unexploited – perhaps due to dogma associated with the fragility of RNA. We hypothesize that seeds offer a plausible refuge for long-term RNA survival, due to the fundamental role of RNA during seed germination. Using RNA-Seq on cDNA synthesized from nucleic acid extracts, we validate this hypothesis through demonstration of partial transcriptomal recovery from two sources of ancient maize kernels. The results suggest that ancient seed transcriptomics may offer a powerful new tool with which to study plant domestication. PMID:23326310

  5. Enzymatic aminoacylation of sequence-specific RNA minihelices and hybrid duplexes with methionine.

    PubMed Central

    Martinis, S A; Schimmel, P

    1992-01-01

    RNA hairpin helices whose sequences are based on the acceptor stems of alanine and histidine tRNAs are specifically aminoacylated with their cognate amino acids. In these examples, major determinants for the identities of the respective tRNAs reside in the acceptor stem; the anticodon and other parts of the tRNA are dispensable for aminoacylation. In contrast, the anticodon is a major determinant for the identity of a methionine tRNA. RNA hairpin helices and hybrid duplexes that reconstruct the acceptor-T psi C stem and the acceptor stem, respectively, of methionine tRNA were investigated here for aminoacylation with methionine. Direct visualization of the aminoacylated RNA product on an acidic polyacrylamide gel by phosphor imaging demonstrated specific aminoacylation with substrates that contained as few as 7 base pairs. No aminoacylation with methionine was detected with several analogous RNA substrates whose sequences were based on noncognate tRNAs. While the efficiency of aminoacylation is reduced by orders of magnitude relative to methionine tRNA, the results establish that specific aminoacylation with methionine of small duplex substrates can be achieved without the anticodon or other domains of the tRNA. The results, combined with earlier studies, suggest a highly specific adaptation of the structures of aminoacyl-tRNA synthetases to the acceptor stems of their cognate tRNAs, resulting in a relationship between the nucleotide sequences/structures of small RNA duplexes and specific amino acids. Images PMID:1729719

  6. Combined DECS Analysis and Next-Generation Sequencing Enable Efficient Detection of Novel Plant RNA Viruses

    PubMed Central

    Yanagisawa, Hironobu; Tomita, Reiko; Katsu, Koji; Uehara, Takuya; Atsumi, Go; Tateda, Chika; Kobayashi, Kappei; Sekine, Ken-Taro

    2016-01-01

    The presence of high molecular weight double-stranded RNA (dsRNA) within plant cells is an indicator of infection with RNA viruses as these possess genomic or replicative dsRNA. DECS (dsRNA isolation, exhaustive amplification, cloning, and sequencing) analysis has been shown to be capable of detecting unknown viruses. We postulated that a combination of DECS analysis and next-generation sequencing (NGS) would improve detection efficiency and usability of the technique. Here, we describe a model case in which we efficiently detected the presumed genome sequence of Blueberry shoestring virus (BSSV), a member of the genus Sobemovirus, which has not so far been reported. dsRNAs were isolated from BSSV-infected blueberry plants using the dsRNA-binding protein, reverse-transcribed, amplified, and sequenced using NGS. A contig of 4,020 nucleotides (nt) that shared similarities with sequences from other Sobemovirus species was obtained as a candidate of the BSSV genomic sequence. Reverse transcription (RT)-PCR primer sets based on sequences from this contig enabled the detection of BSSV in all BSSV-infected plants tested but not in healthy controls. A recombinant protein encoded by the putative coat protein gene was bound by the BSSV-antibody, indicating that the candidate sequence was that of BSSV itself. Our results suggest that a combination of DECS analysis and NGS, designated here as “DECS-C,” is a powerful method for detecting novel plant viruses. PMID:27072419

  7. Combined DECS Analysis and Next-Generation Sequencing Enable Efficient Detection of Novel Plant RNA Viruses.

    PubMed

    Yanagisawa, Hironobu; Tomita, Reiko; Katsu, Koji; Uehara, Takuya; Atsumi, Go; Tateda, Chika; Kobayashi, Kappei; Sekine, Ken-Taro

    2016-03-01

    The presence of high molecular weight double-stranded RNA (dsRNA) within plant cells is an indicator of infection with RNA viruses as these possess genomic or replicative dsRNA. DECS (dsRNA isolation, exhaustive amplification, cloning, and sequencing) analysis has been shown to be capable of detecting unknown viruses. We postulated that a combination of DECS analysis and next-generation sequencing (NGS) would improve detection efficiency and usability of the technique. Here, we describe a model case in which we efficiently detected the presumed genome sequence of Blueberry shoestring virus (BSSV), a member of the genus Sobemovirus, which has not so far been reported. dsRNAs were isolated from BSSV-infected blueberry plants using the dsRNA-binding protein, reverse-transcribed, amplified, and sequenced using NGS. A contig of 4,020 nucleotides (nt) that shared similarities with sequences from other Sobemovirus species was obtained as a candidate of the BSSV genomic sequence. Reverse transcription (RT)-PCR primer sets based on sequences from this contig enabled the detection of BSSV in all BSSV-infected plants tested but not in healthy controls. A recombinant protein encoded by the putative coat protein gene was bound by the BSSV-antibody, indicating that the candidate sequence was that of BSSV itself. Our results suggest that a combination of DECS analysis and NGS, designated here as "DECS-C," is a powerful method for detecting novel plant viruses. PMID:27072419

  8. Sequence and functional characterization of RNase P RNA from the chl alb containing cyanobacterium Prochlorothrix hollandica.

    PubMed

    Fingerhut, C; Schön, A

    1998-05-29

    Only a few complete sequences and very limited functional data are available for the catalytic RNA component of cyanobacterial RNase P. The RNase P RNA from the chl alb containing cyanobacterium Prochlorothrix hollandica belongs to a rarely found structural subtype with an extended P15/16 domain. We have established conditions for optimal in vitro ribozyme activity, and determined the kinetic parameters for cleavage of pre-tRNA(Tyr). Analysis of pre-tRNA mutants revealed that the T-stem sequence only plays a modulating role, whereas the CCA end is essential for efficient product formation. PMID:9654127

  9. Predicting RNA secondary structures from sequence and probing data.

    PubMed

    Lorenz, Ronny; Wolfinger, Michael T; Tanzer, Andrea; Hofacker, Ivo L

    2016-07-01

    RNA secondary structures have proven essential for understanding the regulatory functions performed by RNA such as microRNAs, bacterial small RNAs, or riboswitches. This success is in part due to the availability of efficient computational methods for predicting RNA secondary structures. Recent advances focus on dealing with the inherent uncertainty of prediction by considering the ensemble of possible structures rather than the single most stable one. Moreover, the advent of high-throughput structural probing has spurred the development of computational methods that incorporate such experimental data as auxiliary information. PMID:27064083

  10. The nucleotide sequence of the large ribosomal RNA gene and the adjacent tRNA genes from rat mitochondria.

    PubMed Central

    Saccone, C; Cantatore, P; Gadaleta, G; Gallerani, R; Lanave, C; Pepe, G; Kroon, A M

    1981-01-01

    We have sequenced the Eco R(1) fragment D from rat mitochondrial DNA. It contains one third of the tRNA (Val) gene (the remaining part has been sequenced from the 3' end of the Eco R(1) fragment A) the complete gene for the large mt 16S rRNA, the tRNA (Leu) gene and the 5' end of an unidentified reading frame. The mt gene for the large rRNA from rat has been aligned with the homologous genes from mouse and human using graphic computer programs. Hypervariable regions at the center of the molecule and highly conserved regions toward the 3' end have been detected. The mt gene for tRNA Leu is of the conventional type and its primary structure is highly conserved among mammals. The mt gene for tRNA(Val) shows characteristics similar to those of other mt tRNA genes but the degree of homology is lower. Comparative studies confirm that AGA and AGG are read as stop codons in mammalian mitochondria. PMID:6913863

  11. RNA sequencing of Sleeping Beauty transposon-induced tumors detects transposon-RNA fusions in forward genetic cancer screens.

    PubMed

    Temiz, Nuri A; Moriarity, Branden S; Wolf, Natalie K; Riordan, Jesse D; Dupuy, Adam J; Largaespada, David A; Sarver, Aaron L

    2016-01-01

    Forward genetic screens using Sleeping Beauty (SB)-mobilized T2/Onc transposons have been used to identify common insertion sites (CISs) associated with tumor formation. Recurrent sites of transposon insertion are commonly identified using ligation-mediated PCR (LM-PCR). Here, we use RNA sequencing (RNA-seq) data to directly identify transcriptional events mediated by T2/Onc. Surprisingly, the majority (∼80%) of LM-PCR identified junction fragments do not lead to observable changes in RNA transcripts. However, in CIS regions, direct transcriptional effects of transposon insertions are observed. We developed an automated method to systematically identify T2/Onc-genome RNA fusion sequences in RNA-seq data. RNA fusion-based CISs were identified corresponding to both DNA-based CISs (Cdkn2a, Mycl1, Nf2, Pten, Sema6d, and Rere) and additional regions strongly associated with cancer that were not observed by LM-PCR (Myc, Akt1, Pth, Csf1r, Fgfr2, Wisp1, Map3k5, and Map4k3). In addition to calculating recurrent CISs, we also present complementary methods to identify potential driver events via determination of strongly supported fusions and fusions with large transcript level changes in the absence of multitumor recurrence. These methods independently identify CIS regions and also point to cancer-associated genes like Braf. We anticipate RNA-seq analyses of tumors from forward genetic screens will become an efficient tool to identify causal events. PMID:26553456

  12. RNA sequencing of Sleeping Beauty transposon-induced tumors detects transposon-RNA fusions in forward genetic cancer screens

    PubMed Central

    Temiz, Nuri A.; Moriarity, Branden S.; Wolf, Natalie K.; Riordan, Jesse D.; Dupuy, Adam J.; Largaespada, David A.; Sarver, Aaron L.

    2016-01-01

    Forward genetic screens using Sleeping Beauty (SB)-mobilized T2/Onc transposons have been used to identify common insertion sites (CISs) associated with tumor formation. Recurrent sites of transposon insertion are commonly identified using ligation-mediated PCR (LM-PCR). Here, we use RNA sequencing (RNA-seq) data to directly identify transcriptional events mediated by T2/Onc. Surprisingly, the majority (∼80%) of LM-PCR identified junction fragments do not lead to observable changes in RNA transcripts. However, in CIS regions, direct transcriptional effects of transposon insertions are observed. We developed an automated method to systematically identify T2/Onc-genome RNA fusion sequences in RNA-seq data. RNA fusion-based CISs were identified corresponding to both DNA-based CISs (Cdkn2a, Mycl1, Nf2, Pten, Sema6d, and Rere) and additional regions strongly associated with cancer that were not observed by LM-PCR (Myc, Akt1, Pth, Csf1r, Fgfr2, Wisp1, Map3k5, and Map4k3). In addition to calculating recurrent CISs, we also present complementary methods to identify potential driver events via determination of strongly supported fusions and fusions with large transcript level changes in the absence of multitumor recurrence. These methods independently identify CIS regions and also point to cancer-associated genes like Braf. We anticipate RNA-seq analyses of tumors from forward genetic screens will become an efficient tool to identify causal events. PMID:26553456

  13. miRBase: integrating microRNA annotation and deep-sequencing data.

    PubMed

    Kozomara, Ana; Griffiths-Jones, Sam

    2011-01-01

    miRBase is the primary online repository for all microRNA sequences and annotation. The current release (miRBase 16) contains over 15,000 microRNA gene loci in over 140 species, and over 17,000 distinct mature microRNA sequences. Deep-sequencing technologies have delivered a sharp rise in the rate of novel microRNA discovery. We have mapped reads from short RNA deep-sequencing experiments to microRNAs in miRBase and developed web interfaces to view these mappings. The user can view all read data associated with a given microRNA annotation, filter reads by experiment and count, and search for microRNAs by tissue- and stage-specific expression. These data can be used as a proxy for relative expression levels of microRNA sequences, provide detailed evidence for microRNA annotations and alternative isoforms of mature microRNAs, and allow us to revisit previous annotations. miRBase is available online at: http://www.mirbase.org/. PMID:21037258

  14. Identification of two proteins that bind to a pyrimidine-rich sequence in the 3'-untranslated region of GAP-43 mRNA.

    PubMed Central

    Irwin, N; Baekelandt, V; Goritchenko, L; Benowitz, L I

    1997-01-01

    GAP-43 is a membrane phosphoprotein that is important for the development and plasticity of neural connections. In undifferentiated PC12 pheochromocytoma cells, GAP-43 mRNA degrades rapidly ( t = 5 h), but becomes stable when cells are treated with nerve growth factor. To identify trans- acting factors that may influence mRNA stability, we combined column chromatography and gel mobility shift assays to isolate GAP-43 mRNA binding proteins from neonatal bovine brain tissue. This resulted in the isolation of two proteins that bind specifically and competitively to a pyrimidine-rich sequence in the 3'-untranslated region of GAP-43 mRNA. Partial amino acid sequencing revealed that one of the RNA binding proteins coincides with FBP (far upstream element binding protein), previously characterized as a protein that resembles hnRNP K and which binds to a single-stranded, pyrimidine-rich DNA sequence upstream of the c -myc gene to activate its expression. The other binding protein shares sequence homology with PTB, a polypyrimidine tract binding protein implicated in RNA splicing and regulation of translation initiation. The two proteins bind to a 26 nt pyrimidine-rich sequence lying 300 nt downstream of the end of the coding region, in an area shown by others to confer instability on a reporter mRNA in transient transfection assays. We therefore propose that FBP and the PTB-like protein may compete for binding at the same site to influence the stability of GAP-43 mRNA. PMID:9092640

  15. Deletion analysis of the 5' untranslated leader sequence of tobacco mosaic virus RNA.

    PubMed

    Takamatsu, N; Watanabe, Y; Iwasaki, T; Shiba, T; Meshi, T; Okada, Y

    1991-03-01

    To determine the sequences essential for viral multiplication in the 5' untranslated leader sequence of tobacco mosaic virus RNA, mutant TMV-L (a tomato strain) RNAs which carry several deletions in this 71-nucleotide sequence were constructed by an in vitro transcription system and their multiplication was analyzed by introducing mutant RNA into tobacco protoplasts by electroporation. Large deletions of the sequence from nucleotides 9 to 47 or 25 to 71 abolished viral multiplication; when about 10-nucleotide deletions were introduced throughout this 5' leader sequence, only deletion of the sequence from nucleotides 2 to 8 abolished detectable viral multiplication. This mutant RNA, however, directed the synthesis of the 130,000-molecular-weight protein in a rabbit reticulocyte lysate in vitro translation system, and consequently this 5'-proximal portion appears likely to be essential for replication. PMID:1995954

  16. Use of S1 nuclease in deep sequencing for detection of double-stranded RNA viruses.

    PubMed

    Shimada, Saya; Nagai, Makoto; Moriyama, Hiromitsu; Fukuhara, Toshiyuki; Koyama, Satoshi; Omatsu, Tsutomu; Furuya, Tetsuya; Shirai, Junsuke; Mizutani, Tetsuya

    2015-09-01

    Metagenomic approach using next-generation DNA sequencing has facilitated the detection of many pathogenic viruses from fecal samples. However, in many cases, majority of the detected sequences originate from the host genome and bacterial flora in the gut. Here, to improve efficiency of the detection of double-stranded (ds) RNA viruses from samples, we evaluated the applicability of S1 nuclease on deep sequencing. Treating total RNA with S1 nuclease resulted in 1.5-28.4- and 10.1-208.9-fold increases in sequence reads of group A rotavirus in fecal and viral culture samples, respectively. Moreover, increasing coverage of mapping to reference sequences allowed for sufficient genotyping using analytical software. These results suggest that library construction using S1 nuclease is useful for deep sequencing in the detection of dsRNA viruses. PMID:25843154

  17. Sequence variation between 462 human individuals fine-tunes functional sites of RNA processing.

    PubMed

    Ferreira, Pedro G; Oti, Martin; Barann, Matthias; Wieland, Thomas; Ezquina, Suzana; Friedländer, Marc R; Rivas, Manuel A; Esteve-Codina, Anna; Rosenstiel, Philip; Strom, Tim M; Lappalainen, Tuuli; Guigó, Roderic; Sammeth, Michael

    2016-01-01

    Recent advances in the cost-efficiency of sequencing technologies enabled the combined DNA- and RNA-sequencing of human individuals at the population-scale, making genome-wide investigations of the inter-individual genetic impact on gene expression viable. Employing mRNA-sequencing data from the Geuvadis Project and genome sequencing data from the 1000 Genomes Project we show that the computational analysis of DNA sequences around splice sites and poly-A signals is able to explain several observations in the phenotype data. In contrast to widespread assessments of statistically significant associations between DNA polymorphisms and quantitative traits, we developed a computational tool to pinpoint the molecular mechanisms by which genetic markers drive variation in RNA-processing, cataloguing and classifying alleles that change the affinity of core RNA elements to their recognizing factors. The in silico models we employ further suggest RNA editing can moonlight as a splicing-modulator, albeit less frequently than genomic sequence diversity. Beyond existing annotations, we demonstrate that the ultra-high resolution of RNA-Seq combined from 462 individuals also provides evidence for thousands of bona fide novel elements of RNA processing-alternative splice sites, introns, and cleavage sites-which are often rare and lowly expressed but in other characteristics similar to their annotated counterparts. PMID:27617755

  18. RNA editing generates cellular subsets with diverse sequence within populations

    PubMed Central

    Harjanto, Dewi; Papamarkou, Theodore; Oates, Chris J.; Rayon-Estrada, Violeta; Papavasiliou, F. Nina; Papavasiliou, Anastasia

    2016-01-01

    RNA editing is a mutational mechanism that specifically alters the nucleotide content in transcribed RNA. However, editing rates vary widely, and could result from equivalent editing amongst individual cells, or represent an average of variable editing within a population. Here we present a hierarchical Bayesian model that quantifies the variance of editing rates at specific sites using RNA-seq data from both single cells, and a cognate bulk sample to distinguish between these two possibilities. The model predicts high variance for specific edited sites in murine macrophages and dendritic cells, findings that we validated experimentally by using targeted amplification of specific editable transcripts from single cells. The model also predicts changes in variance in editing rates for specific sites in dendritic cells during the course of LPS stimulation. Our data demonstrate substantial variance in editing signatures amongst single cells, supporting the notion that RNA editing generates diversity within cellular populations. PMID:27418407

  19. RNA editing generates cellular subsets with diverse sequence within populations.

    PubMed

    Harjanto, Dewi; Papamarkou, Theodore; Oates, Chris J; Rayon-Estrada, Violeta; Papavasiliou, F Nina; Papavasiliou, Anastasia

    2016-01-01

    RNA editing is a mutational mechanism that specifically alters the nucleotide content in transcribed RNA. However, editing rates vary widely, and could result from equivalent editing amongst individual cells, or represent an average of variable editing within a population. Here we present a hierarchical Bayesian model that quantifies the variance of editing rates at specific sites using RNA-seq data from both single cells, and a cognate bulk sample to distinguish between these two possibilities. The model predicts high variance for specific edited sites in murine macrophages and dendritic cells, findings that we validated experimentally by using targeted amplification of specific editable transcripts from single cells. The model also predicts changes in variance in editing rates for specific sites in dendritic cells during the course of LPS stimulation. Our data demonstrate substantial variance in editing signatures amongst single cells, supporting the notion that RNA editing generates diversity within cellular populations. PMID:27418407

  20. Analysis of a marine picoplankton community by 16S rRNA gene cloning and sequencing.

    PubMed Central

    Schmidt, T M; DeLong, E F; Pace, N R

    1991-01-01

    The phylogenetic diversity of an oligotrophic marine picoplankton community was examined by analyzing the sequences of cloned ribosomal genes. This strategy does not rely on cultivation of the resident microorganisms. Bulk genomic DNA was isolated from picoplankton collected in the north central Pacific Ocean by tangential flow filtration. The mixed-population DNA was fragmented, size fractionated, and cloned into bacteriophage lambda. Thirty-eight clones containing 16S rRNA genes were identified in a screen of 3.2 x 10(4) recombinant phage, and portions of the rRNA gene were amplified by polymerase chain reaction and sequenced. The resulting sequences were used to establish the identities of the picoplankton by comparison with an established data base of rRNA sequences. Fifteen unique eubacterial sequences were obtained, including four from cyanobacteria and eleven from proteobacteria. A single eucaryote related to dinoflagellates was identified; no archaebacterial sequences were detected. The cyanobacterial sequences are all closely related to sequences from cultivated marine Synechococcus strains and with cyanobacterial sequences obtained from the Atlantic Ocean (Sargasso Sea). Several sequences were related to common marine isolates of the gamma subdivision of proteobacteria. In addition to sequences closely related to those of described bacteria, sequences were obtained from two phylogenetic groups of organisms that are not closely related to any known rRNA sequences from cultivated organisms. Both of these novel phylogenetic clusters are proteobacteria, one group within the alpha subdivision and the other distinct from known proteobacterial subdivisions. The rRNA sequences of the alpha-related group are nearly identical to those of some Sargasso Sea picoplankton, suggesting a global distribution of these organisms. Images PMID:2066334

  1. Informatics for RNA Sequencing: A Web Resource for Analysis on the Cloud.

    PubMed

    Griffith, Malachi; Walker, Jason R; Spies, Nicholas C; Ainscough, Benjamin J; Griffith, Obi L

    2015-08-01

    Massively parallel RNA sequencing (RNA-seq) has rapidly become the assay of choice for interrogating RNA transcript abundance and diversity. This article provides a detailed introduction to fundamental RNA-seq molecular biology and informatics concepts. We make available open-access RNA-seq tutorials that cover cloud computing, tool installation, relevant file formats, reference genomes, transcriptome annotations, quality-control strategies, expression, differential expression, and alternative splicing analysis methods. These tutorials and additional training resources are accompanied by complete analysis pipelines and test datasets made available without encumbrance at www.rnaseq.wiki. PMID:26248053

  2. Informatics for RNA Sequencing: A Web Resource for Analysis on the Cloud

    PubMed Central

    Griffith, Malachi; Walker, Jason R.; Spies, Nicholas C.; Ainscough, Benjamin J.; Griffith, Obi L.

    2015-01-01

    Massively parallel RNA sequencing (RNA-seq) has rapidly become the assay of choice for interrogating RNA transcript abundance and diversity. This article provides a detailed introduction to fundamental RNA-seq molecular biology and informatics concepts. We make available open-access RNA-seq tutorials that cover cloud computing, tool installation, relevant file formats, reference genomes, transcriptome annotations, quality-control strategies, expression, differential expression, and alternative splicing analysis methods. These tutorials and additional training resources are accompanied by complete analysis pipelines and test datasets made available without encumbrance at www.rnaseq.wiki. PMID:26248053

  3. Phylogenetic analysis of oryx species using partial sequences of mitochondrial rRNA genes.

    PubMed

    Khan, H A; Arif, I A; Al Farhan, A H; Al Homaidan, A A

    2008-01-01

    We conducted a comparative evaluation of 12S rRNA and 16S rRNA genes of the mitochondrial genome for molecular differentiation among three oryx species (Oryx leucoryx, Oryx dammah and Oryx gazella) with respect to two closely related outgroups, addax and roan. Our findings showed the failure of 12S rRNA gene to differentiate between the genus Oryx and addax, whereas a 342-bp partial sequence of 16S rRNA accurately grouped all five taxa studied, suggesting the utility of 16S rRNA segment for molecular phylogeny of oryx at the genus and possibly species levels. PMID:19048493

  4. Chromosomal localization and sequence variation of 5S rRNA gene in five Capsicum species.

    PubMed

    Park, Y K; Park, K C; Park, C H; Kim, N S

    2000-02-29

    Chromosomal localization and sequence analysis of the 5S rRNA gene were carried out in five Capsicum species. Fluorescence in situ hybridization revealed that chromosomal location of the 5S rRNA gene was conserved in a single locus at a chromosome which was assigned to chromosome 1 by the synteny relationship with tomato. In sequence analysis, the repeating units of the 5S rRNA genes in the Capsicum species were variable in size from 278 bp to 300 bp. In sequence comparison of our results to the results with other Solanaceae plants as published by others, the coding region was highly conserved, but the spacer regions varied in size and sequence. T stretch regions, just after the end of the coding sequences, were more prominant in the Capsicum species than in two other plants. High G x C rich regions, which might have similar functions as that of the GC islands in the genes transcribed by RNA PolII, were observed after the T stretch region. Although we could not observe the TATA like sequences, an AT rich segment at -27 to -18 was detected in the 5S rRNA genes of the Capsicum species. Species relationship among the Capsicum species was also studied by the sequence comparison of the 5S rRNA genes. While C. chinense, C. frutescens, and C. annuum formed one lineage, C. baccatum was revealed to be an intermediate species between the former three species and C. pubescens. PMID:10774742

  5. New perspectives on the diversification of the RNA interference system: insights from comparative genomics and small RNA sequencing

    PubMed Central

    Burroughs, Alexander Maxwell; Ando, Yoshinari; Aravind, L

    2014-01-01

    Our understanding of the pervasive involvement of small RNAs in regulating diverse biological processes has been greatly augmented by recent application of deep-sequencing technologies to small RNA across diverse eukaryotes. We review the currently-known small RNA classes and place them in context of the reconstructed evolutionary history of the RNAi protein machinery. This synthesis indicates the earliest versions of eukaryotic RNAi systems likely utilized small RNA processed from three types of precursors: 1) sense-antisense transcriptional products, 2) genome-encoded, imperfectly-complementary hairpin sequences, and 3) larger non-coding RNA precursor sequences. Structural dissection of PIWI proteins along with recent discovery of novel families (including Med13 of the Mediator complex) suggest that emergence of a distinct architecture with the N-terminal domains (also occurring separately fused to endoDNases in prokaryotes) formed via duplication of an ancestral unit was key to their recruitment as primary RNAi effectors and use of small RNAs of certain preferred lengths. Prokaryotic PIWI proteins are typically components of several RNA-directed DNA restriction or CRISPR/Cas systems. However, eukaryotic versions appear to have emerged from a subset that evolved RNA-directed RNA interference. They were recruited alongside RNaseIII domains and RdRP domains, also from prokaryotic systems, to form the core eukaryotic RNAi system. Like certain regulatory systems, RNAi diversified into two distinct but linked arms concomitant with eukaryotic nucleo-cytoplasmic compartmentalization. Subsequent elaboration of RNAi proceeded via diversification of the core protein machinery through lineage-specific expansions and recruitment of new components from prokaryotes (nucleases and small RNA-modifying enzymes), allowing for diversification of associating small RNAs. PMID:24311560

  6. Sequence analysis of the 3' non-coding region of mouse immunoglobulin light chain messenger RNA.

    PubMed Central

    Hamlyn, P H; Gillam, S; Smith, M; Milstein, C

    1977-01-01

    Using an oligonucleotide d(pT10-C-A) as primer, cDNA has been transcribed from the 3' non-coding region of mouse immunoglobulin light chain mRNA and sequenced by a modification1 of the 'plus-minus' gel method2. The sequence obtained has partially corrected and extended a previously obtained sequence3. The new data contains an unusual sequence in which a trinucleotide is repeated seven times. Images PMID:405661

  7. Fluorescent in situ sequencing (FISSEQ) of RNA for gene expression profiling in intact cells and tissues

    PubMed Central

    Lee, Je Hyuk; Daugharthy, Evan R.; Scheiman, Jonathan; Kalhor, Reza; Ferrante, Thomas C.; Terry, Richard; Turczyk, Brian M.; Yang, Joyce L.; Lee, Ho Suk; Aach, John; Zhang, Kun; Church, George M.

    2014-01-01

    RNA sequencing measures the quantitative change in gene expression over the whole transcriptome, but it lacks spatial context. On the other hand, in situ hybridization provides the location of gene expression, but only for a small number of genes. Here we detail a protocol for genome-wide profiling of gene expression in situ in fixed cells and tissues, in which RNA is converted into cross-linked cDNA amplicons and sequenced manually on a confocal microscope. Unlike traditional RNA-seq our method enriches for context-specific transcripts over house-keeping and/or structural RNA, and it preserves the tissue architecture for RNA localization studies. Our protocol is written for researchers experienced in cell microscopy with minimal computing skills. Library construction and sequencing can be completed within 14 d, with image analysis requiring an additional 2 d. PMID:25675209

  8. Empirical analysis of RNA robustness and evolution using high-throughput sequencing of ribozyme reactions.

    PubMed

    Hayden, Eric J

    2016-08-15

    RNA molecules provide a realistic but tractable model of a genotype to phenotype relationship. This relationship has been extensively investigated computationally using secondary structure prediction algorithms. Enzymatic RNA molecules, or ribozymes, offer access to genotypic and phenotypic information in the laboratory. Advancements in high-throughput sequencing technologies have enabled the analysis of sequences in the lab that now rivals what can be accomplished computationally. This has motivated a resurgence of in vitro selection experiments and opened new doors for the analysis of the distribution of RNA functions in genotype space. A body of computational experiments has investigated the persistence of specific RNA structures despite changes in the primary sequence, and how this mutational robustness can promote adaptations. This article summarizes recent approaches that were designed to investigate the role of mutational robustness during the evolution of RNA molecules in the laboratory, and presents theoretical motivations, experimental methods and approaches to data analysis. PMID:27215494

  9. RNA sequencing using fluorescent-labeled dideoxynucleotides and automated fluorescence detection.

    PubMed Central

    Bauer, G J

    1990-01-01

    Although dideoxy terminated sequencing of RNA, using reverse transcriptase and oligodeoxynucleotide primers, is now a well established method, the accuracy is limited by sequence ambiguities due to unspecific chain termination events. A protocol is described which circumvents these ambiguities by using fluorescence labels tagged to dideoxynucleotides. Only chain terminations caused by dideoxynucleotides were detected while premature terminated cDNA's remain undetectable. In addition, the remaining multiple signals at nucleotide positions can be assigned to sequence heterogeneities within the RNA sequence to be determined. Images PMID:1690393

  10. RNA sequence and transcriptional properties of the 3' end of the Newcastle disease virus genome

    SciTech Connect

    Kurilla, M.G.; Stone, H.O.; Keene, J.D.

    1985-09-01

    The 3' end of the genomic RNA of Newcastle disease virus (NDV) has been sequenced and the leader RNA defined. Using hybridization to a 3'-end-labeled genome, leader RNA species from in vitro transcription reactions and from infected cell extracts were found to be 47 and 53 nucleotides long. In addition, the start site of the 3'-proximal mRNA was determined by sequence analysis of in vitro (beta-32P)GTP-labeled transcription products. The genomic sequence extending beyond the leader region demonstrated an open reading frame for at least 42 amino acids and probably represents the amino terminus of the nucleocapsid protein (NP). The terminal 8 nucleotides of the NDV genome were identical to those of measles virus and Sendai virus while the sequence of the distal half of the leader region was more similar to that of vesicular stomatitis virus. These data argue for strong evolutionary relatedness between the paramyxovirus and rhabdovirus groups.

  11. Large-scale sequencing and the natural history of model human RNA viruses

    PubMed Central

    Dugan, Vivien G; Saira, Kazima; Ghedin, Elodie

    2012-01-01

    RNA virus exploration within the field of medical virology has greatly benefited from technological developments in genomics, deepening our understanding of viral dynamics and emergence. Large-scale first-generation technology sequencing projects have expedited molecular epidemiology studies at an unprecedented scale for two pathogenic RNA viruses chosen as models: influenza A virus and dengue. Next-generation sequencing approaches are now leading to a more in-depth analysis of virus genetic diversity, which is greater for RNA than DNA viruses because of high replication rates and the absence of proofreading activity of the RNA-dependent RNA polymerase. In the field of virus discovery, technological advancements and metagenomic approaches are expanding the catalogs of novel viruses by facilitating our probing into the RNA virus world. PMID:23682295

  12. Small RNA and RNA-IP Sequencing Identifies and Validates Novel MicroRNAs in Human Mesenchymal Stem Cells.

    PubMed

    Tsai, Chin-Han; Liao, Ko-Hsun; Shih, Chuan-Chi; Chan, Chia-Hao; Hsieh, Jui-Yu; Tsai, Cheng-Fong; Wang, Hsei-Wei; Chang, Shing-Jyh

    2016-03-01

    Organ regeneration therapies using multipotent mesenchymal stem cells (MSCs) are currently being investigated for a variety of common complex diseases. Understanding the molecular regulation of MSC biology will benefit regenerative medicine. MicroRNAs (miRNAs) act as regulators in MSC stemness. There are approximately 2500 currently known human miRNAs that have been recorded in the miRBase v21 database. In the present study, we identified novel microRNAs involved in MSC stemness and differentiation by obtaining the global microRNA expression profiles (miRNomes) of MSCs from two anatomical locations bone marrow (BM-MSCs) and umbilical cord Wharton's jelly (WJ-MSCs) and from osteogenically and adipogenically differentiated progenies of BM-MSCs. Small RNA sequencing (smRNA-seq) and bioinformatics analyses predicted that 49 uncharacterized miRNA candidates had high cellular expression values in MSCs. Another independent batch of Ago1/2-based RNA immunoprecipitation (RNA-IP) sequencing datasets validated the existence of 40 unreported miRNAs in cells and their associations with the RNA-induced silencing complex (RISC). Nine of these 40 new miRNAs were universally overexpressed in both MSC types; nine others were overexpressed in differentiated cells. A novel miRNA (UNI-118-3p) was specifically expressed in BM-MSCs, as verified using RT-qPCR. Taken together, this report offers comprehensive miRNome profiles for two MSC types, as well as cells differentiated from BM-MSCs. MSC transplantation has the potential to ameliorate degenerative disorders and repair damaged tissues. Interventions involving the above 40 new microRNA members in transplanted MSCs may potentially guide future clinical applications. PMID:26910904

  13. The nucleotide sequence at the 5' end of foot and mouth disease virus RNA.

    PubMed Central

    Harris, T J

    1979-01-01

    Foot and mouth disease virus RNA has been treated with RNase H in the presence of oligo (dG) specifically to digest the poly(C) tract which lies near the 5' end of the molecule (10). The short (S) fragment containing the 5' end of the RNA was separated from the remainder of the RNA (L fragment) by gel electrophoresis. RNA ligase mediated labelling of the 3' end of S fragment showed that the RNase H digestion gave rise to molecules that differed only in the number of cytidylic acid residues remaining at their 3' ends and did not leave the unique 3' end necessary for fast sequence analysis. As the 5' end of S fragment prepared form virus RNA is blocked by VPg, S fragment was prepared from virus specific messenger RNA which does not contain this protein. This RNA was labelled at the 5' end using polynucleotide kinase and the sequence of 70 nucleotides at the 5' end determined by partial enzyme digestion sequencing on polyacrylamide gels. Some of this sequence was confirmed from an analysis of the oligonucleotides derived by RNase T1 digestion of S fragment. The sequence obtained indicates that there is a stable hairpin loop at the 5' terminus of the RNA before an initiation codon 33 nucleotides from the 5' end. In addition, the RNase T1 analysis suggests that there are short repeated sequences in S fragment and that an eleven nucleotide inverted complementary repeat of a sequence near the 3' end of the RNA is present at the junction of S fragment and the poly(C) tract. Images PMID:231762

  14. Complete Sequence Construction of the Highly Repetitive Ribosomal RNA Gene Repeats in Eukaryotes Using Whole Genome Sequence Data.

    PubMed

    Agrawal, Saumya; Ganley, Austen R D

    2016-01-01

    The ribosomal RNA genes (rDNA) encode the major rRNA species of the ribosome, and thus are essential across life. These genes are highly repetitive in most eukaryotes, forming blocks of tandem repeats that form the core of nucleoli. The primary role of the rDNA in encoding rRNA has been long understood, but more recently the rDNA has been implicated in a number of other important biological phenomena, including genome stability, cell cycle, and epigenetic silencing. Noncoding elements, primarily located in the intergenic spacer region, appear to mediate many of these phenomena. Although sequence information is available for the genomes of many organisms, in almost all cases rDNA repeat sequences are lacking, primarily due to problems in assembling these intriguing regions during whole genome assemblies. Here, we present a method to obtain complete rDNA repeat unit sequences from whole genome assemblies. Limitations of next generation sequencing (NGS) data make them unsuitable for assembling complete rDNA unit sequences; therefore, the method we present relies on the use of Sanger whole genome sequence data. Our method makes use of the Arachne assembler, which can assemble highly repetitive regions such as the rDNA in a memory-efficient way. We provide a detailed step-by-step protocol for generating rDNA sequences from whole genome Sanger sequence data using Arachne, for refining complete rDNA unit sequences, and for validating the sequences obtained. In principle, our method will work for any species where the rDNA is organized into tandem repeats. This will help researchers working on species without a complete rDNA sequence, those working on evolutionary aspects of the rDNA, and those interested in conducting phylogenetic footprinting studies with the rDNA. PMID:27576718

  15. Comprehensive analysis of human small RNA sequencing data provides insights into expression profiles and miRNA editing

    PubMed Central

    Gong, Jing; Wu, Yuliang; Zhang, Xiantong; Liao, Yifang; Sibanda, Vusumuzi Leroy; Liu, Wei; Guo, An-Yuan

    2014-01-01

    MicroRNAs (miRNAs) play key regulatory roles in various biological processes and diseases. A comprehensive analysis of large scale small RNA sequencing data (smRNA-seq) will be very helpful to explore tissue or disease specific miRNA markers and uncover miRNA variants. Here, we systematically analyzed 410 human smRNA-seq datasets, which samples are from 24 tissue/disease/cell lines. We tested the mapping strategies and found that it was necessary to make multiple-round mappings with different mismatch parameters. miRNA expression profiles revealed that on average ∼70% of known miRNAs were expressed at low level or not expressed (RPM < 1) in a sample and only ∼9% of known miRNAs were relatively highly expressed (RPM > 100). About 30% known miRNAs were not expressed in all of our used samples. The miRNA expression profiles were compiled into an online database (HMED, http://bioinfo.life.hust.edu.cn/smallRNA/). Dozens of tissue/disease specific miRNAs, disease/control dysregulated miRNAs and miRNAs with arm switching events were discovered. Further, we identified some highly confident editing sites including 24 A-to-I sites and 23 C-to-U sites. About half of them were widespread miRNA editing sites in different tissues. We characterized that the 2 types of editing sites have different features with regard to location, editing level and frequency. Our analyses for expression profiles, specific miRNA markers, arm switching, and editing sites, may provide valuable information for further studies of miRNA function and biomarker finding. PMID:25692236

  16. Quantitative Assessment of RNA-Protein Interactions with High Throughput Sequencing - RNA Affinity Profiling (HiTS-RAP)

    PubMed Central

    Ozer, Abdullah; Tome, Jacob M.; Friedman, Robin C.; Gheba, Dan; Schroth, Gary P.; Lis, John T.

    2016-01-01

    Because RNA-protein interactions play a central role in a wide-array of biological processes, methods that enable a quantitative assessment of these interactions in a high-throughput manner are in great demand. Recently, we developed the High Throughput Sequencing-RNA Affinity Profiling (HiTS-RAP) assay, which couples sequencing on an Illumina GAIIx with the quantitative assessment of one or several proteins’ interactions with millions of different RNAs in a single experiment. We have successfully used HiTS-RAP to analyze interactions of EGFP and NELF-E proteins with their corresponding canonical and mutant RNA aptamers. Here, we provide a detailed protocol for HiTS-RAP, which can be completed in about a month (8 days hands-on time) including the preparation and testing of recombinant proteins and DNA templates, clustering DNA templates on a flowcell, high-throughput sequencing and protein binding with GAIIx, and finally data analysis. We also highlight aspects of HiTS-RAP that can be further improved and points of comparison between HiTS-RAP and two other recently developed methods, RNA-MaP and RBNS. A successful HiTS-RAP experiment provides the sequence and binding curves for approximately 200 million RNAs in a single experiment. PMID:26182240

  17. Method for rapid base sequencing in DNA and RNA with two base labeling

    DOEpatents

    Jett, J.H.; Keller, R.A.; Martin, J.C.; Posner, R.G.; Marrone, B.L.; Hammond, M.L.; Simpson, D.J.

    1995-04-11

    A method is described for rapid-base sequencing in DNA and RNA with two-base labeling and employing fluorescent detection of single molecules at two wavelengths. Bases modified to accept fluorescent labels are used to replicate a single DNA or RNA strand to be sequenced. The bases are then sequentially cleaved from the replicated strand, excited with a chosen spectrum of electromagnetic radiation, and the fluorescence from individual, tagged bases detected in the order of cleavage from the strand. 4 figures.

  18. Method for rapid base sequencing in DNA and RNA with two base labeling

    DOEpatents

    Jett, James H.; Keller, Richard A.; Martin, John C.; Posner, Richard G.; Marrone, Babetta L.; Hammond, Mark L.; Simpson, Daniel J.

    1995-01-01

    Method for rapid-base sequencing in DNA and RNA with two-base labeling and employing fluorescent detection of single molecules at two wavelengths. Bases modified to accept fluorescent labels are used to replicate a single DNA or RNA strand to be sequenced. The bases are then sequentially cleaved from the replicated strand, excited with a chosen spectrum of electromagnetic radiation, and the fluorescence from individual, tagged bases detected in the order of cleavage from the strand.

  19. ARM-Seq: AlkB-facilitated RNA methylation sequencing reveals a complex landscape of modified tRNA fragments

    PubMed Central

    Cozen, Aaron E.; Quartley, Erin; Holmes, Andrew D.; Robinson, Eva H.; Phizicky, Eric M.; Lowe, Todd M.

    2015-01-01

    High throughput RNA sequencing has accelerated discovery of the complex regulatory roles of small RNAs, but RNAs containing modified nucleosides may escape detection when those modifications interfere with reverse transcription during RNA-seq library preparation. Here we describe AlkB-facilitated RNA Methylation sequencing (ARM-Seq) which uses pre-treatment with Escherichia coli AlkB to demethylate 1-methyladenosine, 3-methylcytidine, and 1-methylguanosine, all commonly found in transfer RNAs. Comparative methylation analysis using ARM-Seq provides the first detailed, transcriptome-scale map of these modifications, and reveals an abundance of previously undetected, methylated small RNAs derived from tRNAs. ARM-Seq demonstrates that tRNA-derived small RNAs accurately recapitulate the m1A modification state for well-characterized yeast tRNAs, and generates new predictions for a large number of human tRNAs, including tRNA precursors and mitochondrial tRNAs. Thus, ARM-Seq provides broad utility for identifying previously overlooked methyl-modified RNAs, can efficiently monitor methylation state, and may reveal new roles for tRNA-derived RNAs as biomarkers or signaling molecules. PMID:26237225

  20. tRNAfeature: An algorithm for tRNA features to identify tRNA genes in DNA sequences.

    PubMed

    Yang, Cheng-Hong; Lin, Yu-Da; Chuang, Li-Yeh

    2016-09-01

    The identification of transfer RNAs (tRNAs) is critical for a detailed understanding of the evolution of biological organisms and viruses. However, some tRNAs are difficult to recognize due to their unusual sub-structures and may result in the detection of the wrong anticodon. Therefore, the detection of unusual sub-structures of tRNA genes remains an important challenge. In this study, we propose a method to identify tRNA genes based on tRNA features. tRNAfeature attempts to refold the sequence with single-stranded regions longer than those found in the canonical and conventional structural models for tRNA. We predicted a set of 53926 archaeal, eubacterial and eukaryotic tRNA genes annotated in tRNADB-CE and scanned the tRNA genes in whole genome sequencing. The results indicate that tRNAfeature is more powerful than other existing methods for identifying tRNAs. PMID:27291467

  1. Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers

    PubMed Central

    Liu, Zongzhi; DeSantis, Todd Z.; Andersen, Gary L.; Knight, Rob

    2008-01-01

    The recent introduction of massively parallel pyrosequencers allows rapid, inexpensive analysis of microbial community composition using 16S ribosomal RNA (rRNA) sequences. However, a major challenge is to design a workflow so that taxonomic information can be accurately and rapidly assigned to each read, so that the composition of each community can be linked back to likely ecological roles played by members of each species, genus, family or phylum. Here, we use three large 16S rRNA datasets to test whether taxonomic information based on the full-length sequences can be recaptured by short reads that simulate the pyrosequencer outputs. We find that different taxonomic assignment methods vary radically in their ability to recapture the taxonomic information in full-length 16S rRNA sequences: most methods are sensitive to the region of the 16S rRNA gene that is targeted for sequencing, but many combinations of methods and rRNA regions produce consistent and accurate results. To process large datasets of partial 16S rRNA sequences obtained from surveys of various microbial communities, including those from human body habitats, we recommend the use of Greengenes or RDP classifier with fragments of at least 250 bases, starting from one of the primers R357, R534, R798, F343 or F517. PMID:18723574

  2. Species Identification and Profiling of Complex Microbial Communities Using Shotgun Illumina Sequencing of 16S rRNA Amplicon Sequences

    PubMed Central

    Lay, Christophe; Ho, Eliza Xin Pei; Low, Louie; Hibberd, Martin Lloyd; Nagarajan, Niranjan

    2013-01-01

    The high throughput and cost-effectiveness afforded by short-read sequencing technologies, in principle, enable researchers to perform 16S rRNA profiling of complex microbial communities at unprecedented depth and resolution. Existing Illumina sequencing protocols are, however, limited by the fraction of the 16S rRNA gene that is interrogated and therefore limit the resolution and quality of the profiling. To address this, we present the design of a novel protocol for shotgun Illumina sequencing of the bacterial 16S rRNA gene, optimized to amplify more than 90% of sequences in the Greengenes database and with the ability to distinguish nearly twice as many species-level OTUs compared to existing protocols. Using several in silico and experimental datasets, we demonstrate that despite the presence of multiple variable and conserved regions, the resulting shotgun sequences can be used to accurately quantify the constituents of complex microbial communities. The reconstruction of a significant fraction of the 16S rRNA gene also enabled high precision (>90%) in species-level identification thereby opening up potential application of this approach for clinical microbial characterization. PMID:23579286

  3. Research Techniques Made Simple: Bacterial 16S Ribosomal RNA Gene Sequencing in Cutaneous Research.

    PubMed

    Jo, Jay-Hyun; Kennedy, Elizabeth A; Kong, Heidi H

    2016-03-01

    Skin serves as a protective barrier and also harbors numerous microorganisms collectively comprising the skin microbiome. As a result of recent advances in sequencing (next-generation sequencing), our understanding of microbial communities on skin has advanced substantially. In particular, the 16S ribosomal RNA gene sequencing technique has played an important role in efforts to identify the global communities of bacteria in healthy individuals and patients with various disorders in multiple topographical regions over the skin surface. Here, we describe basic principles, study design, and a workflow of 16S ribosomal RNA gene sequencing methodology, primarily for investigators who are not familiar with this approach. This article will also discuss some applications and challenges of 16S ribosomal RNA sequencing as well as directions for future development. PMID:26902128

  4. High-Throughput Mapping of Single-Neuron Projections by Sequencing of Barcoded RNA.

    PubMed

    Kebschull, Justus M; Garcia da Silva, Pedro; Reid, Ashlan P; Peikon, Ian D; Albeanu, Dinu F; Zador, Anthony M

    2016-09-01

    Neurons transmit information to distant brain regions via long-range axonal projections. In the mouse, area-to-area connections have only been systematically mapped using bulk labeling techniques, which obscure the diverse projections of intermingled single neurons. Here we describe MAPseq (Multiplexed Analysis of Projections by Sequencing), a technique that can map the projections of thousands or even millions of single neurons by labeling large sets of neurons with random RNA sequences ("barcodes"). Axons are filled with barcode mRNA, each putative projection area is dissected, and the barcode mRNA is extracted and sequenced. Applying MAPseq to the locus coeruleus (LC), we find that individual LC neurons have preferred cortical targets. By recasting neuroanatomy, which is traditionally viewed as a problem of microscopy, as a problem of sequencing, MAPseq harnesses advances in sequencing technology to permit high-throughput interrogation of brain circuits. PMID:27545715

  5. Efficient Nucleic Acid Extraction and 16S rRNA Gene Sequencing for Bacterial Community Characterization.

    PubMed

    Anahtar, Melis N; Bowman, Brittany A; Kwon, Douglas S

    2016-01-01

    There is a growing appreciation for the role of microbial communities as critical modulators of human health and disease. High throughput sequencing technologies have allowed for the rapid and efficient characterization of bacterial communities using 16S rRNA gene sequencing from a variety of sources. Although readily available tools for 16S rRNA sequence analysis have standardized computational workflows, sample processing for DNA extraction remains a continued source of variability across studies. Here we describe an efficient, robust, and cost effective method for extracting nucleic acid from swabs. We also delineate downstream methods for 16S rRNA gene sequencing, including generation of sequencing libraries, data quality control, and sequence analysis. The workflow can accommodate multiple samples types, including stool and swabs collected from a variety of anatomical locations and host species. Additionally, recovered DNA and RNA can be separated and used for other applications, including whole genome sequencing or RNA-seq. The method described allows for a common processing approach for multiple sample types and accommodates downstream analysis of genomic, metagenomic and transcriptional information. PMID:27168460

  6. A method for accurate determination of terminal sequences of viral genomic RNA.

    PubMed

    Weng, Z; Xiong, Z

    1995-09-01

    A combination of ligation-anchored PCR and anchored cDNA cloning techniques were used to clone the termini of the saguaro cactus virus (SCV) RNA genome. The terminal sequences of the viral genome were subsequently determined from the clones. The 5' terminus was cloned by ligation-anchored PCR, whereas the 3' terminus was obtained by a technique we term anchored cDNA cloning. In anchored cDNA cloning, an anchor oligonucleotide was prepared by phosphorylation at the 5' end, followed by addition of a dideoxynucleotide at the 3' end to block the free hydroxyl group. The 5' end of the anchor was subsequently ligated to the 3' end of SCV RNA. The anchor-ligated, chimerical viral RNA was then reverse-transcribed into cDNA using a primer complementary to the anchor. The cDNA containing the complete 3'-terminal sequence was converted into ds-cDNA, cloned, and sequenced. Two restriction sites, one within the viral sequence and one within the primer sequence, were used to facilitate cloning. The combination of these techniques proved to be an easy and accurate way to determine the terminal sequences of SCV RNA genome and should be applicable to any other RNA molecules with unknown terminal sequences. PMID:9132274

  7. Sequence heterogeneity in the two 16S rRNA genes of Phormium yellow leaf phytoplasma.

    PubMed Central

    Liefting, L W; Andersen, M T; Beever, R E; Gardner, R C; Forster, R L

    1996-01-01

    Phormium yellow leaf (PYL) phytoplasma causes a lethal disease of the monocotyledon, New Zealand flax (Phormium tenax). The 16S rRNA genes of PYL phytoplasma were amplified from infected flax by PCR and cloned, and the nucleotide sequences were determined. DNA sequencing and Southern hybridization analysis of genomic DNA indicated the presence of two copies of the 16S rRNA gene. The two 16S rRNA genes exhibited sequence heterogeneity in 4 nucleotide positions and could be distinguished by the restriction enzymes BpmI and BsrI. This is the first record in which sequence heterogeneity in the 16S rRNA genes of a phytoplasma has been determined by sequence analysis. A phylogenetic tree based on 16S rRNA gene sequences showed that PYL phytoplasma is most closely related to the stolbur and German grapevine yellows phytoplasmas, which form the stolbur subgroup of the aster yellows group. This phylogenetic position of PYL phytoplasma was supported by 16S/23S spacer region sequence data. PMID:8795200

  8. Efficient Nucleic Acid Extraction and 16S rRNA Gene Sequencing for Bacterial Community Characterization

    PubMed Central

    Anahtar, Melis N.; Bowman, Brittany A.; Kwon, Douglas S.

    2016-01-01

    There is a growing appreciation for the role of microbial communities as critical modulators of human health and disease. High throughput sequencing technologies have allowed for the rapid and efficient characterization of bacterial communities using 16S rRNA gene sequencing from a variety of sources. Although readily available tools for 16S rRNA sequence analysis have standardized computational workflows, sample processing for DNA extraction remains a continued source of variability across studies. Here we describe an efficient, robust, and cost effective method for extracting nucleic acid from swabs. We also delineate downstream methods for 16S rRNA gene sequencing, including generation of sequencing libraries, data quality control, and sequence analysis. The workflow can accommodate multiple samples types, including stool and swabs collected from a variety of anatomical locations and host species. Additionally, recovered DNA and RNA can be separated and used for other applications, including whole genome sequencing or RNA-seq. The method described allows for a common processing approach for multiple sample types and accommodates downstream analysis of genomic, metagenomic and transcriptional information. PMID:27168460

  9. A 5'-proximal RNA sequence of murine coronavirus as a potential initiation site for genomic-length mRNA transcription.

    PubMed Central

    Zhang, X; Lai, M M

    1996-01-01

    Coronavirus transcription is a discontinuous process, involving interactions between a trans-acting leader and the intergenic transcription initiation sequences. A 9-nucleotide (nt) sequence (UUUAUAAAC), which is located immediately downstream of the leader at the 5' terminus of the mouse hepatitis virus (MHV) genomic RNA, contains a sequence resembling the consensus intergenic sequence (UCUAAAC). It has been shown previously that the presence of the 9-nt sequence facilitates leader RNA switching and may enhance subgenomic mRNA transcription. It is unclear how the 9-nt sequence exerts these functions. In this study, we inserted the 9-nt sequence into a defective interfering (DI) RNA reporter system and demonstrated that mRNA transcription could be initiated from the 9-nt sequence almost as efficiently as from the intergenic sequence between genes 6 and 7. Sequence analysis of the mRNAs showed that the 9-nt sequence served as a site of fusion between the leaders and mRNA. The transcription initiation function of the 9-nt sequence could not be substituted by other 5'-terminal sequences. When the entire 5'-terminal sequence, including four copies of the UCUAA sequence plus the 9-nt sequence, was present, transcription could be initiated from any of the UCUAA copies or the 9-nt sequence, resulting in different copy numbers of the UCUAA sequence and the deletion of the 9-nt sequence in some mRNAs. All of these heterogeneous RNA species were also detected from the 5'-terminal region of the viral genomic-length RNA in MHV-infected cells. These results thus suggest tha the heterogeneity of the copy number of UCUAA sequences at the 5' end, the deletion of the 9-nt sequence in viral and DI RNAs, and the leader RNA switching are the results of transcriptional initiation from the 9-nt site. They also show that an mRNA species (mRNA 1) that lacks the 9-nt sequence can be synthesized during MHV infection. Therefore, MHV genomic RNA replication and mRNA 1 transcription may be

  10. Adenovirus type 12-specific RNA sequences during productive infection of KB cells.

    PubMed Central

    Smiley, J R; Mak, S

    1976-01-01

    The complementary strands of adenovirus type 12 DNA were separated, and virus-specific RNA was analyzed by saturation hybridization in solution. Late during infection whole cell RNA hybridized to 75% of the light (1) strand and 15% of the heavy (H) strand, whereas cytoplasmic RNA hybridized to 65% of the 1 strand and 15% of the h strand. Late nuclear RNA hybridized to about 90% of the 1 strand and at least 36% of the h strand. Double-stranded RNA was isolated from infected cells late after infection, which annealed to greater than 30% of each of the two complementary DNA strands. Early whole cell RNA hybridized to 45 to 50% of the 1 strand and 15% of the h strand, whereas early cytoplasmic RNA hybridized to about 15% of each of the complementary strands. All early cytoplasmic sequences were present in the cytoplasm at late times. PMID:950688

  11. Complete sequence and gene organization of the Nosema heliothidis ribosomal RNA gene region.

    PubMed

    Dong, Shinan; Shen, Zhongyuan; Zhu, Feng; Tang, Xudong; Xu, Li

    2011-01-01

    By sequencing the entire ribosomal RNA (rRNA) gene region of Nosema heliothidis isolated from cotton bollworm (Helicoverpa armigera), we showed that its gene organization is similar to the type species, Nosema bombycis: the 5'-large subunit rRNA (2,490 bp)-internal transcribed spacer (192 bp)-small subunit rRNA (1,232 bp)-intergenic spacer (274 bp)-5S rRNA (115 bp)-3'. We constructed two phylogenetic trees, analyzed phylogenetic relationships, examined rRNA organization of microsporidia, and compared the secondary structure of small subunit rRNA with closely related microsporidia. The latter two features may provide important information for the classification and phylogenetic analysis of microsporidia. PMID:21895841

  12. Excess of Yra1 RNA-Binding Factor Causes Transcription-Dependent Genome Instability, Replication Impairment and Telomere Shortening

    PubMed Central

    Gavaldá, Sandra; Santos-Pereira, José M.; García-Rubio, María L.; Luna, Rosa; Aguilera, Andrés

    2016-01-01

    Yra1 is an essential nuclear factor of the evolutionarily conserved family of hnRNP-like export factors that when overexpressed impairs mRNA export and cell growth. To investigate further the relevance of proper Yra1 stoichiometry in the cell, we overexpressed Yra1 by transforming yeast cells with YRA1 intron-less constructs and analyzed its effect on gene expression and genome integrity. We found that YRA1 overexpression induces DNA damage and leads to a transcription-associated hyperrecombination phenotype that is mediated by RNA:DNA hybrids. In addition, it confers a genome-wide replication retardation as seen by reduced BrdU incorporation and accumulation of the Rrm3 helicase. In addition, YRA1 overexpression causes a cell senescence-like phenotype and telomere shortening. ChIP-chip analysis shows that overexpressed Yra1 is loaded to transcribed chromatin along the genome and to Y’ telomeric regions, where Rrm3 is also accumulated, suggesting an impairment of telomere replication. Our work not only demonstrates that a proper stoichiometry of the Yra1 mRNA binding and export factor is required to maintain genome integrity and telomere homeostasis, but suggests that the cellular imbalance between transcribed RNA and specific RNA-binding factors may become a major cause of genome instability mediated by co-transcriptional replication impairment. PMID:27035147

  13. Characterization and phylogenetic relationships among microsporidia infecting silkworm, Bombyx mori, using inter simple sequence repeat (ISSR) and small subunit rRNA (SSU-rRNA) sequence analysis.

    PubMed

    Rao, S Nageswara; Nath, B Surendra; Saratchandra, B

    2005-06-01

    This study is the first report on the genetic characterization and relationships among different microsporidia infecting the silkworm, Bombyx mori, using inter simple sequence repeat PCR (ISSR-PCR) analysis. Six different microsporidians were distinguished through molecular DNA typing using ISSR-PCR. Thus, ISSR-PCR analysis can be a powerful tool to detect polymorphisms and identify microsporidians, which are difficult to study with microscopy because of their extremely small size. Of the 100 ISSR primers tested, only 28 primers had reproducibility and high polymorphism (93%). A total of 24 ISSR primers produced 55 unique genetic markers, which could be used to differentiate the microsporidians from each other. Among the 28 SSRs tested, the most abundant were (CA)n, (GA)n, and (GT)n repeats. The degree of band sharing was used to evaluate genetic similarity between different microsporidian isolates and to construct a phylogenetic tree using Jaccard's similarity coefficient. The results indicate that the DNA profiles based on ISSR markers can be used as diagnostic tools to identify different microsporidia with considerable accuracy. In addition, the small subunit ribosomal RNA (SSU-rRNA) sequence gene was amplified, cloned, and sequenced from each of the 6 microsporidian isolates. These sequences were compared with 20 other microsporidian SSU-rRNA sequences to develop a phylogenetic tree for the microsporidia isolated from the silkworms. This method was found to be useful in establishing the phylogenetic relationships among the different microsporidians isolated from silkworms. Of the 6 microsporidian isolates, NIK-1s revealed an SSU-rRNA gene sequence similar to Nosema bombycis, indicating that NIK-1s is similar to N. bombycis; the remaining 5 isolates, which differed from each other and from N. bombycis, were considered to be different variants belonging to the species N. bombycis. PMID:16121233

  14. Draft Genome Sequences of Leviviridae RNA Phages EC and MB Recovered from San Francisco Wastewater

    PubMed Central

    DeRisi, Joseph L.

    2015-01-01

    We report here the draft genome sequences of marine RNA phages EC and MB assembled from metagenomic sequencing of organisms in San Francisco wastewater. These phages showed moderate translated amino acid identity to other enterobacteria phages and appear to constitute novel members of the Leviviridae family. PMID:26112785

  15. Draft Genome Sequences of Leviviridae RNA Phages EC and MB Recovered from San Francisco Wastewater.

    PubMed

    Greninger, Alexander L; DeRisi, Joseph L

    2015-01-01

    We report here the draft genome sequences of marine RNA phages EC and MB assembled from metagenomic sequencing of organisms in San Francisco wastewater. These phages showed moderate translated amino acid identity to other enterobacteria phages and appear to constitute novel members of the Leviviridae family. PMID:26112785

  16. Molecular Diagnosis of Actinomadura madurae Infection by 16S rRNA Deep Sequencing

    PubMed Central

    SenGupta, Dhruba J.; Hoogestraat, Daniel R.; Cummings, Lisa A.; Bryant, Bronwyn H.; Natividad, Catherine; Thielges, Stephanie; Monsaas, Peter W.; Chau, Mimosa; Barbee, Lindley A.; Rosenthal, Christopher; Cookson, Brad T.; Hoffman, Noah G.

    2013-01-01

    Next-generation DNA sequencing can be used to catalog individual organisms within complex, polymicrobial specimens. Here, we utilized deep sequencing of 16S rRNA to implicate Actinomadura madurae as the cause of mycetoma in a diabetic patient when culture and conventional molecular methods were overwhelmed by overgrowth of other organisms. PMID:24108607

  17. Genome Sequence of Saccharomyces cerevisiae Double-Stranded RNA Virus L-A-28

    PubMed Central

    Konovalovas, Aleksandras

    2016-01-01

    We cloned and sequenced the complete genome of the L-A-28 virus from the Saccharomyces cerevisiae K28 killer strain. This sequence completes the set of currently identified L-A helper viruses required for expression of double-stranded RNA-originated killer phenotypes in baking yeast. PMID:27313294

  18. Genome Sequence of Saccharomyces cerevisiae Double-Stranded RNA Virus L-A-28.

    PubMed

    Konovalovas, Aleksandras; Serviené, Elena; Serva, Saulius

    2016-01-01

    We cloned and sequenced the complete genome of the L-A-28 virus from the Saccharomyces cerevisiae K28 killer strain. This sequence completes the set of currently identified L-A helper viruses required for expression of double-stranded RNA-originated killer phenotypes in baking yeast. PMID:27313294

  19. Differential DNA and RNA sequence discrimination by PNA having charged side chains.

    PubMed

    De Costa, N Tilani S; Heemstra, Jennifer M

    2014-05-15

    PNA sequences modified with charged side chains were evaluated for base-pairing sequence selectivity under physiological conditions. PNA having negatively charged aspartic acid side chains shows higher selectivity with RNA, while PNA having positively charged lysine side chains shows higher selectivity with DNA. These observations provide insight into the binding selectivity of modified PNA in antisense and antigene applications. PMID:24731279

  20. Taxonomic Assessment of Rumen Microbiota Using Total RNA and Targeted Amplicon Sequencing Approaches.

    PubMed

    Li, Fuyong; Henderson, Gemma; Sun, Xu; Cox, Faith; Janssen, Peter H; Guan, Le Luo

    2016-01-01

    Taxonomic characterization of active gastrointestinal microbiota is essential to detect shifts in microbial communities and functions under various conditions. This study aimed to identify and quantify potentially active rumen microbiota using total RNA sequencing and to compare the outcomes of this approach with the widely used targeted RNA/DNA amplicon sequencing technique. Total RNA isolated from rumen digesta samples from five beef steers was subjected to Illumina paired-end sequencing (RNA-seq), and bacterial and archaeal amplicons of partial 16S rRNA/rDNA were subjected to 454 pyrosequencing (RNA/DNA Amplicon-seq). Taxonomic assessments of the RNA-seq, RNA Amplicon-seq, and DNA Amplicon-seq datasets were performed using a pipeline developed in house. The detected major microbial phylotypes were common among the three datasets, with seven bacterial phyla, fifteen bacterial families, and five archaeal taxa commonly identified across all datasets. There were also unique microbial taxa detected in each dataset. Elusimicrobia and Verrucomicrobia phyla; Desulfovibrionaceae, Elusimicrobiaceae, and Sphaerochaetaceae families; and Methanobrevibacter woesei were only detected in the RNA-Seq and RNA Amplicon-seq datasets, whereas Streptococcaceae was only detected in the DNA Amplicon-seq dataset. In addition, the relative abundances of four bacterial phyla, eight bacterial families and one archaeal taxon were different among the three datasets. This is the first study to compare the outcomes of rumen microbiota profiling between RNA-seq and RNA/DNA Amplicon-seq datasets. Our results illustrate the differences between these methods in characterizing microbiota both qualitatively and quantitatively for the same sample, and so caution must be exercised when comparing data. PMID:27446027

  1. Taxonomic Assessment of Rumen Microbiota Using Total RNA and Targeted Amplicon Sequencing Approaches

    PubMed Central

    Li, Fuyong; Henderson, Gemma; Sun, Xu; Cox, Faith; Janssen, Peter H.; Guan, Le Luo

    2016-01-01

    Taxonomic characterization of active gastrointestinal microbiota is essential to detect shifts in microbial communities and functions under various conditions. This study aimed to identify and quantify potentially active rumen microbiota using total RNA sequencing and to compare the outcomes of this approach with the widely used targeted RNA/DNA amplicon sequencing technique. Total RNA isolated from rumen digesta samples from five beef steers was subjected to Illumina paired-end sequencing (RNA-seq), and bacterial and archaeal amplicons of partial 16S rRNA/rDNA were subjected to 454 pyrosequencing (RNA/DNA Amplicon-seq). Taxonomic assessments of the RNA-seq, RNA Amplicon-seq, and DNA Amplicon-seq datasets were performed using a pipeline developed in house. The detected major microbial phylotypes were common among the three datasets, with seven bacterial phyla, fifteen bacterial families, and five archaeal taxa commonly identified across all datasets. There were also unique microbial taxa detected in each dataset. Elusimicrobia and Verrucomicrobia phyla; Desulfovibrionaceae, Elusimicrobiaceae, and Sphaerochaetaceae families; and Methanobrevibacter woesei were only detected in the RNA-Seq and RNA Amplicon-seq datasets, whereas Streptococcaceae was only detected in the DNA Amplicon-seq dataset. In addition, the relative abundances of four bacterial phyla, eight bacterial families and one archaeal taxon were different among the three datasets. This is the first study to compare the outcomes of rumen microbiota profiling between RNA-seq and RNA/DNA Amplicon-seq datasets. Our results illustrate the differences between these methods in characterizing microbiota both qualitatively and quantitatively for the same sample, and so caution must be exercised when comparing data. PMID:27446027

  2. Nucleotide sequence of an exceptionally long 5.8S ribosomal RNA from Crithidia fasciculata.

    PubMed

    Schnare, M N; Gray, M W

    1982-03-25

    In Crithidia fasciculata, a trypanosomatid protozoan, the large ribosomal subunit contains five small RNA species (e, f, g, i, j) in addition to 5S rRNA [Gray, M.W. (1981) Mol. Cell. Biol. 1, 347-357]. The complete primary sequence of species i is shown here to be pAACGUGUmCGCGAUGGAUGACUUGGCUUCCUAUCUCGUUGA ... AGAmACGCAGUAAAGUGCGAUAAGUGGUApsiCAAUUGmCAGAAUCAUUCAAUUACCGAAUCUUUGAACGAAACGG ... CGCAUGGGAGAAGCUCUUUUGAGUCAUCCCCGUGCAUGCCAUAUUCUCCAmGUGUCGAA(C)OH. This sequence establishes that species i is a 5.8S rRNA, despite its exceptional length (171-172 nucleotides). The extra nucleotides in C. fasciculata 5.8S rRNA are located in a region whose primary sequence and length are highly variable among 5.8S rRNAs, but which is capable of forming a stable hairpin loop structure (the "G+C-rich hairpin"). The sequence of C. fasciculata 5.8S rRNA is no more closely related to that of another protozoan, Acanthamoeba castellanii, than it is to representative 5.8S rRNA sequences from the other eukaryotic kingdoms, emphasizing the deep phylogenetic divisions that seem to exist within the Kingdom Protista. PMID:7079176

  3. Self-Assembly of Measles Virus Nucleocapsid-like Particles: Kinetics and RNA Sequence Dependence.

    PubMed

    Milles, Sigrid; Jensen, Malene Ringkjøbing; Communie, Guillaume; Maurin, Damien; Schoehn, Guy; Ruigrok, Rob W H; Blackledge, Martin

    2016-08-01

    Measles virus RNA genomes are packaged into helical nucleocapsids (NCs), comprising thousands of nucleo-proteins (N) that bind the entire genome. N-RNA provides the template for replication and transcription by the viral polymerase and is a promising target for viral inhibition. Elucidation of mechanisms regulating this process has been severely hampered by the inability to controllably assemble NCs. Here, we demonstrate self-organization of N into NC-like particles in vitro upon addition of RNA, providing a simple and versatile tool for investigating assembly. Real-time NMR and fluorescence spectroscopy reveals biphasic assembly kinetics. Remarkably, assembly depends strongly on the RNA-sequence, with the genomic 5' end and poly-Adenine sequences assembling efficiently, while sequences such as poly-Uracil are incompetent for NC formation. This observation has important consequences for understanding the assembly process. PMID:27270664

  4. Sequence characterization of 5S ribosomal RNA from eight gram positive procaryotes

    NASA Technical Reports Server (NTRS)

    Woese, C. R.; Luehrsen, K. R.; Pribula, C. D.; Fox, G. E.

    1976-01-01

    Complete nucleotide sequences are presented for 5S rRNA from Bacillus subtilis, B. firmus, B. pasteurii, B. brevis, Lactobacillus brevis, and Streptococcus faecalis, and 5S rRNA oligonucleotide catalogs and partial sequence data are given for B. cereus and Sporosarcina ureae. These data demonstrate a striking consistency of 5S rRNA primary and secondary structure within a given bacterial grouping. An exception is B. brevis, in which the 5S rRNA sequence varies significantly from that of other bacilli in the tuned helix and the procaryotic loop. The localization of these variations suggests that B. brevis occupies an ecological niche that selects such changes. It is noted that this organism produces antibiotics which affect ribosome function.

  5. A known expressed sequence tag, BM742401, is a potent lincRNA inhibiting cancer metastasis.

    PubMed

    Park, Seong-Min; Park, Sung-Joon; Kim, Hee-Jin; Kwon, Oh-Hyung; Kang, Tae-Wook; Sohn, Hyun-Ahm; Kim, Seon-Kyu; Moo Noh, Seung; Song, Kyu-Sang; Jang, Se-Jin; Sung Kim, Yong; Kim, Seon-Young

    2013-01-01

    Long intergenic non-coding RNAs (lincRNAs) have historically been ignored in cancer biology. However, thousands of lincRNAs have been identified in mammals using recently developed genomic tools, including microarray and high-throughput RNA sequencing (RNA-seq). Several of the lincRNAs identified have been well characterized for their functions in carcinogenesis. Here we performed RNA-seq experiments comparing gastric cancer with normal tissues to find differentially expressed transcripts in intergenic regions. By analyzing our own RNA-seq and public microarray data, we identified 31 transcripts, including a known expressed sequence tag, BM742401. BM742401 was downregulated in cancer, and its downregulation was associated with poor survival in gastric cancer patients. Ectopic overexpression of BM742401 inhibited metastasis-related phenotypes and decreased the concentration of extracellular MMP9. These results suggest that BM742401 is a potential lincRNA marker and therapeutic target. PMID:23846333

  6. Complete nucleotide sequence of the genomic RNA of tobacco mosaic virus strain Cg.

    PubMed

    Yamanaka, T; Komatani, H; Meshi, T; Naito, S; Ishikawa, M; Ohno, T

    1998-01-01

    Tobacco mosaic virus (TMV)-Cg is a crucifer-infecting tobamovirus that was isolated from field-grown garlic. We determined the complete nucleotide sequence of the genomic RNA of TMV-Cg. The genomic RNA of TMV-Cg consists of 6303 nucleotides and encodes four large open reading frames, organized basically in the same way as that of other tobamoviruses. The nucleotide and deduced amino acid sequences are very similar to those of the other crucifer-infecting tobamoviruses that have been sequenced so far. PMID:9608662

  7. The RNA sequence context defines the mechanistic routes by which yeast arginyl-tRNA synthetase charges tRNA.

    PubMed

    Sissler, M; Giegé, R; Florentz, C

    1998-06-01

    Arginylation of tRNA transcripts by yeast arginyl-tRNA synthetase can be triggered by two alternate recognition sets in anticodon loops: C35 and U36 or G36 in tRNA(Arg) and C36 and G37 in tRNA(Asp) (Sissler M, Giegé R, Florentz C, 1996, EMBO J 15:5069-5076). Kinetic studies on tRNA variants were done to explore the mechanisms by which these sets are expressed. Although the synthetase interacts in a similar manner with tRNA(Arg) and tRNA(Asp), the details of the interaction patterns are idiosyncratic, especially in anticodon loops (Sissler M, Eriani G, Martin F, Giegé R, Florentz C, 1997, Nucleic Acids Res 25:4899-4906). Exchange of individual recognition elements between arginine and aspartate tRNA frameworks strongly blocks arginylation of the mutated tRNAs, whereas full exchange of the recognition sets leads to efficient arginine acceptance of the transplanted tRNAs. Unpredictably, the similar catalytic efficiencies of native and transplanted tRNAs originate from different k(cat) and Km combinations. A closer analysis reveals that efficient arginylation results from strong anticooperative effects between individual recognition elements. Nonrecognition nucleotides as well as the tRNA architecture are additional factors that tune efficiency. Altogether, arginyl-tRNA synthetase is able to utilize different context-dependent mechanistic routes to be activated. This confers biological advantages to the arginine aminoacylation system and sheds light on its evolutionary relationship with the aspartate system. PMID:9622124

  8. Identification of characteristic oligonucleotides in the bacterial 16S ribosomal RNA sequence dataset

    NASA Technical Reports Server (NTRS)

    Zhang, Zhengdong; Willson, Richard C.; Fox, George E.

    2002-01-01

    MOTIVATION: The phylogenetic structure of the bacterial world has been intensively studied by comparing sequences of 16S ribosomal RNA (16S rRNA). This database of sequences is now widely used to design probes for the detection of specific bacteria or groups of bacteria one at a time. The success of such methods reflects the fact that there are local sequence segments that are highly characteristic of particular organisms or groups of organisms. It is not clear, however, the extent to which such signature sequences exist in the 16S rRNA dataset. A better understanding of the numbers and distribution of highly informative oligonucleotide sequences may facilitate the design of hybridization arrays that can characterize the phylogenetic position of an unknown organism or serve as the basis for the development of novel approaches for use in bacterial identification. RESULTS: A computer-based algorithm that characterizes the extent to which any individual oligonucleotide sequence in 16S rRNA is characteristic of any particular bacterial grouping was developed. A measure of signature quality, Q(s), was formulated and subsequently calculated for every individual oligonucleotide sequence in the size range of 5-11 nucleotides and for 15mers with reference to each cluster and subcluster in a 929 organism representative phylogenetic tree. Subsequently, the perfect signature sequences were compared to the full set of 7322 sequences to see how common false positives were. The work completed here establishes beyond any doubt that highly characteristic oligonucleotides exist in the bacterial 16S rRNA sequence dataset in large numbers. Over 16,000 15mers were identified that might be useful as signatures. Signature oligonucleotides are available for over 80% of the nodes in the representative tree.

  9. Diversity of host species and strains of Pneumocystis carinii is based on rRNA sequences.

    PubMed Central

    Shah, J S; Pieciak, W; Liu, J; Buharin, A; Lane, D J

    1996-01-01

    We have amplified by PCR Pneumocystis carinii cytoplasmic small-subunit rRNA (variously referred to as 16S-like or 18S-like rRNA) genes from DNA extracted from bronchoalveolar lavage and induced sputum specimens from patients positive for P. carinii and from infected ferret lung tissue. The amplification products were cloned into pUC18, and individual clones were sequenced. Comparison of the determined sequences with each other and with published rat and partial human P.carinii small-subunit rRNA gene sequences reveals that, although all P. carinii small-subunit rRNAs are closely related (approximately 96% identity), small-subunit rRNA genes isolated from different host species (human, rat, and ferret) exhibit distinctive patterns of sequence variation. Two types of sequences were isolated from the infected ferret lung tissue, one as a predominant species and the other as a minor species. There was 96% identity between the two types. In situ hybridization of the infected ferret lung tissue with oligonucleotide probes specific for each type revealed that there were two distinct strains of P. carinii present in the ferret lung tissue. Unlike the ferret P. carinii isolates, the small-subunit rRNA gene sequences from different human P. carinii isolates have greater than 99% identity and are distinct from all rat and ferret sequences so far inspected or reported in the literature. Southern blot hybridization analysis of PCR amplification products from several additional bronchoalveolar lavage or induced sputum specimens from P. carinii-infected patients, using a 32P-labeled oligonucleotide probe specific for human P. carinii, also suggests that all of the human P. carinii isolates are identical. These findings indicate that human P. carinii isolates may represent a distinct species of P. carinii distinguishable from rat and ferret P. carinii on the basis of characterization of small-subunit rRNA gene sequences. PMID:8770515

  10. The phylogenetic utility and functional constraint of microRNA flanking sequences

    PubMed Central

    Kenny, Nathan J.; Sin, Yung Wa; Hayward, Alexander; Paps, Jordi; Chu, Ka Hou; Hui, Jerome H. L.

    2015-01-01

    MicroRNAs (miRNAs) have recently risen to prominence as novel factors responsible for post-transcriptional regulation of gene expression. miRNA genes have been posited as highly conserved in the clades in which they exist. Consequently, miRNAs have been used as rare genome change characters to estimate phylogeny by tracking their gain and loss. However, their short length (21–23 bp) has limited their perceived utility in sequenced-based phylogenetic inference. Here, using reference taxa with established phylogenetic relationships, we demonstrate that miRNA sequences are of high utility in quantitative, rather than in qualitative, phylogenetic analysis. The clear orthology among miRNA genes from different species makes it straightforward to identify and align these sequences from even fragmentary datasets. We also identify significant sequence conservation in the regions directly flanking miRNA genes, and show that this too is of utility in phylogenetic analysis, as well as highlighting conserved regions that will be of interest to other fields. Employing miRNA sequences from 12 sequenced drosophilid genomes, together with a Tribolium castaneum outgroup, we demonstrate that this approach is robust using Bayesian and maximum-likelihood methods. The utility of these characters is further demonstrated in the rhabditid nematodes and primates. As next-generation sequencing makes it more cost-effective to sequence genomes and small RNA libraries, this methodology provides an alternative data source for phylogenetic analysis. The approach allows rapid resolution of relationships between both closely related and rapidly evolving species, and provides an additional tool for investigation of relationships within the tree of life. PMID:25694624

  11. Deep Sequencing Analysis of Nucleolar Small RNAs: RNA Isolation and Library Preparation.

    PubMed

    Bai, Baoyan; Laiho, Marikki

    2016-01-01

    The nucleolus is a subcellular compartment with a key essential function in ribosome biogenesis. The nucleolus is rich in noncoding RNAs, mostly the ribosomal RNAs and small nucleolar RNAs. Surprisingly, also several miRNAs have been detected in the nucleolus, raising the question as to whether other small RNA species are present and functional in the nucleolus. We have developed a strategy for stepwise enrichment of nucleolar small RNAs from the total nucleolar RNA extracts and subsequent construction of nucleolar small RNA libraries which are suitable for deep sequencing. Our method successfully isolates the small RNA population from total RNAs and monitors the RNA quality in each step to ensure that small RNAs recovered represent the actual small RNA population in the nucleolus and not degradation products from larger RNAs. We have further applied this approach to characterize the distribution of small RNAs in different cellular compartments. PMID:27576723

  12. Next-generation sequencing of the porcine skeletal muscle transcriptome for computational prediction of microRNA gene targets

    Technology Transfer Automated Retrieval System (TEKTRAN)

    MicroRNA are a class of small RNAs that regulate gene expression by inhibiting translation of protein encoding transcripts. Inhibition is exerted through targeting of a microRNA-protein complex by base-pairing of the microRNA sequence to cognate recognition sequences in the 3’ untranslated region (...

  13. Analyzing the microRNA Transcriptome in Plants Using Deep Sequencing Data

    PubMed Central

    Yang, Xiaozeng; Li, Lei

    2012-01-01

    MicroRNAs (miRNAs) are 20- to 24-nucleotide endogenous small RNA molecules emerging as an important class of sequence-specific, trans-acting regulators for modulating gene expression at the post-transcription level. There has been a surge of interest in the past decade in identifying miRNAs and profiling their expression pattern using various experimental approaches. In particular, ultra-deep sampling of specifically prepared low-molecular-weight RNA libraries based on next-generation sequencing technologies has been used successfully in diverse species. The challenge now is to effectively deconvolute the complex sequencing data to provide comprehensive and reliable information on the miRNAs, miRNA precursors, and expression profile of miRNA genes. Here we review the recently developed computational tools and their applications in profiling the miRNA transcriptomes, with an emphasis on the model plant Arabidopsis thaliana. Highlighted is also progress and insight into miRNA biology derived from analyzing available deep sequencing data. PMID:24832228

  14. StarScan: a web server for scanning small RNA targets from degradome sequencing data

    PubMed Central

    Liu, Shun; Li, Jun-Hao; Wu, Jie; Zhou, Ke-Ren; Zhou, Hui; Yang, Jian-Hua; Qu, Liang-Hu

    2015-01-01

    Endogenous small non-coding RNAs (sRNAs), including microRNAs, PIWI-interacting RNAs and small interfering RNAs, play important gene regulatory roles in animals and plants by pairing to the protein-coding and non-coding transcripts. However, computationally assigning these various sRNAs to their regulatory target genes remains technically challenging. Recently, a high-throughput degradome sequencing method was applied to identify biologically relevant sRNA cleavage sites. In this study, an integrated web-based tool, StarScan (sRNA target Scan), was developed for scanning sRNA targets using degradome sequencing data from 20 species. Given a sRNA sequence from plants or animals, our web server performs an ultrafast and exhaustive search for potential sRNA–target interactions in annotated and unannotated genomic regions. The interactions between small RNAs and target transcripts were further evaluated using a novel tool, alignScore. A novel tool, degradomeBinomTest, was developed to quantify the abundance of degradome fragments located at the 9–11th nucleotide from the sRNA 5′ end. This is the first web server for discovering potential sRNA-mediated RNA cleavage events in plants and animals, which affords mechanistic insights into the regulatory roles of sRNAs. The StarScan web server is available at http://mirlab.sysu.edu.cn/starscan/. PMID:25990732

  15. The novel organization and complete sequence of the ribosomal RNA gene of Nosema bombycis.

    PubMed

    Huang, Wei-Fone; Tsai, Shu-Jen; Lo, Chu-Fang; Soichi, Yamane; Wang, Chung-Hsiung

    2004-05-01

    We present here for the first time the complete DNA sequence data (4301bp) of the ribosomal RNA (rRNA) gene of the microsporidian type species, Nosema bombycis. Sequences for the large subunit gene (LSUrRNA: 2497bp, GenBank Accession No. ), the internal transcribed spacer (ITS: 179bp, GenBank Accession No. ), the small subunit gene (SSUrRNA: 1232bp), intergenic spacer (IGS: 279bp), and 5S region (114bp) are also given, and the secondary structure of the large subunit is discussed. The organization of the N. bombycis rRNA gene is LSUrRNA-ITS-SSUrRNA-IGS-5S. This novel arrangement, in which the LSU is 5' of the SSU, is the reverse of the organizational sequence (i.e., SSU-ITS-LSU) found in all previously reported microsporidian rRNAs, including Nosema apis. This unique character in the type species may have taxonomic implications for the members of the genus Nosema. PMID:15050536

  16. MicroRNA transcriptome in the newborn mouse ovaries determined by massive parallel sequencing.

    PubMed

    Ahn, Hyo Won; Morin, Ryan D; Zhao, Han; Harris, Ronald A; Coarfa, Cristian; Chen, Zi-Jiang; Milosavljevic, Aleksandar; Marra, Marco A; Rajkovic, Aleksandar

    2010-07-01

    Small non-coding RNAs, such as microRNAs (miRNAs), are involved in diverse biological processes including organ development and tissue differentiation. Global disruption of miRNA biogenesis in Dicer knockout mice disrupts early embryogenesis and primordial germ cell formation. However, the role of miRNAs in early folliculogenesis is poorly understood. In order to identify a full transcriptome set of small RNAs expressed in the newborn (NB) ovary, we extracted small RNA fraction from mouse NB ovary tissues and subjected it to massive parallel sequencing using the Genome Analyzer from Illumina. Massive sequencing produced 4 655 992 reads of 33 bp each representing a total of 154 Mbp of sequence data. The Pash alignment algorithm mapped 50.13% of the reads to the mouse genome. Sequence reads were clustered based on overlapping mapping coordinates and intersected with known miRNAs, small nucleolar RNAs (snoRNAs), piwi-interacting RNA (piRNA) clusters and repetitive genomic regions; 25.2% of the reads mapped to known miRNAs, 25.5% to genomic repeats, 3.5% to piRNAs and 0.18% to snoRNAs. Three hundred and ninety-eight known miRNA species were among the sequenced small RNAs, and 118 isomiR sequences that are not in the miRBase database. Let-7 family was the most abundantly expressed miRNA, and mmu-mir-672, mmu-mir-322, mmu-mir-503 and mmu-mir-465 families are the most abundant X-linked miRNA detected. X-linked mmu-mir-503, mmu-mir-672 and mmu-mir-465 family showed preferential expression in testes and ovaries. We also identified four novel miRNAs that are preferentially expressed in gonads. Gonadal selective miRNAs may play important roles in ovarian development, folliculogenesis and female fertility. PMID:20215419

  17. Sequence requirements for localization and packaging of Ty3 retroelement RNA

    PubMed Central

    Clemens, Kristina; Bilanchone, Virginia; Beliakova-Bethell, Nadejda; Larsen, Liza S.Z.; Nguyen, Kim; Sandmeyer, Suzanne

    2012-01-01

    Retroviruses and retrotransposons package genomic RNA into virus-like particles (VLPs) in a poorly understood process. Expression of the budding yeast retrotransposon Ty3 results in the formation of cytoplasmic Ty3 VLP assembly foci comprised of Ty3 RNA and proteins, and cellular factors associated with RNA processing body (PB) components, which modulate translation and effect nonsense-mediated decay (NMD). A series of Ty3 RNA variants were tested to understand the effects of read-through translation via programmed frameshifting on RNA localization and packaging into VLPs, and to identify the roles of coding and non-coding sequences in those processes. These experiments showed that a low level of read-through translation of the downstream open reading frame (as opposed to no translation or translation without frameshifting) is important for localization of full-length Ty3 RNA to foci. Ty3 RNA variants associated with PB components via independent determinants in the native Ty3 untranslated regions (UTRs) and in GAG3-POL3 sequences flanked by UTRs adapted from non-Ty3 transcripts. However, despite localization, RNAs containing GAG3-POL3 but lacking Ty3 UTRs were not packaged efficiently. Surprisingly, sequences within Ty3 UTRs, which bind the initiator tRNAMet proposed to provide the dimerization interface, were not required for packaging of full-length Ty3 RNA into VLPs. In summary, our results demonstrate that Gag3 is sufficient and required for localization and packaging of RNAs containing Ty3 UTRs and support a role for POL3 sequences, translation of which is attenuated by programmed frameshifting, in both localization and packaging of the Ty3 full-length gRNA. PMID:23073180

  18. R3D-2-MSA: the RNA 3D structure-to-multiple sequence alignment server.

    PubMed

    Cannone, Jamie J; Sweeney, Blake A; Petrov, Anton I; Gutell, Robin R; Zirbel, Craig L; Leontis, Neocles

    2015-07-01

    The RNA 3D Structure-to-Multiple Sequence Alignment Server (R3D-2-MSA) is a new web service that seamlessly links RNA three-dimensional (3D) structures to high-quality RNA multiple sequence alignments (MSAs) from diverse biological sources. In this first release, R3D-2-MSA provides manual and programmatic access to curated, representative ribosomal RNA sequence alignments from bacterial, archaeal, eukaryal and organellar ribosomes, using nucleotide numbers from representative atomic-resolution 3D structures. A web-based front end is available for manual entry and an Application Program Interface for programmatic access. Users can specify up to five ranges of nucleotides and 50 nucleotide positions per range. The R3D-2-MSA server maps these ranges to the appropriate columns of the corresponding MSA and returns the contents of the columns, either for display in a web browser or in JSON format for subsequent programmatic use. The browser output page provides a 3D interactive display of the query, a full list of sequence variants with taxonomic information and a statistical summary of distinct sequence variants found. The output can be filtered and sorted in the browser. Previous user queries can be viewed at any time by resubmitting the output URL, which encodes the search and re-generates the results. The service is freely available with no login requirement at http://rna.bgsu.edu/r3d-2-msa. PMID:26048960

  19. R3D-2-MSA: the RNA 3D structure-to-multiple sequence alignment server

    PubMed Central

    Cannone, Jamie J.; Sweeney, Blake A.; Petrov, Anton I.; Gutell, Robin R.; Zirbel, Craig L.; Leontis, Neocles

    2015-01-01

    The RNA 3D Structure-to-Multiple Sequence Alignment Server (R3D-2-MSA) is a new web service that seamlessly links RNA three-dimensional (3D) structures to high-quality RNA multiple sequence alignments (MSAs) from diverse biological sources. In this first release, R3D-2-MSA provides manual and programmatic access to curated, representative ribosomal RNA sequence alignments from bacterial, archaeal, eukaryal and organellar ribosomes, using nucleotide numbers from representative atomic-resolution 3D structures. A web-based front end is available for manual entry and an Application Program Interface for programmatic access. Users can specify up to five ranges of nucleotides and 50 nucleotide positions per range. The R3D-2-MSA server maps these ranges to the appropriate columns of the corresponding MSA and returns the contents of the columns, either for display in a web browser or in JSON format for subsequent programmatic use. The browser output page provides a 3D interactive display of the query, a full list of sequence variants with taxonomic information and a statistical summary of distinct sequence variants found. The output can be filtered and sorted in the browser. Previous user queries can be viewed at any time by resubmitting the output URL, which encodes the search and re-generates the results. The service is freely available with no login requirement at http://rna.bgsu.edu/r3d-2-msa. PMID:26048960

  20. Sequence and phylogenetic analysis of SSU rRNA gene of five microsporidia.

    PubMed

    Dong, ShiNan; Shen, ZhongYuan; Xu, Li; Zhu, Feng

    2010-01-01

    The complete small subunit rRNA (SSU rRNA) gene sequences of five microsporidia including Nosema heliothidis, and four novel microsporidia isolated from Pieris rapae, Phyllobrotica armta, Hemerophila atrilineata, and Bombyx mori, respectively, were obtained by PCR amplification, cloning, and sequencing. Two phylogenetic trees based on SSU rRNA sequences had been constructed by using Neighbor-Joining of Phylip software and UPGMA of MEGA4.0 software. The taxonomic status of four novel microsporidia was determined by analysis of phylogenetic relationship, length, G+C content, identity, and divergence of the SSU rRNA sequences. The results showed that the microsporidia isolated from Pieris rapae, Phyllobrotica armta, and Hemerophila atrilineata have close phylogenetic relationship with the Nosema, while another microsporidium isolated from Bombyx mori is closely related to the Endoreticulatus. So, we temporarily classify three novel species of microsporidia to genus Nosema, as Nosema sp. PR, Nosema sp. PA, Nosema sp. HA. Another is temporarily classified into genus Endoreticulatus, as Endoreticulatus sp. Zhenjiang. The result indicated as well that it is feasible and valuable to elucidate phylogenetic relationships and taxonomic status of microsporidian species by analyzing information from SSU rRNA sequences of microsporidia. PMID:19768503

  1. Splice site consensus sequences are preferentially accessible to nucleases in isolated adenovirus RNA.

    PubMed Central

    Munroe, S H; Duthie, R S

    1986-01-01

    The conformation of RNA sequences spanning five 3' splice sites and two 5' splice sites in adenovirus mRNA was probed by partial digestion with single-strand specific nucleases. Although cleavage of nucleotides near both 3' and 5' splice sites was observed, most striking was the preferential digestion of sequences near the 3' splice site. At each 3' splice site a region of very strong cleavage is observed at low concentrations of enzyme near the splice site consensus sequence or the upstream branch point consensus sequence. Additional sites of moderately strong cutting near the branch point consensus sequence were observed in those sequences where the splice site was the preferred target. Since recognition of the 3' splice site and branch site appear to be early events in mRNA splicing these observations may indicate that the local conformation of the splice site sequences may play a direct or indirect role in enhancing the accessibility of sequences important for splicing. Images PMID:3024107

  2. High-quality RNA extraction from copepods for Next Generation Sequencing: A comparative study.

    PubMed

    Asai, Sneha; Ianora, Adrianna; Lauritano, Chiara; Lindeque, Penelope K; Carotenuto, Ylenia

    2015-12-01

    Despite the ecological importance of copepods, few Next Generation Sequencing studies (NGS) have been performed on small crustaceans, and a standard method for RNA extraction is lacking. In this study, we compared three commonly-used methods: TRIzol®, Aurum Total RNA Mini Kit and Qiagen RNeasy Micro Kit, in combination with preservation reagents TRIzol® or RNAlater®, to obtain high-quality and quantity of RNA from copepods for NGS. Total RNA was extracted from the copepods Calanus helgolandicus, Centropages typicus and Temora stylifera and its quantity and quality were evaluated using NanoDrop, agarose gel electrophoresis and Agilent Bioanalyzer. Our results demonstrate that preservation of copepods in RNAlater® and extraction with Qiagen RNeasy Micro Kit were the optimal isolation method for high-quality and quantity of RNA for NGS studies of C. helgolandicus. Intriguingly, C. helgolandicus 28S rRNA is formed by two subunits that separate after heat-denaturation and migrate along with 18S rRNA. This unique property of protostome RNA has never been reported in copepods. Overall, our comparative study on RNA extraction protocols will help increase gene expression studies on copepods using high-throughput applications, such as RNA-Seq and microarrays. PMID:25546577

  3. The reverse transcription signature of N-1-methyladenosine in RNA-Seq is sequence dependent

    PubMed Central

    Hauenschild, Ralf; Tserovski, Lyudmil; Schmid, Katharina; Thüring, Kathrin; Winz, Marie-Luise; Sharma, Sunny; Entian, Karl-Dieter; Wacheul, Ludivine; Lafontaine, Denis L. J.; Anderson, James; Alfonzo, Juan; Hildebrandt, Andreas; Jäschke, Andres; Motorin, Yuri; Helm, Mark

    2015-01-01

    The combination of Reverse Transcription (RT) and high-throughput sequencing has emerged as a powerful combination to detect modified nucleotides in RNA via analysis of either abortive RT-products or of the incorporation of mismatched dNTPs into cDNA. Here we simultaneously analyze both parameters in detail with respect to the occurrence of N-1-methyladenosine (m1A) in the template RNA. This naturally occurring modification is associated with structural effects, but it is also known as a mediator of antibiotic resistance in ribosomal RNA. In structural probing experiments with dimethylsulfate, m1A is routinely detected by RT-arrest. A specifically developed RNA-Seq protocol was tailored to the simultaneous analysis of RT-arrest and misincorporation patterns. By application to a variety of native and synthetic RNA preparations, we found a characteristic signature of m1A, which, in addition to an arrest rate, features misincorporation as a significant component. Detailed analysis suggests that the signature depends on RNA structure and on the nature of the nucleotide 3′ of m1A in the template RNA, meaning it is sequence dependent. The RT-signature of m1A was used for inspection and confirmation of suspected modification sites and resulted in the identification of hitherto unknown m1A residues in trypanosomal tRNA. PMID:26365242

  4. Study design requirements for RNA sequencing-based breast cancer diagnostics.

    PubMed

    Mer, Arvind Singh; Klevebring, Daniel; Grönberg, Henrik; Rantalainen, Mattias

    2016-01-01

    Sequencing-based molecular characterization of tumors provides information required for individualized cancer treatment. There are well-defined molecular subtypes of breast cancer that provide improved prognostication compared to routine biomarkers. However, molecular subtyping is not yet implemented in routine breast cancer care. Clinical translation is dependent on subtype prediction models providing high sensitivity and specificity. In this study we evaluate sample size and RNA-sequencing read requirements for breast cancer subtyping to facilitate rational design of translational studies. We applied subsampling to ascertain the effect of training sample size and the number of RNA sequencing reads on classification accuracy of molecular subtype and routine biomarker prediction models (unsupervised and supervised). Subtype classification accuracy improved with increasing sample size up to N = 750 (accuracy = 0.93), although with a modest improvement beyond N = 350 (accuracy = 0.92). Prediction of routine biomarkers achieved accuracy of 0.94 (ER) and 0.92 (Her2) at N = 200. Subtype classification improved with RNA-sequencing library size up to 5 million reads. Development of molecular subtyping models for cancer diagnostics requires well-designed studies. Sample size and the number of RNA sequencing reads directly influence accuracy of molecular subtyping. Results in this study provide key information for rational design of translational studies aiming to bring sequencing-based diagnostics to the clinic. PMID:26830453

  5. Study design requirements for RNA sequencing-based breast cancer diagnostics

    PubMed Central

    Mer, Arvind Singh; Klevebring, Daniel; Grönberg, Henrik; Rantalainen, Mattias

    2016-01-01

    Sequencing-based molecular characterization of tumors provides information required for individualized cancer treatment. There are well-defined molecular subtypes of breast cancer that provide improved prognostication compared to routine biomarkers. However, molecular subtyping is not yet implemented in routine breast cancer care. Clinical translation is dependent on subtype prediction models providing high sensitivity and specificity. In this study we evaluate sample size and RNA-sequencing read requirements for breast cancer subtyping to facilitate rational design of translational studies. We applied subsampling to ascertain the effect of training sample size and the number of RNA sequencing reads on classification accuracy of molecular subtype and routine biomarker prediction models (unsupervised and supervised). Subtype classification accuracy improved with increasing sample size up to N = 750 (accuracy = 0.93), although with a modest improvement beyond N = 350 (accuracy = 0.92). Prediction of routine biomarkers achieved accuracy of 0.94 (ER) and 0.92 (Her2) at N = 200. Subtype classification improved with RNA-sequencing library size up to 5 million reads. Development of molecular subtyping models for cancer diagnostics requires well-designed studies. Sample size and the number of RNA sequencing reads directly influence accuracy of molecular subtyping. Results in this study provide key information for rational design of translational studies aiming to bring sequencing-based diagnostics to the clinic. PMID:26830453

  6. Two methods for full-length RNA sequencing for low quantities of cells and single cells

    PubMed Central

    Pan, Xinghua; Durrett, Russell E.; Zhu, Haiying; Tanaka, Yoshiaki; Li, Yumei; Zi, Xiaoyuan; Marjani, Sadie L.; Euskirchen, Ghia; Ma, Chao; LaMotte, Robert H.; Park, In-Hyun; Snyder, Michael P.; Mason, Christopher E.; Weissman, Sherman M.

    2013-01-01

    The ability to determine the gene expression pattern in low quantities of cells or single cells is important for resolving a variety of problems in many biological disciplines. A robust description of the expression signature of a single cell requires determination of the full-length sequence of the expressed mRNAs in the cell, yet existing methods have either 3′ biased or variable transcript representation. Here, we report our protocols for the amplification and high-throughput sequencing of very small amounts of RNA for sequencing using procedures of either semirandom primed PCR or phi29 DNA polymerase-based DNA amplification, for the cDNA generated with oligo-dT and/or random oligonucleotide primers. Unlike existing methods, these protocols produce relatively uniformly distributed sequences covering the full length of almost all transcripts independent of their sizes, from 1,000 to 10 cells, and even with single cells. Both protocols produced satisfactory detection/coverage of the abundant mRNAs from a single K562 erythroleukemic cell or a single dorsal root ganglion neuron. The phi29-based method produces long products with less noise, uses an isothermal reaction, and is simple to practice. The semirandom primed PCR procedure is more sensitive and reproducible at low transcript levels or with low quantities of cells. These methods provide tools for mRNA sequencing or RNA sequencing when only low quantities of cells, a single cell, or even degraded RNA are available for profiling. PMID:23267071

  7. The landscape of fusion transcripts in spitzoid melanoma and biologically indeterminate spitzoid tumors by RNA sequencing

    PubMed Central

    Wu, Gang; Barnhill, Raymond L.; Lee, Seungjae; Li, Yongjin; Shao, Ying; Easton, John; Dalton, James; Zhang, Jinghui; Pappo, Alberto; Bahrami, Armita

    2016-01-01

    Kinase activation by chromosomal translocations is a common mechanism that drives tumorigenesis in spitzoid neoplasms. To explore the landscape of fusion transcripts in these tumors, we performed whole-transcriptome sequencing using formalin-fixed paraffin-embedded tissues in malignant or biologically indeterminate spitzoid tumors from 7 patients (age 2–14 years). RNA sequence libraries enriched for coding regions were prepared and the sequencing was analyzed by a novel assembly-based algorithm designed for detecting complex fusions. In addition, tumor samples were screened for hotspot TERT promoter mutations, and telomerase expression was assessed by TERT mRNA in situ hybridization (ISH). Two patients had widespread metastasis and subsequently died of disease, and 5 patients had a benign clinical course on limited follow-up (mean: 30 months). RNA sequencing and TERT mRNA ISH were successful in 6 tumors and unsuccessful in 1 disseminating tumor due to low RNA quality. RNA sequencing identified a kinase fusion in 5 of the 6 sequenced tumors: TPM3–NTRK1 (2 tumors), complex rearrangements involving TPM3, ALK, and IL6R (1 tumor), BAIAP2L1–BRAF (1 tumor), and EML4–BRAF (1 disseminating tumor). All predicted chimeric transcripts were expressed at high levels and contained the intact kinase domain. In addition, 2 tumors each contained a second fusion gene, ARID1B-SNX9 or PTPRZ1-NFAM1. The detected chimeric genes were validated by home-brew break-apart or fusion fluorescence in situ hybridization. The 2 disseminating tumors each harbored the TERT promoter −124C>T (Chr 5:1,295,228 hg19 coordinate) mutation whereas the remaining 5 tumors retained the wild-type gene. The presence of the −124C>T mutation correlated with telomerase expression by TERT mRNA ISH. In summary, we demonstrated complex fusion transcripts and novel partner genes for BRAF by RNA sequencing of FFPE samples. The diversity of gene fusions demonstrated by RNA sequencing defines the molecular

  8. Prediction of Immunomodulatory potential of an RNA sequence for designing non-toxic siRNAs and RNA-based vaccine adjuvants

    PubMed Central

    Chaudhary, Kumardeep; Nagpal, Gandharva; Dhanda, Sandeep Kumar; Raghava, Gajendra P. S.

    2016-01-01

    Our innate immune system recognizes a foreign RNA sequence of a pathogen and activates the immune system to eliminate the pathogen from our body. This immunomodulatory potential of RNA can be used to design RNA-based immunotherapy and vaccine adjuvants. In case of siRNA-based therapy, the immunomodulatory effect of an RNA sequence is unwanted as it may cause immunotoxicity. Thus, we developed a method for designing a single-stranded RNA (ssRNA) sequence with desired immunomodulatory potentials, for designing RNA-based therapeutics, immunotherapy and vaccine adjuvants. The dataset used for training and testing our models consists of 602 experimentally verified immunomodulatory oligoribonucleotides (IMORNs) that are ssRNA sequences of length 17 to 27 nucleotides and 520 circulating miRNAs as non-immunomodulatory sequences. We developed prediction models using various features that include composition-based features, binary profile, selected features, and hybrid features. All models were evaluated using five-fold cross-validation and external validation techniques; achieving a maximum mean Matthews Correlation Coefficient (MCC) of 0.86 with 93% accuracy. We identified motifs using MERCI software and observed the abundance of adenine (A) in motifs. Based on the above study, we developed a web server, imRNA, comprising of various modules important for designing RNA-based therapeutics (http://crdd.osdd.net/raghava/imrna/). PMID:26861761

  9. Use of yeast nuclear DNA sequences to define the mitochondrial RNA polymerase promoter in vitro.

    PubMed Central

    Marczynski, G T; Schultz, P W; Jaehning, J A

    1989-01-01

    We have extended an earlier observation that the TATA box for the nuclear GAL10 gene serves as a promoter for the mitochondrial RNA polymerase in in vitro transcription reactions (C. S. Winkley, M. J. Keller, and J. A. Jaehning, J. Biol. Chem. 260:14214-14223, 1985). In this work, we demonstrate that other nuclear genes also have upstream sequences that function in vitro as mitochondrial RNA polymerase promoters. These genes include the GAL7 and MEL1 genes, which are regulated in concert with the GAL10 gene, the sigma repetitive element, and the 2 microns plasmid origin of replication. We used in vitro transcription reactions to test a large number of nuclear DNA sequences that contain critical mitochondrial promoter sequences as defined by Biswas et al. (T. K. Biswas, J. C. Edwards, M. Rabinowitz, and G. S. Getz, J. Biol. Chem. 262:13690-13696, 1987). The results of these experiments allowed us to extend the definition of essential promoter elements. This extended sequence, -ACTATAAACGatcATAG-, was frequently found in the upstream regulatory regions of nuclear genes. On the basis of these observations, we hypothesized that either (i) a catalytic RNA polymerase related to the mitochondrial enzyme functions in the nucleus of the yeast cell or (ii) a DNA sequence recognition factor is shared by the two genetic compartments. By using cells deficient in the catalytic core of the mitochondrial RNA polymerase (rpo41-) and sensitive assays for transcripts initiating from the nuclear promoter sequences, we have conclusively ruled out a role for the catalytic RNA polymerase in synthesizing transcripts from all of the nuclear sequences analyzed. The possibility that a DNA sequence recognition factor functions in both the nucleus and the mitochondria remains to be tested. Images PMID:2677667

  10. Bioinformatics of Cancer ncRNA in High Throughput Sequencing: Present State and Challenges

    PubMed Central

    Jorge, Natasha Andressa Nogueira; Ferreira, Carlos Gil; Passetti, Fabio

    2012-01-01

    The numerous genome sequencing projects produced unprecedented amount of data providing significant information to the discovery of novel non-coding RNA (ncRNA). Several ncRNAs have been described to control gene expression and display important role during cell differentiation and homeostasis. In the last decade, high throughput methods in conjunction with approaches in bioinformatics have been used to identify, classify, and evaluate the expression of hundreds of ncRNA in normal and pathological states, such as cancer. Patient outcomes have been already associated with differential expression of ncRNAs in normal and tumoral tissues, providing new insights in the development of innovative therapeutic strategies in oncology. In this review, we present and discuss bioinformatics advances in the development of computational approaches to analyze and discover ncRNA data in oncology using high throughput sequencing technologies. PMID:23251139