Sample records for rna structural features

  1. New Era of Studying RNA Secondary Structure and Its Influence on Gene Regulation in Plants.

    PubMed

    Yang, Xiaofei; Yang, Minglei; Deng, Hongjing; Ding, Yiliang

    2018-01-01

    The dynamic structure of RNA plays a central role in post-transcriptional regulation of gene expression such as RNA maturation, degradation, and translation. With the rise of next-generation sequencing, the study of RNA structure has been transformed from in vitro low-throughput RNA structure probing methods to in vivo high-throughput RNA structure profiling. The development of these methods enables incremental studies on the function of RNA structure to be performed, revealing new insights of novel regulatory mechanisms of RNA structure in plants. Genome-wide scale RNA structure profiling allows us to investigate general RNA structural features over 10s of 1000s of mRNAs and to compare RNA structuromes between plant species. Here, we provide a comprehensive and up-to-date overview of: (i) RNA structure probing methods; (ii) the biological functions of RNA structure; (iii) genome-wide RNA structural features corresponding to their regulatory mechanisms; and (iv) RNA structurome evolution in plants.

  2. TRANSAT-- method for detecting the conserved helices of functional RNA structures, including transient, pseudo-knotted and alternative structures.

    PubMed

    Wiebe, Nicholas J P; Meyer, Irmtraud M

    2010-06-24

    The prediction of functional RNA structures has attracted increased interest, as it allows us to study the potential functional roles of many genes. RNA structure prediction methods, however, assume that there is a unique functional RNA structure and also do not predict functional features required for in vivo folding. In order to understand how functional RNA structures form in vivo, we require sophisticated experiments or reliable prediction methods. So far, there exist only a few, experimentally validated transient RNA structures. On the computational side, there exist several computer programs which aim to predict the co-transcriptional folding pathway in vivo, but these make a range of simplifying assumptions and do not capture all features known to influence RNA folding in vivo. We want to investigate if evolutionarily related RNA genes fold in a similar way in vivo. To this end, we have developed a new computational method, Transat, which detects conserved helices of high statistical significance. We introduce the method, present a comprehensive performance evaluation and show that Transat is able to predict the structural features of known reference structures including pseudo-knotted ones as well as those of known alternative structural configurations. Transat can also identify unstructured sub-sequences bound by other molecules and provides evidence for new helices which may define folding pathways, supporting the notion that homologous RNA sequence not only assume a similar reference RNA structure, but also fold similarly. Finally, we show that the structural features predicted by Transat differ from those assuming thermodynamic equilibrium. Unlike the existing methods for predicting folding pathways, our method works in a comparative way. This has the disadvantage of not being able to predict features as function of time, but has the considerable advantage of highlighting conserved features and of not requiring a detailed knowledge of the cellular environment.

  3. Conserved and variable domains of RNase MRP RNA.

    PubMed

    Dávila López, Marcela; Rosenblad, Magnus Alm; Samuelsson, Tore

    2009-01-01

    Ribonuclease MRP is a eukaryotic ribonucleoprotein complex consisting of one RNA molecule and 7-10 protein subunits. One important function of MRP is to catalyze an endonucleolytic cleavage during processing of rRNA precursors. RNase MRP is evolutionary related to RNase P which is critical for tRNA processing. A large number of MRP RNA sequences that now are available have been used to identify conserved primary and secondary structure features of the molecule. MRP RNA has structural features in common with P RNA such as a conserved catalytic core, but it also has unique features and is characterized by a domain highly variable between species. Information regarding primary and secondary structure features is of interest not only in basic studies of the function of MRP RNA, but also because mutations in the RNA give rise to human genetic diseases such as cartilage-hair hypoplasia.

  4. Protein functional features are reflected in the patterns of mRNA translation speed.

    PubMed

    López, Daniel; Pazos, Florencio

    2015-07-09

    The degeneracy of the genetic code makes it possible for the same amino acid string to be coded by different messenger RNA (mRNA) sequences. These "synonymous mRNAs" may differ largely in a number of aspects related to their overall translational efficiency, such as secondary structure content and availability of the encoded transfer RNAs (tRNAs). Consequently, they may render different yields of the translated polypeptides. These mRNA features related to translation efficiency are also playing a role locally, resulting in a non-uniform translation speed along the mRNA, which has been previously related to some protein structural features and also used to explain some dramatic effects of "silent" single-nucleotide-polymorphisms (SNPs). In this work we perform the first large scale analysis of the relationship between three experimental proxies of mRNA local translation efficiency and the local features of the corresponding encoded proteins. We found that a number of protein functional and structural features are reflected in the patterns of ribosome occupancy, secondary structure and tRNA availability along the mRNA. One or more of these proxies of translation speed have distinctive patterns around the mRNA regions coding for certain protein local features. In some cases the three patterns follow a similar trend. We also show specific examples where these patterns of translation speed point to the protein's important structural and functional features. This support the idea that the genome not only codes the protein functional features as sequences of amino acids, but also as subtle patterns of mRNA properties which, probably through local effects on the translation speed, have some consequence on the final polypeptide. These results open the possibility of predicting a protein's functional regions based on a single genomic sequence, and have implications for heterologous protein expression and fine-tuning protein function.

  5. In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features.

    PubMed

    Ding, Yiliang; Tang, Yin; Kwok, Chun Kit; Zhang, Yu; Bevilacqua, Philip C; Assmann, Sarah M

    2014-01-30

    RNA structure has critical roles in processes ranging from ligand sensing to the regulation of translation, polyadenylation and splicing. However, a lack of genome-wide in vivo RNA structural data has limited our understanding of how RNA structure regulates gene expression in living cells. Here we present a high-throughput, genome-wide in vivo RNA structure probing method, structure-seq, in which dimethyl sulphate methylation of unprotected adenines and cytosines is identified by next-generation sequencing. Application of this method to Arabidopsis thaliana seedlings yielded the first in vivo genome-wide RNA structure map at nucleotide resolution for any organism, with quantitative structural information across more than 10,000 transcripts. Our analysis reveals a three-nucleotide periodic repeat pattern in the structure of coding regions, as well as a less-structured region immediately upstream of the start codon, and shows that these features are strongly correlated with translation efficiency. We also find patterns of strong and weak secondary structure at sites of alternative polyadenylation, as well as strong secondary structure at 5' splice sites that correlates with unspliced events. Notably, in vivo structures of messenger RNAs annotated for stress responses are poorly predicted in silico, whereas mRNA structures of genes related to cell function maintenance are well predicted. Global comparison of several structural features between these two categories shows that the mRNAs associated with stress responses tend to have more single-strandedness, longer maximal loop length and higher free energy per nucleotide, features that may allow these RNAs to undergo conformational changes in response to environmental conditions. Structure-seq allows the RNA structurome and its biological roles to be interrogated on a genome-wide scale and should be applicable to any organism.

  6. In silico methods for co-transcriptional RNA secondary structure prediction and for investigating alternative RNA structure expression.

    PubMed

    Meyer, Irmtraud M

    2017-05-01

    RNA transcripts are the primary products of active genes in any living organism, including many viruses. Their cellular destiny not only depends on primary sequence signals, but can also be determined by RNA structure. Recent experimental evidence shows that many transcripts can be assigned more than a single functional RNA structure throughout their cellular life and that structure formation happens co-transcriptionally, i.e. as the transcript is synthesised in the cell. Moreover, functional RNA structures are not limited to non-coding transcripts, but can also feature in coding transcripts. The picture that now emerges is that RNA structures constitute an additional layer of information that can be encoded in any RNA transcript (and on top of other layers of information such as protein-context) in order to exert a wide range of functional roles. Moreover, different encoded RNA structures can be expressed at different stages of a transcript's life in order to alter the transcript's behaviour depending on its actual cellular context. Similar to the concept of alternative splicing for protein-coding genes, where a single transcript can yield different proteins depending on cellular context, it is thus appropriate to propose the notion of alternative RNA structure expression for any given transcript. This review introduces several computational strategies that my group developed to detect different aspects of RNA structure expression in vivo. Two aspects are of particular interest to us: (1) RNA secondary structure features that emerge during co-transcriptional folding and (2) functional RNA structure features that are expressed at different times of a transcript's life and potentially mutually exclusive. Copyright © 2017. Published by Elsevier Inc.

  7. On the importance of cotranscriptional RNA structure formation

    PubMed Central

    Lai, Daniel; Proctor, Jeff R.; Meyer, Irmtraud M.

    2013-01-01

    The expression of genes, both coding and noncoding, can be significantly influenced by RNA structural features of their corresponding transcripts. There is by now mounting experimental and some theoretical evidence that structure formation in vivo starts during transcription and that this cotranscriptional folding determines the functional RNA structural features that are being formed. Several decades of research in bioinformatics have resulted in a wide range of computational methods for predicting RNA secondary structures. Almost all state-of-the-art methods in terms of prediction accuracy, however, completely ignore the process of structure formation and focus exclusively on the final RNA structure. This review hopes to bridge this gap. We summarize the existing evidence for cotranscriptional folding and then review the different, currently used strategies for RNA secondary-structure prediction. Finally, we propose a range of ideas on how state-of-the-art methods could be potentially improved by explicitly capturing the process of cotranscriptional structure formation. PMID:24131802

  8. Nucleic Acid Database (NDB)

    Science.gov Websites

    the NDB archive or in the Non-Redundant list Advanced Search Search for structures based on structural features, chemical features, binding modes, citation and experimental information Featured Tools RNA 3D Motif Atlas, a representative collection of RNA 3D internal and hairpin loop motifs Non-redundant Lists

  9. bpRNA: large-scale automated annotation and analysis of RNA secondary structure.

    PubMed

    Danaee, Padideh; Rouches, Mason; Wiley, Michelle; Deng, Dezhong; Huang, Liang; Hendrix, David

    2018-05-09

    While RNA secondary structure prediction from sequence data has made remarkable progress, there is a need for improved strategies for annotating the features of RNA secondary structures. Here, we present bpRNA, a novel annotation tool capable of parsing RNA structures, including complex pseudoknot-containing RNAs, to yield an objective, precise, compact, unambiguous, easily-interpretable description of all loops, stems, and pseudoknots, along with the positions, sequence, and flanking base pairs of each such structural feature. We also introduce several new informative representations of RNA structure types to improve structure visualization and interpretation. We have further used bpRNA to generate a web-accessible meta-database, 'bpRNA-1m', of over 100 000 single-molecule, known secondary structures; this is both more fully and accurately annotated and over 20-times larger than existing databases. We use a subset of the database with highly similar (≥90% identical) sequences filtered out to report on statistical trends in sequence, flanking base pairs, and length. Both the bpRNA method and the bpRNA-1m database will be valuable resources both for specific analysis of individual RNA molecules and large-scale analyses such as are useful for updating RNA energy parameters for computational thermodynamic predictions, improving machine learning models for structure prediction, and for benchmarking structure-prediction algorithms.

  10. Improve the prediction of RNA-binding residues using structural neighbours.

    PubMed

    Li, Quan; Cao, Zanxia; Liu, Haiyan

    2010-03-01

    The interactions between RNA-binding proteins (RBPs) with RNA play key roles in managing some of the cell's basic functions. The identification and prediction of RNA binding sites is important for understanding the RNA-binding mechanism. Computational approaches are being developed to predict RNA-binding residues based on the sequence- or structure-derived features. To achieve higher prediction accuracy, improvements on current prediction methods are necessary. We identified that the structural neighbors of RNA-binding and non-RNA-binding residues have different amino acid compositions. Combining this structure-derived feature with evolutionary (PSSM) and other structural information (secondary structure and solvent accessibility) significantly improves the predictions over existing methods. Using a multiple linear regression approach and 6-fold cross validation, our best model can achieve an overall correct rate of 87.8% and MCC of 0.47, with a specificity of 93.4%, correctly predict 52.4% of the RNA-binding residues for a dataset containing 107 non-homologous RNA-binding proteins. Compared with existing methods, including the amino acid compositions of structure neighbors lead to clearly improvement. A web server was developed for predicting RNA binding residues in a protein sequence (or structure),which is available at http://mcgill.3322.org/RNA/.

  11. Structural features of microRNA (miRNA) precursors and their relevance to miRNA biogenesis and small interfering RNA/short hairpin RNA design.

    PubMed

    Krol, Jacek; Sobczak, Krzysztof; Wilczynska, Urszula; Drath, Maria; Jasinska, Anna; Kaczynska, Danuta; Krzyzosiak, Wlodzimierz J

    2004-10-01

    We have established the structures of 10 human microRNA (miRNA) precursors using biochemical methods. Eight of these structures turned out to be different from those that were computer-predicted. The differences localized in the terminal loop region and at the opposite side of the precursor hairpin stem. We have analyzed the features of these structures from the perspectives of miRNA biogenesis and active strand selection. We demonstrated the different thermodynamic stability profiles for pre-miRNA hairpins harboring miRNAs at their 5'- and 3'-sides and discussed their functional implications. Our results showed that miRNA prediction based on predicted precursor structures may give ambiguous results, and the success rate is significantly higher for the experimentally determined structures. On the other hand, the differences between the predicted and experimentally determined structures did not affect the stability of termini produced through "conceptual dicing." This result confirms the value of thermodynamic analysis based on mfold as a predictor of strand section by RNAi-induced silencing complex (RISC).

  12. Accurate prediction of RNA-binding protein residues with two discriminative structural descriptors.

    PubMed

    Sun, Meijian; Wang, Xia; Zou, Chuanxin; He, Zenghui; Liu, Wei; Li, Honglin

    2016-06-07

    RNA-binding proteins participate in many important biological processes concerning RNA-mediated gene regulation, and several computational methods have been recently developed to predict the protein-RNA interactions of RNA-binding proteins. Newly developed discriminative descriptors will help to improve the prediction accuracy of these prediction methods and provide further meaningful information for researchers. In this work, we designed two structural features (residue electrostatic surface potential and triplet interface propensity) and according to the statistical and structural analysis of protein-RNA complexes, the two features were powerful for identifying RNA-binding protein residues. Using these two features and other excellent structure- and sequence-based features, a random forest classifier was constructed to predict RNA-binding residues. The area under the receiver operating characteristic curve (AUC) of five-fold cross-validation for our method on training set RBP195 was 0.900, and when applied to the test set RBP68, the prediction accuracy (ACC) was 0.868, and the F-score was 0.631. The good prediction performance of our method revealed that the two newly designed descriptors could be discriminative for inferring protein residues interacting with RNAs. To facilitate the use of our method, a web-server called RNAProSite, which implements the proposed method, was constructed and is freely available at http://lilab.ecust.edu.cn/NABind .

  13. Structural Features of a Picornavirus Polymerase Involved in the Polyadenylation of Viral RNA

    PubMed Central

    Kempf, Brian J.; Kelly, Michelle M.; Springer, Courtney L.; Peersen, Olve B.

    2013-01-01

    Picornaviruses have 3′ polyadenylated RNA genomes, but the mechanisms by which these genomes are polyadenylated during viral replication remain obscure. Based on prior studies, we proposed a model wherein the poliovirus RNA-dependent RNA polymerase (3Dpol) uses a reiterative transcription mechanism while replicating the poly(A) and poly(U) portions of viral RNA templates. To further test this model, we examined whether mutations in 3Dpol influenced the polyadenylation of virion RNA. We identified nine alanine substitution mutations in 3Dpol that resulted in shorter or longer 3′ poly(A) tails in virion RNA. These mutations could disrupt structural features of 3Dpol required for the recruitment of a cellular poly(A) polymerase; however, the structural orientation of these residues suggests a direct role of 3Dpol in the polyadenylation of RNA genomes. Reaction mixtures containing purified 3Dpol and a template RNA with a defined poly(U) sequence provided data consistent with a template-dependent reiterative transcription mechanism for polyadenylation. The phylogenetically conserved structural features of 3Dpol involved in the polyadenylation of virion RNA include a thumb domain alpha helix that is positioned in the minor groove of the double-stranded RNA product and lysine and arginine residues that interact with the phosphates of both the RNA template and product strands. PMID:23468507

  14. RNA-TVcurve: a Web server for RNA secondary structure comparison based on a multi-scale similarity of its triple vector curve representation.

    PubMed

    Li, Ying; Shi, Xiaohu; Liang, Yanchun; Xie, Juan; Zhang, Yu; Ma, Qin

    2017-01-21

    RNAs have been found to carry diverse functionalities in nature. Inferring the similarity between two given RNAs is a fundamental step to understand and interpret their functional relationship. The majority of functional RNAs show conserved secondary structures, rather than sequence conservation. Those algorithms relying on sequence-based features usually have limitations in their prediction performance. Hence, integrating RNA structure features is very critical for RNA analysis. Existing algorithms mainly fall into two categories: alignment-based and alignment-free. The alignment-free algorithms of RNA comparison usually have lower time complexity than alignment-based algorithms. An alignment-free RNA comparison algorithm was proposed, in which novel numerical representations RNA-TVcurve (triple vector curve representation) of RNA sequence and corresponding secondary structure features are provided. Then a multi-scale similarity score of two given RNAs was designed based on wavelet decomposition of their numerical representation. In support of RNA mutation and phylogenetic analysis, a web server (RNA-TVcurve) was designed based on this alignment-free RNA comparison algorithm. It provides three functional modules: 1) visualization of numerical representation of RNA secondary structure; 2) detection of single-point mutation based on secondary structure; and 3) comparison of pairwise and multiple RNA secondary structures. The inputs of the web server require RNA primary sequences, while corresponding secondary structures are optional. For the primary sequences alone, the web server can compute the secondary structures using free energy minimization algorithm in terms of RNAfold tool from Vienna RNA package. RNA-TVcurve is the first integrated web server, based on an alignment-free method, to deliver a suite of RNA analysis functions, including visualization, mutation analysis and multiple RNAs structure comparison. The comparison results with two popular RNA comparison tools, RNApdist and RNAdistance, showcased that RNA-TVcurve can efficiently capture subtle relationships among RNAs for mutation detection and non-coding RNA classification. All the relevant results were shown in an intuitive graphical manner, and can be freely downloaded from this server. RNA-TVcurve, along with test examples and detailed documents, are available at: http://ml.jlu.edu.cn/tvcurve/ .

  15. Regulatory effects of cotranscriptional RNA structure formation and transitions.

    PubMed

    Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi

    2016-09-01

    RNAs, which play significant roles in many fundamental biological processes of life, fold into sophisticated and precise structures. RNA folding is a dynamic and intricate process, which conformation transition of coding and noncoding RNAs form the primary elements of genetic regulation. The cellular environment contains various intrinsic and extrinsic factors that potentially affect RNA folding in vivo, and experimental and theoretical evidence increasingly indicates that the highly flexible features of the RNA structure are affected by these factors, which include the flanking sequence context, physiochemical conditions, cis RNA-RNA interactions, and RNA interactions with other molecules. Furthermore, distinct RNA structures have been identified that govern almost all steps of biological processes in cells, including transcriptional activation and termination, transcriptional mutagenesis, 5'-capping, splicing, 3'-polyadenylation, mRNA export and localization, and translation. Here, we briefly summarize the dynamic and complex features of RNA folding along with a wide variety of intrinsic and extrinsic factors that affect RNA folding. We then provide several examples to elaborate RNA structure-mediated regulation at the transcriptional and posttranscriptional levels. Finally, we illustrate the regulatory roles of RNA structure and discuss advances pertaining to RNA structure in plants. WIREs RNA 2016, 7:562-574. doi: 10.1002/wrna.1350 For further resources related to this article, please visit the WIREs website. © 2016 Wiley Periodicals, Inc.

  16. DSSR-enhanced visualization of nucleic acid structures in Jmol

    PubMed Central

    Hanson, Robert M.

    2017-01-01

    Abstract Sophisticated and interactive visualizations are essential for making sense of the intricate 3D structures of macromolecules. For proteins, secondary structural components are routinely featured in molecular graphics visualizations. However, the field of RNA structural bioinformatics is still lagging behind; for example, current molecular graphics tools lack built-in support even for base pairs, double helices, or hairpin loops. DSSR (Dissecting the Spatial Structure of RNA) is an integrated and automated command-line tool for the analysis and annotation of RNA tertiary structures. It calculates a comprehensive and unique set of features for characterizing RNA, as well as DNA structures. Jmol is a widely used, open-source Java viewer for 3D structures, with a powerful scripting language. JSmol, its reincarnation based on native JavaScript, has a predominant position in the post Java-applet era for web-based visualization of molecular structures. The DSSR-Jmol integration presented here makes salient features of DSSR readily accessible, either via the Java-based Jmol application itself, or its HTML5-based equivalent, JSmol. The DSSR web service accepts 3D coordinate files (in mmCIF or PDB format) initiated from a Jmol or JSmol session and returns DSSR-derived structural features in JSON format. This seamless combination of DSSR and Jmol/JSmol brings the molecular graphics of 3D RNA structures to a similar level as that for proteins, and enables a much deeper analysis of structural characteristics. It fills a gap in RNA structural bioinformatics, and is freely accessible (via the Jmol application or the JSmol-based website http://jmol.x3dna.org). PMID:28472503

  17. Comprehensive comparative analysis and identification of RNA-binding protein domains: multi-class classification and feature selection.

    PubMed

    Jahandideh, Samad; Srinivasasainagendra, Vinodh; Zhi, Degui

    2012-11-07

    RNA-protein interaction plays an important role in various cellular processes, such as protein synthesis, gene regulation, post-transcriptional gene regulation, alternative splicing, and infections by RNA viruses. In this study, using Gene Ontology Annotated (GOA) and Structural Classification of Proteins (SCOP) databases an automatic procedure was designed to capture structurally solved RNA-binding protein domains in different subclasses. Subsequently, we applied tuned multi-class SVM (TMCSVM), Random Forest (RF), and multi-class ℓ1/ℓq-regularized logistic regression (MCRLR) for analysis and classifying RNA-binding protein domains based on a comprehensive set of sequence and structural features. In this study, we compared prediction accuracy of three different state-of-the-art predictor methods. From our results, TMCSVM outperforms the other methods and suggests the potential of TMCSVM as a useful tool for facilitating the multi-class prediction of RNA-binding protein domains. On the other hand, MCRLR by elucidating importance of features for their contribution in predictive accuracy of RNA-binding protein domains subclasses, helps us to provide some biological insights into the roles of sequences and structures in protein-RNA interactions.

  18. DSSR-enhanced visualization of nucleic acid structures in Jmol.

    PubMed

    Hanson, Robert M; Lu, Xiang-Jun

    2017-07-03

    Sophisticated and interactive visualizations are essential for making sense of the intricate 3D structures of macromolecules. For proteins, secondary structural components are routinely featured in molecular graphics visualizations. However, the field of RNA structural bioinformatics is still lagging behind; for example, current molecular graphics tools lack built-in support even for base pairs, double helices, or hairpin loops. DSSR (Dissecting the Spatial Structure of RNA) is an integrated and automated command-line tool for the analysis and annotation of RNA tertiary structures. It calculates a comprehensive and unique set of features for characterizing RNA, as well as DNA structures. Jmol is a widely used, open-source Java viewer for 3D structures, with a powerful scripting language. JSmol, its reincarnation based on native JavaScript, has a predominant position in the post Java-applet era for web-based visualization of molecular structures. The DSSR-Jmol integration presented here makes salient features of DSSR readily accessible, either via the Java-based Jmol application itself, or its HTML5-based equivalent, JSmol. The DSSR web service accepts 3D coordinate files (in mmCIF or PDB format) initiated from a Jmol or JSmol session and returns DSSR-derived structural features in JSON format. This seamless combination of DSSR and Jmol/JSmol brings the molecular graphics of 3D RNA structures to a similar level as that for proteins, and enables a much deeper analysis of structural characteristics. It fills a gap in RNA structural bioinformatics, and is freely accessible (via the Jmol application or the JSmol-based website http://jmol.x3dna.org). © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  19. Structure, recognition and adaptive binding in RNA aptamer complexes.

    PubMed

    Patel, D J; Suri, A K; Jiang, F; Jiang, L; Fan, P; Kumar, R A; Nonin, S

    1997-10-10

    Novel features of RNA structure, recognition and discrimination have been recently elucidated through the solution structural characterization of RNA aptamers that bind cofactors, aminoglycoside antibiotics, amino acids and peptides with high affinity and specificity. This review presents the solution structures of RNA aptamer complexes with adenosine monophosphate, flavin mononucleotide, arginine/citrulline and tobramycin together with an example of hydrogen exchange measurements of the base-pair kinetics for the AMP-RNA aptamer complex. A comparative analysis of the structures of these RNA aptamer complexes yields the principles, patterns and diversity associated with RNA architecture, molecular recognition and adaptive binding associated with complex formation.

  20. The nucleotide sequence of the entire ribosomal DNA operon and the structure of the large subunit rRNA of Giardia muris.

    PubMed

    van Keulen, H; Gutell, R R; Campbell, S R; Erlandsen, S L; Jarroll, E L

    1992-10-01

    The total nucleotide sequence of the rDNA of Giardia muris, an intestinal protozoan parasite of rodents, has been determined. The repeat unit is 7668 basepairs (bp) in size and consists of a spacer of 3314 bp, a small-subunit rRNA (SSU-rRNA) gene of 1429, and a large-subunit rRNA (LSU-rRNA) gene of 2698 bp. The spacer contains long direct repeats and is heterogeneous in size. The LSU-rRNA of G. muris was compared to that of the human intestinal parasite Giardia duodenalis, to the bird parasite Giardia ardeae, and to that of Escherichia coli. The LSU-rRNA has a size comparable to the 23S rRNA of E. coli but shows structural features typical for eukaryotes. Some variable regions are typically small and account for the overall smaller size of this rRNA. The structure of the G. muris LSU-rRNA is similar to that of the other Giardia rRNA, but each rRNA has characteristic features residing in a number of variable regions.

  1. A folded viral noncoding RNA blocks host cell exoribonucleases through a conformationally dynamic RNA structure.

    PubMed

    Steckelberg, Anna-Lena; Akiyama, Benjamin M; Costantino, David A; Sit, Tim L; Nix, Jay C; Kieft, Jeffrey S

    2018-06-19

    Folded RNA elements that block processive 5' → 3' cellular exoribonucleases (xrRNAs) to produce biologically active viral noncoding RNAs have been discovered in flaviviruses, potentially revealing a new mode of RNA maturation. However, whether this RNA structure-dependent mechanism exists elsewhere and, if so, whether a singular RNA fold is required, have been unclear. Here we demonstrate the existence of authentic RNA structure-dependent xrRNAs in dianthoviruses, plant-infecting viruses unrelated to animal-infecting flaviviruses. These xrRNAs have no sequence similarity to known xrRNAs; thus, we used a combination of biochemistry and virology to characterize their sequence requirements and mechanism of stopping exoribonucleases. By solving the structure of a dianthovirus xrRNA by X-ray crystallography, we reveal a complex fold that is very different from that of the flavivirus xrRNAs. However, both versions of xrRNAs contain a unique topological feature, a pseudoknot that creates a protective ring around the 5' end of the RNA structure; this may be a defining structural feature of xrRNAs. Single-molecule FRET experiments reveal that the dianthovirus xrRNAs undergo conformational changes and can use "codegradational remodeling," exploiting the exoribonucleases' degradation-linked helicase activity to help form their resistant structure; such a mechanism has not previously been reported. Convergent evolution has created RNA structure-dependent exoribonuclease resistance in different contexts, which establishes it as a general RNA maturation mechanism and defines xrRNAs as an authentic functional class of RNAs.

  2. Principles for Predicting RNA Secondary Structure Design Difficulty.

    PubMed

    Anderson-Lee, Jeff; Fisker, Eli; Kosaraju, Vineet; Wu, Michelle; Kong, Justin; Lee, Jeehyung; Lee, Minjae; Zada, Mathew; Treuille, Adrien; Das, Rhiju

    2016-02-27

    Designing RNAs that form specific secondary structures is enabling better understanding and control of living systems through RNA-guided silencing, genome editing and protein organization. Little is known, however, about which RNA secondary structures might be tractable for downstream sequence design, increasing the time and expense of design efforts due to inefficient secondary structure choices. Here, we present insights into specific structural features that increase the difficulty of finding sequences that fold into a target RNA secondary structure, summarizing the design efforts of tens of thousands of human participants and three automated algorithms (RNAInverse, INFO-RNA and RNA-SSD) in the Eterna massive open laboratory. Subsequent tests through three independent RNA design algorithms (NUPACK, DSS-Opt and MODENA) confirmed the hypothesized importance of several features in determining design difficulty, including sequence length, mean stem length, symmetry and specific difficult-to-design motifs such as zigzags. Based on these results, we have compiled an Eterna100 benchmark of 100 secondary structure design challenges that span a large range in design difficulty to help test future efforts. Our in silico results suggest new routes for improving computational RNA design methods and for extending these insights to assess "designability" of single RNA structures, as well as of switches for in vitro and in vivo applications. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.

  3. A deep learning framework for modeling structural features of RNA-binding protein targets

    PubMed Central

    Zhang, Sai; Zhou, Jingtian; Hu, Hailin; Gong, Haipeng; Chen, Ligong; Cheng, Chao; Zeng, Jianyang

    2016-01-01

    RNA-binding proteins (RBPs) play important roles in the post-transcriptional control of RNAs. Identifying RBP binding sites and characterizing RBP binding preferences are key steps toward understanding the basic mechanisms of the post-transcriptional gene regulation. Though numerous computational methods have been developed for modeling RBP binding preferences, discovering a complete structural representation of the RBP targets by integrating their available structural features in all three dimensions is still a challenging task. In this paper, we develop a general and flexible deep learning framework for modeling structural binding preferences and predicting binding sites of RBPs, which takes (predicted) RNA tertiary structural information into account for the first time. Our framework constructs a unified representation that characterizes the structural specificities of RBP targets in all three dimensions, which can be further used to predict novel candidate binding sites and discover potential binding motifs. Through testing on the real CLIP-seq datasets, we have demonstrated that our deep learning framework can automatically extract effective hidden structural features from the encoded raw sequence and structural profiles, and predict accurate RBP binding sites. In addition, we have conducted the first study to show that integrating the additional RNA tertiary structural features can improve the model performance in predicting RBP binding sites, especially for the polypyrimidine tract-binding protein (PTB), which also provides a new evidence to support the view that RBPs may own specific tertiary structural binding preferences. In particular, the tests on the internal ribosome entry site (IRES) segments yield satisfiable results with experimental support from the literature and further demonstrate the necessity of incorporating RNA tertiary structural information into the prediction model. The source code of our approach can be found in https://github.com/thucombio/deepnet-rbp. PMID:26467480

  4. Sequence, Structure, and Context Preferences of Human RNA Binding Proteins.

    PubMed

    Dominguez, Daniel; Freese, Peter; Alexis, Maria S; Su, Amanda; Hochman, Myles; Palden, Tsultrim; Bazile, Cassandra; Lambert, Nicole J; Van Nostrand, Eric L; Pratt, Gabriel A; Yeo, Gene W; Graveley, Brenton R; Burge, Christopher B

    2018-06-07

    RNA binding proteins (RBPs) orchestrate the production, processing, and function of mRNAs. Here, we present the affinity landscapes of 78 human RBPs using an unbiased assay that determines the sequence, structure, and context preferences of these proteins in vitro by deep sequencing of bound RNAs. These data enable construction of "RNA maps" of RBP activity without requiring crosslinking-based assays. We found an unexpectedly low diversity of RNA motifs, implying frequent convergence of binding specificity toward a relatively small set of RNA motifs, many with low compositional complexity. Offsetting this trend, however, we observed extensive preferences for contextual features distinct from short linear RNA motifs, including spaced "bipartite" motifs, biased flanking nucleotide composition, and bias away from or toward RNA structure. Our results emphasize the importance of contextual features in RNA recognition, which likely enable targeting of distinct subsets of transcripts by different RBPs that recognize the same linear motif. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  5. The RNA Newton polytope and learnability of energy parameters.

    PubMed

    Forouzmand, Elmirasadat; Chitsaz, Hamidreza

    2013-07-01

    Computational RNA structure prediction is a mature important problem that has received a new wave of attention with the discovery of regulatory non-coding RNAs and the advent of high-throughput transcriptome sequencing. Despite nearly two score years of research on RNA secondary structure and RNA-RNA interaction prediction, the accuracy of the state-of-the-art algorithms are still far from satisfactory. So far, researchers have proposed increasingly complex energy models and improved parameter estimation methods, experimental and/or computational, in anticipation of endowing their methods with enough power to solve the problem. The output has disappointingly been only modest improvements, not matching the expectations. Even recent massively featured machine learning approaches were not able to break the barrier. Why is that? The first step toward high-accuracy structure prediction is to pick an energy model that is inherently capable of predicting each and every one of known structures to date. In this article, we introduce the notion of learnability of the parameters of an energy model as a measure of such an inherent capability. We say that the parameters of an energy model are learnable iff there exists at least one set of such parameters that renders every known RNA structure to date the minimum free energy structure. We derive a necessary condition for the learnability and give a dynamic programming algorithm to assess it. Our algorithm computes the convex hull of the feature vectors of all feasible structures in the ensemble of a given input sequence. Interestingly, that convex hull coincides with the Newton polytope of the partition function as a polynomial in energy parameters. To the best of our knowledge, this is the first approach toward computing the RNA Newton polytope and a systematic assessment of the inherent capabilities of an energy model. The worst case complexity of our algorithm is exponential in the number of features. However, dimensionality reduction techniques can provide approximate solutions to avoid the curse of dimensionality. We demonstrated the application of our theory to a simple energy model consisting of a weighted count of A-U, C-G and G-U base pairs. Our results show that this simple energy model satisfies the necessary condition for more than half of the input unpseudoknotted sequence-structure pairs (55%) chosen from the RNA STRAND v2.0 database and severely violates the condition for ~ 13%, which provide a set of hard cases that require further investigation. From 1350 RNA strands, the observed 3D feature vector for 749 strands is on the surface of the computed polytope. For 289 RNA strands, the observed feature vector is not on the boundary of the polytope but its distance from the boundary is not more than one. A distance of one essentially means one base pair difference between the observed structure and the closest point on the boundary of the polytope, which need not be the feature vector of a structure. For 171 sequences, this distance is larger than two, and for only 11 sequences, this distance is larger than five. The source code is available on http://compbio.cs.wayne.edu/software/rna-newton-polytope.

  6. RStrucFam: a web server to associate structure and cognate RNA for RNA-binding proteins from sequence information.

    PubMed

    Ghosh, Pritha; Mathew, Oommen K; Sowdhamini, Ramanathan

    2016-10-07

    RNA-binding proteins (RBPs) interact with their cognate RNA(s) to form large biomolecular assemblies. They are versatile in their functionality and are involved in a myriad of processes inside the cell. RBPs with similar structural features and common biological functions are grouped together into families and superfamilies. It will be useful to obtain an early understanding and association of RNA-binding property of sequences of gene products. Here, we report a web server, RStrucFam, to predict the structure, type of cognate RNA(s) and function(s) of proteins, where possible, from mere sequence information. The web server employs Hidden Markov Model scan (hmmscan) to enable association to a back-end database of structural and sequence families. The database (HMMRBP) comprises of 437 HMMs of RBP families of known structure that have been generated using structure-based sequence alignments and 746 sequence-centric RBP family HMMs. The input protein sequence is associated with structural or sequence domain families, if structure or sequence signatures exist. In case of association of the protein with a family of known structures, output features like, multiple structure-based sequence alignment (MSSA) of the query with all others members of that family is provided. Further, cognate RNA partner(s) for that protein, Gene Ontology (GO) annotations, if any and a homology model of the protein can be obtained. The users can also browse through the database for details pertaining to each family, protein or RNA and their related information based on keyword search or RNA motif search. RStrucFam is a web server that exploits structurally conserved features of RBPs, derived from known family members and imprinted in mathematical profiles, to predict putative RBPs from sequence information. Proteins that fail to associate with such structure-centric families are further queried against the sequence-centric RBP family HMMs in the HMMRBP database. Further, all other essential information pertaining to an RBP, like overall function annotations, are provided. The web server can be accessed at the following link: http://caps.ncbs.res.in/rstrucfam .

  7. Analysis of sequencing data for probing RNA secondary structures and protein-RNA binding in studying posttranscriptional regulations.

    PubMed

    Hu, Xihao; Wu, Yang; Lu, Zhi John; Yip, Kevin Y

    2016-11-01

    High-throughput sequencing has been used to study posttranscriptional regulations, where the identification of protein-RNA binding is a major and fast-developing sub-area, which is in turn benefited by the sequencing methods for whole-transcriptome probing of RNA secondary structures. In the study of RNA secondary structures using high-throughput sequencing, bases are modified or cleaved according to their structural features, which alter the resulting composition of sequencing reads. In the study of protein-RNA binding, methods have been proposed to immuno-precipitate (IP) protein-bound RNA transcripts in vitro or in vivo By sequencing these transcripts, the protein-RNA interactions and the binding locations can be identified. For both types of data, read counts are affected by a combination of confounding factors, including expression levels of transcripts, sequence biases, mapping errors and the probing or IP efficiency of the experimental protocols. Careful processing of the sequencing data and proper extraction of important features are fundamentally important to a successful analysis. Here we review and compare different experimental methods for probing RNA secondary structures and binding sites of RNA-binding proteins (RBPs), and the computational methods proposed for analyzing the corresponding sequencing data. We suggest how these two types of data should be integrated to study the structural properties of RBP binding sites as a systematic way to better understand posttranscriptional regulations. © The Author 2015. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

  8. Ab initio RNA folding by discrete molecular dynamics: From structure prediction to folding mechanisms

    PubMed Central

    Ding, Feng; Sharma, Shantanu; Chalasani, Poornima; Demidov, Vadim V.; Broude, Natalia E.; Dokholyan, Nikolay V.

    2008-01-01

    RNA molecules with novel functions have revived interest in the accurate prediction of RNA three-dimensional (3D) structure and folding dynamics. However, existing methods are inefficient in automated 3D structure prediction. Here, we report a robust computational approach for rapid folding of RNA molecules. We develop a simplified RNA model for discrete molecular dynamics (DMD) simulations, incorporating base-pairing and base-stacking interactions. We demonstrate correct folding of 150 structurally diverse RNA sequences. The majority of DMD-predicted 3D structures have <4 Å deviations from experimental structures. The secondary structures corresponding to the predicted 3D structures consist of 94% native base-pair interactions. Folding thermodynamics and kinetics of tRNAPhe, pseudoknots, and mRNA fragments in DMD simulations are in agreement with previous experimental findings. Folding of RNA molecules features transient, non-native conformations, suggesting non-hierarchical RNA folding. Our method allows rapid conformational sampling of RNA folding, with computational time increasing linearly with RNA length. We envision this approach as a promising tool for RNA structural and functional analyses. PMID:18456842

  9. MiRNA-miRNA synergistic network: construction via co-regulating functional modules and disease miRNA topological features.

    PubMed

    Xu, Juan; Li, Chuan-Xing; Li, Yong-Sheng; Lv, Jun-Ying; Ma, Ye; Shao, Ting-Ting; Xu, Liang-De; Wang, Ying-Ying; Du, Lei; Zhang, Yun-Peng; Jiang, Wei; Li, Chun-Quan; Xiao, Yun; Li, Xia

    2011-02-01

    Synergistic regulations among multiple microRNAs (miRNAs) are important to understand the mechanisms of complex post-transcriptional regulations in humans. Complex diseases are affected by several miRNAs rather than a single miRNA. So, it is a challenge to identify miRNA synergism and thereby further determine miRNA functions at a system-wide level and investigate disease miRNA features in the miRNA-miRNA synergistic network from a new view. Here, we constructed a miRNA-miRNA functional synergistic network (MFSN) via co-regulating functional modules that have three features: common targets of corresponding miRNA pairs, enriched in the same gene ontology category and close proximity in the protein interaction network. Predicted miRNA synergism is validated by significantly high co-expression of functional modules and significantly negative regulation to functional modules. We found that the MFSN exhibits a scale free, small world and modular architecture. Furthermore, the topological features of disease miRNAs in the MFSN are distinct from non-disease miRNAs. They have more synergism, indicating their higher complexity of functions and are the global central cores of the MFSN. In addition, miRNAs associated with the same disease are close to each other. The structure of the MFSN and the features of disease miRNAs are validated to be robust using different miRNA target data sets.

  10. repRNA: a web server for generating various feature vectors of RNA sequences.

    PubMed

    Liu, Bin; Liu, Fule; Fang, Longyun; Wang, Xiaolong; Chou, Kuo-Chen

    2016-02-01

    With the rapid growth of RNA sequences generated in the postgenomic age, it is highly desired to develop a flexible method that can generate various kinds of vectors to represent these sequences by focusing on their different features. This is because nearly all the existing machine-learning methods, such as SVM (support vector machine) and KNN (k-nearest neighbor), can only handle vectors but not sequences. To meet the increasing demands and speed up the genome analyses, we have developed a new web server, called "representations of RNA sequences" (repRNA). Compared with the existing methods, repRNA is much more comprehensive, flexible and powerful, as reflected by the following facts: (1) it can generate 11 different modes of feature vectors for users to choose according to their investigation purposes; (2) it allows users to select the features from 22 built-in physicochemical properties and even those defined by users' own; (3) the resultant feature vectors and the secondary structures of the corresponding RNA sequences can be visualized. The repRNA web server is freely accessible to the public at http://bioinformatics.hitsz.edu.cn/repRNA/ .

  11. COME: a robust coding potential calculation tool for lncRNA identification and characterization based on multiple features.

    PubMed

    Hu, Long; Xu, Zhiyu; Hu, Boqin; Lu, Zhi John

    2017-01-09

    Recent genomic studies suggest that novel long non-coding RNAs (lncRNAs) are specifically expressed and far outnumber annotated lncRNA sequences. To identify and characterize novel lncRNAs in RNA sequencing data from new samples, we have developed COME, a coding potential calculation tool based on multiple features. It integrates multiple sequence-derived and experiment-based features using a decompose-compose method, which makes it more accurate and robust than other well-known tools. We also showed that COME was able to substantially improve the consistency of predication results from other coding potential calculators. Moreover, COME annotates and characterizes each predicted lncRNA transcript with multiple lines of supporting evidence, which are not provided by other tools. Remarkably, we found that one subgroup of lncRNAs classified by such supporting features (i.e. conserved local RNA secondary structure) was highly enriched in a well-validated database (lncRNAdb). We further found that the conserved structural domains on lncRNAs had better chance than other RNA regions to interact with RNA binding proteins, based on the recent eCLIP-seq data in human, indicating their potential regulatory roles. Overall, we present COME as an accurate, robust and multiple-feature supported method for the identification and characterization of novel lncRNAs. The software implementation is available at https://github.com/lulab/COME. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. Mutational robustness accelerates the origin of novel RNA phenotypes through phenotypic plasticity.

    PubMed

    Wagner, Andreas

    2014-02-18

    Novel phenotypes can originate either through mutations in existing genotypes or through phenotypic plasticity, the ability of one genotype to form multiple phenotypes. From molecules to organisms, plasticity is a ubiquitous feature of life, and a potential source of exaptations, adaptive traits that originated for nonadaptive reasons. Another ubiquitous feature is robustness to mutations, although it is unknown whether such robustness helps or hinders the origin of new phenotypes through plasticity. RNA is ideal to address this question, because it shows extensive plasticity in its secondary structure phenotypes, a consequence of their continual folding and unfolding, and these phenotypes have important biological functions. Moreover, RNA is to some extent robust to mutations. This robustness structures RNA genotype space into myriad connected networks of genotypes with the same phenotype, and it influences the dynamics of evolving populations on a genotype network. In this study I show that both effects help accelerate the exploration of novel phenotypes through plasticity. My observations are based on many RNA molecules sampled at random from RNA sequence space, and on 30 biological RNA molecules. They are thus not only a generic feature of RNA sequence space but are relevant for the molecular evolution of biological RNA. Copyright © 2014 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  13. Prediction and Dissection of Protein-RNA Interactions by Molecular Descriptors.

    PubMed

    Liu, Zhi-Ping; Chen, Luonan

    2016-01-01

    Protein-RNA interactions play crucial roles in numerous biological processes. However, detecting the interactions and binding sites between protein and RNA by traditional experiments is still time consuming and labor costing. Thus, it is of importance to develop bioinformatics methods for predicting protein-RNA interactions and binding sites. Accurate prediction of protein-RNA interactions and recognitions will highly benefit to decipher the interaction mechanisms between protein and RNA, as well as to improve the RNA-related protein engineering and drug design. In this work, we summarize the current bioinformatics strategies of predicting protein-RNA interactions and dissecting protein-RNA interaction mechanisms from local structure binding motifs. In particular, we focus on the feature-based machine learning methods, in which the molecular descriptors of protein and RNA are extracted and integrated as feature vectors of representing the interaction events and recognition residues. In addition, the available methods are classified and compared comprehensively. The molecular descriptors are expected to elucidate the binding mechanisms of protein-RNA interaction and reveal the functional implications from structural complementary perspective.

  14. nRC: non-coding RNA Classifier based on structural features.

    PubMed

    Fiannaca, Antonino; La Rosa, Massimo; La Paglia, Laura; Rizzo, Riccardo; Urso, Alfonso

    2017-01-01

    Non-coding RNA (ncRNA) are small non-coding sequences involved in gene expression regulation of many biological processes and diseases. The recent discovery of a large set of different ncRNAs with biologically relevant roles has opened the way to develop methods able to discriminate between the different ncRNA classes. Moreover, the lack of knowledge about the complete mechanisms in regulative processes, together with the development of high-throughput technologies, has required the help of bioinformatics tools in addressing biologists and clinicians with a deeper comprehension of the functional roles of ncRNAs. In this work, we introduce a new ncRNA classification tool, nRC (non-coding RNA Classifier). Our approach is based on features extraction from the ncRNA secondary structure together with a supervised classification algorithm implementing a deep learning architecture based on convolutional neural networks. We tested our approach for the classification of 13 different ncRNA classes. We obtained classification scores, using the most common statistical measures. In particular, we reach an accuracy and sensitivity score of about 74%. The proposed method outperforms other similar classification methods based on secondary structure features and machine learning algorithms, including the RNAcon tool that, to date, is the reference classifier. nRC tool is freely available as a docker image at https://hub.docker.com/r/tblab/nrc/. The source code of nRC tool is also available at https://github.com/IcarPA-TBlab/nrc.

  15. Information-Theoretic Uncertainty of SCFG-Modeled Folding Space of The Non-coding RNA

    PubMed Central

    Manzourolajdad, Amirhossein; Wang, Yingfeng; Shaw, Timothy I.; Malmberg, Russell L.

    2012-01-01

    RNA secondary structure ensembles define probability distributions for alternative equilibrium secondary structures of an RNA sequence. Shannon’s Entropy is a measure for the amount of diversity present in any ensemble. In this work, Shannon’s entropy of the SCFG ensemble on an RNA sequence is derived and implemented in polynomial time for both structurally ambiguous and unambiguous grammars. Micro RNA sequences generally have low folding entropy, as previously discovered. Surprisingly, signs of significantly high folding entropy were observed in certain ncRNA families. More effective models coupled with targeted randomization tests can lead to a better insight into folding features of these families. PMID:23160142

  16. Dendrimers as Carriers for siRNA Delivery and Gene Silencing: A Review

    PubMed Central

    Huang, Weizhe; He, Ziying

    2013-01-01

    RNA interference (RNAi) was first literaturally reported in 1998 and has become rapidly a promising tool for therapeutic applications in gene therapy. In a typical RNAi process, small interfering RNAs (siRNA) are used to specifically downregulate the expression of the targeted gene, known as the term “gene silencing.” One key point for successful gene silencing is to employ a safe and efficient siRNA delivery system. In this context, dendrimers are emerging as potential nonviral vectors to deliver siRNA for RNAi purpose. Dendrimers have attracted intense interest since their emanating research in the 1980s and are extensively studied as efficient DNA delivery vectors in gene transfer applications, due to their unique features based on the well-defined and multivalent structures. Knowing that DNA and RNA possess a similar structure in terms of nucleic acid framework and the electronegative nature, one can also use the excellent DNA delivery properties of dendrimers to develop effective siRNA delivery systems. In this review, the development of dendrimer-based siRNA delivery vectors is summarized, focusing on the vector features (siRNA delivery efficiency, cytotoxicity, etc.) of different types of dendrimers and the related investigations on structure-activity relationship to promote safe and efficient siRNA delivery system. PMID:24288498

  17. nextPARS: parallel probing of RNA structures in Illumina

    PubMed Central

    Saus, Ester; Willis, Jesse R.; Pryszcz, Leszek P.; Hafez, Ahmed; Llorens, Carlos; Himmelbauer, Heinz

    2018-01-01

    RNA molecules play important roles in virtually every cellular process. These functions are often mediated through the adoption of specific structures that enable RNAs to interact with other molecules. Thus, determining the secondary structures of RNAs is central to understanding their function and evolution. In recent years several sequencing-based approaches have been developed that allow probing structural features of thousands of RNA molecules present in a sample. Here, we describe nextPARS, a novel Illumina-based implementation of in vitro parallel probing of RNA structures. Our approach achieves comparable accuracy to previous implementations, while enabling higher throughput and sample multiplexing. PMID:29358234

  18. Global Organization of a Positive-strand RNA Virus Genome

    PubMed Central

    Wu, Baodong; Grigull, Jörg; Ore, Moriam O.; Morin, Sylvie; White, K. Andrew

    2013-01-01

    The genomes of plus-strand RNA viruses contain many regulatory sequences and structures that direct different viral processes. The traditional view of these RNA elements are as local structures present in non-coding regions. However, this view is changing due to the discovery of regulatory elements in coding regions and functional long-range intra-genomic base pairing interactions. The ∼4.8 kb long RNA genome of the tombusvirus tomato bushy stunt virus (TBSV) contains these types of structural features, including six different functional long-distance interactions. We hypothesized that to achieve these multiple interactions this viral genome must utilize a large-scale organizational strategy and, accordingly, we sought to assess the global conformation of the entire TBSV genome. Atomic force micrographs of the genome indicated a mostly condensed structure composed of interconnected protrusions extending from a central hub. This configuration was consistent with the genomic secondary structure model generated using high-throughput selective 2′-hydroxyl acylation analysed by primer extension (i.e. SHAPE), which predicted different sized RNA domains originating from a central region. Known RNA elements were identified in both domain and inter-domain regions, and novel structural features were predicted and functionally confirmed. Interestingly, only two of the six long-range interactions known to form were present in the structural model. However, for those interactions that did not form, complementary partner sequences were positioned relatively close to each other in the structure, suggesting that the secondary structure level of viral genome structure could provide a basic scaffold for the formation of different long-range interactions. The higher-order structural model for the TBSV RNA genome provides a snapshot of the complex framework that allows multiple functional components to operate in concert within a confined context. PMID:23717202

  19. Hepatitis Delta Antigen Requires a Flexible Quasi-Double-Stranded RNA Structure To Bind and Condense Hepatitis Delta Virus RNA in a Ribonucleoprotein Complex

    PubMed Central

    Griffin, Brittany L.; Chasovskikh, Sergey; Dritschilo, Anatoly

    2014-01-01

    ABSTRACT The circular genome and antigenome RNAs of hepatitis delta virus (HDV) form characteristic unbranched, quasi-double-stranded RNA secondary structures in which short double-stranded helical segments are interspersed with internal loops and bulges. The ribonucleoprotein complexes (RNPs) formed by these RNAs with the virus-encoded protein hepatitis delta antigen (HDAg) perform essential roles in the viral life cycle, including viral replication and virion formation. Little is understood about the formation and structure of these complexes and how they function in these key processes. Here, the specific RNA features required for HDAg binding and the topology of the complexes formed were investigated. Selective 2′OH acylation analyzed by primer extension (SHAPE) applied to free and HDAg-bound HDV RNAs indicated that the characteristic secondary structure of the RNA is preserved when bound to HDAg. Notably, the analysis indicated that predicted unpaired positions in the RNA remained dynamic in the RNP. Analysis of the in vitro binding activity of RNAs in which internal loops and bulges were mutated and of synthetically designed RNAs demonstrated that the distinctive secondary structure, not the primary RNA sequence, is the major determinant of HDAg RNA binding specificity. Atomic force microscopy analysis of RNPs formed in vitro revealed complexes in which the HDV RNA is substantially condensed by bending or wrapping. Our results support a model in which the internal loops and bulges in HDV RNA contribute flexibility to the quasi-double-stranded structure that allows RNA bending and condensing by HDAg. IMPORTANCE RNA-protein complexes (RNPs) formed by the hepatitis delta virus RNAs and protein, HDAg, perform critical roles in virus replication. Neither the structures of these RNPs nor the RNA features required to form them have been characterized. HDV RNA is unusual in that it forms an unbranched quasi-double-stranded structure in which short base-paired segments are interspersed with internal loops and bulges. We analyzed the role of the HDV RNA sequence and secondary structure in the formation of a minimal RNP and visualized the structure of this RNP using atomic force microscopy. Our results indicate that HDAg does not recognize the primary sequence of the RNA; rather, the principle contribution of unpaired bases in HDV RNA to HDAg binding is to allow flexibility in the unbranched quasi-double-stranded RNA structure. Visualization of RNPs by atomic force microscopy indicated that the RNA is significantly bent or condensed in the complex. PMID:24741096

  20. Sinorhizobium meliloti YbeY is an endoribonuclease with unprecedented catalytic features, acting as silencing enzyme in riboregulation.

    PubMed

    Saramago, Margarida; Peregrina, Alexandra; Robledo, Marta; Matos, Rute G; Hilker, Rolf; Serrania, Javier; Becker, Anke; Arraiano, Cecilia M; Jiménez-Zurdo, José I

    2017-02-17

    Structural and biochemical features suggest that the almost ubiquitous bacterial YbeY protein may serve catalytic and/or Hfq-like protective functions central to small RNA (sRNA)-mediated regulation and RNA metabolism. We have biochemically and genetically characterized the YbeY ortholog of the legume symbiont Sinorhizobium meliloti (SmYbeY). Co-immunoprecipitation (CoIP) with a FLAG-tagged SmYbeY yielded a poor enrichment in RNA species, compared to Hfq CoIP-RNA uncovered previously by a similar experimental setup. Purified SmYbeY behaved as a monomer that indistinctly cleaved single- and double-stranded RNA substrates, a unique ability among bacterial endoribonucleases. SmYbeY-mediated catalysis was supported by the divalent metal ions Mg2+, Mn2+ and Ca2+, which influenced in a different manner cleavage efficiency and reactivity patterns, with Ca2+ specifically blocking activity on double-stranded and some structured RNA molecules. SmYbeY loss-of-function compromised expression of core energy and RNA metabolism genes, whilst promoting accumulation of motility, late symbiotic and transport mRNAs. Some of the latter transcripts are known Hfq-binding sRNA targets and might be SmYbeY substrates. Genetic reporter and in vitro assays confirmed that SmYbeY is required for sRNA-mediated down-regulation of the amino acid ABC transporter prbA mRNA. We have thus discovered a bacterial endoribonuclease with unprecedented catalytic features, acting also as gene silencing enzyme.

  1. Sinorhizobium meliloti YbeY is an endoribonuclease with unprecedented catalytic features, acting as silencing enzyme in riboregulation

    PubMed Central

    Saramago, Margarida; Peregrina, Alexandra; Robledo, Marta; Matos, Rute G.; Hilker, Rolf; Serrania, Javier; Becker, Anke; Arraiano, Cecilia M.

    2017-01-01

    Abstract Structural and biochemical features suggest that the almost ubiquitous bacterial YbeY protein may serve catalytic and/or Hfq-like protective functions central to small RNA (sRNA)-mediated regulation and RNA metabolism. We have biochemically and genetically characterized the YbeY ortholog of the legume symbiont Sinorhizobium meliloti (SmYbeY). Co-immunoprecipitation (CoIP) with a FLAG-tagged SmYbeY yielded a poor enrichment in RNA species, compared to Hfq CoIP-RNA uncovered previously by a similar experimental setup. Purified SmYbeY behaved as a monomer that indistinctly cleaved single- and double-stranded RNA substrates, a unique ability among bacterial endoribonucleases. SmYbeY-mediated catalysis was supported by the divalent metal ions Mg2+, Mn2+ and Ca2+, which influenced in a different manner cleavage efficiency and reactivity patterns, with Ca2+ specifically blocking activity on double-stranded and some structured RNA molecules. SmYbeY loss-of-function compromised expression of core energy and RNA metabolism genes, whilst promoting accumulation of motility, late symbiotic and transport mRNAs. Some of the latter transcripts are known Hfq-binding sRNA targets and might be SmYbeY substrates. Genetic reporter and in vitro assays confirmed that SmYbeY is required for sRNA-mediated down-regulation of the amino acid ABC transporter prbA mRNA. We have thus discovered a bacterial endoribonuclease with unprecedented catalytic features, acting also as gene silencing enzyme. PMID:28180335

  2. Structural architecture of the human long non-coding RNA, steroid receptor RNA activator

    PubMed Central

    Novikova, Irina V.; Hennelly, Scott P.; Sanbonmatsu, Karissa Y.

    2012-01-01

    While functional roles of several long non-coding RNAs (lncRNAs) have been determined, the molecular mechanisms are not well understood. Here, we report the first experimentally derived secondary structure of a human lncRNA, the steroid receptor RNA activator (SRA), 0.87 kB in size. The SRA RNA is a non-coding RNA that coactivates several human sex hormone receptors and is strongly associated with breast cancer. Coding isoforms of SRA are also expressed to produce proteins, making the SRA gene a unique bifunctional system. Our experimental findings (SHAPE, in-line, DMS and RNase V1 probing) reveal that this lncRNA has a complex structural organization, consisting of four domains, with a variety of secondary structure elements. We examine the coevolution of the SRA gene at the RNA structure and protein structure levels using comparative sequence analysis across vertebrates. Rapid evolutionary stabilization of RNA structure, combined with frame-disrupting mutations in conserved regions, suggests that evolutionary pressure preserves the RNA structural core rather than its translational product. We perform similar experiments on alternatively spliced SRA isoforms to assess their structural features. PMID:22362738

  3. Secondary structural entropy in RNA switch (Riboswitch) identification.

    PubMed

    Manzourolajdad, Amirhossein; Arnold, Jonathan

    2015-04-28

    RNA regulatory elements play a significant role in gene regulation. Riboswitches, a widespread group of regulatory RNAs, are vital components of many bacterial genomes. These regulatory elements generally function by forming a ligand-induced alternative fold that controls access to ribosome binding sites or other regulatory sites in RNA. Riboswitch-mediated mechanisms are ubiquitous across bacterial genomes. A typical class of riboswitch has its own unique structural and biological complexity, making de novo riboswitch identification a formidable task. Traditionally, riboswitches have been identified through comparative genomics based on sequence and structural homology. The limitations of structural-homology-based approaches, coupled with the assumption that there is a great diversity of undiscovered riboswitches, suggests the need for alternative methods for riboswitch identification, possibly based on features intrinsic to their structure. As of yet, no such reliable method has been proposed. We used structural entropy of riboswitch sequences as a measure of their secondary structural dynamics. Entropy values of a diverse set of riboswitches were compared to that of their mutants, their dinucleotide shuffles, and their reverse complement sequences under different stochastic context-free grammar folding models. Significance of our results was evaluated by comparison to other approaches, such as the base-pairing entropy and energy landscapes dynamics. Classifiers based on structural entropy optimized via sequence and structural features were devised as riboswitch identifiers and tested on Bacillus subtilis, Escherichia coli, and Synechococcus elongatus as an exploration of structural entropy based approaches. The unusually long untranslated region of the cotH in Bacillus subtilis, as well as upstream regions of certain genes, such as the sucC genes were associated with significant structural entropy values in genome-wide examinations. Various tests show that there is in fact a relationship between higher structural entropy and the potential for the RNA sequence to have alternative structures, within the limitations of our methodology. This relationship, though modest, is consistent across various tests. Understanding the behavior of structural entropy as a fairly new feature for RNA conformational dynamics, however, may require extensive exploratory investigation both across RNA sequences and folding models.

  4. The mitochondrial genomes of Campodea fragilis and C. lubbocki(Hexapoda: Diplura): high genetic divergence in a morphologically uniformtaxon

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Podsiadlowski, L.; Carapelli, A.; Nardi, F.

    2005-12-01

    Mitochondrial genomes from two dipluran hexapods of the genus Campodea have been sequenced. Gene order is the same as in most other hexapods and crustaceans. Secondary structures of tRNAs reveal specific structural changes in tRNA-C, tRNA-R, tRNA-S1 and tRNA-S2. Comparative analyses of nucleotide and amino acid composition, as well as structural features of both ribosomal RNA subunits, reveal substantial differences among the analyzed taxa. Although the two Campodea species are morphologically highly uniform, genetic divergence is larger than expected, suggesting a long evolutionary history under stable ecological conditions.

  5. Efficient RNA structure comparison algorithms.

    PubMed

    Arslan, Abdullah N; Anandan, Jithendar; Fry, Eric; Monschke, Keith; Ganneboina, Nitin; Bowerman, Jason

    2017-12-01

    Recently proposed relative addressing-based ([Formula: see text]) RNA secondary structure representation has important features by which an RNA structure database can be stored into a suffix array. A fast substructure search algorithm has been proposed based on binary search on this suffix array. Using this substructure search algorithm, we present a fast algorithm that finds the largest common substructure of given multiple RNA structures in [Formula: see text] format. The multiple RNA structure comparison problem is NP-hard in its general formulation. We introduced a new problem for comparing multiple RNA structures. This problem has more strict similarity definition and objective, and we propose an algorithm that solves this problem efficiently. We also develop another comparison algorithm that iteratively calls this algorithm to locate nonoverlapping large common substructures in compared RNAs. With the new resulting tools, we improved the RNASSAC website (linked from http://faculty.tamuc.edu/aarslan ). This website now also includes two drawing tools: one specialized for preparing RNA substructures that can be used as input by the search tool, and another one for automatically drawing the entire RNA structure from a given structure sequence.

  6. R-chie: a web server and R package for visualizing RNA secondary structures

    PubMed Central

    Lai, Daniel; Proctor, Jeff R.; Zhu, Jing Yun A.; Meyer, Irmtraud M.

    2012-01-01

    Visually examining RNA structures can greatly aid in understanding their potential functional roles and in evaluating the performance of structure prediction algorithms. As many functional roles of RNA structures can already be studied given the secondary structure of the RNA, various methods have been devised for visualizing RNA secondary structures. Most of these methods depict a given RNA secondary structure as a planar graph consisting of base-paired stems interconnected by roundish loops. In this article, we present an alternative method of depicting RNA secondary structure as arc diagrams. This is well suited for structures that are difficult or impossible to represent as planar stem-loop diagrams. Arc diagrams can intuitively display pseudo-knotted structures, as well as transient and alternative structural features. In addition, they facilitate the comparison of known and predicted RNA secondary structures. An added benefit is that structure information can be displayed in conjunction with a corresponding multiple sequence alignments, thereby highlighting structure and primary sequence conservation and variation. We have implemented the visualization algorithm as a web server R-chie as well as a corresponding R package called R4RNA, which allows users to run the software locally and across a range of common operating systems. PMID:22434875

  7. Rtools: a web server for various secondary structural analyses on single RNA sequences.

    PubMed

    Hamada, Michiaki; Ono, Yukiteru; Kiryu, Hisanori; Sato, Kengo; Kato, Yuki; Fukunaga, Tsukasa; Mori, Ryota; Asai, Kiyoshi

    2016-07-08

    The secondary structures, as well as the nucleotide sequences, are the important features of RNA molecules to characterize their functions. According to the thermodynamic model, however, the probability of any secondary structure is very small. As a consequence, any tool to predict the secondary structures of RNAs has limited accuracy. On the other hand, there are a few tools to compensate the imperfect predictions by calculating and visualizing the secondary structural information from RNA sequences. It is desirable to obtain the rich information from those tools through a friendly interface. We implemented a web server of the tools to predict secondary structures and to calculate various structural features based on the energy models of secondary structures. By just giving an RNA sequence to the web server, the user can get the different types of solutions of the secondary structures, the marginal probabilities such as base-paring probabilities, loop probabilities and accessibilities of the local bases, the energy changes by arbitrary base mutations as well as the measures for validations of the predicted secondary structures. The web server is available at http://rtools.cbrc.jp, which integrates software tools, CentroidFold, CentroidHomfold, IPKnot, CapR, Raccess, Rchange and RintD. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  8. First Mitochondrial Genome from Nemouridae (Plecoptera) Reveals Novel Features of the Elongated Control Region and Phylogenetic Implications

    PubMed Central

    Chen, Zhi-Teng; Du, Yu-Zhou

    2017-01-01

    The complete mitochondrial genome (mitogenome) of Nemoura nankinensis (Plecoptera: Nemouridae) was sequenced as the first reported mitogenome from the family Nemouridae. The N. nankinensis mitogenome was the longest (16,602 bp) among reported plecopteran mitogenomes, and it contains 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes and two ribosomal RNA (rRNA) genes. Most PCGs used standard ATN as start codons, and TAN as termination codons. All tRNA genes of N. nankinensis could fold into the cloverleaf secondary structures except for trnSer (AGN), whose dihydrouridine (DHU) arm was reduced to a small loop. There was also a large non-coding region (control region, CR) in the N. nankinensis mitogenome. The 1751 bp CR was the longest and had the highest A+T content (81.8%) among stoneflies. A large tandem repeat region, five potential stem-loop (SL) structures, four tRNA-like structures and four conserved sequence blocks (CSBs) were detected in the elongated CR. The presence of these tRNA-like structures in the CR has never been reported in other plecopteran mitogenomes. These novel features of the elongated CR in N. nankinensis may have functions associated with the process of replication and transcription. Finally, phylogenetic reconstruction suggested that Nemouridae was the sister-group of Capniidae. PMID:28475163

  9. First Mitochondrial Genome from Nemouridae (Plecoptera) Reveals Novel Features of the Elongated Control Region and Phylogenetic Implications.

    PubMed

    Chen, Zhi-Teng; Du, Yu-Zhou

    2017-05-05

    The complete mitochondrial genome (mitogenome) of Nemoura nankinensis (Plecoptera: Nemouridae) was sequenced as the first reported mitogenome from the family Nemouridae. The N. nankinensis mitogenome was the longest (16,602 bp) among reported plecopteran mitogenomes, and it contains 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes and two ribosomal RNA (rRNA) genes. Most PCGs used standard ATN as start codons, and TAN as termination codons. All tRNA genes of N. nankinensis could fold into the cloverleaf secondary structures except for trnSer ( AGN ), whose dihydrouridine (DHU) arm was reduced to a small loop. There was also a large non-coding region (control region, CR) in the N. nankinensis mitogenome. The 1751 bp CR was the longest and had the highest A+T content (81.8%) among stoneflies. A large tandem repeat region, five potential stem-loop (SL) structures, four tRNA-like structures and four conserved sequence blocks (CSBs) were detected in the elongated CR. The presence of these tRNA-like structures in the CR has never been reported in other plecopteran mitogenomes. These novel features of the elongated CR in N. nankinensis may have functions associated with the process of replication and transcription. Finally, phylogenetic reconstruction suggested that Nemouridae was the sister-group of Capniidae.

  10. RAG-3D: A search tool for RNA 3D substructures

    DOE PAGES

    Zahran, Mai; Sevim Bayrak, Cigdem; Elmetwaly, Shereef; ...

    2015-08-24

    In this study, to address many challenges in RNA structure/function prediction, the characterization of RNA's modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally describedmore » in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding.« less

  11. RAG-3D: a search tool for RNA 3D substructures

    PubMed Central

    Zahran, Mai; Sevim Bayrak, Cigdem; Elmetwaly, Shereef; Schlick, Tamar

    2015-01-01

    To address many challenges in RNA structure/function prediction, the characterization of RNA's modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally described in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding. PMID:26304547

  12. RAG-3D: A search tool for RNA 3D substructures

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zahran, Mai; Sevim Bayrak, Cigdem; Elmetwaly, Shereef

    In this study, to address many challenges in RNA structure/function prediction, the characterization of RNA's modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally describedmore » in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding.« less

  13. The structural analysis of the mitochondrial SSUrRNA implies a close phylogenetic relationship between mitochondria from plants and from the heterotrophic alga Prototheca wickerhamii.

    PubMed

    Wolff, G; Kück, U

    1990-04-01

    The gene for the mitochondrial small subunit rRNA (SSUrRNA) from the heterotrophic alga Prototheca wickerhamii has been isolated from a gene library of extranuclear DNA. Sequence and structural analyses allow the determination of a secondary structure model for this rRNA. In addition, several sequence motifs are present which are typically found in SSUrRNAs of various mitochondrial origins. Unexpectedly, the Prototheca RNA sequence has more features in common with mitochondrial SSUrRNAs from plants than with that from the green alga Chlamydomonas reinhardtii. The phylogenetic relationship between mitochondria from plants and algae is discussed.

  14. ModeRNA: a tool for comparative modeling of RNA 3D structure

    PubMed Central

    Rother, Magdalena; Rother, Kristian; Puton, Tomasz; Bujnicki, Janusz M.

    2011-01-01

    RNA is a large group of functionally important biomacromolecules. In striking analogy to proteins, the function of RNA depends on its structure and dynamics, which in turn is encoded in the linear sequence. However, while there are numerous methods for computational prediction of protein three-dimensional (3D) structure from sequence, with comparative modeling being the most reliable approach, there are very few such methods for RNA. Here, we present ModeRNA, a software tool for comparative modeling of RNA 3D structures. As an input, ModeRNA requires a 3D structure of a template RNA molecule, and a sequence alignment between the target to be modeled and the template. It must be emphasized that a good alignment is required for successful modeling, and for large and complex RNA molecules the development of a good alignment usually requires manual adjustments of the input data based on previous expertise of the respective RNA family. ModeRNA can model post-transcriptional modifications, a functionally important feature analogous to post-translational modifications in proteins. ModeRNA can also model DNA structures or use them as templates. It is equipped with many functions for merging fragments of different nucleic acid structures into a single model and analyzing their geometry. Windows and UNIX implementations of ModeRNA with comprehensive documentation and a tutorial are freely available. PMID:21300639

  15. RNA Dependent RNA Polymerases: Insights from Structure, Function and Evolution.

    PubMed

    Venkataraman, Sangita; Prasad, Burra V L S; Selvarajan, Ramasamy

    2018-02-10

    RNA dependent RNA polymerase (RdRp) is one of the most versatile enzymes of RNA viruses that is indispensable for replicating the genome as well as for carrying out transcription. The core structural features of RdRps are conserved, despite the divergence in their sequences. The structure of RdRp resembles that of a cupped right hand and consists of fingers, palm and thumb subdomains. The catalysis involves the participation of conserved aspartates and divalent metal ions. Complexes of RdRps with substrates, inhibitors and metal ions provide a comprehensive view of their functional mechanism and offer valuable insights regarding the development of antivirals. In this article, we provide an overview of the structural aspects of RdRps and their complexes from the Group III, IV and V viruses and their structure-based phylogeny.

  16. RNA Dependent RNA Polymerases: Insights from Structure, Function and Evolution

    PubMed Central

    Venkataraman, Sangita; Prasad, Burra V L S; Selvarajan, Ramasamy

    2018-01-01

    RNA dependent RNA polymerase (RdRp) is one of the most versatile enzymes of RNA viruses that is indispensable for replicating the genome as well as for carrying out transcription. The core structural features of RdRps are conserved, despite the divergence in their sequences. The structure of RdRp resembles that of a cupped right hand and consists of fingers, palm and thumb subdomains. The catalysis involves the participation of conserved aspartates and divalent metal ions. Complexes of RdRps with substrates, inhibitors and metal ions provide a comprehensive view of their functional mechanism and offer valuable insights regarding the development of antivirals. In this article, we provide an overview of the structural aspects of RdRps and their complexes from the Group III, IV and V viruses and their structure-based phylogeny. PMID:29439438

  17. Mapping protein-RNA interactions by RCAP, RNA-cross-linking and peptide fingerprinting.

    PubMed

    Vaughan, Robert C; Kao, C Cheng

    2015-01-01

    RNA nanotechnology often feature protein RNA complexes. The interaction between proteins and large RNAs are difficult to study using traditional structure-based methods like NMR or X-ray crystallography. RCAP, an approach that uses reversible-cross-linking affinity purification method coupled with mass spectrometry, has been developed to map regions within proteins that contact RNA. This chapter details how RCAP is applied to map protein-RNA contacts within virions.

  18. A parallel implementation of the Wuchty algorithm with additional experimental filters to more thoroughly explore RNA conformational space.

    PubMed

    Stone, Jonathan W; Bleckley, Samuel; Lavelle, Sean; Schroeder, Susan J

    2015-01-01

    We present new modifications to the Wuchty algorithm in order to better define and explore possible conformations for an RNA sequence. The new features, including parallelization, energy-independent lonely pair constraints, context-dependent chemical probing constraints, helix filters, and optional multibranch loops, provide useful tools for exploring the landscape of RNA folding. Chemical probing alone may not necessarily define a single unique structure. The helix filters and optional multibranch loops are global constraints on RNA structure that are an especially useful tool for generating models of encapsidated viral RNA for which cryoelectron microscopy or crystallography data may be available. The computations generate a combinatorially complete set of structures near a free energy minimum and thus provide data on the density and diversity of structures near the bottom of a folding funnel for an RNA sequence. The conformational landscapes for some RNA sequences may resemble a low, wide basin rather than a steep funnel that converges to a single structure.

  19. R3D Align web server for global nucleotide to nucleotide alignments of RNA 3D structures.

    PubMed

    Rahrig, Ryan R; Petrov, Anton I; Leontis, Neocles B; Zirbel, Craig L

    2013-07-01

    The R3D Align web server provides online access to 'RNA 3D Align' (R3D Align), a method for producing accurate nucleotide-level structural alignments of RNA 3D structures. The web server provides a streamlined and intuitive interface, input data validation and output that is more extensive and easier to read and interpret than related servers. The R3D Align web server offers a unique Gallery of Featured Alignments, providing immediate access to pre-computed alignments of large RNA 3D structures, including all ribosomal RNAs, as well as guidance on effective use of the server and interpretation of the output. By accessing the non-redundant lists of RNA 3D structures provided by the Bowling Green State University RNA group, R3D Align connects users to structure files in the same equivalence class and the best-modeled representative structure from each group. The R3D Align web server is freely accessible at http://rna.bgsu.edu/r3dalign/.

  20. The Spot 42 RNA: A regulatory small RNA with roles in the central metabolism.

    PubMed

    Bækkedal, Cecilie; Haugen, Peik

    2015-01-01

    The Spot 42 RNA is a 109 nucleotide long (in Escherichia coli) noncoding small regulatory RNA (sRNA) encoded by the spf (spot fourty-two) gene. spf is found in gamma-proteobacteria and the majority of experimental work on Spot 42 RNA has been performed using E. coli, and recently Aliivibrio salmonicida. In the cell Spot 42 RNA plays essential roles as a regulator in carbohydrate metabolism and uptake, and its expression is activated by glucose, and inhibited by the cAMP-CRP complex. Here we summarize the current knowledge on Spot 42, and present the natural distribution of spf, show family-specific secondary structural features of Spot 42, and link highly conserved structural regions to mRNA target binding.

  1. DMS-MaPseq for genome-wide or targeted RNA structure probing in vivo.

    PubMed

    Zubradt, Meghan; Gupta, Paromita; Persad, Sitara; Lambowitz, Alan M; Weissman, Jonathan S; Rouskin, Silvi

    2017-01-01

    Coupling of structure-specific in vivo chemical modification to next-generation sequencing is transforming RNA secondary structure studies in living cells. The dominant strategy for detecting in vivo chemical modifications uses reverse transcriptase truncation products, which introduce biases and necessitate population-average assessments of RNA structure. Here we present dimethyl sulfate (DMS) mutational profiling with sequencing (DMS-MaPseq), which encodes DMS modifications as mismatches using a thermostable group II intron reverse transcriptase. DMS-MaPseq yields a high signal-to-noise ratio, can report multiple structural features per molecule, and allows both genome-wide studies and focused in vivo investigations of even low-abundance RNAs. We apply DMS-MaPseq for the first analysis of RNA structure within an animal tissue and to identify a functional structure involved in noncanonical translation initiation. Additionally, we use DMS-MaPseq to compare the in vivo structure of pre-mRNAs with their mature isoforms. These applications illustrate DMS-MaPseq's capacity to dramatically expand in vivo analysis of RNA structure.

  2. regSNPs-splicing: a tool for prioritizing synonymous single-nucleotide substitution.

    PubMed

    Zhang, Xinjun; Li, Meng; Lin, Hai; Rao, Xi; Feng, Weixing; Yang, Yuedong; Mort, Matthew; Cooper, David N; Wang, Yue; Wang, Yadong; Wells, Clark; Zhou, Yaoqi; Liu, Yunlong

    2017-09-01

    While synonymous single-nucleotide variants (sSNVs) have largely been unstudied, since they do not alter protein sequence, mounting evidence suggests that they may affect RNA conformation, splicing, and the stability of nascent-mRNAs to promote various diseases. Accurately prioritizing deleterious sSNVs from a pool of neutral ones can significantly improve our ability of selecting functional genetic variants identified from various genome-sequencing projects, and, therefore, advance our understanding of disease etiology. In this study, we develop a computational algorithm to prioritize sSNVs based on their impact on mRNA splicing and protein function. In addition to genomic features that potentially affect splicing regulation, our proposed algorithm also includes dozens structural features that characterize the functions of alternatively spliced exons on protein function. Our systematical evaluation on thousands of sSNVs suggests that several structural features, including intrinsic disorder protein scores, solvent accessible surface areas, protein secondary structures, and known and predicted protein family domains, show significant differences between disease-causing and neutral sSNVs. Our result suggests that the protein structure features offer an added dimension of information while distinguishing disease-causing and neutral synonymous variants. The inclusion of structural features increases the predictive accuracy for functional sSNV prioritization.

  3. A novel knowledge-based potential for RNA 3D structure evaluation

    NASA Astrophysics Data System (ADS)

    Yang, Yi; Gu, Qi; Zhang, Ben-Gong; Shi, Ya-Zhou; Shao, Zhi-Gang

    2018-03-01

    Ribonucleic acids (RNAs) play a vital role in biology, and knowledge of their three-dimensional (3D) structure is required to understand their biological functions. Recently structural prediction methods have been developed to address this issue, but a series of RNA 3D structures are generally predicted by most existing methods. Therefore, the evaluation of the predicted structures is generally indispensable. Although several methods have been proposed to assess RNA 3D structures, the existing methods are not precise enough. In this work, a new all-atom knowledge-based potential is developed for more accurately evaluating RNA 3D structures. The potential not only includes local and nonlocal interactions but also fully considers the specificity of each RNA by introducing a retraining mechanism. Based on extensive test sets generated from independent methods, the proposed potential correctly distinguished the native state and ranked near-native conformations to effectively select the best. Furthermore, the proposed potential precisely captured RNA structural features such as base-stacking and base-pairing. Comparisons with existing potential methods show that the proposed potential is very reliable and accurate in RNA 3D structure evaluation. Project supported by the National Science Foundation of China (Grants Nos. 11605125, 11105054, 11274124, and 11401448).

  4. Insights into Structural and Mechanistic Features of Viral IRES Elements

    PubMed Central

    Martinez-Salas, Encarnacion; Francisco-Velilla, Rosario; Fernandez-Chamorro, Javier; Embarek, Azman M.

    2018-01-01

    Internal ribosome entry site (IRES) elements are cis-acting RNA regions that promote internal initiation of protein synthesis using cap-independent mechanisms. However, distinct types of IRES elements present in the genome of various RNA viruses perform the same function despite lacking conservation of sequence and secondary RNA structure. Likewise, IRES elements differ in host factor requirement to recruit the ribosomal subunits. In spite of this diversity, evolutionarily conserved motifs in each family of RNA viruses preserve sequences impacting on RNA structure and RNA–protein interactions important for IRES activity. Indeed, IRES elements adopting remarkable different structural organizations contain RNA structural motifs that play an essential role in recruiting ribosomes, initiation factors and/or RNA-binding proteins using different mechanisms. Therefore, given that a universal IRES motif remains elusive, it is critical to understand how diverse structural motifs deliver functions relevant for IRES activity. This will be useful for understanding the molecular mechanisms beyond cap-independent translation, as well as the evolutionary history of these regulatory elements. Moreover, it could improve the accuracy to predict IRES-like motifs hidden in genome sequences. This review summarizes recent advances on the diversity and biological relevance of RNA structural motifs for viral IRES elements. PMID:29354113

  5. A new way to see RNA

    PubMed Central

    Keating, Kevin S.; Humphris, Elisabeth L.; Pyle, Anna Marie

    2015-01-01

    Unlike proteins, the RNA backbone has numerous degrees of freedom (eight, if one counts the sugar pucker), making RNA modeling, structure building and prediction a multidimensional problem of exceptionally high complexity. And yet RNA tertiary structures are not infinite in their structural morphology; rather, they are built from a limited set of discrete units. In order to reduce the dimensionality of the RNA backbone in a physically reasonable way, a shorthand notation was created that reduced the RNA backbone torsion angles to two (η and θ, analogous to ϕ and ψ in proteins). When these torsion angles are calculated for nucleotides in a crystallographic database and plotted against one another, one obtains a plot analogous to a Ramachandran plot (the η/θ plot), with highly populated and unpopulated regions. Nucleotides that occupy proximal positions on the plot have identical structures and are found in the same units of tertiary structure. In this review, we describe the statistical validation of the η/θ formalism and the exploration of features within the η/θ plot. We also describe the application of the η/θ formalism in RNA motif discovery, structural comparison, RNA structure building and tertiary structure prediction. More than a tool, however, the η/θ formalism has provided new insights into RNA structure itself, revealing its fundamental components and the factors underlying RNA architectural form. PMID:21729350

  6. Structural features of influenza A virus panhandle RNA enabling the activation of RIG-I independently of 5′-triphosphate

    PubMed Central

    Lee, Mi-Kyung; Kim, Hee-Eun; Park, Eun-Byeol; Lee, Janghyun; Kim, Ki-Hun; Lim, Kyungeun; Yum, Seoyun; Lee, Young-Hoon; Kang, Suk-Jo; Lee, Joon-Hwa; Choi, Byong-Seok

    2016-01-01

    Retinoic acid-inducible gene I (RIG-I) recognizes specific molecular patterns of viral RNAs for inducing type I interferon. The C-terminal domain (CTD) of RIG-I binds to double-stranded RNA (dsRNA) with the 5′-triphosphate (5′-PPP), which induces a conformational change in RIG-I to an active form. It has been suggested that RIG-I detects infection of influenza A virus by recognizing the 5′-triphosphorylated panhandle structure of the viral RNA genome. Influenza panhandle RNA has a unique structure with a sharp helical bending. In spite of extensive studies of how viral RNAs activate RIG-I, whether the structural elements of the influenza panhandle RNA confer the ability to activate RIG-I signaling has been poorly explored. Here, we investigated the dynamics of the influenza panhandle RNA in complex with RIG-I CTD using NMR spectroscopy and showed that the bending structure of the panhandle RNA negates the requirement of a 5′-PPP moiety for RIG-I activation. PMID:27288441

  7. Insights into the structural features and stability of peptide nucleic acid with a D-prolyl-2-aminocyclopentane carboxylic acid backbone that binds to DNA and RNA.

    PubMed

    Poomsuk, Nattawee; Vilaivan, Tirayut; Siriwong, Khatcharin

    2018-06-12

    Peptide nucleic acid (PNA) is a powerful biomolecule with a wide variety of important applications. In this work, the molecular structures and binding affinity of PNA with a D-prolyl-2-aminocyclopentane carboxylic acid backbone (acpcPNA) that binds to both DNA and RNA were studied using molecular dynamics simulations. The simulated structures of acpcPNA-DNA and acpcPNA-RNA duplexes more closely resembled the typical structures of B-DNA and A-RNA than the corresponding duplexes of aegPNA. The calculated binding free energies are in good agreement with the experimental results that the acpcPNA-DNA duplex is more stable than the acpcPNA-RNA duplex regardless of the base sequences. The results provide further insights in the relationship between structure and stability of this unique PNA system. Copyright © 2018 Elsevier Inc. All rights reserved.

  8. Cas9 versus Cas12a/Cpf1: Structure-function comparisons and implications for genome editing.

    PubMed

    Swarts, Daan C; Jinek, Martin

    2018-05-22

    Cas9 and Cas12a are multidomain CRISPR-associated nucleases that can be programmed with a guide RNA to bind and cleave complementary DNA targets. The guide RNA sequence can be varied, making these effector enzymes versatile tools for genome editing and gene regulation applications. While Cas9 is currently the best-characterized and most widely used nuclease for such purposes, Cas12a (previously named Cpf1) has recently emerged as an alternative for Cas9. Cas9 and Cas12a have distinct evolutionary origins and exhibit different structural architectures, resulting in distinct molecular mechanisms. Here we compare the structural and mechanistic features that distinguish Cas9 and Cas12a, and describe how these features modulate their activity. We discuss implications for genome editing, and how they may influence the choice of Cas9 or Cas12a for specific applications. Finally, we review recent studies in which Cas12a has been utilized as a genome editing tool. This article is categorized under: RNA Interactions with Proteins and Other Molecules > Protein-RNA Interactions: Functional Implications Regulatory RNAs/RNAi/Riboswitches > Biogenesis of Effector Small RNAs RNA Interactions with Proteins and Other Molecules > RNA-Protein Complexes. © 2018 Wiley Periodicals, Inc.

  9. Crystal structure of RlmAI: Implications for understanding the 23S rRNA G745/G748-methylation at the macrolide antibiotic-binding site

    PubMed Central

    Das, Kalyan; Acton, Thomas; Chiang, Yiwen; Shih, Lydia; Arnold, Eddy; Montelione, Gaetano T.

    2004-01-01

    The RlmA class of enzymes (RlmAI and RlmAII) catalyzes N1-methylation of a guanine base (G745 in Gram-negative and G748 in Gram-positive bacteria) of hairpin 35 of 23S rRNA. We have determined the crystal structure of Escherichia coli RlmAI at 2.8-Å resolution, providing 3D structure information for the RlmA class of RNA methyltransferases. The dimeric protein structure exhibits features that provide new insights into its molecular function. Each RlmAI molecule has a Zn-binding domain, responsible for specific recognition and binding of its rRNA substrate, and a methyltransferase domain. The asymmetric RlmAI dimer observed in the crystal structure has a well defined W-shaped RNA-binding cleft. Two S-adenosyl-l-methionine substrate molecules are located at the two valleys of the W-shaped RNA-binding cleft. The unique shape of the RNA-binding cleft, different from that of known RNA-binding proteins, is highly specific and structurally complements the 3D structure of hairpin 35 of bacterial 23S rRNA. Apart from the hairpin 35, parts of hairpins 33 and 34 also interact with the RlmAI dimer. PMID:14999102

  10. URS DataBase: universe of RNA structures and their motifs.

    PubMed

    Baulin, Eugene; Yacovlev, Victor; Khachko, Denis; Spirin, Sergei; Roytberg, Mikhail

    2016-01-01

    The Universe of RNA Structures DataBase (URSDB) stores information obtained from all RNA-containing PDB entries (2935 entries in October 2015). The content of the database is updated regularly. The database consists of 51 tables containing indexed data on various elements of the RNA structures. The database provides a web interface allowing user to select a subset of structures with desired features and to obtain various statistical data for a selected subset of structures or for all structures. In particular, one can easily obtain statistics on geometric parameters of base pairs, on structural motifs (stems, loops, etc.) or on different types of pseudoknots. The user can also view and get information on an individual structure or its selected parts, e.g. RNA-protein hydrogen bonds. URSDB employs a new original definition of loops in RNA structures. That definition fits both pseudoknot-free and pseudoknotted secondary structures and coincides with the classical definition in case of pseudoknot-free structures. To our knowledge, URSDB is the first database supporting searches based on topological classification of pseudoknots and on extended loop classification.Database URL: http://server3.lpm.org.ru/urs/. © The Author(s) 2016. Published by Oxford University Press.

  11. The Spot 42 RNA: A regulatory small RNA with roles in the central metabolism

    PubMed Central

    Bækkedal, Cecilie; Haugen, Peik

    2015-01-01

    The Spot 42 RNA is a 109 nucleotide long (in Escherichia coli) noncoding small regulatory RNA (sRNA) encoded by the spf (spot fourty-two) gene. spf is found in gamma-proteobacteria and the majority of experimental work on Spot 42 RNA has been performed using E. coli, and recently Aliivibrio salmonicida. In the cell Spot 42 RNA plays essential roles as a regulator in carbohydrate metabolism and uptake, and its expression is activated by glucose, and inhibited by the cAMP-CRP complex. Here we summarize the current knowledge on Spot 42, and present the natural distribution of spf, show family-specific secondary structural features of Spot 42, and link highly conserved structural regions to mRNA target binding. PMID:26327359

  12. Single molecule imaging of RNA polymerase II using atomic force microscopy

    NASA Astrophysics Data System (ADS)

    Rhodin, Thor; Fu, Jianhua; Umemura, Kazuo; Gad, Mohammed; Jarvis, Suzi; Ishikawa, Mitsuru

    2003-03-01

    An atomic force microscopy (AFM) study of the shape, orientation and surface topology of RNA polymerase II supported on silanized freshly cleaved mica was made. The overall aim is to define the molecular topology of RNA polymerase II in appropriate fluids to help clarify the relationship of conformational features to biofunctionality. A Nanoscope III atomic force microscope was used in the tapping mode with oxide-sharpened (8-10 nm) Si 3N 4 probes in aqueous zinc chloride buffer. The main structural features observed by AFM were compared to those derived from electron-density plots based on X-ray crystallographic studies. The conformational features included a bilobal silhouette with an inverted umbrella-shaped crater connected to a reaction site. These studies provide a starting point for constructing a 3D-AFM profiling analysis of proteins such as RNA polymerase complexes.

  13. A computational method for predicting regulation of human microRNAs on the influenza virus genome

    PubMed Central

    2013-01-01

    Background While it has been suggested that host microRNAs (miRNAs) may downregulate viral gene expression as an antiviral defense mechanism, such a mechanism has not been explored in the influenza virus for human flu studies. As it is difficult to conduct related experiments on humans, computational studies can provide some insight. Although many computational tools have been designed for miRNA target prediction, there is a need for cross-species prediction, especially for predicting viral targets of human miRNAs. However, finding putative human miRNAs targeting influenza virus genome is still challenging. Results We developed machine-learning features and conducted comprehensive data training for predicting interactions between H1N1 genome segments and host miRNA. We defined our seed region as the first ten nucleotides from the 5' end of the miRNA to the 3' end of the miRNA and integrated various features including the number of consecutive matching bases in the seed region of 10 bases, a triplet feature in seed regions, thermodynamic energy, penalty of bulges and wobbles at binding sites, and the secondary structure of viral RNA for the prediction. Conclusions Compared to general predictive models, our model fully takes into account the conservation patterns and features of viral RNA secondary structures, and greatly improves the prediction accuracy. Our model identified some key miRNAs including hsa-miR-489, hsa-miR-325, hsa-miR-876-3p and hsa-miR-2117, which target HA, PB2, MP and NS of H1N1, respectively. Our study provided an interesting hypothesis concerning the miRNA-based antiviral defense mechanism against influenza virus in human, i.e., the binding between human miRNA and viral RNAs may not result in gene silencing but rather may block the viral RNA replication. PMID:24565017

  14. Using RNA Sequence and Structure for the Prediction of Riboswitch Aptamer: A Comprehensive Review of Available Software and Tools

    PubMed Central

    Antunes, Deborah; Jorge, Natasha A. N.; Caffarena, Ernesto R.; Passetti, Fabio

    2018-01-01

    RNA molecules are essential players in many fundamental biological processes. Prokaryotes and eukaryotes have distinct RNA classes with specific structural features and functional roles. Computational prediction of protein structures is a research field in which high confidence three-dimensional protein models can be proposed based on the sequence alignment between target and templates. However, to date, only a few approaches have been developed for the computational prediction of RNA structures. Similar to proteins, RNA structures may be altered due to the interaction with various ligands, including proteins, other RNAs, and metabolites. A riboswitch is a molecular mechanism, found in the three kingdoms of life, in which the RNA structure is modified by the binding of a metabolite. It can regulate multiple gene expression mechanisms, such as transcription, translation initiation, and mRNA splicing and processing. Due to their nature, these entities also act on the regulation of gene expression and detection of small metabolites and have the potential to helping in the discovery of new classes of antimicrobial agents. In this review, we describe software and web servers currently available for riboswitch aptamer identification and secondary and tertiary structure prediction, including applications. PMID:29403526

  15. NMR structure of the Aquifex aeolicus tmRNA pseudoknot PK1: new insights into the recoding event of the ribosomal trans-translation

    PubMed Central

    Nonin-Lecomte, Sylvie; Felden, Brice; Dardel, Frédéric

    2006-01-01

    The transfer-messenger RNA (tmRNA) pseudoknot PK1 is essential for bacterial trans-translation, a ribosomal rescue mechanism. We report the solution structure of PK1 from Aquifex aeolicus, which despite an unprecedented small number of nucleotides and thus an unprecented compact size, displays a very high thermal stability. Several unusual structural features account for these properties and indicate that PK1 belongs to the class of ribosomal frameshift pseudoknots. This suggests a similarity between the mechanism of programmed ribosomal frameshifting and trans-translation. PMID:16595798

  16. NMR structure of the Aquifex aeolicus tmRNA pseudoknot PK1: new insights into the recoding event of the ribosomal trans-translation.

    PubMed

    Nonin-Lecomte, Sylvie; Felden, Brice; Dardel, Frédéric

    2006-01-01

    The transfer-messenger RNA (tmRNA) pseudoknot PK1 is essential for bacterial trans-translation, a ribosomal rescue mechanism. We report the solution structure of PK1 from Aquifex aeolicus, which despite an unprecedented small number of nucleotides and thus an unprecented compact size, displays a very high thermal stability. Several unusual structural features account for these properties and indicate that PK1 belongs to the class of ribosomal frameshift pseudoknots. This suggests a similarity between the mechanism of programmed ribosomal frameshifting and trans-translation.

  17. Distribution of genotype network sizes in sequence-to-structure genotype-phenotype maps.

    PubMed

    Manrubia, Susanna; Cuesta, José A

    2017-04-01

    An essential quantity to ensure evolvability of populations is the navigability of the genotype space. Navigability, understood as the ease with which alternative phenotypes are reached, relies on the existence of sufficiently large and mutually attainable genotype networks. The size of genotype networks (e.g. the number of RNA sequences folding into a particular secondary structure or the number of DNA sequences coding for the same protein structure) is astronomically large in all functional molecules investigated: an exhaustive experimental or computational study of all RNA folds or all protein structures becomes impossible even for moderately long sequences. Here, we analytically derive the distribution of genotype network sizes for a hierarchy of models which successively incorporate features of increasingly realistic sequence-to-structure genotype-phenotype maps. The main feature of these models relies on the characterization of each phenotype through a prototypical sequence whose sites admit a variable fraction of letters of the alphabet. Our models interpolate between two limit distributions: a power-law distribution, when the ordering of sites in the prototypical sequence is strongly constrained, and a lognormal distribution, as suggested for RNA, when different orderings of the same set of sites yield different phenotypes. Our main result is the qualitative and quantitative identification of those features of sequence-to-structure maps that lead to different distributions of genotype network sizes. © 2017 The Author(s).

  18. Picornaviral Polymerase Structure, Function, and Fidelity Modulation

    PubMed Central

    Peersen, Olve B.

    2017-01-01

    Like all positive strand RNA viruses, the picornaviruses replicate their genomes using a virally encoded RNA-dependent RNA polymerase enzyme known as 3Dpol. Over the past decade we have made tremendous advances in our understanding of 3Dpol structure and function, including the discovery of a novel mechanism for closing the active site that allows these viruses to easily fine tune replication fidelity and quasispecies distributions. This review summarizes current knowledge of picornaviral polymerase structure and how the enzyme interacts with RNA and other viral proteins to form stable and processive elongation complexes. The picornaviral RdRPs are among the smallest viral polymerases, but their fundamental molecular mechanism for catalysis appears to be generally applicable as a common feature of all positive strand RNA virus polymerases. PMID:28163093

  19. Methylation guide RNA evolution in archaea: structure, function and genomic organization of 110 C/D box sRNA families across six Pyrobaculum species.

    PubMed

    Lui, Lauren M; Uzilov, Andrew V; Bernick, David L; Corredor, Andrea; Lowe, Todd M; Dennis, Patrick P

    2018-05-16

    Archaeal homologs of eukaryotic C/D box small nucleolar RNAs (C/D box sRNAs) guide precise 2'-O-methyl modification of ribosomal and transfer RNAs. Although C/D box sRNA genes constitute one of the largest RNA gene families in archaeal thermophiles, most genomes have incomplete sRNA gene annotation because reliable, fully automated detection methods are not available. We expanded and curated a comprehensive gene set across six species of the crenarchaeal genus Pyrobaculum, particularly rich in C/D box sRNA genes. Using high-throughput small RNA sequencing, specialized computational searches and comparative genomics, we analyzed 526 Pyrobaculum C/D box sRNAs, organizing them into 110 families based on synteny and conservation of guide sequences which determine methylation targets. We examined gene duplications and rearrangements, including one family that has expanded in a pattern similar to retrotransposed repetitive elements in eukaryotes. New training data and inclusion of kink-turn secondary structural features enabled creation of an improved search model. Our analyses provide the most comprehensive, dynamic view of C/D box sRNA evolutionary history within a genus, in terms of modification function, feature plasticity, and gene mobility.

  20. DMS-MaPseq for genome-wide or targeted RNA structure probing in vivo

    PubMed Central

    Zubradt, Meghan; Gupta, Paromita; Persad, Sitara; Lambowitz, Alan M.; Weissman, Jonathan S.; Rouskin, Silvi

    2017-01-01

    Coupling structure-specific in vivo chemical modification to next-generation sequencing is transforming RNA secondary structural studies in living cells. The dominant strategy for detecting in vivo chemical modifications uses reverse transcriptase truncation products, which introduces biases and necessitates population-average assessments of RNA structure. Here we present dimethyl sulfate mutational profiling with sequencing (DMS-MaPseq), which encodes DMS modifications as mismatches using a thermostable group II intron reverse transcriptase (TGIRT). DMS-MaPseq yields a high signal-to-noise ratio, can report multiple structural features per molecule, and allows both genome-wide studies and focused in vivo investigations of even low abundance RNAs. We apply DMS-MaPseq for the first analysis of RNA structure within an animal tissue and to identify a functional structure involved in non-canonical translation initiation. Additionally, we use DMS-MaPseq to compare the in vivo structure of pre-mRNAs to their mature isoforms. These applications illustrate DMS-MaPseq’s capacity to dramatically expand in vivo analysis of RNA structure. PMID:27819661

  1. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Akiyama, Benjamin M.; Laurence, Hannah M.; Massey, Aaron R.

    The outbreak of Zika virus (ZIKV) and associated fetal microcephaly mandates efforts to understand the molecular processes of infection. Related flaviviruses produce noncoding subgenomic flaviviral RNAs (sfRNAs) that are linked to pathogenicity in fetal mice. These viruses make sfRNAs by co-opting a cellular exonuclease via structured RNAs called xrRNAs. We found that ZIKV-infected monkey and human epithelial cells, mouse neurons, and mosquito cells produce sfRNAs. The RNA structure that is responsible for ZIKV sfRNA production forms a complex fold that is likely found in many pathogenic flaviviruses. Mutations that disrupt the structure affect exonuclease resistance in vitro and sfRNA formationmore » during infection. The complete ZIKV xrRNA structure clarifies the mechanism of exonuclease resistance and identifies features that may modulate function in diverse flaviviruses.« less

  2. Two molecular features contribute to the Argonaute specificity for the microRNA and RNAi pathways in C. elegans.

    PubMed

    Jannot, Guillaume; Boisvert, Marie-Eve L; Banville, Isabelle H; Simard, Martin J

    2008-05-01

    In Caenorhabditis elegans, specific Argonaute proteins are dedicated to the RNAi and microRNA pathways. To uncover how the precise Argonaute selection occurs, we designed dsRNA triggers containing both miRNA and siRNA sequences. While dsRNA carrying nucleotides mismatches can only enter the miRNA pathway, a fully complementary dsRNA successfully rescues let-7 miRNA function and initiates silencing by RNAi. We demonstrated that RDE-1 is essential for RNAi induced by the perfectly paired trigger, yet is not required for silencing by the let-7 miRNA. In contrast, ALG-1/ALG-2 are required for the miRNA function, but not for the siRNA-directed gene silencing. Finally, a dsRNA containing a bulged miRNA and a perfectly paired siRNA can enter both pathways suggesting that the sorting of small RNAs occurs after that the dsRNA trigger has been processed by Dicer. Thus, our data suggest that the selection of Argonaute proteins is affected by two molecular features: (1) the structure of the small RNA duplex; and (2) the Argonautes specific characteristics.

  3. RNA helicase proteins as chaperones and remodelers

    PubMed Central

    Jarmoskaite, Inga; Russell, Rick

    2014-01-01

    Superfamily 2 helicase proteins are ubiquitous in RNA biology and have an extraordinarily broad set of functional roles. Central among these roles are to promote rearrangements of structured RNAs and to remodel RNA-protein complexes (RNPs), allowing formation of native RNA structure or progression through a functional cycle of structures. While all superfamily 2 helicases share a conserved helicase core, they are divided evolutionarily into several families, and it is principally proteins from three families, the DEAD-box, DEAH/RHA and Ski2-like families, that function to manipulate structured RNAs and RNPs. Strikingly, there are emerging differences in the mechanisms of these proteins, both between families and within the largest family (DEAD-box), and these differences appear to be tuned to their RNA or RNP substrates and their specific roles. This review outlines basic mechanistic features of the three families and surveys individual proteins and the current understanding of their biological substrates and mechanisms. PMID:24635478

  4. Mutations Abrogating VP35 Interaction with Double-Stranded RNA Render Ebola Virus Avirulent in Guinea Pigs

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Prins, Kathleen C.; Delpeut, Sebastien; Leung, Daisy W.

    2010-10-11

    Ebola virus (EBOV) protein VP35 is a double-stranded RNA (dsRNA) binding inhibitor of host interferon (IFN)-{alpha}/{beta} responses that also functions as a viral polymerase cofactor. Recent structural studies identified key features, including a central basic patch, required for VP35 dsRNA binding activity. To address the functional significance of these VP35 structural features for EBOV replication and pathogenesis, two point mutations, K319A/R322A, that abrogate VP35 dsRNA binding activity and severely impair its suppression of IFN-{alpha}/{beta} production were identified. Solution nuclear magnetic resonance (NMR) spectroscopy and X-ray crystallography reveal minimal structural perturbations in the K319A/R322A VP35 double mutant and suggest that lossmore » of basic charge leads to altered function. Recombinant EBOVs encoding the mutant VP35 exhibit, relative to wild-type VP35 viruses, minimal growth attenuation in IFN-defective Vero cells but severe impairment in IFN-competent cells. In guinea pigs, the VP35 mutant virus revealed a complete loss of virulence. Strikingly, the VP35 mutant virus effectively immunized animals against subsequent wild-type EBOV challenge. These in vivo studies, using recombinant EBOV viruses, combined with the accompanying biochemical and structural analyses directly correlate VP35 dsRNA binding and IFN inhibition functions with viral pathogenesis. Moreover, these studies provide a framework for the development of antivirals targeting this critical EBOV virulence factor.« less

  5. Inforna 2.0: A Platform for the Sequence-Based Design of Small Molecules Targeting Structured RNAs.

    PubMed

    Disney, Matthew D; Winkelsas, Audrey M; Velagapudi, Sai Pradeep; Southern, Mark; Fallahi, Mohammad; Childs-Disney, Jessica L

    2016-06-17

    The development of small molecules that target RNA is challenging yet, if successful, could advance the development of chemical probes to study RNA function or precision therapeutics to treat RNA-mediated disease. Previously, we described Inforna, an approach that can mine motifs (secondary structures) within target RNAs, which is deduced from the RNA sequence, and compare them to a database of known RNA motif-small molecule binding partners. Output generated by Inforna includes the motif found in both the database and the desired RNA target, lead small molecules for that target, and other related meta-data. Lead small molecules can then be tested for binding and affecting cellular (dys)function. Herein, we describe Inforna 2.0, which incorporates all known RNA motif-small molecule binding partners reported in the scientific literature, a chemical similarity searching feature, and an improved user interface and is freely available via an online web server. By incorporation of interactions identified by other laboratories, the database has been doubled, containing 1936 RNA motif-small molecule interactions, including 244 unique small molecules and 1331 motifs. Interestingly, chemotype analysis of the compounds that bind RNA in the database reveals features in small molecule chemotypes that are privileged for binding. Further, this updated database expanded the number of cellular RNAs to which lead compounds can be identified.

  6. Know Your Enemy: Successful Bioinformatic Approaches to Predict Functional RNA Structures in Viral RNAs.

    PubMed

    Lim, Chun Shen; Brown, Chris M

    2017-01-01

    Structured RNA elements may control virus replication, transcription and translation, and their distinct features are being exploited by novel antiviral strategies. Viral RNA elements continue to be discovered using combinations of experimental and computational analyses. However, the wealth of sequence data, notably from deep viral RNA sequencing, viromes, and metagenomes, necessitates computational approaches being used as an essential discovery tool. In this review, we describe practical approaches being used to discover functional RNA elements in viral genomes. In addition to success stories in new and emerging viruses, these approaches have revealed some surprising new features of well-studied viruses e.g., human immunodeficiency virus, hepatitis C virus, influenza, and dengue viruses. Some notable discoveries were facilitated by new comparative analyses of diverse viral genome alignments. Importantly, comparative approaches for finding RNA elements embedded in coding and non-coding regions differ. With the exponential growth of computer power we have progressed from stem-loop prediction on single sequences to cutting edge 3D prediction, and from command line to user friendly web interfaces. Despite these advances, many powerful, user friendly prediction tools and resources are underutilized by the virology community.

  7. Know Your Enemy: Successful Bioinformatic Approaches to Predict Functional RNA Structures in Viral RNAs

    PubMed Central

    Lim, Chun Shen; Brown, Chris M.

    2018-01-01

    Structured RNA elements may control virus replication, transcription and translation, and their distinct features are being exploited by novel antiviral strategies. Viral RNA elements continue to be discovered using combinations of experimental and computational analyses. However, the wealth of sequence data, notably from deep viral RNA sequencing, viromes, and metagenomes, necessitates computational approaches being used as an essential discovery tool. In this review, we describe practical approaches being used to discover functional RNA elements in viral genomes. In addition to success stories in new and emerging viruses, these approaches have revealed some surprising new features of well-studied viruses e.g., human immunodeficiency virus, hepatitis C virus, influenza, and dengue viruses. Some notable discoveries were facilitated by new comparative analyses of diverse viral genome alignments. Importantly, comparative approaches for finding RNA elements embedded in coding and non-coding regions differ. With the exponential growth of computer power we have progressed from stem-loop prediction on single sequences to cutting edge 3D prediction, and from command line to user friendly web interfaces. Despite these advances, many powerful, user friendly prediction tools and resources are underutilized by the virology community. PMID:29354101

  8. Assembly and analysis of eukaryotic Argonaute–RNA complexes in microRNA-target recognition

    PubMed Central

    Gan, Hin Hark; Gunsalus, Kristin C.

    2015-01-01

    Experimental studies have uncovered a variety of microRNA (miRNA)–target duplex structures that include perfect, imperfect and seedless duplexes. However, non-canonical binding modes from imperfect/seedless duplexes are not well predicted by computational approaches, which rely primarily on sequence and secondary structural features, nor have their tertiary structures been characterized because solved structures to date are limited to near perfect, straight duplexes in Argonautes (Agos). Here, we use structural modeling to examine the role of Ago dynamics in assembling viable eukaryotic miRNA-induced silencing complexes (miRISCs). We show that combinations of low-frequency, global modes of motion of Ago domains are required to accommodate RNA duplexes in model human and C. elegans Ago structures. Models of viable miRISCs imply that Ago adopts variable conformations at distinct target sites that generate distorted, imperfect miRNA-target duplexes. Ago's ability to accommodate a duplex is dependent on the region where structural distortions occur: distortions in solvent-exposed seed and 3′-end regions are less likely to produce steric clashes than those in the central duplex region. Energetic analyses of assembled miRISCs indicate that target recognition is also driven by favorable Ago-duplex interactions. Such structural insights into Ago loading and target recognition mechanisms may provide a more accurate assessment of miRNA function. PMID:26432829

  9. R2R--software to speed the depiction of aesthetic consensus RNA secondary structures.

    PubMed

    Weinberg, Zasha; Breaker, Ronald R

    2011-01-04

    With continuing identification of novel structured noncoding RNAs, there is an increasing need to create schematic diagrams showing the consensus features of these molecules. RNA structural diagrams are typically made either with general-purpose drawing programs like Adobe Illustrator, or with automated or interactive programs specific to RNA. Unfortunately, the use of applications like Illustrator is extremely time consuming, while existing RNA-specific programs produce figures that are useful, but usually not of the same aesthetic quality as those produced at great cost in Illustrator. Additionally, most existing RNA-specific applications are designed for drawing single RNA molecules, not consensus diagrams. We created R2R, a computer program that facilitates the generation of aesthetic and readable drawings of RNA consensus diagrams in a fraction of the time required with general-purpose drawing programs. Since the inference of a consensus RNA structure typically requires a multiple-sequence alignment, the R2R user annotates the alignment with commands directing the layout and annotation of the RNA. R2R creates SVG or PDF output that can be imported into Adobe Illustrator, Inkscape or CorelDRAW. R2R can be used to create consensus sequence and secondary structure models for novel RNA structures or to revise models when new representatives for known RNA classes become available. Although R2R does not currently have a graphical user interface, it has proven useful in our efforts to create 100 schematic models of distinct noncoding RNA classes. R2R makes it possible to obtain high-quality drawings of the consensus sequence and structural models of many diverse RNA structures with a more practical amount of effort. R2R software is available at http://breaker.research.yale.edu/R2R and as an Additional file.

  10. Evaluation of sequence alignments and oligonucleotide probes with respect to three-dimensional structure of ribosomal RNA using ARB software package

    PubMed Central

    Kumar, Yadhu; Westram, Ralf; Kipfer, Peter; Meier, Harald; Ludwig, Wolfgang

    2006-01-01

    Background Availability of high-resolution RNA crystal structures for the 30S and 50S ribosomal subunits and the subsequent validation of comparative secondary structure models have prompted the biologists to use three-dimensional structure of ribosomal RNA (rRNA) for evaluating sequence alignments of rRNA genes. Furthermore, the secondary and tertiary structural features of rRNA are highly useful and successfully employed in designing rRNA targeted oligonucleotide probes intended for in situ hybridization experiments. RNA3D, a program to combine sequence alignment information with three-dimensional structure of rRNA was developed. Integration into ARB software package, which is used extensively by the scientific community for phylogenetic analysis and molecular probe designing, has substantially extended the functionality of ARB software suite with 3D environment. Results Three-dimensional structure of rRNA is visualized in OpenGL 3D environment with the abilities to change the display and overlay information onto the molecule, dynamically. Phylogenetic information derived from the multiple sequence alignments can be overlaid onto the molecule structure in a real time. Superimposition of both statistical and non-statistical sequence associated information onto the rRNA 3D structure can be done using customizable color scheme, which is also applied to a textual sequence alignment for reference. Oligonucleotide probes designed by ARB probe design tools can be mapped onto the 3D structure along with the probe accessibility models for evaluation with respect to secondary and tertiary structural conformations of rRNA. Conclusion Visualization of three-dimensional structure of rRNA in an intuitive display provides the biologists with the greater possibilities to carry out structure based phylogenetic analysis. Coupled with secondary structure models of rRNA, RNA3D program aids in validating the sequence alignments of rRNA genes and evaluating probe target sites. Superimposition of the information derived from the multiple sequence alignment onto the molecule dynamically allows the researchers to observe any sequence inherited characteristics (phylogenetic information) in real-time environment. The extended ARB software package is made freely available for the scientific community via . PMID:16672074

  11. URS DataBase: universe of RNA structures and their motifs

    PubMed Central

    Baulin, Eugene; Yacovlev, Victor; Khachko, Denis; Spirin, Sergei; Roytberg, Mikhail

    2016-01-01

    The Universe of RNA Structures DataBase (URSDB) stores information obtained from all RNA-containing PDB entries (2935 entries in October 2015). The content of the database is updated regularly. The database consists of 51 tables containing indexed data on various elements of the RNA structures. The database provides a web interface allowing user to select a subset of structures with desired features and to obtain various statistical data for a selected subset of structures or for all structures. In particular, one can easily obtain statistics on geometric parameters of base pairs, on structural motifs (stems, loops, etc.) or on different types of pseudoknots. The user can also view and get information on an individual structure or its selected parts, e.g. RNA–protein hydrogen bonds. URSDB employs a new original definition of loops in RNA structures. That definition fits both pseudoknot-free and pseudoknotted secondary structures and coincides with the classical definition in case of pseudoknot-free structures. To our knowledge, URSDB is the first database supporting searches based on topological classification of pseudoknots and on extended loop classification. Database URL: http://server3.lpm.org.ru/urs/ PMID:27242032

  12. Influence of Na+ and Mg2+ ions on RNA structures studied with molecular dynamics simulations.

    PubMed

    Fischer, Nina M; Polêto, Marcelo D; Steuer, Jakob; van der Spoel, David

    2018-06-01

    The structure of ribonucleic acid (RNA) polymers is strongly dependent on the presence of, in particular Mg2+ cations to stabilize structural features. Only in high-resolution X-ray crystallography structures can ions be identified reliably. Here, we perform molecular dynamics simulations of 24 RNA structures with varying ion concentrations. Twelve of the structures were helical and the others complex folded. The aim of the study is to predict ion positions but also to evaluate the impact of different types of ions (Na+ or Mg2+) and the ionic strength on structural stability and variations of RNA. As a general conclusion Mg2+ is found to conserve the experimental structure better than Na+ and, where experimental ion positions are available, they can be reproduced with reasonable accuracy. If a large surplus of ions is present the added electrostatic screening makes prediction of binding-sites less reproducible. Distinct differences in ion-binding between helical and complex folded structures are found. The strength of binding (ΔG‡ for breaking RNA atom-ion interactions) is found to differ between roughly 10 and 26 kJ/mol for the different RNA atoms. Differences in stability between helical and complex folded structures and of the influence of metal ions on either are discussed.

  13. Computational study of RNA folding kinetics and thermodynamics

    NASA Astrophysics Data System (ADS)

    Morgan, Steven Robert

    RNA in its many forms is involved in the processes of protein manufacture, gene splicing, catalysis and gene regulation. It is also the store of genetic information in some viruses. The function of the RNA is determined by its structure, and it is the purpose of this thesis to investigate kinetic and thermodynamic properties of RNA secondary structures in order to obtain a better understanding of their formation and function. Our main tenet is that kinetic formation of RNA structure is necessary to explain features found in natural RNA structures, as well as aspects of the biological function of RNA. Firstly we show that examination of the energies of fragments of RNA secondary structure provides evidence for kinetic formation of structure. Local regions of RNA of length less than about 100 nucleotides adopt a conformation with energy near or equal to the minimum possible for those regions, whilst the energies of larger domains are much further from the their respective minima. This is consistent with the patterns that would be expected if RNA structure is folded Idneticatic during transcription. A Monte-Carlo algorithm is then used to model the kinetic folding of RNA during transcriptional growth. The algorithm is capable of finding the correct structure of a natural RNA for which the minimum free energy approach is unsuccessful. In the viral phage MS2 Idneticatic formed RNA structure plays an important role in the regulation of gene expression. The folding algorithm can accurately model this by IdneticaUy controlling access to the gene initiation region. The algorithm is also successfully used to model the control of replication in the ColEl plasmid. Taking a different approach, we then use a simplified model of RNA secondary structure to investigate the size of energy barriers between degenerate minimum energy structures. This model has much in common with physical systems such as spin glasses, and in fact shows similar behaviour to these systems in that energy barriers between structures grow quickly with the length of the RNA sequence. These barriers will serve to trap RNA in non-optimal structures. Together these studies demonstrate the necessity of studying RNA secondary structure from a kinetic point of view, and provide clear directions in which further work may be taken. Kinetic models of RNA secondary structure should continue to prove useful in modelling the structure and function of RNA.

  14. Rose spring dwarf-associated virus has RNA structural and gene-expression features like those of Barley yellow dwarf virus

    PubMed Central

    Salem, Nida’ M.; Miller, W. Allen; Rowhani, Adib; Golino, Deborah A.; Moyne, Anne-Laure; Falk, Bryce W.

    2015-01-01

    We determined the complete nucleotide sequence of the Rose spring dwarf-associated virus (RSDaV) genomic RNA (GenBank accession no. EU024678) and compared its predicted RNA structural characteristics affecting gene expression. A cDNA library was derived from RSDaV double-stranded RNAs (dsRNAs) purified from infected tissue. Nucleotide sequence analysis of the cloned cDNAs, plus for clones generated by 5′- and 3′-RACE showed the RSDaV genomic RNA to be 5,808 nucleotides. The genomic RNA contains five major open reading frames (ORFs), and three small ORFs in the 3′-terminal 800 nucleotides, typical for viruses of genus Luteovirus in the family Luteoviridae. Northern blot hybridization analysis revealed the genomic RNA and two prominent subgenomic RNAs of approximately 3 kb and 1 kb. Putative 5′ ends of the sgRNAs were predicted by identification of conserved sequences and secondary structures which resembled the Barley yellow dwarf virus (BYDV) genomic RNA 5′ end and subgenomic RNA promoter sequences. Secondary structures of the BYDV-like ribosomal frameshift elements and cap-independent translation elements, including long-distance base pairing spanning four kb were identified. These contain similarities but also informative differences with the BYDV structures, including a strikingly different structure predicted for the 3′ cap-independent translation element. These analyses of the RSDaV genomic RNA show more complexity for the RNA structural elements for members of the Luteoviridae. PMID:18329064

  15. Rose spring dwarf-associated virus has RNA structural and gene-expression features like those of Barley yellow dwarf virus.

    PubMed

    Salem, Nida' M; Miller, W Allen; Rowhani, Adib; Golino, Deborah A; Moyne, Anne-Laure; Falk, Bryce W

    2008-06-05

    We determined the complete nucleotide sequence of the Rose spring dwarf-associated virus (RSDaV) genomic RNA (GenBank accession no. EU024678) and compared its predicted RNA structural characteristics affecting gene expression. A cDNA library was derived from RSDaV double-stranded RNAs (dsRNAs) purified from infected tissue. Nucleotide sequence analysis of the cloned cDNAs, plus for clones generated by 5'- and 3'-RACE showed the RSDaV genomic RNA to be 5808 nucleotides. The genomic RNA contains five major open reading frames (ORFs), and three small ORFs in the 3'-terminal 800 nucleotides, typical for viruses of genus Luteovirus in the family Luteoviridae. Northern blot hybridization analysis revealed the genomic RNA and two prominent subgenomic RNAs of approximately 3 kb and 1 kb. Putative 5' ends of the sgRNAs were predicted by identification of conserved sequences and secondary structures which resembled the Barley yellow dwarf virus (BYDV) genomic RNA 5' end and subgenomic RNA promoter sequences. Secondary structures of the BYDV-like ribosomal frameshift elements and cap-independent translation elements, including long-distance base pairing spanning four kb were identified. These contain similarities but also informative differences with the BYDV structures, including a strikingly different structure predicted for the 3' cap-independent translation element. These analyses of the RSDaV genomic RNA show more complexity for the RNA structural elements for members of the Luteoviridae.

  16. Zika virus produces noncoding RNAs using a multi-pseudoknot structure that confounds a cellular exonuclease

    DOE PAGES

    Akiyama, Benjamin M.; Laurence, Hannah M.; Massey, Aaron R.; ...

    2016-11-10

    The outbreak of Zika virus (ZIKV) and associated fetal microcephaly mandates efforts to understand the molecular processes of infection. Related flaviviruses produce noncoding subgenomic flaviviral RNAs (sfRNAs) that are linked to pathogenicity in fetal mice. These viruses make sfRNAs by co-opting a cellular exonuclease via structured RNAs called xrRNAs. We found that ZIKV-infected monkey and human epithelial cells, mouse neurons, and mosquito cells produce sfRNAs. The RNA structure that is responsible for ZIKV sfRNA production forms a complex fold that is likely found in many pathogenic flaviviruses. Mutations that disrupt the structure affect exonuclease resistance in vitro and sfRNA formationmore » during infection. The complete ZIKV xrRNA structure clarifies the mechanism of exonuclease resistance and identifies features that may modulate function in diverse flaviviruses.« less

  17. The importance of mRNA structure in determining the pathogenicity of synonymous and non-synonymous mutations in haemophilia

    PubMed Central

    Hamasaki-Katagiri, Nobuko; Lin, Brian C.; Simon, Jonathan; Hunt, Ryan C.; Schiller, Tal; Russek-Cohen, Estelle; Komar, Anton A.; Bar, Haim; Kimchi-Sarfaty, Chava

    2016-01-01

    Introduction Mutational analysis is commonly used to support the diagnosis and management of haemophilia. This has allowed for the generation of large mutation databases which provide unparalleled insight into genotype-phenotype relationships. Haemophilia is associated with inversions, deletions, insertions, nonsense and missense mutations. Both synonymous and non-synonymous mutations influence the base pairing of messenger RNA (mRNA), which can alter mRNA structure, cellular half-life and ribosome processivity/elongation. However, the role of mRNA structure in determining the pathogenicity of point mutations in haemophilia has not been evaluated. Aim To evaluate mRNA thermodynamic stability and associated RNA prediction software as a means to distinguish between neutral and disease-associated mutations in haemophilia. Methods Five mRNA structure prediction software programs were used to assess the thermodynamic stability of mRNA fragments carrying neutral vs. disease-associated and synonymous vs. non-synonymous point mutations in F8, F9 and a third X-linked gene, DMD (dystrophin). Results In F8 and DMD, disease-associated mutations tend to occur in more structurally stable mRNA regions, represented by lower MFE (minimum free energy) levels. In comparing multiple software packages for mRNA structure prediction, a 101–151 nucleotide fragment length appears to be a feasible range for structuring future studies. Conclusion mRNA thermodynamic stability is one predictive characteristic, which when combined with other RNA and protein features, may offer significant insight when screening sequencing data for novel disease-associated mutations. Our results also suggest potential utility in evaluating the mRNA thermodynamic stability profile of a gene when determining the viability of interchanging codons for biological and therapeutic applications. PMID:27933712

  18. RNA 3D Structure Modeling by Combination of Template-Based Method ModeRNA, Template-Free Folding with SimRNA, and Refinement with QRNAS.

    PubMed

    Piatkowski, Pawel; Kasprzak, Joanna M; Kumar, Deepak; Magnus, Marcin; Chojnowski, Grzegorz; Bujnicki, Janusz M

    2016-01-01

    RNA encompasses an essential part of all known forms of life. The functions of many RNA molecules are dependent on their ability to form complex three-dimensional (3D) structures. However, experimental determination of RNA 3D structures is laborious and challenging, and therefore, the majority of known RNAs remain structurally uncharacterized. To address this problem, computational structure prediction methods were developed that either utilize information derived from known structures of other RNA molecules (by way of template-based modeling) or attempt to simulate the physical process of RNA structure formation (by way of template-free modeling). All computational methods suffer from various limitations that make theoretical models less reliable than high-resolution experimentally determined structures. This chapter provides a protocol for computational modeling of RNA 3D structure that overcomes major limitations by combining two complementary approaches: template-based modeling that is capable of predicting global architectures based on similarity to other molecules but often fails to predict local unique features, and template-free modeling that can predict the local folding, but is limited to modeling the structure of relatively small molecules. Here, we combine the use of a template-based method ModeRNA with a template-free method SimRNA. ModeRNA requires a sequence alignment of the target RNA sequence to be modeled with a template of the known structure; it generates a model that predicts the structure of a conserved core and provides a starting point for modeling of variable regions. SimRNA can be used to fold small RNAs (<80 nt) without any additional structural information, and to refold parts of models for larger RNAs that have a correctly modeled core. ModeRNA can be either downloaded, compiled and run locally or run through a web interface at http://genesilico.pl/modernaserver/ . SimRNA is currently available to download for local use as a precompiled software package at http://genesilico.pl/software/stand-alone/simrna and as a web server at http://genesilico.pl/SimRNAweb . For model optimization we use QRNAS, available at http://genesilico.pl/qrnas .

  19. MultiSETTER: web server for multiple RNA structure comparison.

    PubMed

    Čech, Petr; Hoksza, David; Svozil, Daniel

    2015-08-12

    Understanding the architecture and function of RNA molecules requires methods for comparing and analyzing their tertiary and quaternary structures. While structural superposition of short RNAs is achievable in a reasonable time, large structures represent much bigger challenge. Therefore, we have developed a fast and accurate algorithm for RNA pairwise structure superposition called SETTER and implemented it in the SETTER web server. However, though biological relationships can be inferred by a pairwise structure alignment, key features preserved by evolution can be identified only from a multiple structure alignment. Thus, we extended the SETTER algorithm to the alignment of multiple RNA structures and developed the MultiSETTER algorithm. In this paper, we present the updated version of the SETTER web server that implements a user friendly interface to the MultiSETTER algorithm. The server accepts RNA structures either as the list of PDB IDs or as user-defined PDB files. After the superposition is computed, structures are visualized in 3D and several reports and statistics are generated. To the best of our knowledge, the MultiSETTER web server is the first publicly available tool for a multiple RNA structure alignment. The MultiSETTER server offers the visual inspection of an alignment in 3D space which may reveal structural and functional relationships not captured by other multiple alignment methods based either on a sequence or on secondary structure motifs.

  20. ClaRNA: a classifier of contacts in RNA 3D structures based on a comparative analysis of various classification schemes

    PubMed Central

    Waleń, Tomasz; Chojnowski, Grzegorz; Gierski, Przemysław; Bujnicki, Janusz M.

    2014-01-01

    The understanding of folding and function of RNA molecules depends on the identification and classification of interactions between ribonucleotide residues. We developed a new method named ClaRNA for computational classification of contacts in RNA 3D structures. Unique features of the program are the ability to identify imperfect contacts and to process coarse-grained models. Each doublet of spatially close ribonucleotide residues in a query structure is compared to clusters of reference doublets obtained by analysis of a large number of experimentally determined RNA structures, and assigned a score that describes its similarity to one or more known types of contacts, including pairing, stacking, base–phosphate and base–ribose interactions. The accuracy of ClaRNA is 0.997 for canonical base pairs, 0.983 for non-canonical pairs and 0.961 for stacking interactions. The generalized squared correlation coefficient (GC2) for ClaRNA is 0.969 for canonical base pairs, 0.638 for non-canonical pairs and 0.824 for stacking interactions. The classifier can be easily extended to include new types of spatial relationships between pairs or larger assemblies of nucleotide residues. ClaRNA is freely available via a web server that includes an extensive set of tools for processing and visualizing structural information about RNA molecules. PMID:25159614

  1. Similarities and Differences between RNA and DNA Double-Helical Structures in Circular Dichroism Spectroscopy: A SAC-CI Study.

    PubMed

    Miyahara, Tomoo; Nakatsuji, Hiroshi; Sugiyama, Hiroshi

    2016-11-17

    The helical structures of DNA and RNA are investigated experimentally using circular dichroism (CD) spectroscopy. The signs and the shapes of the CD spectra are much different between the right- and left-handed structures as well as between DNA and RNA. The main difference lies in the sign at around 295 nm of the CD spectra: it is positive for the right-handed B-DNA and the left-handed Z-RNA but is negative for the left-handed Z-DNA and the right-handed A-RNA. We calculated the SAC-CI CD spectra of DNA and RNA using the tetramer models, which include both hydrogen-bonding and stacking interactions that are important in both DNA and RNA. The SAC-CI results reproduced the features at around 295 nm of the experimental CD spectra of each DNA and RNA, and elucidated that the strong stacking interaction between the two base pairs is the origin of the negative peaks at 295 nm of the CD spectra for both DNA and RNA. On the basis of these facts, we discuss the similarities and differences between RNA and DNA double-helical structures in the CD spectroscopy based on the ChiraSac methodology.

  2. Overview of methods in RNA nanotechnology: synthesis, purification, and characterization of RNA nanoparticles.

    PubMed

    Haque, Farzin; Guo, Peixuan

    2015-01-01

    RNA nanotechnology encompasses the use of RNA as a construction material to build homogeneous nanostructures by bottom-up self-assembly with defined size, structure, and stoichiometry; this pioneering concept demonstrated in 1998 (Guo et al., Molecular Cell 2:149-155, 1998; featured in Cell) has emerged as a new field that also involves materials engineering and synthetic structural biology (Guo, Nature Nanotechnology 5:833-842, 2010). The field of RNA nanotechnology has skyrocketed over the last few years, as evidenced by the burst of publications in prominent journals on RNA nanostructures and their applications in nanomedicine and nanotechnology. Rapid advances in RNA chemistry, RNA biophysics, and RNA biology have created new opportunities for translating basic science into clinical practice. RNA nanotechnology holds considerable promise in this regard. Increased evidence also suggests that substantial part of the 98.5 % of human genome (Lander et al. Nature 409:860-921, 2001) that used to be called "junk DNA" actually codes for noncoding RNA. As we understand more on how RNA structures are related to function, we can fabricate synthetic RNA nanoparticles for the diagnosis and treatment of diseases. This chapter provides a brief overview of the field regarding the design, construction, purification, and characterization of RNA nanoparticles for diverse applications in nanotechnology and nanomedicince.

  3. RNA synthetic mechanisms employed by diverse families of RNA viruses.

    PubMed

    McDonald, Sarah M

    2013-01-01

    RNA viruses are ubiquitous in nature, infecting every known organism on the planet. These viruses can also be notorious human pathogens with significant medical and economic burdens. Central to the lifecycle of an RNA virus is the synthesis of new RNA molecules, a process that is mediated by specialized virally encoded enzymes called RNA-dependent RNA polymerases (RdRps). RdRps directly catalyze phosphodiester bond formation between nucleoside triphosphates in an RNA-templated manner. These enzymes are strikingly conserved in their structural and functional features, even among diverse RNA viruses belonging to different families. During host cell infection, the activities of viral RdRps are often regulated by viral cofactor proteins. Cofactors can modulate the type and timing of RNA synthesis by directly engaging the RdRp and/or by indirectly affecting its capacity to recognize template RNA. High-resolution structures of RdRps as apoenzymes, bound to RNA templates, in the midst of catalysis, and/or interacting with regulatory cofactor proteins, have dramatically increased our understanding of viral RNA synthetic mechanisms. Combined with elegant biochemical studies, such structures are providing a scientific platform for the rational design of antiviral agents aimed at preventing and treating RNA virus-induced diseases. Copyright © 2013 John Wiley & Sons, Ltd.

  4. The nucleotide sequence of a major glycine transfer RNA from the posterior silk gland of Bombyx mori L.

    PubMed Central

    Zúñiga, M C; Steitz, J A

    1977-01-01

    The nucleotide sequence of tRNA1Gly isolated from the posterior silk gland of Bombyx mori has been determined. This transfer RNA is present in high amounts in the posterior silk gland during the fifth larval instar. It has a GCC anticodon, capable of decoding a major glycine codon in the fibroin messenger RNA, GGU. Structural features of Bombyx tRNA1Gly and its homology to other eukaryotic glycine tRNAs are discussed. Images PMID:414206

  5. The origin and evolution of tRNA inferred from phylogenetic analysis of structure.

    PubMed

    Sun, Feng-Jie; Caetano-Anollés, Gustavo

    2008-01-01

    The evolutionary history of the two structural and functional domains of tRNA is controversial but harbors the secrets of early translation and the genetic code. To explore the origin and evolution of tRNA, we reconstructed phylogenetic trees directly from molecular structure. Forty-two structural characters describing the geometry of 571 tRNAs and three statistical parameters describing thermodynamic and mechanical features of molecules quantitatively were used to derive phylogenetic trees of molecules and molecular substructures. Trees of molecules failed to group tRNA according to amino acid specificity and did not reveal the tripartite nature of life, probably due to loss of phylogenetic signal or because tRNA diversification predated organismal diversification. Trees of substructures derived from both structural and statistical characters support the origin of tRNA in the acceptor arm and the hypothesis that the top half domain composed of acceptor and pseudouridine (TPsiC) arms is more ancient than the bottom half domain composed of dihydrouridine (DHU) and anticodon arms. This constitutes the cornerstone of the genomic tag hypothesis that postulates tRNAs were ancient telomeres in the RNA world. The trees of substructures suggest a model for the evolution of the major functional and structural components of tRNA. In this model, short RNA hairpins with stems homologous to the acceptor arm of present day tRNAs were extended with regions homologous to TPsiC and anticodon arms. The DHU arm was then incorporated into the resulting three-stemmed structure to form a proto-cloverleaf structure. The variable region was the last structural addition to the molecular repertoire of evolving tRNA substructures.

  6. Self-assembly of multi-stranded RNA motifs into lattices and tubular structures

    DOE PAGES

    Stewart, Jaimie Marie; Subramanian, Hari K. K.; Franco, Elisa

    2017-02-16

    Rational design of nucleic acidmolecules yields selfassembling scaffolds with increasing complexity, size and functionality. It is an open question whether design methods tailored to build DNA nanostructures can be adapted to build RNA nanostructures with comparable features. We demonstrate the formation of RNA lattices and tubular assemblies from double crossover (DX) tiles, a canonical motif in DNA nanotechnology. Tubular structures can exceed 1 m in length, suggesting that this DX motif can produce very robust lattices. Some of these tubes spontaneously form with left-handed chirality. We obtain assemblies by using two methods: a protocol where gel-extracted RNA strands are slowlymore » annealed, and a one-pot transcription and anneal procedure. We then identify the tile nick position as a structural requirement for lattice formation. These results demonstrate that stable RNA structures can be obtained with design tools imported from DNA nanotechnology. These large assemblies could be potentially integrated with a variety of functional RNA motifs for drug or nanoparticle delivery, or for colocalization of cellular components.« less

  7. Self-assembly of multi-stranded RNA motifs into lattices and tubular structures

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Stewart, Jaimie Marie; Subramanian, Hari K. K.; Franco, Elisa

    Rational design of nucleic acidmolecules yields selfassembling scaffolds with increasing complexity, size and functionality. It is an open question whether design methods tailored to build DNA nanostructures can be adapted to build RNA nanostructures with comparable features. We demonstrate the formation of RNA lattices and tubular assemblies from double crossover (DX) tiles, a canonical motif in DNA nanotechnology. Tubular structures can exceed 1 m in length, suggesting that this DX motif can produce very robust lattices. Some of these tubes spontaneously form with left-handed chirality. We obtain assemblies by using two methods: a protocol where gel-extracted RNA strands are slowlymore » annealed, and a one-pot transcription and anneal procedure. We then identify the tile nick position as a structural requirement for lattice formation. These results demonstrate that stable RNA structures can be obtained with design tools imported from DNA nanotechnology. These large assemblies could be potentially integrated with a variety of functional RNA motifs for drug or nanoparticle delivery, or for colocalization of cellular components.« less

  8. Self-assembly of multi-stranded RNA motifs into lattices and tubular structures

    PubMed Central

    Stewart, Jaimie Marie; Subramanian, Hari K. K.

    2017-01-01

    Abstract Rational design of nucleic acid molecules yields self-assembling scaffolds with increasing complexity, size and functionality. It is an open question whether design methods tailored to build DNA nanostructures can be adapted to build RNA nanostructures with comparable features. Here we demonstrate the formation of RNA lattices and tubular assemblies from double crossover (DX) tiles, a canonical motif in DNA nanotechnology. Tubular structures can exceed 1 μm in length, suggesting that this DX motif can produce very robust lattices. Some of these tubes spontaneously form with left-handed chirality. We obtain assemblies by using two methods: a protocol where gel-extracted RNA strands are slowly annealed, and a one-pot transcription and anneal procedure. We identify the tile nick position as a structural requirement for lattice formation. Our results demonstrate that stable RNA structures can be obtained with design tools imported from DNA nanotechnology. These large assemblies could be potentially integrated with a variety of functional RNA motifs for drug or nanoparticle delivery, or for colocalization of cellular components. PMID:28204562

  9. The Crystal Structure and RNA-Binding of an Orthomyxovirus Nucleoprotein

    PubMed Central

    Zheng, Wenjie; Olson, John; Vakharia, Vikram; Tao, Yizhi Jane

    2013-01-01

    Genome packaging for viruses with segmented genomes is often a complex problem. This is particularly true for influenza viruses and other orthomyxoviruses, whose genome consists of multiple negative-sense RNAs encapsidated as ribonucleoprotein (RNP) complexes. To better understand the structural features of orthomyxovirus RNPs that allow them to be packaged, we determined the crystal structure of the nucleoprotein (NP) of a fish orthomyxovirus, the infectious salmon anemia virus (ISAV) (genus Isavirus). As the major protein component of the RNPs, ISAV-NP possesses a bi-lobular structure similar to the influenza virus NP. Because both RNA-free and RNA-bound ISAV NP forms stable dimers in solution, we were able to measure the NP RNA binding affinity as well as the stoichiometry using recombinant proteins and synthetic oligos. Our RNA binding analysis revealed that each ISAV-NP binds ∼12 nts of RNA, shorter than the 24–28 nts originally estimated for the influenza A virus NP based on population average. The 12-nt stoichiometry was further confirmed by results from electron microscopy and dynamic light scattering. Considering that RNPs of ISAV and the influenza viruses have similar morphologies and dimensions, our findings suggest that NP-free RNA may exist on orthomyxovirus RNPs, and selective RNP packaging may be accomplished through direct RNA-RNA interactions. PMID:24068932

  10. The separation between the 5'-3' ends in long RNA molecules is short and nearly constant.

    PubMed

    Leija-Martínez, Nehemías; Casas-Flores, Sergio; Cadena-Nava, Rubén D; Roca, Joan A; Mendez-Cabañas, José A; Gomez, Eduardo; Ruiz-Garcia, Jaime

    2014-12-16

    RNA molecules play different roles in coding, decoding and gene expression regulation. Such roles are often associated to the RNA secondary or tertiary structures. The folding dynamics lead to multiple secondary structures of long RNA molecules, since an RNA molecule might fold into multiple distinct native states. Despite an ensemble of different structures, it has been theoretically proposed that the separation between the 5' and 3' ends of long single-stranded RNA molecules (ssRNA) remains constant, independent of their base content and length. Here, we present the first experimental measurements of the end-to-end separation in long ssRNA molecules. To determine this separation, we use single molecule Fluorescence Resonance Energy Transfer of fluorescently end-labeled ssRNA molecules ranging from 500 to 5500 nucleotides in length, obtained from two viruses and a fungus. We found that the end-to-end separation is indeed short, within 5-9 nm. It is remarkable that the separation of the ends of all RNA molecules studied remains small and similar, despite the origin, length and differences in their secondary structure. This implies that the ssRNA molecules are 'effectively circularized' something that might be a general feature of RNAs, and could result in fine-tuning for translation and gene expression regulation. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. NOBAI: a web server for character coding of geometrical and statistical features in RNA structure

    PubMed Central

    Knudsen, Vegeir; Caetano-Anollés, Gustavo

    2008-01-01

    The Numeration of Objects in Biology: Alignment Inferences (NOBAI) web server provides a web interface to the applications in the NOBAI software package. This software codes topological and thermodynamic information related to the secondary structure of RNA molecules as multi-state phylogenetic characters, builds character matrices directly in NEXUS format and provides sequence randomization options. The web server is an effective tool that facilitates the search for evolutionary history embedded in the structure of functional RNA molecules. The NOBAI web server is accessible at ‘http://www.manet.uiuc.edu/nobai/nobai.php’. This web site is free and open to all users and there is no login requirement. PMID:18448469

  12. RNApdbee--a webserver to derive secondary structures from pdb files of knotted and unknotted RNAs.

    PubMed

    Antczak, Maciej; Zok, Tomasz; Popenda, Mariusz; Lukasiak, Piotr; Adamiak, Ryszard W; Blazewicz, Jacek; Szachniuk, Marta

    2014-07-01

    In RNA structural biology and bioinformatics an access to correct RNA secondary structure and its proper representation is of crucial importance. This is true especially in the field of secondary and 3D RNA structure prediction. Here, we introduce RNApdbee-a new tool that allows to extract RNA secondary structure from the pdb file, and presents it in both textual and graphical form. RNApdbee supports processing of knotted and unknotted structures of large RNAs, also within protein complexes. The method works not only for first but also for high order pseudoknots, and gives an information about canonical and non-canonical base pairs. A combination of these features is unique among existing applications for RNA structure analysis. Additionally, a function of converting between the text notations, i.e. BPSEQ, CT and extended dot-bracket, is provided. In order to facilitate a more comprehensive study, the webserver integrates the functionality of RNAView, MC-Annotate and 3DNA/DSSR, being the most common tools used for automated identification and classification of RNA base pairs. RNApdbee is implemented as a publicly available webserver with an intuitive interface and can be freely accessed at http://rnapdbee.cs.put.poznan.pl/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. Structural RNAs of known and unknown function identified in malaria parasites by comparative genomics and RNA analysis

    PubMed Central

    Chakrabarti, Kausik; Pearson, Michael; Grate, Leslie; Sterne-Weiler, Timothy; Deans, Jonathan; Donohue, John Paul; Ares, Manuel

    2007-01-01

    As the genomes of more eukaryotic pathogens are sequenced, understanding how molecular differences between parasite and host might be exploited to provide new therapies has become a major focus. Central to cell function are RNA-containing complexes involved in gene expression, such as the ribosome, the spliceosome, snoRNAs, RNase P, and telomerase, among others. In this article we identify by comparative genomics and validate by RNA analysis numerous previously unknown structural RNAs encoded by the Plasmodium falciparum genome, including the telomerase RNA, U3, 31 snoRNAs, as well as previously predicted spliceosomal snRNAs, SRP RNA, MRP RNA, and RNAse P RNA. Furthermore, we identify six new RNA coding genes of unknown function. To investigate the relationships of the RNA coding genes to other genomic features in related parasites, we developed a genome browser for P. falciparum (http://areslab.ucsc.edu/cgi-bin/hgGateway). Additional experiments provide evidence supporting the prediction that snoRNAs guide methylation of a specific position on U4 snRNA, as well as predicting an snRNA promoter element particular to Plasmodium sp. These findings should allow detailed structural comparisons between the RNA components of the gene expression machinery of the parasite and its vertebrate hosts. PMID:17901154

  14. Turning limited experimental information into 3D models of RNA.

    PubMed

    Flores, Samuel Coulbourn; Altman, Russ B

    2010-09-01

    Our understanding of RNA functions in the cell is evolving rapidly. As for proteins, the detailed three-dimensional (3D) structure of RNA is often key to understanding its function. Although crystallography and nuclear magnetic resonance (NMR) can determine the atomic coordinates of some RNA structures, many 3D structures present technical challenges that make these methods difficult to apply. The great flexibility of RNA, its charged backbone, dearth of specific surface features, and propensity for kinetic traps all conspire with its long folding time, to challenge in silico methods for physics-based folding. On the other hand, base-pairing interactions (either in runs to form helices or isolated tertiary contacts) and motifs are often available from relatively low-cost experiments or informatics analyses. We present RNABuilder, a novel code that uses internal coordinate mechanics to satisfy user-specified base pairing and steric forces under chemical constraints. The code recapitulates the topology and characteristic L-shape of tRNA and obtains an accurate noncrystallographic structure of the Tetrahymena ribozyme P4/P6 domain. The algorithm scales nearly linearly with molecule size, opening the door to the modeling of significantly larger structures.

  15. Steric interactions lead to collective tilting motion in the ribosome during mRNA-tRNA translocation

    NASA Astrophysics Data System (ADS)

    Nguyen, Kien; Whitford, Paul C.

    2016-02-01

    Translocation of mRNA and tRNA through the ribosome is associated with large-scale rearrangements of the head domain in the 30S ribosomal subunit. To elucidate the relationship between 30S head dynamics and mRNA-tRNA displacement, we apply molecular dynamics simulations using an all-atom structure-based model. Here we provide a statistical analysis of 250 spontaneous transitions between the A/P-P/E and P/P-E/E ensembles. Consistent with structural studies, the ribosome samples a chimeric ap/P-pe/E intermediate, where the 30S head is rotated ~18°. It then transiently populates a previously unreported intermediate ensemble, which is characterized by a ~10° tilt of the head. To identify the origins of head tilting, we analyse 781 additional simulations in which specific steric features are perturbed. These calculations show that head tilting may be attributed to specific steric interactions between tRNA and the 30S subunit (PE loop and protein S13). Taken together, this study demonstrates how molecular structure can give rise to large-scale collective rearrangements.

  16. Structure of RDE-4 dsRBDs and mutational studies provide insights into dsRNA recognition in the Caenorhabditis elegans RNAi pathway.

    PubMed

    Chiliveri, Sai Chaitanya; Deshmukh, Mandar V

    2014-02-15

    The association of RDE-4 (RNAi defective 4), a protein containing two dsRBDs (dsRNA-binding domains), with long dsRNA and Dcr-1 (Dicer1 homologue) initiates the siRNA pathway in Caenorhabditis elegans. Unlike its homologues in higher eukaryotes, RDE-4 dsRBDs possess weak (micromolar) affinity for short dsRNA. With increasing length of dsRNA, RDE-4 exhibits enhanced affinity due to co-operativity. The linker and dsRBD2 are indispensable for RDE-4's simultaneous interaction with dsRNA and Dcr-1. In the present study, we have determined the solution structures of RDE-4 constructs that contain both dsRBDs and the linker region. In addition to the canonical dsRBD fold, both dsRBDs of RDE-4 show modified structural features such as truncation in the β1-β2 loop that rationalize RDE-4's relatively weak dsRNA affinity. Structure and binding studies demonstrate that dsRBD2 plays a decisive role in the RDE-4-dsRNA interaction; however, in contrast with previous findings, we found ephemeral interaction of RDE-4 dsRBD1 with dsRNA. More importantly, mutations in two tandem lysine residues (Lys217 and Lys218) in dsRBD2 impair RDE-4's dsRNA-binding ability and could obliterate RNAi initiation in C. elegans. Additionally, we postulate a structural basis for the minimal requirement of linker and dsRBD2 for RDE-4's association with dsRNA and Dcr-1.

  17. R2R - software to speed the depiction of aesthetic consensus RNA secondary structures

    PubMed Central

    2011-01-01

    Background With continuing identification of novel structured noncoding RNAs, there is an increasing need to create schematic diagrams showing the consensus features of these molecules. RNA structural diagrams are typically made either with general-purpose drawing programs like Adobe Illustrator, or with automated or interactive programs specific to RNA. Unfortunately, the use of applications like Illustrator is extremely time consuming, while existing RNA-specific programs produce figures that are useful, but usually not of the same aesthetic quality as those produced at great cost in Illustrator. Additionally, most existing RNA-specific applications are designed for drawing single RNA molecules, not consensus diagrams. Results We created R2R, a computer program that facilitates the generation of aesthetic and readable drawings of RNA consensus diagrams in a fraction of the time required with general-purpose drawing programs. Since the inference of a consensus RNA structure typically requires a multiple-sequence alignment, the R2R user annotates the alignment with commands directing the layout and annotation of the RNA. R2R creates SVG or PDF output that can be imported into Adobe Illustrator, Inkscape or CorelDRAW. R2R can be used to create consensus sequence and secondary structure models for novel RNA structures or to revise models when new representatives for known RNA classes become available. Although R2R does not currently have a graphical user interface, it has proven useful in our efforts to create 100 schematic models of distinct noncoding RNA classes. Conclusions R2R makes it possible to obtain high-quality drawings of the consensus sequence and structural models of many diverse RNA structures with a more practical amount of effort. R2R software is available at http://breaker.research.yale.edu/R2R and as an Additional file. PMID:21205310

  18. Insights into RNA binding by the anticancer drug cisplatin from the crystal structure of cisplatin-modified ribosome

    PubMed Central

    Melnikov, Sergey V.; Söll, Dieter; Steitz, Thomas A.

    2016-01-01

    Abstract Cisplatin is a widely prescribed anticancer drug, which triggers cell death by covalent binding to a broad range of biological molecules. Among cisplatin targets, cellular RNAs remain the most poorly characterized molecules. Although cisplatin was shown to inactivate essential RNAs, including ribosomal, spliceosomal and telomeric RNAs, cisplatin binding sites in most RNA molecules are unknown, and therefore it remains challenging to study how modifications of RNA by cisplatin contributes to its toxicity. Here we report a 2.6Å-resolution X-ray structure of cisplatin-modified 70S ribosome, which describes cisplatin binding to the ribosome and provides the first nearly atomic model of cisplatin–RNA complex. We observe nine cisplatin molecules bound to the ribosome and reveal consensus structural features of the cisplatin-binding sites. Two of the cisplatin molecules modify conserved functional centers of the ribosome—the mRNA-channel and the GTPase center. In the mRNA-channel, cisplatin intercalates between the ribosome and the messenger RNA, suggesting that the observed inhibition of protein synthesis by cisplatin is caused by impaired mRNA-translocation. Our structure provides an insight into RNA targeting and inhibition by cisplatin, which can help predict cisplatin-binding sites in other cellular RNAs and design studies to elucidate a link between RNA modifications by cisplatin and cisplatin toxicity. PMID:27079977

  19. La-related protein 1 (LARP1) repression of TOP mRNA translation is mediated through its cap-binding domain and controlled by an adjacent regulatory region

    PubMed Central

    Philippe, Lucas; Vasseur, Jean-Jacques; Debart, Françoise

    2018-01-01

    Abstract Cell growth is a complex process shaped by extensive and coordinated changes in gene expression. Among these is the tightly regulated translation of a family of growth-related mRNAs defined by a 5′ terminal oligopyrimidine (TOP) motif. TOP mRNA translation is partly controlled via the eukaryotic initiation factor 4F (eIF4F), a translation factor that recognizes the mRNA 5′ cap structure. Recent studies have also implicated La-related protein 1 (LARP1), which competes with eIF4F for binding to mRNA 5′ ends. However, it has remained controversial whether LARP1 represses TOP mRNA translation directly and, if so, what features define its mRNA targets. Here, we show that the C-terminal half of LARP1 is necessary and sufficient to control TOP mRNA translation in cells. This fragment contains the DM15 cap-binding domain as well as an adjacent regulatory region that we identified. We further demonstrate that purified LARP1 represses TOP mRNA translation in vitro through the combined recognition of both the TOP sequence and cap structure, and that its intrinsic repressive activity and affinity for these features are subject to regulation. These results support a model whereby the translation of TOP mRNAs is controlled by a growth-regulated competition between eIF4F and LARP1 for their 5′ ends. PMID:29244122

  20. Crystal structure of group II intron domain 1 reveals a template for RNA assembly

    DOE PAGES

    Zhao, Chen; Rajashankar, Kanagalaghatta R.; Marcia, Marco; ...

    2015-10-26

    Although the importance of large noncoding RNAs is increasingly appreciated, our understanding of their structures and architectural dynamics remains limited. In particular, we know little about RNA folding intermediates and how they facilitate the productive assembly of RNA tertiary structures. In this paper, we report the crystal structure of an obligate intermediate that is required during the earliest stages of group II intron folding. Composed of domain 1 from the Oceanobacillus iheyensis group II intron (266 nucleotides), this intermediate retains native-like features but adopts a compact conformation in which the active site cleft is closed. Transition between this closed andmore » the open (native) conformation is achieved through discrete rotations of hinge motifs in two regions of the molecule. Finally, the open state is then stabilized by sequential docking of downstream intron domains, suggesting a 'first come, first folded' strategy that may represent a generalizable pathway for assembly of large RNA and ribonucleoprotein structures.« less

  1. RNA2D3D: a program for generating, viewing, and comparing 3-dimensional models of RNA.

    PubMed

    Martinez, Hugo M; Maizel, Jacob V; Shapiro, Bruce A

    2008-06-01

    Using primary and secondary structure information of an RNA molecule, the program RNA2D3D automatically and rapidly produces a first-order approximation of a 3-dimensional conformation consistent with this information. Applicable to structures of arbitrary branching complexity and pseudoknot content, it features efficient interactive graphical editing for the removal of any overlaps introduced by the initial generating procedure and for making conformational changes favorable to targeted features and subsequent refinement. With emphasis on fast exploration of alternative 3D conformations, one may interactively add or delete base-pairs, adjacent stems can be coaxially stacked or unstacked, single strands can be shaped to accommodate special constraints, and arbitrary subsets can be defined and manipulated as rigid bodies. Compaction, whereby base stacking within stems is optimally extended into connecting single strands, is also available as a means of strategically making the structures more compact and revealing folding motifs. Subsequent refinement of the first-order approximation, of modifications, and for the imposing of tertiary constraints is assisted with standard energy refinement techniques. Previously determined coordinates for any part of the molecule are readily incorporated, and any part of the modeled structure can be output as a PDB or XYZ file. Illustrative applications in the areas of ribozymes, viral kissing loops, viral internal ribosome entry sites, and nanobiology are presented.

  2. CryoEM structures of two spliceosomal complexes: starter and dessert at the spliceosome feast.

    PubMed

    Nguyen, Thi Hoang Duong; Galej, Wojciech P; Fica, Sebastian M; Lin, Pei-Chun; Newman, Andrew J; Nagai, Kiyoshi

    2016-02-01

    The spliceosome is formed on pre-mRNA substrates from five small nuclear ribonucleoprotein particles (U1, U2, U4/U6 and U5 snRNPs), and numerous non-snRNP factors. Saccharomyces cerevisiae U4/U6.U5 tri-snRNP comprises U5 snRNA, U4/U6 snRNA duplex and approximately 30 proteins and represents a substantial part of the spliceosome before activation. Schizosaccharomyces pombe U2.U6.U5 spliceosomal complex is a post-catalytic intron lariat spliceosome containing U2 and U5 snRNPs, NTC (nineteen complex), NTC-related proteins (NTR), U6 snRNA, and an RNA intron lariat. Two recent papers describe near-complete atomic structures of these complexes based on cryoEM single-particle analysis. The U4/U6.U5 tri-snRNP structure provides crucial insight into the activation mechanism of the spliceosome. The U2.U6.U5 complex reveals the striking architecture of NTC and NTR and important features of the group II intron-like catalytic RNA core remaining after spliced mRNA is released. These two structures greatly advance our understanding of the mechanism of pre-mRNA splicing. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.

  3. Splicing-Related Features of Introns Serve to Propel Evolution

    PubMed Central

    Luo, Yuping; Li, Chun; Gong, Xi; Wang, Yanlu; Zhang, Kunshan; Cui, Yaru; Sun, Yi Eve; Li, Siguang

    2013-01-01

    The role of spliceosomal intronic structures played in evolution has only begun to be elucidated. Comparative genomic analyses of fungal snoRNA sequences, which are often contained within introns and/or exons, revealed that about one-third of snoRNA-associated introns in three major snoRNA gene clusters manifested polymorphisms, likely resulting from intron loss and gain events during fungi evolution. Genomic deletions can clearly be observed as one mechanism underlying intron and exon loss, as well as generation of complex introns where several introns lie in juxtaposition without intercalating exons. Strikingly, by tracking conserved snoRNAs in introns, we found that some introns had moved from one position to another by excision from donor sites and insertion into target sties elsewhere in the genome without needing transposon structures. This study revealed the origin of many newly gained introns. Moreover, our analyses suggested that intron-containing sequences were more prone to sustainable structural changes than DNA sequences without introns due to intron's ability to jump within the genome via unknown mechanisms. We propose that splicing-related structural features of introns serve as an additional motor to propel evolution. PMID:23516505

  4. Protein-RNA interface residue prediction using machine learning: an assessment of the state of the art.

    PubMed

    Walia, Rasna R; Caragea, Cornelia; Lewis, Benjamin A; Towfic, Fadi; Terribilini, Michael; El-Manzalawy, Yasser; Dobbs, Drena; Honavar, Vasant

    2012-05-10

    RNA molecules play diverse functional and structural roles in cells. They function as messengers for transferring genetic information from DNA to proteins, as the primary genetic material in many viruses, as catalysts (ribozymes) important for protein synthesis and RNA processing, and as essential and ubiquitous regulators of gene expression in living organisms. Many of these functions depend on precisely orchestrated interactions between RNA molecules and specific proteins in cells. Understanding the molecular mechanisms by which proteins recognize and bind RNA is essential for comprehending the functional implications of these interactions, but the recognition 'code' that mediates interactions between proteins and RNA is not yet understood. Success in deciphering this code would dramatically impact the development of new therapeutic strategies for intervening in devastating diseases such as AIDS and cancer. Because of the high cost of experimental determination of protein-RNA interfaces, there is an increasing reliance on statistical machine learning methods for training predictors of RNA-binding residues in proteins. However, because of differences in the choice of datasets, performance measures, and data representations used, it has been difficult to obtain an accurate assessment of the current state of the art in protein-RNA interface prediction. We provide a review of published approaches for predicting RNA-binding residues in proteins and a systematic comparison and critical assessment of protein-RNA interface residue predictors trained using these approaches on three carefully curated non-redundant datasets. We directly compare two widely used machine learning algorithms (Naïve Bayes (NB) and Support Vector Machine (SVM)) using three different data representations in which features are encoded using either sequence- or structure-based windows. Our results show that (i) Sequence-based classifiers that use a position-specific scoring matrix (PSSM)-based representation (PSSMSeq) outperform those that use an amino acid identity based representation (IDSeq) or a smoothed PSSM (SmoPSSMSeq); (ii) Structure-based classifiers that use smoothed PSSM representation (SmoPSSMStr) outperform those that use PSSM (PSSMStr) as well as sequence identity based representation (IDStr). PSSMSeq classifiers, when tested on an independent test set of 44 proteins, achieve performance that is comparable to that of three state-of-the-art structure-based predictors (including those that exploit geometric features) in terms of Matthews Correlation Coefficient (MCC), although the structure-based methods achieve substantially higher Specificity (albeit at the expense of Sensitivity) compared to sequence-based methods. We also find that the expected performance of the classifiers on a residue level can be markedly different from that on a protein level. Our experiments show that the classifiers trained on three different non-redundant protein-RNA interface datasets achieve comparable cross-validation performance. However, we find that the results are significantly affected by differences in the distance threshold used to define interface residues. Our results demonstrate that protein-RNA interface residue predictors that use a PSSM-based encoding of sequence windows outperform classifiers that use other encodings of sequence windows. While structure-based methods that exploit geometric features can yield significant increases in the Specificity of protein-RNA interface residue predictions, such increases are offset by decreases in Sensitivity. These results underscore the importance of comparing alternative methods using rigorous statistical procedures, multiple performance measures, and datasets that are constructed based on several alternative definitions of interface residues and redundancy cutoffs as well as including evaluations on independent test sets into the comparisons.

  5. Nuclear RNA Exosome at 3.1 Å Reveals Substrate Specificities, RNA Paths, and Allosteric Inhibition of Rrp44/Dis3.

    PubMed

    Zinder, John C; Wasmuth, Elizabeth V; Lima, Christopher D

    2016-11-17

    The eukaryotic RNA exosome is an essential and conserved 3'-to-5' exoribonuclease complex that degrades or processes nearly every class of cellular RNA. The nuclear RNA exosome includes a 9-subunit non-catalytic core that binds Rrp44 (Dis3) and Rrp6 subunits to modulate their processive and distributive 3'-to-5' exoribonuclease activities, respectively. Here we utilize an engineered RNA with two 3' ends to obtain a crystal structure of an 11-subunit nuclear exosome bound to RNA at 3.1 Å. The structure reveals an extended RNA path to Rrp6 that penetrates into the non-catalytic core; contacts between the non-catalytic core and Rrp44, which inhibit exoribonuclease activity; and features of the Rrp44 exoribonuclease site that support its ability to degrade 3' phosphate RNA substrates. Using reconstituted exosome complexes, we show that 3' phosphate RNA is not a substrate for Rrp6 but is readily degraded by Rrp44 in the nuclear exosome. Copyright © 2016 Elsevier Inc. All rights reserved.

  6. Assemble: an interactive graphical tool to analyze and build RNA architectures at the 2D and 3D levels.

    PubMed

    Jossinet, Fabrice; Ludwig, Thomas E; Westhof, Eric

    2010-08-15

    Assemble is an intuitive graphical interface to analyze, manipulate and build complex 3D RNA architectures. It provides several advanced and unique features within the framework of a semi-automated modeling process that can be performed by homology and ab initio with or without electron density maps. Those include the interactive editing of a secondary structure and a searchable, embedded library of annotated tertiary structures. Assemble helps users with performing recurrent and otherwise tedious tasks in structural RNA research. Assemble is released under an open-source license (MIT license) and is freely available at http://bioinformatics.org/assemble. It is implemented in the Java language and runs on MacOSX, Linux and Windows operating systems.

  7. Molecular interactions within the halophilic, thermophilic, and mesophilic prokaryotic ribosomal complexes: clues to environmental adaptation.

    PubMed

    Mallik, Saurav; Kundu, Sudip

    2015-01-01

    Using the available crystal structures of 50S ribosomal subunits from three prokaryotic species: Escherichia coli (mesophilic), Thermus thermophilus (thermophilic), and Haloarcula marismortui (halophilic), we have analyzed different structural features of ribosomal RNAs (rRNAs), proteins, and of their interfaces. We have correlated these structural features with the environmental adaptation strategies of the corresponding species. While dense intra-rRNA packing is observed in thermophilic, loose intra-rRNA packing is observed in halophilic (both compared to mesophilic). Interestingly, protein-rRNA interfaces of both the extremophiles are densely packed compared to that of the mesophilic. The intersubunit bridge regions are almost devoid of cavities, probably ensuring the proper formation of each bridge (by not allowing any loosely packed region nearby). During rRNA binding, the ribosomal proteins experience some structural transitions. Here, we have analyzed the intrinsically disordered and ordered regions of the ribosomal proteins, which are subjected to such transitions. The intrinsically disordered and disorder-to-order transition sites of the thermophilic and mesophilic ribosomal proteins are simultaneously (i) highly conserved and (ii) slowly evolving compared to rest of the protein structure. Although high conservation is observed at such sites of halophilic ribosomal proteins, but slow rate of evolution is absent. Such differences between thermophilic, mesophilic, and halophilic can be explained from their environmental adaptation strategy. Interestingly, a universal biophysical principle evident by a linear relationship between the free energy of interface formation, interface area, and structural changes of r-proteins during assembly is always maintained, irrespective of the environmental conditions.

  8. Structural similarities and functional differences clarify evolutionary relationships between tRNA healing enzymes and the myelin enzyme CNPase.

    PubMed

    Muruganandam, Gopinath; Raasakka, Arne; Myllykoski, Matti; Kursula, Inari; Kursula, Petri

    2017-05-16

    Eukaryotic tRNA splicing is an essential process in the transformation of a primary tRNA transcript into a mature functional tRNA molecule. 5'-phosphate ligation involves two steps: a healing reaction catalyzed by polynucleotide kinase (PNK) in association with cyclic phosphodiesterase (CPDase), and a sealing reaction catalyzed by an RNA ligase. The enzymes that catalyze tRNA healing in yeast and higher eukaryotes are homologous to the members of the 2H phosphoesterase superfamily, in particular to the vertebrate myelin enzyme 2',3'-cyclic nucleotide 3'-phosphodiesterase (CNPase). We employed different biophysical and biochemical methods to elucidate the overall structural and functional features of the tRNA healing enzymes yeast Trl1 PNK/CPDase and lancelet PNK/CPDase and compared them with vertebrate CNPase. The yeast and the lancelet enzymes have cyclic phosphodiesterase and polynucleotide kinase activity, while vertebrate CNPase lacks PNK activity. In addition, we also show that the healing enzymes are structurally similar to the vertebrate CNPase by applying synchrotron radiation circular dichroism spectroscopy and small-angle X-ray scattering. We provide a structural analysis of the tRNA healing enzyme PNK and CPDase domains together. Our results support evolution of vertebrate CNPase from tRNA healing enzymes with a loss of function at its N-terminal PNK-like domain.

  9. In vivo tmRNA protection by SmpB and pre-ribosome binding conformation in solution.

    PubMed

    Ranaei-Siadat, Ehsan; Mérigoux, Cécile; Seijo, Bili; Ponchon, Luc; Saliou, Jean-Michel; Bernauer, Julie; Sanglier-Cianférani, Sarah; Dardel, Fréderic; Vachette, Patrice; Nonin-Lecomte, Sylvie

    2014-10-01

    TmRNA is an abundant RNA in bacteria with tRNA and mRNA features. It is specialized in trans-translation, a translation rescuing system. We demonstrate that its partner protein SmpB binds the tRNA-like region (TLD) in vivo and chaperones the fold of the TLD-H2 region. We use an original approach combining the observation of tmRNA degradation pathways in a heterologous system, the analysis of the tmRNA digests by MS and NMR, and co-overproduction assays of tmRNA and SmpB. We study the conformation in solution of tmRNA alone or in complex with one SmpB before ribosome binding using SAXS. Our data show that Mg(2+) drives compaction of the RNA structure and that, in the absence of Mg(2+), SmpB has a similar effect albeit to a lesser extent. Our results show that tmRNA is intrinsically structured in solution with identical topology to that observed on complexes on ribosomes which should facilitate its subsequent recruitment by the 70S ribosome, free or preloaded with one SmpB molecule. © 2014 Ranaei-Siadat et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  10. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Januszyk, Kurt; Liu, Quansheng; Lima, Christopher D.

    The eukaryotic RNA exosome is a highly conserved multi-subunit complex that catalyzes degradation and processing of coding and noncoding RNA. A noncatalytic nine-subunit exosome core interacts with Rrp44 and Rrp6, two subunits that possess processive and distributive 3'-to-5' exoribonuclease activity, respectively. While both Rrp6 and Rrp44 are responsible for RNA processing in budding yeast, Rrp6 may play a more prominent role in processing, as it has been demonstrated to be inhibited by stable RNA secondary structure in vitro and because the null allele in budding yeast leads to the buildup of specific structured RNA substrates. Human RRP6, otherwise known asmore » PM/SCL-100 or EXOSC10, shares sequence similarity to budding yeast Rrp6 and is proposed to catalyze 3'-to-5' exoribonuclease activity on a variety of nuclear transcripts including ribosomal RNA subunits, RNA that has been poly-adenylated by TRAMP, as well as other nuclear RNA transcripts destined for processing and/or destruction. To characterize human RRP6, we expressed the full-length enzyme as well as truncation mutants that retain catalytic activity, compared their activities to analogous constructs for Saccharomyces cerevisiae Rrp6, and determined the X-ray structure of a human construct containing the exoribonuclease and HRDC domains that retains catalytic activity. Structural data show that the human active site is more exposed when compared to the yeast structure, and biochemical data suggest that this feature may play a role in the ability of human RRP6 to productively engage and degrade structured RNA substrates more effectively than the analogous budding yeast enzyme.« less

  11. Protein structure and the sequential structure of mRNA: alpha-helix and beta-sheet signals at the nucleotide level.

    PubMed

    Brunak, S; Engelbrecht, J

    1996-06-01

    A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed. We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting protein. The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain. A complete search for GenBank nucleotide sequences coding for structural entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment. By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets. These signals do not originate from the clustering of rare codons, but from the similarity of codons coding for very abundant amino acid residues at the N- and C-termini of helices and sheets. No correlation between the positioning of rare codons and the location of structural units was found. The mRNA signals were also compared with conserved nucleotide features of 16S-like ribosomal RNA sequences and related to mechanisms for maintaining the correct reading frame by the ribosome.

  12. Recognizing the enemy within: licensing RNA-guided genome defense

    PubMed Central

    Dumesic, Phillip A.; Madhani, Hiten D.

    2014-01-01

    How do cells distinguish normal genes from transposons? Although much has been learned about RNAi-related RNA silencing pathways responsible for genome defense, this fundamental question remains. The literature points to several classes of mechanisms. In some cases, double-stranded RNA structures produced by transposon inverted repeats or antisense integration trigger endo-siRNA biogenesis. In other instances, DNA features associated with transposons—such as their unusual copy number, chromosomal arrangement, and/or chromatin environment—license RNA silencing. Finally, recent studies have identified improper transcript processing events, such as stalled pre-mRNA splicing, as signals for siRNA production. Thus, the suboptimal gene expression properties of selfish elements can enable their identification by RNA silencing pathways. PMID:24280023

  13. Revealing the distinct folding phases of an RNA three-helix junction.

    PubMed

    Plumridge, Alex; Katz, Andrea M; Calvey, George D; Elber, Ron; Kirmizialtin, Serdal; Pollack, Lois

    2018-05-14

    Remarkable new insight has emerged into the biological role of RNA in cells. RNA folding and dynamics enable many of these newly discovered functions, calling for an understanding of RNA self-assembly and conformational dynamics. Because RNAs pass through multiple structures as they fold, an ensemble perspective is required to visualize the flow through fleetingly populated sets of states. Here, we combine microfluidic mixing technology and small angle X-ray scattering (SAXS) to measure the Mg-induced folding of a small RNA domain, the tP5abc three helix junction. Our measurements are interpreted using ensemble optimization to select atomically detailed structures that recapitulate each experimental curve. Structural ensembles, derived at key stages in both time-resolved studies and equilibrium titrations, reproduce the features of known intermediates, and more importantly, offer a powerful new structural perspective on the time-progression of folding. Distinct collapse phases along the pathway appear to be orchestrated by specific interactions with Mg ions. These key interactions subsequently direct motions of the backbone that position the partners of tertiary contacts for later bonding, and demonstrate a remarkable synergy between Mg and RNA across numerous time-scales.

  14. Initiation of translation in bacteria by a structured eukaryotic IRES RNA.

    PubMed

    Colussi, Timothy M; Costantino, David A; Zhu, Jianyu; Donohue, John Paul; Korostelev, Andrei A; Jaafar, Zane A; Plank, Terra-Dawn M; Noller, Harry F; Kieft, Jeffrey S

    2015-03-05

    The central dogma of gene expression (DNA to RNA to protein) is universal, but in different domains of life there are fundamental mechanistic differences within this pathway. For example, the canonical molecular signals used to initiate protein synthesis in bacteria and eukaryotes are mutually exclusive. However, the core structures and conformational dynamics of ribosomes that are responsible for the translation steps that take place after initiation are ancient and conserved across the domains of life. We wanted to explore whether an undiscovered RNA-based signal might be able to use these conserved features, bypassing mechanisms specific to each domain of life, and initiate protein synthesis in both bacteria and eukaryotes. Although structured internal ribosome entry site (IRES) RNAs can manipulate ribosomes to initiate translation in eukaryotic cells, an analogous RNA structure-based mechanism has not been observed in bacteria. Here we report our discovery that a eukaryotic viral IRES can initiate translation in live bacteria. We solved the crystal structure of this IRES bound to a bacterial ribosome to 3.8 Å resolution, revealing that despite differences between bacterial and eukaryotic ribosomes this IRES binds directly to both and occupies the space normally used by transfer RNAs. Initiation in both bacteria and eukaryotes depends on the structure of the IRES RNA, but in bacteria this RNA uses a different mechanism that includes a form of ribosome repositioning after initial recruitment. This IRES RNA bridges billions of years of evolutionary divergence and provides an example of an RNA structure-based translation initiation signal capable of operating in two domains of life.

  15. Structure of an Rrp6-RNA exosome complex bound to poly(A) RNA

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wasmuth, Elizabeth V.; Januszyk, Kurt; Lima, Christopher D.

    The eukaryotic RNA exosome processes and degrades RNA by directing substrates to the distributive or processive 3' to 5' exoribonuclease activities of Rrp6 or Rrp44, respectively. The non-catalytic nine-subunit exosome core (Exo9) features a prominent central channel. Although RNA can pass through the channel to engage Rrp44, it is not clear how RNA is directed to Rrp6 or whether Rrp6 uses the central channel. Here we report a 3.3 Å crystal structure of a ten-subunit RNA exosome complex from Saccharomyces cerevisiae composed of the Exo9 core and Rrp6 bound to single-stranded poly(A) RNA. The Rrp6 catalytic domain rests on topmore » of the Exo9 S1/KH ring above the central channel, the RNA 3' end is anchored in the Rrp6 active site, and the remaining RNA traverses the S1/KH ring in an opposite orientation to that observed in a structure of a Rrp44-containing exosome complex. Solution studies with human and yeast RNA exosome complexes suggest that the RNA path to Rrp6 is conserved and dependent on the integrity of the S1/KH ring. Although path selection to Rrp6 or Rrp44 is stochastic in vitro, the fate of a particular RNA may be determined in vivo by the manner in which cofactors present RNA to the RNA exosome.« less

  16. Packaging of Mason-Pfizer monkey virus (MPMV) genomic RNA depends upon conserved long-range interactions (LRIs) between U5 and gag sequences

    PubMed Central

    Kalloush, Rawan M.; Vivet-Boudou, Valérie; Ali, Lizna M.; Mustafa, Farah; Marquet, Roland; Rizvi, Tahir A.

    2016-01-01

    MPMV has great potential for development as a vector for gene therapy. In this respect, precisely defining the sequences and structural motifs that are important for dimerization and packaging of its genomic RNA (gRNA) are of utmost importance. A distinguishing feature of the MPMV gRNA packaging signal is two phylogenetically conserved long-range interactions (LRIs) between U5 and gag complementary sequences, LRI-I and LRI-II. To test their biological significance in the MPMV life cycle, we introduced mutations into these structural motifs and tested their effects on MPMV gRNA packaging and propagation. Furthermore, we probed the structure of key mutants using SHAPE (selective 2′hydroxyl acylation analyzed by primer extension). Disrupting base-pairing of the LRIs affected gRNA packaging and propagation, demonstrating their significance to the MPMV life cycle. A double mutant restoring a heterologous LRI-I was fully functional, whereas a similar LRI-II mutant failed to restore gRNA packaging and propagation. These results demonstrate that while LRI-I acts at the structural level, maintaining base-pairing is not sufficient for LRI-II function. In addition, in vitro RNA dimerization assays indicated that the loss of RNA packaging in LRI mutants could not be attributed to the defects in dimerization. Our findings suggest that U5-gag LRIs play an important architectural role in maintaining the structure of the 5′ region of the MPMV gRNA, expanding the crucial role of LRIs to the nonlentiviral group of retroviruses. PMID:27095024

  17. 3′ Cap-Independent Translation Enhancers of Plant Viruses

    PubMed Central

    Simon, Anne E.; Miller, W. Allen

    2014-01-01

    In the absence of a 5′ cap, plant positive-strand RNA viruses have evolved a number of different elements in their 3′ untranslated region (UTR) to attract initiation factors and/or ribosomes to their templates. These 3′ cap-independent translational enhancers (3′ CITEs) take different forms, such as I-shaped, Y-shaped, T-shaped, or pseudoknotted structures, or radiate multiple helices from a central hub. Common features of most 3′ CITEs include the ability to bind a component of the translation initiation factor eIF4F complex and to engage in an RNA-RNA kissing-loop interaction with a hairpin loop located at the 5′ end of the RNA. The two T-shaped structures can bind to ribosomes and ribosomal subunits, with one structure also able to engage in a simultaneous long-distance RNA-RNA interaction. Several of these 3′ CITEs are interchangeable and there is evidence that natural recombination allows exchange of modular CITE units, which may overcome genetic resistance or extend the virus’s host range. PMID:23682606

  18. Noncanoncial signal recognition particle RNAs in a major eukaryotic phylum revealed by purification of SRP from the human pathogen Cryptococcus neoformans

    PubMed Central

    Dumesic, Phillip A.; Rosenblad, Magnus A.; Samuelsson, Tore; Nguyen, Tiffany; Moresco, James J.; Yates, John R.; Madhani, Hiten D.

    2015-01-01

    Despite conservation of the signal recognition particle (SRP) from bacteria to man, computational approaches have failed to identify SRP components from genomes of many lower eukaryotes, raising the possibility that they have been lost or altered in those lineages. We report purification and analysis of SRP in the human pathogen Cryptococcus neoformans, providing the first description of SRP in basidiomycetous yeast. The C. neoformans SRP RNA displays a predicted structure in which the universally conserved helix 8 contains an unprecedented stem-loop insertion. Guided by this sequence, we computationally identified 152 SRP RNAs throughout the phylum Basidiomycota. This analysis revealed additional helix 8 alterations including single and double stem-loop insertions as well as loop diminutions affecting RNA structural elements that are otherwise conserved from bacteria to man. Strikingly, these SRP RNA features in Basidiomycota are accompanied by phylum-specific alterations in the RNA-binding domain of Srp54, the SRP protein subunit that directly interacts with helix 8. Our findings reveal unexpected fungal SRP diversity and suggest coevolution of the two most conserved SRP features—SRP RNA helix 8 and Srp54—in basidiomycetes. Because members of this phylum include important human and plant pathogens, these noncanonical features provide new targets for antifungal compound development. PMID:26275773

  19. In vivo tmRNA protection by SmpB and pre-ribosome binding conformation in solution

    PubMed Central

    Ranaei-Siadat, Ehsan; Mérigoux, Cécile; Seijo, Bili; Ponchon, Luc; Saliou, Jean-Michel; Bernauer, Julie; Sanglier-Cianférani, Sarah; Dardel, Fréderic

    2014-01-01

    TmRNA is an abundant RNA in bacteria with tRNA and mRNA features. It is specialized in trans-translation, a translation rescuing system. We demonstrate that its partner protein SmpB binds the tRNA-like region (TLD) in vivo and chaperones the fold of the TLD-H2 region. We use an original approach combining the observation of tmRNA degradation pathways in a heterologous system, the analysis of the tmRNA digests by MS and NMR, and co-overproduction assays of tmRNA and SmpB. We study the conformation in solution of tmRNA alone or in complex with one SmpB before ribosome binding using SAXS. Our data show that Mg2+ drives compaction of the RNA structure and that, in the absence of Mg2+, SmpB has a similar effect albeit to a lesser extent. Our results show that tmRNA is intrinsically structured in solution with identical topology to that observed on complexes on ribosomes which should facilitate its subsequent recruitment by the 70S ribosome, free or preloaded with one SmpB molecule. PMID:25135523

  20. 3dRPC: a web server for 3D RNA-protein structure prediction.

    PubMed

    Huang, Yangyu; Li, Haotian; Xiao, Yi

    2018-04-01

    RNA-protein interactions occur in many biological processes. To understand the mechanism of these interactions one needs to know three-dimensional (3D) structures of RNA-protein complexes. 3dRPC is an algorithm for prediction of 3D RNA-protein complex structures and consists of a docking algorithm RPDOCK and a scoring function 3dRPC-Score. RPDOCK is used to sample possible complex conformations of an RNA and a protein by calculating the geometric and electrostatic complementarities and stacking interactions at the RNA-protein interface according to the features of atom packing of the interface. 3dRPC-Score is a knowledge-based potential that uses the conformations of nucleotide-amino-acid pairs as statistical variables and that is used to choose the near-native complex-conformations obtained from the docking method above. Recently, we built a web server for 3dRPC. The users can easily use 3dRPC without installing it locally. RNA and protein structures in PDB (Protein Data Bank) format are the only needed input files. It can also incorporate the information of interface residues or residue-pairs obtained from experiments or theoretical predictions to improve the prediction. The address of 3dRPC web server is http://biophy.hust.edu.cn/3dRPC. yxiao@hust.edu.cn.

  1. TBI server: a web server for predicting ion effects in RNA folding.

    PubMed

    Zhu, Yuhong; He, Zhaojian; Chen, Shi-Jie

    2015-01-01

    Metal ions play a critical role in the stabilization of RNA structures. Therefore, accurate prediction of the ion effects in RNA folding can have a far-reaching impact on our understanding of RNA structure and function. Multivalent ions, especially Mg²⁺, are essential for RNA tertiary structure formation. These ions can possibly become strongly correlated in the close vicinity of RNA surface. Most of the currently available software packages, which have widespread success in predicting ion effects in biomolecular systems, however, do not explicitly account for the ion correlation effect. Therefore, it is important to develop a software package/web server for the prediction of ion electrostatics in RNA folding by including ion correlation effects. The TBI web server http://rna.physics.missouri.edu/tbi_index.html provides predictions for the total electrostatic free energy, the different free energy components, and the mean number and the most probable distributions of the bound ions. A novel feature of the TBI server is its ability to account for ion correlation and ion distribution fluctuation effects. By accounting for the ion correlation and fluctuation effects, the TBI server is a unique online tool for computing ion-mediated electrostatic properties for given RNA structures. The results can provide important data for in-depth analysis for ion effects in RNA folding including the ion-dependence of folding stability, ion uptake in the folding process, and the interplay between the different energetic components.

  2. RiboSketch: Versatile Visualization of Multi-stranded RNA and DNA Secondary Structure.

    PubMed

    Lu, Jacob S; Bindewald, Eckart; Kasprzak, Wojciech; Shapiro, Bruce A

    2018-06-15

    Creating clear, visually pleasing 2D depictions of RNA and DNA strands and their interactions is important to facilitate and communicate insights related to nucleic acid structure. Here we present RiboSketch, a secondary structure image production application that enables the visualization of multistranded structures via layout algorithms, comprehensive editing capabilities, and a multitude of simulation modes. These interactive features allow RiboSketch to create publication quality diagrams for structures with a wide range of composition, size, and complexity. The program may be run in any web browser without the need for installation, or as a standalone Java application. https://binkley2.ncifcrf.gov/users/bindewae/ribosketch_web.

  3. Specific binding of a HeLa cell nuclear protein to RNA sequences in the human immunodeficiency virus transactivating region.

    PubMed Central

    Gaynor, R; Soultanakis, E; Kuwabara, M; Garcia, J; Sigman, D S

    1989-01-01

    The transactivator protein, tat, encoded by the human immunodeficiency virus is a key regulator of viral transcription. Activation by the tat protein requires sequences downstream of the transcription initiation site called the transactivating region (TAR). RNA derived from the TAR is capable of forming a stable stem-loop structure and the maintenance of both the stem structure and the loop sequences located between +19 and +44 is required for complete in vivo activation by tat. Gel retardation assays with RNA from both wild-type and mutant TAR constructs generated in vitro with SP6 polymerase indicated specific binding of HeLa nuclear proteins to the TAR. To characterize this RNA-protein interaction, a method of chemical "imprinting" has been developed using photoactivated uranyl acetate as the nucleolytic agent. This reagent nicks RNA under physiological conditions at all four nucleotides in a reaction that is independent of sequence and secondary structure. Specific interaction of cellular proteins with TAR RNA could be detected by enhanced cleavages or imprints surrounding the loop region. Mutations that either disrupted stem base-pairing or extensively changed the primary sequence resulted in alterations in the cleavage pattern of the TAR RNA. Structural features of the TAR RNA stem-loop essential for tat activation are also required for specific binding of the HeLa cell nuclear protein. Images PMID:2544877

  4. Packaging of Mason-Pfizer monkey virus (MPMV) genomic RNA depends upon conserved long-range interactions (LRIs) between U5 and gag sequences.

    PubMed

    Kalloush, Rawan M; Vivet-Boudou, Valérie; Ali, Lizna M; Mustafa, Farah; Marquet, Roland; Rizvi, Tahir A

    2016-06-01

    MPMV has great potential for development as a vector for gene therapy. In this respect, precisely defining the sequences and structural motifs that are important for dimerization and packaging of its genomic RNA (gRNA) are of utmost importance. A distinguishing feature of the MPMV gRNA packaging signal is two phylogenetically conserved long-range interactions (LRIs) between U5 and gag complementary sequences, LRI-I and LRI-II. To test their biological significance in the MPMV life cycle, we introduced mutations into these structural motifs and tested their effects on MPMV gRNA packaging and propagation. Furthermore, we probed the structure of key mutants using SHAPE (selective 2'hydroxyl acylation analyzed by primer extension). Disrupting base-pairing of the LRIs affected gRNA packaging and propagation, demonstrating their significance to the MPMV life cycle. A double mutant restoring a heterologous LRI-I was fully functional, whereas a similar LRI-II mutant failed to restore gRNA packaging and propagation. These results demonstrate that while LRI-I acts at the structural level, maintaining base-pairing is not sufficient for LRI-II function. In addition, in vitro RNA dimerization assays indicated that the loss of RNA packaging in LRI mutants could not be attributed to the defects in dimerization. Our findings suggest that U5-gag LRIs play an important architectural role in maintaining the structure of the 5' region of the MPMV gRNA, expanding the crucial role of LRIs to the nonlentiviral group of retroviruses. © 2016 Kalloush et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  5. Nucleic Acid Engineering: RNA Following the Trail of DNA.

    PubMed

    Kim, Hyejin; Park, Yongkuk; Kim, Jieun; Jeong, Jaepil; Han, Sangwoo; Lee, Jae Sung; Lee, Jong Bum

    2016-02-08

    The self-assembly feature of the naturally occurring biopolymer, DNA, has fascinated researchers in the fields of materials science and bioengineering. With the improved understanding of the chemical and structural nature of DNA, DNA-based constructs have been designed and fabricated from two-dimensional arbitrary shapes to reconfigurable three-dimensional nanodevices. Although DNA has been used successfully as a building block in a finely organized and controlled manner, its applications need to be explored. Hence, with the myriad of biological functions, RNA has recently attracted considerable attention to further the application of nucleic acid-based structures. This Review categorizes different approaches of engineering nucleic acid-based structures and introduces the concepts, principles, and applications of each technique, focusing on how DNA engineering is applied as a guide to RNA engineering.

  6. RNA Nanotechnology: Engineering, Assembly and Applications in Detection, Gene Delivery and Therapy

    PubMed Central

    Guo, Peixuan

    2010-01-01

    Biological macromolecules including DNA, RNA, and proteins, have intrinsic features that make them potential building blocks for the bottom-up fabrication of nanodevices. RNA is unique in nanoscale fabrication due to its amazing diversity of function and structure. RNA molecules can be designed and manipulated with a level of simplicity characteristic of DNA while possessing versatility in structure and function similar to that of proteins. RNA molecules typically contain a large variety of single stranded loops suitable for inter- and intra-molecular interaction. These loops can serve as mounting dovetails obviating the need for external linking dowels in fabrication and assembly. The self-assembly of nanoparticles from RNA involves cooperative interaction of individual RNA molecules that spontaneously assemble in a predefined manner to form a larger two- or three-dimensional structure. Within the realm of self-assembly there are two main categories, namely template and non-template. Template assembly involves interaction of RNA molecules under the influence of specific external sequence, forces, or spatial constraints such as RNA transcription, hybridization, replication, annealing, molding, or replicas. In contrast, non-template assembly involves formation of a larger structure by individual components without the influence of external forces. Examples of non-template assembly are ligation, chemical conjugation, covalent linkage, and loop/loop interaction of RNA, especially the formation of RNA multimeric complexes. The best characterized RNA multiplier and the first to be described in RNA nanotechnological application is the motor pRNA of bacteriophage phi29 which form dimers, trimers, and hexamers, via hand-in-hand interaction. phi29 pRNA can be redesigned to form a variety of structures and shapes including twins, tetramers, rods, triangles, and 3D arrays several microns in size via interaction of programmed helical regions and loops. 3D RNA array formation requires a defined nucleotide number for twisting and a palindromic sequence. Such arrays are unusually stable and resistant to a wide range of temperatures, salt concentrations, and pH. Both the therapeutic siRNA or ribozyme and a receptor-binding RNA aptamer or other ligands have been engineered into individual pRNAs. Individual chimeric RNA building blocks harboring siRNA or other therapeutic molecules have been fabricated subsequently into a trimer through hand-in-hand interaction of the engineered right and left interlocking RNA loops. The incubation of these particles containing the receptor-binding aptamer or other ligands results in the binding and co-entry of trivalent therapeutic particles into cells. Such particles were subsequently shown to modulate the apoptosis of cancer cells in both cell cultures and animal trials. The use of such antigen-free 20–40 nm particles holds promise for the repeated long-term treatment of chronic diseases. Other potentially useful RNA molecules that form multimers include HIV RNA that contain kissing loop to form dimers, tecto-RNA that forms a “jigsaw puzzle,” and the Drosophila bicoid mRNA that forms multimers via “hand-by-arm” interactions. Applications of RNA molecules involving replication, molding, embossing, and other related techniques, have recently been described that allow the utilization of a variety of materials to enhance diversity and resolution of nanomaterials. It should eventually be possible to adapt RNA to facilitate construction of ordered, patterned, or pre-programmed arrays or superstructures. Given the potential for 3D fabrication, the chance to produce reversible self-assembly, and the ability of self-repair, editing and replication, RNA self-assembly will play an increasingly significant role in integrated biological nanofabrication. A random 100-nucleotide RNA library may exist in 1.6 × 1060 varieties with multifarious structure to serve as a vital system for efficient fabrication, with a complexity and diversity far exceeding that of any current nanoscale system. This review covers the basic concepts of RNA structure and function, certain methods for the study of RNA structure, the approaches for engineering or fabricating RNA into nanoparticles or arrays, and special features of RNA molecules that form multimers. The most recent development in exploration of RNA nanoparticles for pathogen detection, drug/gene delivery, and therapeutic application is also introduced in this review. PMID:16430131

  7. High salt solution structure of a left-handed RNA double helix

    PubMed Central

    Popenda, Mariusz; Milecki, Jan; Adamiak, Ryszard W.

    2004-01-01

    Right-handed RNA duplexes of (CG)n sequence undergo salt-induced helicity reversal, forming left-handed RNA double helices (Z-RNA). In contrast to the thoroughly studied Z-DNA, no Z-RNA structure of natural origin is known. Here we report the NMR structure of a half-turn, left-handed RNA helix (CGCGCG)2 determined in 6 M NaClO4. This is the first nucleic acid motif determined at such high salt. Sequential assignments of non-exchangeable proton resonances of the Z-form were based on the hitherto unreported NOE connectivity path [H6(n)-H5′/H5″(n)-H8(n+1)-H1′(n+1)-H6(n+2)] found for left-handed helices. Z-RNA structure shows several conformational features significantly different from Z-DNA. Intra-strand but no inter-strand base stacking was observed for both CpG and GpC steps. Helical twist angles for CpG steps have small positive values (4–7°), whereas GpC steps have large negative values (−61°). In the full-turn model of Z-RNA (12.4 bp per turn), base pairs are much closer to the helix axis than in Z-DNA, thus both the very deep, narrow minor groove with buried cytidine 2′-OH groups, and the major groove are well defined. The 2′-OH group of cytidines plays a crucial role in the Z-RNA structure and its formation; 2′-O-methylation of cytidine, but not of guanosine residues prohibits A to Z helicity reversal. PMID:15292450

  8. Crystal structure and RNA-binding properties of an Hfq homolog from the deep-branching Aquificae: conservation of the lateral RNA-binding mode

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Stanek, Kimberly A.; Patterson-West, Jennifer; Randolph, Peter S.

    The host factor Hfq, as the bacterial branch of the Sm family, is an RNA-binding protein involved in the post-transcriptional regulation of mRNA expression and turnover. Hfq facilitates pairing between small regulatory RNAs (sRNAs) and their corresponding mRNA targets by binding both RNAs and bringing them into close proximity. Hfq homologs self-assemble into homo-hexameric rings with at least two distinct surfaces that bind RNA. Recently, another binding site, dubbed the `lateral rim', has been implicated in sRNA·mRNA annealing; the RNA-binding properties of this site appear to be rather subtle, and its degree of evolutionary conservation is unknown. An Hfq homologmore » has been identified in the phylogenetically deep-branching thermophileAquifex aeolicus(Aae), but little is known about the structure and function of Hfq from basal bacterial lineages such as the Aquificae. Therefore,AaeHfq was cloned, overexpressed, purified, crystallized and biochemically characterized. Structures ofAaeHfq were determined in space groupsP1 andP6, both to 1.5 Å resolution, and nanomolar-scale binding affinities for uridine- and adenosine-rich RNAs were discovered. Co-crystallization with U 6RNA reveals that the outer rim of theAaeHfq hexamer features a well defined binding pocket that is selective for uracil. ThisAaeHfq structure, combined with biochemical and biophysical characterization of the homolog, reveals deep evolutionary conservation of the lateral RNA-binding mode, and lays a foundation for further studies of Hfq-associated RNA biology in ancient bacterial phyla.« less

  9. A novel RNA binding surface of the TAM domain of TIP5/BAZ2A mediates epigenetic regulation of rRNA genes.

    PubMed

    Anosova, Irina; Melnik, Svitlana; Tripsianes, Konstantinos; Kateb, Fatiha; Grummt, Ingrid; Sattler, Michael

    2015-05-26

    The chromatin remodeling complex NoRC, comprising the subunits SNF2h and TIP5/BAZ2A, mediates heterochromatin formation at major clusters of repetitive elements, including rRNA genes, centromeres and telomeres. Association with chromatin requires the interaction of the TAM (TIP5/ARBP/MBD) domain of TIP5 with noncoding RNA, which targets NoRC to specific genomic loci. Here, we show that the NMR structure of the TAM domain of TIP5 resembles the fold of the MBD domain, found in methyl-CpG binding proteins. However, the TAM domain exhibits an extended MBD fold with unique C-terminal extensions that constitute a novel surface for RNA binding. Mutation of critical amino acids within this surface abolishes RNA binding in vitro and in vivo. Our results explain the distinct binding specificities of TAM and MBD domains to RNA and methylated DNA, respectively, and reveal structural features for the interaction of NoRC with non-coding RNA. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. Non-encapsidation Activities of the Capsid Proteins of Positive-strand RNA Viruses

    PubMed Central

    Ni, Peng; Kao, C. Cheng

    2013-01-01

    Viral capsid proteins (CPs) are characterized by their role in forming protective shells around viral genomes. However, CPs have additional and important roles in the virus infection cycles and in the cellular response to infection. These activities involve CP binding to RNAs in both sequence-specific and nonspecific manners as well as association with other proteins. This review focuses on CPs of both plant and animal-infecting viruses with positive-strand RNA genomes. We summarize the structural features of CPs and describe their modulatory roles in viral translation, RNA-dependent RNA synthesis, and host defense responses. PMID:24074574

  11. PRince: a web server for structural and physicochemical analysis of protein-RNA interface.

    PubMed

    Barik, Amita; Mishra, Abhishek; Bahadur, Ranjit Prasad

    2012-07-01

    We have developed a web server, PRince, which analyzes the structural features and physicochemical properties of the protein-RNA interface. Users need to submit a PDB file containing the atomic coordinates of both the protein and the RNA molecules in complex form (in '.pdb' format). They should also mention the chain identifiers of interacting protein and RNA molecules. The size of the protein-RNA interface is estimated by measuring the solvent accessible surface area buried in contact. For a given protein-RNA complex, PRince calculates structural, physicochemical and hydration properties of the interacting surfaces. All these parameters generated by the server are presented in a tabular format. The interacting surfaces can also be visualized with software plug-in like Jmol. In addition, the output files containing the list of the atomic coordinates of the interacting protein, RNA and interface water molecules can be downloaded. The parameters generated by PRince are novel, and users can correlate them with the experimentally determined biophysical and biochemical parameters for better understanding the specificity of the protein-RNA recognition process. This server will be continuously upgraded to include more parameters. PRince is publicly accessible and free for use. Available at http://www.facweb.iitkgp.ernet.in/~rbahadur/prince/home.html.

  12. Functional implications from the Cid1 poly(U) polymerase crystal structure.

    PubMed

    Munoz-Tello, Paola; Gabus, Caroline; Thore, Stéphane

    2012-06-06

    In eukaryotes, mRNA degradation begins with poly(A) tail removal, followed by decapping, and the mRNA body is degraded by exonucleases. In recent years, the major influence of 3'-end uridylation as a regulatory step within several RNA degradation pathways has generated significant attention toward the responsible enzymes, which are called poly(U) polymerases (PUPs). We determined the atomic structure of the Cid1 protein, the founding member of the PUP family, in its UTP-bound form, allowing unambiguous positioning of the UTP molecule. Our data also suggest that the RNA substrate accommodation and product translocation by the Cid1 protein rely on local and global movements of the enzyme. Supplemented by point mutations, the atomic model is used to propose a catalytic cycle. Our study underlines the Cid1 RNA binding properties, a feature with critical implications for miRNAs, histone mRNAs, and, more generally, cellular RNA degradation. Copyright © 2012 Elsevier Ltd. All rights reserved.

  13. The crystal structure of Zika virus NS5 reveals conserved drug targets.

    PubMed

    Duan, Wenqian; Song, Hao; Wang, Haiyuan; Chai, Yan; Su, Chao; Qi, Jianxun; Shi, Yi; Gao, George F

    2017-04-03

    Zika virus (ZIKV) has emerged as major health concern, as ZIKV infection has been shown to be associated with microcephaly, severe neurological disease and possibly male sterility. As the largest protein component within the ZIKV replication complex, NS5 plays key roles in the life cycle and survival of the virus through its N-terminal methyltransferase (MTase) and C-terminal RNA-dependent RNA polymerase (RdRp) domains. Here, we present the crystal structures of ZIKV NS5 MTase in complex with an RNA cap analogue ( m7 GpppA) and the free NS5 RdRp. We have identified the conserved features of ZIKV NS5 MTase and RdRp structures that could lead to development of current antiviral inhibitors being used against flaviviruses, including dengue virus and West Nile virus, to treat ZIKV infection. These results should inform and accelerate the structure-based design of antiviral compounds against ZIKV. © 2017 The Authors.

  14. A Sequence-Independent, Unstructured Internal Ribosome Entry Site Is Responsible for Internal Expression of the Coat Protein of Turnip Crinkle Virus

    PubMed Central

    May, Jared; Johnson, Philip; Saleem, Huma

    2017-01-01

    ABSTRACT To maximize the coding potential of viral genomes, internal ribosome entry sites (IRES) can be used to bypass the traditional requirement of a 5′ cap and some/all of the associated translation initiation factors. Although viral IRES typically contain higher-order RNA structure, an unstructured sequence of about 84 nucleotides (nt) immediately upstream of the Turnip crinkle virus (TCV) coat protein (CP) open reading frame (ORF) has been found to promote internal expression of the CP from the genomic RNA (gRNA) both in vitro and in vivo. An absence of extensive RNA structure was predicted using RNA folding algorithms and confirmed by selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) RNA structure probing. Analysis of the IRES region in vitro by use of both the TCV gRNA and reporter constructs did not reveal any sequence-specific elements but rather suggested that an overall lack of structure was an important feature for IRES activity. The CP IRES is A-rich, independent of orientation, and strongly conserved among viruses in the same genus. The IRES was dependent on eIF4G, but not eIF4E, for activity. Low levels of CP accumulated in vivo in the absence of detectable TCV subgenomic RNAs, strongly suggesting that the IRES was active in the gRNA in vivo. Since the TCV CP also serves as the viral silencing suppressor, early translation of the CP from the viral gRNA is likely important for countering host defenses. Cellular mRNA IRES also lack extensive RNA structures or sequence conservation, suggesting that this viral IRES and cellular IRES may have similar strategies for internal translation initiation. IMPORTANCE Cap-independent translation is a common strategy among positive-sense, single-stranded RNA viruses for bypassing the host cell requirement of a 5′ cap structure. Viral IRES, in general, contain extensive secondary structure that is critical for activity. In contrast, we demonstrate that a region of viral RNA devoid of extensive secondary structure has IRES activity and produces low levels of viral coat protein in vitro and in vivo. Our findings may be applicable to cellular mRNA IRES that also have little or no sequences/structures in common. PMID:28179526

  15. Polyadenylation of RNA transcribed from mammalian SINEs by RNA polymerase III: Complex requirements for nucleotide sequences.

    PubMed

    Borodulina, Olga R; Golubchikova, Julia S; Ustyantsev, Ilia G; Kramerov, Dmitri A

    2016-02-01

    It is generally accepted that only transcripts synthesized by RNA polymerase II (e.g., mRNA) were subject to AAUAAA-dependent polyadenylation. However, we previously showed that RNA transcribed by RNA polymerase III (pol III) from mouse B2 SINE could be polyadenylated in an AAUAAA-dependent manner. Many species of mammalian SINEs end with the pol III transcriptional terminator (TTTTT) and contain hexamers AATAAA in their A-rich tail. Such SINEs were united into Class T(+), whereas SINEs lacking the terminator and AATAAA sequences were classified as T(-). Here we studied the structural features of SINE pol III transcripts that are necessary for their polyadenylation. Eight and six SINE families from classes T(+) and T(-), respectively, were analyzed. The replacement of AATAAA with AACAAA in T(+) SINEs abolished the RNA polyadenylation. Interestingly, insertion of the polyadenylation signal (AATAAA) and pol III transcription terminator in T(-) SINEs did not result in polyadenylation. The detailed analysis of three T(+) SINEs (B2, DIP, and VES) revealed areas important for the polyadenylation of their pol III transcripts: the polyadenylation signal and terminator in A-rich tail, β region positioned immediately downstream of the box B of pol III promoter, and τ region located upstream of the tail. In DIP and VES (but not in B2), the τ region is a polypyrimidine motif which is also characteristic of many other T(+) SINEs. Most likely, SINEs of different mammals acquired these structural features independently as a result of parallel evolution. Copyright © 2015 Elsevier B.V. All rights reserved.

  16. Inhibitor-induced structural change in the HCV IRES domain IIa RNA

    PubMed Central

    Paulsen, Ryan B.; Seth, Punit P.; Swayze, Eric E.; Griffey, Richard H.; Skalicky, Jack J.; Cheatham, Thomas E.; Davis, Darrell R.

    2010-01-01

    Translation of the hepatitis C virus (HCV) RNA is initiated from a highly structured internal ribosomal entry site (IRES) in the 5′ untranslated region (5′ UTR) of the RNA genome. An important structural feature of the native RNA is an approximately 90° helical bend localized to domain IIa that positions the apical loop of domain IIb of the IRES near the 40S ribosomal E-site to promote eIF2-GDP release, facilitating 80S ribosome assembly. We report here the NMR structure of a domain IIa construct in complex with a potent small-molecule inhibitor of HCV replication. Molecular dynamics refinement in explicit solvent and subsequent energetic analysis indicated that each inhibitor stereoisomer bound with comparable affinity and in an equivalent binding mode. The in silico analysis was substantiated by fluorescence-based assays showing that the relative binding free energies differed by only 0.7 kcal/mol. Binding of the inhibitor displaces key nucleotide residues within the bulge region, effecting a major conformational change that eliminates the bent RNA helical trajectory, providing a mechanism for the antiviral activity of this inhibitor class. PMID:20360559

  17. Affinity maturation of a portable Fab–RNA module for chaperone-assisted RNA crystallography

    PubMed Central

    Koirala, Deepak; Shelke, Sandip A; Dupont, Marcel; Ruiz, Stormy; DasGupta, Saurja; Bailey, Lucas J; Benner, Steven A; Piccirilli, Joseph A

    2018-01-01

    Abstract Antibody fragments such as Fabs possess properties that can enhance protein and RNA crystallization and therefore can facilitate macromolecular structure determination. In particular, Fab BL3–6 binds to an AAACA RNA pentaloop closed by a GC pair with ∼100 nM affinity. The Fab and hairpin have served as a portable module for RNA crystallization. The potential for general application make it desirable to adjust the properties of this crystallization module in a manner that facilitates its use for RNA structure determination, such as ease of purification, surface entropy or binding affinity. In this work, we used both in vitro RNA selection and phage display selection to alter the epitope and paratope sides of the binding interface, respectively, for improved binding affinity. We identified a 5′-GNGACCC-3′ consensus motif in the RNA and S97N mutation in complimentarity determining region L3 of the Fab that independently impart about an order of magnitude improvement in affinity, resulting from new hydrogen bonding interactions. Using a model RNA, these modifications facilitated crystallization under a wider range of conditions and improved diffraction. The improved features of the Fab–RNA module may facilitate its use as an affinity tag for RNA purification and imaging and as a chaperone for RNA crystallography. PMID:29309709

  18. Structural basis of DNA folding and recognition in an AMP-DNA aptamer complex: distinct architectures but common recognition motifs for DNA and RNA aptamers complexed to AMP.

    PubMed

    Lin, C H; Patel, D J

    1997-11-01

    Structural studies by nuclear magnetic resonance (NMR) of RNA and DNA aptamer complexes identified through in vitro selection and amplification have provided a wealth of information on RNA and DNA tertiary structure and molecular recognition in solution. The RNA and DNA aptamers that target ATP (and AMP) with micromolar affinity exhibit distinct binding site sequences and secondary structures. We report below on the tertiary structure of the AMP-DNA aptamer complex in solution and compare it with the previously reported tertiary structure of the AMP-RNA aptamer complex in solution. The solution structure of the AMP-DNA aptamer complex shows, surprisingly, that two AMP molecules are intercalated at adjacent sites within a rectangular widened minor groove. Complex formation involves adaptive binding where the asymmetric internal bubble of the free DNA aptamer zippers up through formation of a continuous six-base mismatch segment which includes a pair of adjacent three-base platforms. The AMP molecules pair through their Watson-Crick edges with the minor groove edges of guanine residues. These recognition G.A mismatches are flanked by sheared G.A and reversed Hoogsteen G.G mismatch pairs. The AMP-DNA aptamer and AMP-RNA aptamer complexes have distinct tertiary structures and binding stoichiometries. Nevertheless, both complexes have similar structural features and recognition alignments in their binding pockets. Specifically, AMP targets both DNA and RNA aptamers by intercalating between purine bases and through identical G.A mismatch formation. The recognition G.A mismatch stacks with a reversed Hoogsteen G.G mismatch in one direction and with an adenine base in the other direction in both complexes. It is striking that DNA and RNA aptamers selected independently from libraries of 10(14) molecules in each case utilize identical mismatch alignments for molecular recognition with micromolar affinity within binding-site pockets containing common structural elements.

  19. Deformability in the cleavage site of primary microRNA is not sensed by the double-stranded RNA binding domains in the microprocessor component DGCR8.

    PubMed

    Quarles, Kaycee A; Chadalavada, Durga; Showalter, Scott A

    2015-06-01

    The prevalence of double-stranded RNA (dsRNA) in eukaryotic cells has only recently been appreciated. Of interest here, RNA silencing begins with dsRNA substrates that are bound by the dsRNA-binding domains (dsRBDs) of their processing proteins. Specifically, processing of microRNA (miRNA) in the nucleus minimally requires the enzyme Drosha and its dsRBD-containing cofactor protein, DGCR8. The smallest recombinant construct of DGCR8 that is sufficient for in vitro dsRNA binding, referred to as DGCR8-Core, consists of its two dsRBDs and a C-terminal tail. As dsRBDs rarely recognize the nucleotide sequence of dsRNA, it is reasonable to hypothesize that DGCR8 function is dependent on the recognition of specific structural features in the miRNA precursor. Previously, we demonstrated that noncanonical structural elements that promote RNA flexibility within the stem of miRNA precursors are necessary for efficient in vitro cleavage by reconstituted Microprocessor complexes. Here, we combine gel shift assays with in vitro processing assays to demonstrate that neither the N-terminal dsRBD of DGCR8 in isolation nor the DGCR8-Core construct is sensitive to the presence of noncanonical structural elements within the stem of miRNA precursors, or to single-stranded segments flanking the stem. Extending DGCR8-Core to include an N-terminal heme-binding region does not change our conclusions. Thus, our data suggest that although the DGCR8-Core region is necessary for dsRNA binding and recruitment to the Microprocessor, it is not sufficient to establish the previously observed connection between RNA flexibility and processing efficiency. © 2015 Wiley Periodicals, Inc.

  20. Well-characterized sequence features of eukaryote genomes and implications for ab initio gene prediction.

    PubMed

    Huang, Ying; Chen, Shi-Yi; Deng, Feilong

    2016-01-01

    In silico analysis of DNA sequences is an important area of computational biology in the post-genomic era. Over the past two decades, computational approaches for ab initio prediction of gene structure from genome sequence alone have largely facilitated our understanding on a variety of biological questions. Although the computational prediction of protein-coding genes has already been well-established, we are also facing challenges to robustly find the non-coding RNA genes, such as miRNA and lncRNA. Two main aspects of ab initio gene prediction include the computed values for describing sequence features and used algorithm for training the discriminant function, and by which different combinations are employed into various bioinformatic tools. Herein, we briefly review these well-characterized sequence features in eukaryote genomes and applications to ab initio gene prediction. The main purpose of this article is to provide an overview to beginners who aim to develop the related bioinformatic tools.

  1. Ricin - inhibitor design. Annual report, 15 April 1994-14 April 1995

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schramm, V.L.

    1995-05-14

    Substrates for ricin A-chain include short RNA stem-loop structures which have been synthesized with radioactive labels for ease of catalytic assay and for kinetic isotope effects. Ricin A-chain from several sources is incapable of completing multiple catalytic cycles using these substrates. A family of ricin substrate analogue molecules have been synthesized and tested which are specific for transition states with oxycarbonium character or for enzymatic mechanisms involving protonation of the adenine leaving group. Formycin analogues were incorporated into RNA oligomeric structures and tested for binding to ricin A-chain or as inhibitors of the ricin-inactivation of in vitro translation using rabbitmore » reticulocyte lysates. Ribo-oxycarbonium ion analogues containing iminoribitol analogues of ribose were synthetically incorporated into RNA oligomeric structures. Neither formycin nor ribo-oxycarbonium analogues, either singly or in RNA oligomers caused significant inhibition of ricin A-chain when assayed in reticulocyte lysate translation assays. The results indicate a novel transition state mechanism for ricin A-chain, or a requirement for additional features of 28s rRNA to bind transition state analogues.« less

  2. A definition of the domains Archaea, Bacteria and Eucarya in terms of small subunit ribosomal RNA characteristics

    NASA Technical Reports Server (NTRS)

    Winker, S.; Woese, C. R.

    1991-01-01

    The number of small subunit rRNA sequences is now great enough that the three domains Archaea, Bacteria and Eucarya (Woese et al., 1990) can be reliably defined in terms of their sequence "signatures". Approximately 50 homologous positions (or nucleotide pairs) in the small subunit rRNA characterize and distinguish among the three. In addition, the three can be recognized by a variety of nonhomologous rRNA characters, either individual positions and/or higher-order structural features. The Crenarchaeota and the Euryarchaeota, the two archaeal kingdoms, can also be defined and distinguished by their characteristic compositions at approximately fifteen positions in the small subunit rRNA molecule.

  3. Analytical study of avian reticuloendotheliosis virus dimeric RNA generated in vivo and in vitro.

    PubMed

    Darlix, J L; Gabus, C; Allain, B

    1992-12-01

    The retroviral genome consists of two identical RNA molecules associated at their 5' ends by a stable structure called the dimer linkage structure. The dimer linkage structure, while maintaining the dimer state of the retroviral genome, might also be involved in packaging and reverse transcription, as well as recombination during proviral DNA synthesis. To study the dimer structure of the retroviral genome and the mechanism of dimerization, we analyzed features of the dimeric genome of reticuloendotheliosis virus (REV) type A and identified elements required for its dimerization. Here we report that the REV dimeric genome extracted from virions and infected cells, as well as that synthesized in vitro, is more resistant to heat denaturation than avian sarcoma and leukemia virus, murine leukemia virus, or human immunodeficiency virus type 1 dimeric RNA. The minimal domain required to form a stable REV RNA dimer in vitro was found to map between positions 268 and 452 (KpnI and SalI sites), thus corresponding to the E encapsidation sequence (J. E. Embretson and H. M. Temin, J. Virol. 61:2675-2683, 1987). In addition, both the 5' and 3' halves of E are necessary in cis for RNA dimerization and the extent of RNA dimerization is influenced by viral sequences flanking E. Rapid and efficient dimerization of REV RNA containing gag sequences in addition to the E sequences and annealing of replication primer tRNA(Pro) to the primer-binding site necessitate the nucleocapsid protein.

  4. Analytical study of avian reticuloendotheliosis virus dimeric RNA generated in vivo and in vitro.

    PubMed Central

    Darlix, J L; Gabus, C; Allain, B

    1992-01-01

    The retroviral genome consists of two identical RNA molecules associated at their 5' ends by a stable structure called the dimer linkage structure. The dimer linkage structure, while maintaining the dimer state of the retroviral genome, might also be involved in packaging and reverse transcription, as well as recombination during proviral DNA synthesis. To study the dimer structure of the retroviral genome and the mechanism of dimerization, we analyzed features of the dimeric genome of reticuloendotheliosis virus (REV) type A and identified elements required for its dimerization. Here we report that the REV dimeric genome extracted from virions and infected cells, as well as that synthesized in vitro, is more resistant to heat denaturation than avian sarcoma and leukemia virus, murine leukemia virus, or human immunodeficiency virus type 1 dimeric RNA. The minimal domain required to form a stable REV RNA dimer in vitro was found to map between positions 268 and 452 (KpnI and SalI sites), thus corresponding to the E encapsidation sequence (J. E. Embretson and H. M. Temin, J. Virol. 61:2675-2683, 1987). In addition, both the 5' and 3' halves of E are necessary in cis for RNA dimerization and the extent of RNA dimerization is influenced by viral sequences flanking E. Rapid and efficient dimerization of REV RNA containing gag sequences in addition to the E sequences and annealing of replication primer tRNA(Pro) to the primer-binding site necessitate the nucleocapsid protein. Images PMID:1331519

  5. Structure of a bacterial RNA polymerase holoenzyme open promoter complex

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bae, Brian; Feklistov, Andrey; Lass-Napiorkowska, Agnieszka

    2015-09-08

    Initiation of transcription is a primary means for controlling gene expression. In bacteria, the RNA polymerase (RNAP) holoenzyme binds and unwinds promoter DNA, forming the transcription bubble of the open promoter complex (RPo). We have determined crystal structures, refined to 4.14 Å-resolution, of RPo containing Thermus aquaticus RNAP holoenzyme and promoter DNA that includes the full transcription bubble. The structures, combined with biochemical analyses, reveal key features supporting the formation and maintenance of the double-strand/single-strand DNA junction at the upstream edge of the -10 element where bubble formation initiates. The results also reveal RNAP interactions with duplex DNA just upstreammore » of the -10 element and potential protein/DNA interactions that direct the DNA template strand into the RNAP active site. Addition of an RNA primer to yield a 4 base-pair post-translocated RNA:DNA hybrid mimics an initially transcribing complex at the point where steric clash initiates abortive initiation and σA dissociation.« less

  6. Structure of a bacterial RNA polymerase holoenzyme open promoter complex

    DOE PAGES

    Bae, Brian; Feklistov, Andrey; Lass-Napiorkowska, Agnieszka; ...

    2015-09-08

    Initiation of transcription is a primary means for controlling gene expression. In bacteria, the RNA polymerase (RNAP) holoenzyme binds and unwinds promoter DNA, forming the transcription bubble of the open promoter complex (RPo). We have determined crystal structures, refined to 4.14 Å-resolution, of RPo containing Thermus aquaticus RNAP holoenzyme and promoter DNA that includes the full transcription bubble. The structures, combined with biochemical analyses, reveal key features supporting the formation and maintenance of the double-strand/single-strand DNA junction at the upstream edge of the -10 element where bubble formation initiates. The results also reveal RNAP interactions with duplex DNA just upstreammore » of the -10 element and potential protein/DNA interactions that direct the DNA template strand into the RNAP active site. Additionally a RNA primer to yield a 4 base-pair post-translocated RNA:DNA hybrid mimics an initially transcribing complex at the point where steric clash initiates abortive initiation and σ A dissociation.« less

  7. Structures of two aptamers with differing ligand specificity reveal ruggedness in the functional landscape of RNA

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Knappenberger, Andrew John; Reiss, Caroline Wetherington; Strobel, Scott A.

    Two classes of riboswitches related to the ykkC guanidine-I riboswitch bind phosphoribosyl pyrophosphate (PRPP) and guanosine tetraphosphate (ppGpp). Here we report the co-crystal structure of the PRPP aptamer and its ligand. We also report the structure of the G96A point mutant that prefers ppGpp over PRPP with a dramatic 40,000-fold switch in specificity. The ends of the aptamer form a helix that is not present in the guanidine aptamer and is involved in the expression platform. In the mutant, the base of ppGpp replaces G96 in three-dimensional space. This disrupts the S-turn, which is a primary structural feature of themore » ykkC RNA motif. These dramatic differences in ligand specificity are achieved with minimal mutations. ykkC aptamers are therefore a prime example of an RNA fold with a rugged fitness landscape. The ease with which the ykkC aptamer acquires new specificity represents a striking case of evolvability in RNA.« less

  8. Structures of two aptamers with differing ligand specificity reveal ruggedness in the functional landscape of RNA.

    PubMed

    Knappenberger, Andrew John; Reiss, Caroline Wetherington; Strobel, Scott A

    2018-06-07

    Two classes of riboswitches related to the ykkC guanidine-I riboswitch bind phosphoribosyl pyrophosphate (PRPP) and guanosine tetraphosphate (ppGpp). Here we report the co-crystal structure of the PRPP aptamer and its ligand. We also report the structure of the G96A point mutant that prefers ppGpp over PRPP with a dramatic 40,000-fold switch in specificity. The ends of the aptamer form a helix that is not present in the guanidine aptamer and is involved in the expression platform. In the mutant, the base of ppGpp replaces G96 in three-dimensional space. This disrupts the S-turn, which is a primary structural feature of the ykkC RNA motif. These dramatic differences in ligand specificity are achieved with minimal mutations. ykkC aptamers are therefore a prime example of an RNA fold with a rugged fitness landscape. The ease with which the ykkC aptamer acquires new specificity represents a striking case of evolvability in RNA. © 2018, Knappenberger et al.

  9. Structural analyses of the CRISPR protein Csc2 reveal the RNA-binding interface of the type I-D Cas7 family.

    PubMed

    Hrle, Ajla; Maier, Lisa-Katharina; Sharma, Kundan; Ebert, Judith; Basquin, Claire; Urlaub, Henning; Marchfelder, Anita; Conti, Elena

    2014-01-01

    Upon pathogen invasion, bacteria and archaea activate an RNA-interference-like mechanism termed CRISPR (clustered regularly interspaced short palindromic repeats). A large family of Cas (CRISPR-associated) proteins mediates the different stages of this sophisticated immune response. Bioinformatic studies have classified the Cas proteins into families, according to their sequences and respective functions. These range from the insertion of the foreign genetic elements into the host genome to the activation of the interference machinery as well as target degradation upon attack. Cas7 family proteins are central to the type I and type III interference machineries as they constitute the backbone of the large interference complexes. Here we report the crystal structure of Thermofilum pendens Csc2, a Cas7 family protein of type I-D. We found that Csc2 forms a core RRM-like domain, flanked by three peripheral insertion domains: a lid domain, a Zinc-binding domain and a helical domain. Comparison with other Cas7 family proteins reveals a set of similar structural features both in the core and in the peripheral domains, despite the absence of significant sequence similarity. T. pendens Csc2 binds single-stranded RNA in vitro in a sequence-independent manner. Using a crosslinking - mass-spectrometry approach, we mapped the RNA-binding surface to a positively charged surface patch on T. pendens Csc2. Thus our analysis of the key structural and functional features of T. pendens Csc2 highlights recurring themes and evolutionary relationships in type I and type III Cas proteins.

  10. Riboswitches: emerging themes in RNA structure and function.

    PubMed

    Montange, Rebecca K; Batey, Robert T

    2008-01-01

    Riboswitches are RNAs capable of binding cellular metabolites using a diverse array of secondary and tertiary structures to modulate gene expression. The recent determination of the three-dimensional structures of parts of six different riboswitches illuminates common features that allow riboswitches to be grouped into one of two types. Type I riboswitches, as exemplified by the purine riboswitch, are characterized by a single, localized binding pocket supported by a largely pre-established global fold. This arrangement limits ligand-induced conformational changes in the RNA to a small region. In contrast, Type II riboswitches, such as the thiamine pyrophosphate riboswitch, contain binding pockets split into at least two spatially distinct sites. As a result, binding induces both local changes to the binding pocket and global architecture. Similar organizational themes are found in other noncoding RNAs, making it possible to begin to build a hierarchical classification of RNA structure based on the spatial organization of their active sites and associated secondary structural elements.

  11. Cloning and determination of the transcription termination site of ribosomal RNA gene of the mouse.

    PubMed Central

    Kominami, R; Mishima, Y; Urano, Y; Sakai, M; Muramatsu, M

    1982-01-01

    A Eco RI 6.6 kb DNA fragment containing the 3'-end of 28S ribosomal RNA gene of the mouse was detected by Southern blot hybridization, and cloned in a lambda-phage vector. The site of transcription termination and the processed 3'-end of 28S RNA were determined on the cloned fragment and the surrounding nucleotide sequence determined. The 3'-terminal nucleotides of mouse 28S RNA are similar to those of yeast, Drosophila and Xenopus although the homology was lost drastically beyond the 3'-end of 28S RNA. 45S precursor RNA terminated at 30 nucleotides downstream from the 3'-end of 28S RNA gene. A structure of a dyad symmetry with a loop was found immediately prior to the termination site of 45S RNA. The rDNA termination site thus shares some common features with termination sites recognized by other RNA polymerases. Images PMID:6281727

  12. Crystallographic and Computational Analyses of AUUCU Repeating RNA That Causes Spinocerebellar Ataxia Type 10 (SCA10).

    PubMed

    Park, HaJeung; González, Àlex L; Yildirim, Ilyas; Tran, Tuan; Lohman, Jeremy R; Fang, Pengfei; Guo, Min; Disney, Matthew D

    2015-06-23

    Spinocerebellar ataxia type 10 (SCA10) is caused by a pentanucleotide repeat expansion of r(AUUCU) within intron 9 of the ATXN10 pre-mRNA. The RNA causes disease by a gain-of-function mechanism in which it inactivates proteins involved in RNA biogenesis. Spectroscopic studies showed that r(AUUCU) repeats form a hairpin structure; however, there were no high-resolution structural models prior to this work. Herein, we report the first crystal structure of model r(AUUCU) repeats refined to 2.8 Å and analysis of the structure via molecular dynamics simulations. The r(AUUCU) tracts adopt an overall A-form geometry in which 3 × 3 nucleotide (5')UCU(3')/(3')UCU(5') internal loops are closed by AU pairs. Helical parameters of the refined structure as well as the corresponding electron density map on the crystallographic model reflect dynamic features of the internal loop. The computational analyses captured dynamic motion of the loop closing pairs, which can form single-stranded conformations with relatively low energies. Overall, the results presented here suggest the possibility for r(AUUCU) repeats to form metastable A-from structures, which can rearrange into single-stranded conformations and attract proteins such as heterogeneous nuclear ribonucleoprotein K (hnRNP K). The information presented here may aid in the rational design of therapeutics targeting this RNA.

  13. Crystallographic and Computational Analyses of AUUCU Repeating RNA That Causes Spinocerebellar Ataxia Type 10 (SCA10)

    PubMed Central

    Park, HaJeung; González, Àlex L.; Yildirim, Ilyas; Tran, Tuan; Lohman, Jeremy R.; Fang, Pengfei; Guo, Min; Disney, Matthew D.

    2016-01-01

    Spinocerebellar ataxia type 10 (SCA10) is caused by a pentanucleotide repeat expansion of r(AUUCU) within intron 9 of the ATXN10 pre-mRNA. The RNA causes disease by a gain-of-function mechanism in which it inactivates proteins involved in RNA biogenesis. Spectroscopic studies showed that r(AUUCU) repeats form a hairpin structure; however, there were no high-resolution structural models prior to this work. Herein, we report the first crystal structure of model r(AUUCU) repeats refined to 2.8 Å and analysis of the structure via molecular dynamics simulations. The r(AUUCU) tracts adopt an overall A-form geometry in which 3 × 3 nucleotide 5′UCU3′/3′UCU5′ internal loops are closed by AU pairs. Helical parameters of the refined structure as well as the corresponding electron density map on the crystallographic model reflect dynamic features of the internal loop. The computational analyses captured dynamic motion of the loop closing pairs, which can form single-stranded conformations with relatively low energies. Overall, the results presented here suggest the possibility for r(AUUCU) repeats to form metastable A-from structures, which can rearrange into single-stranded conformations and attract proteins such as heterogeneous nuclear ribonucleoprotein K (hnRNP K). The information presented here may aid in the rational design of therapeutics targeting this RNA. PMID:26039897

  14. Pharmacological Characterization of Chemically Synthesized Monomeric phi29 pRNA Nanoparticles for Systemic Delivery

    PubMed Central

    Abdelmawla, Sherine; Guo, Songchuan; Zhang, Limin; Pulukuri, Sai M; Patankar, Prithviraj; Conley, Patrick; Trebley, Joseph; Guo, Peixuan; Li, Qi-Xiang

    2011-01-01

    Previous studies have shown that the packaging RNA (pRNA) of bacteriophage phi29 DNA packaging motor folds into a compact structure, constituting a RNA nanoparticle that can be modularized with functional groups as a nanodelivery system. pRNA nanoparticles can also be self-assembled by the bipartite approach without altering folding property. The present study demonstrated that 2′-F-modified pRNA nanoparticles were readily manufactured through this scalable bipartite strategy, featuring total chemical synthesis and permitting diverse functional modularizations. The RNA nanoparticles were chemically and metabolically stable and demonstrated a favorable pharmacokinetic (PK) profile in mice (half-life (T1/2): 5–10 hours, clearance (Cl): <0.13 l/kg/hour, volume of distribution (Vd): 1.2 l/kg). It did not induce an interferon (IFN) response nor did it induce cytokine production in mice. Repeat intravenous administrations in mice up to 30 mg/kg did not result in any toxicity. Fluorescent folate-pRNA nanoparticles efficiently and specifically bound and internalized to folate receptor (FR)-bearing cancer cells in vitro. It also specifically and dose-dependently targeted to FR+ xenograft tumor in mice with minimal accumulation in normal tissues. This first comprehensive pharmacological study suggests that the pRNA nanoparticle had all the preferred pharmacological features to serve as an efficient nanodelivery platform for broad medical applications. PMID:21468004

  15. Comparative analyses of the thermodynamic RNA binding signatures of different types of RNA recognition motifs

    PubMed Central

    Cléry, Antoine; Allain, Frédéric H-T

    2017-01-01

    Abstract RNA recognition motifs (RRMs) are structurally versatile domains important in regulation of alternative splicing. Structural mechanisms of sequence-specific recognition of single-stranded RNAs (ssRNAs) by RRMs are well understood. The thermodynamic strategies are however unclear. Therefore, we utilized microcalorimetry and semi-empirical analyses to comparatively analyze the cognate ssRNA binding thermodynamics of four different RRM domains, each with a different RNA binding mode. The different binding modes are: canonical binding to the β-sheet surface; canonical binding with involvement of N- and C-termini; binding to conserved loops; and binding to an α-helix. Our results identify enthalpy as the sole and general force driving association at physiological temperatures. Also, networks of weak interactions are a general feature regulating stability of the different RRM–ssRNA complexes. In agreement, non-polyelectrolyte effects contributed between ∼75 and 90% of the overall free energy of binding in the considered complexes. The various RNA binding modes also displayed enormous heat capacity differences, that upon dissection revealed large differential changes in hydration, conformations and dynamics upon binding RNA. Altogether, different modes employed by RRMs to bind cognate ssRNAs utilize various thermodynamics strategies during the association process. PMID:28334819

  16. Identification of a conserved branched RNA structure that functions as a factor-independent terminator.

    PubMed

    Johnson, Christopher M; Chen, Yuqing; Lee, Heejin; Ke, Ailong; Weaver, Keith E; Dunny, Gary M

    2014-03-04

    Anti-Q is a small RNA encoded on pCF10, an antibiotic resistance plasmid of Enterococcus faecalis, which negatively regulates conjugation of the plasmid. In this study we sought to understand how Anti-Q is generated relative to larger transcripts of the same operon. We found that Anti-Q folds into a branched structure that functions as a factor-independent terminator. In vitro and in vivo, termination is dependent on the integrity of this structure as well as the presence of a 3' polyuridine tract, but is not dependent on other downstream sequences. In vitro, terminated transcripts are released from RNA polymerase after synthesis. In vivo, a mutant with reduced termination efficiency demonstrated loss of tight control of conjugation function. A search of bacterial genomes revealed the presence of sequences that encode Anti-Q-like RNA structures. In vitro and in vivo experiments demonstrated that one of these functions as a terminator. This work reveals a previously unappreciated flexibility in the structure of factor-independent terminators and identifies a mechanism for generation of functional small RNAs; it should also inform annotation of bacterial sequence features, such as terminators, functional sRNAs, and operons.

  17. Identification of a conserved branched RNA structure that functions as a factor-independent terminator

    PubMed Central

    Johnson, Christopher M.; Chen, Yuqing; Lee, Heejin; Ke, Ailong; Weaver, Keith E.; Dunny, Gary M.

    2014-01-01

    Anti-Q is a small RNA encoded on pCF10, an antibiotic resistance plasmid of Enterococcus faecalis, which negatively regulates conjugation of the plasmid. In this study we sought to understand how Anti-Q is generated relative to larger transcripts of the same operon. We found that Anti-Q folds into a branched structure that functions as a factor-independent terminator. In vitro and in vivo, termination is dependent on the integrity of this structure as well as the presence of a 3′ polyuridine tract, but is not dependent on other downstream sequences. In vitro, terminated transcripts are released from RNA polymerase after synthesis. In vivo, a mutant with reduced termination efficiency demonstrated loss of tight control of conjugation function. A search of bacterial genomes revealed the presence of sequences that encode Anti-Q–like RNA structures. In vitro and in vivo experiments demonstrated that one of these functions as a terminator. This work reveals a previously unappreciated flexibility in the structure of factor-independent terminators and identifies a mechanism for generation of functional small RNAs; it should also inform annotation of bacterial sequence features, such as terminators, functional sRNAs, and operons. PMID:24550474

  18. Analysis of Nearly One Thousand Mammalian Mirtrons Reveals Novel Features of Dicer Substrates

    PubMed Central

    Shenker, Sol; Mohammed, Jaaved; Lai, Eric C.

    2015-01-01

    Mirtrons are microRNA (miRNA) substrates that utilize the splicing machinery to bypass the necessity of Drosha cleavage for their biogenesis. Expanding our recent efforts for mammalian mirtron annotation, we use meta-analysis of aggregate datasets to identify ~500 novel mouse and human introns that confidently generate diced small RNA duplexes. These comprise nearly 1000 total loci distributed in four splicing-mediated biogenesis subclasses, with 5'-tailed mirtrons as, by far, the dominant subtype. Thus, mirtrons surprisingly comprise a substantial fraction of endogenous Dicer substrates in mammalian genomes. Although mirtron-derived small RNAs exhibit overall expression correlation with their host mRNAs, we observe a subset with substantial differences that suggest regulated processing or accumulation. We identify characteristic sequence, length, and structural features of mirtron loci that distinguish them from bulk introns, and find that mirtrons preferentially emerge from genes with larger numbers of introns. While mirtrons generate miRNA-class regulatory RNAs, we also find that mirtrons exhibit many features that distinguish them from canonical miRNAs. We observe that conventional mirtron hairpins are substantially longer than Drosha-generated pre-miRNAs, indicating that the characteristic length of canonical pre-miRNAs is not a general feature of Dicer substrate hairpins. In addition, mammalian mirtrons exhibit unique patterns of ordered 5' and 3' heterogeneity, which reveal hidden complexity in miRNA processing pathways. These include broad 3'-uridylation of mirtron hairpins, atypically heterogeneous 5' termini that may result from exonucleolytic processing, and occasionally robust decapitation of the 5' guanine (G) of mirtron-5p species defined by splicing. Altogether, this study reveals that this extensive class of non-canonical miRNA bears a multitude of characteristic properties, many of which raise general mechanistic questions regarding the processing of endogenous hairpin transcripts. PMID:26325366

  19. tRNomics: analysis of tRNA genes from 50 genomes of Eukarya, Archaea, and Bacteria reveals anticodon-sparing strategies and domain-specific features.

    PubMed Central

    Marck, Christian; Grosjean, Henri

    2002-01-01

    From 50 genomes of the three domains of life (7 eukarya, 13 archaea, and 30 bacteria), we extracted, analyzed, and compared over 4,000 sequences corresponding to cytoplasmic, nonorganellar tRNAs. For each genome, the complete set of tRNAs required to read the 61 sense codons was identified, which permitted revelation of three major anticodon-sparing strategies. Other features and sequence peculiarities analyzed are the following: (1) fit to the standard cloverleaf structure, (2) characteristic consensus sequences for elongator and initiator tDNAs, (3) frequencies of bases at each sequence position, (4) type and frequencies of conserved 2D and 3D base pairs, (5) anticodon/tDNA usages and anticodon-sparing strategies, (6) identification of the tRNA-Ile with anticodon CAU reading AUA, (7) size of variable arm, (8) occurrence and location of introns, (9) occurrence of 3'-CCA and 5'-extra G encoded at the tDNA level, and (10) distribution of the tRNA genes in genomes and their mode of transcription. Among all tRNA isoacceptors, we found that initiator tDNA-iMet is the most conserved across the three domains, yet domain-specific signatures exist. Also, according to which tRNA feature is considered (5'-extra G encoded in tDNAs-His, AUA codon read by tRNA-Ile with anticodon CAU, presence of intron, absence of "two-out-of-three" reading mode and short V-arm in tDNA-Tyr) Archaea sequester either with Bacteria or Eukarya. No common features between Eukarya and Bacteria not shared with Archaea could be unveiled. Thus, from the tRNomic point of view, Archaea appears as an "intermediate domain" between Eukarya and Bacteria. PMID:12403461

  20. Modular architecture of eukaryotic RNase P and RNase MRP revealed by electron microscopy.

    PubMed

    Hipp, Katharina; Galani, Kyriaki; Batisse, Claire; Prinz, Simone; Böttcher, Bettina

    2012-04-01

    Ribonuclease P (RNase P) and RNase MRP are closely related ribonucleoprotein enzymes, which process RNA substrates including tRNA precursors for RNase P and 5.8 S rRNA precursors, as well as some mRNAs, for RNase MRP. The structures of RNase P and RNase MRP have not yet been solved, so it is unclear how the proteins contribute to the structure of the complexes and how substrate specificity is determined. Using electron microscopy and image processing we show that eukaryotic RNase P and RNase MRP have a modular architecture, where proteins stabilize the RNA fold and contribute to cavities, channels and chambers between the modules. Such features are located at strategic positions for substrate recognition by shape and coordination of the cleaved-off sequence. These are also the sites of greatest difference between RNase P and RNase MRP, highlighting the importance of the adaptation of this region to the different substrates.

  1. Identification and characterization of circular RNAs in zebrafish.

    PubMed

    Shen, Yudong; Guo, Xianwu; Wang, Weimin

    2017-01-01

    Circular RNA (circRNA), a class of RNAs with circular structure, has received little attention until recently, when some new features and functions were discovered. In the present study, we sequenced circRNAs in zebrafish (Danio rerio) and identified 3868 circRNAs using three algorithms (find_circ, CIRI, segemehl). The analysis of microRNA target sites on circRNAs shows that some circRNAs may function as miRNA sponges. Furthermore, we identified the existence of reverse complementary sequences in the flanking regions of only 25 (2.64%) exonic circRNAs, indicating that the mechanism of zebrafish exonic circRNA biogenesis might be different from that in mammals. Moreover, 1122 (29%) zebrafish circRNA sequences showed homology with human, mouse and coelacanth circRNAs. © 2016 Federation of European Biochemical Societies.

  2. Unconventional features of C9ORF72 expanded repeat in amyotrophic lateral sclerosis and frontotemporal lobar degeneration.

    PubMed

    Vatovec, Sabina; Kovanda, Anja; Rogelj, Boris

    2014-10-01

    Amyotrophic lateral sclerosis (ALS) and frontotemporal lobar degeneration (FTLD) are devastating neurodegenerative diseases that form two ends of a complex disease spectrum. Aggregation of RNA binding proteins is one of the hallmark pathologic features of ALS and FTDL and suggests perturbance of the RNA metabolism in their etiology. Recent identification of the disease-associated expansions of the intronic hexanucleotide repeat GGGGCC in the C9ORF72 gene further substantiates the case for RNA involvement. The expanded repeat, which has turned out to be the single most common genetic cause of ALS and FTLD, may enable the formation of complex DNA and RNA structures, changes in RNA transcription, and processing and formation of toxic RNA foci, which may sequester and inactivate RNA binding proteins. Additionally, the transcribed expanded repeat can undergo repeat-associated non-ATG-initiated translation resulting in accumulation of a series of dipeptide repeat proteins. Understanding the basis of the proposed mechanisms and shared pathways, as well as interactions with known key proteins such as TAR DNA-binding protein (TDP-43) are needed to clarify the pathology of ALS and/or FTLD, and make possible steps toward therapy development. Copyright © 2014 Elsevier Inc. All rights reserved.

  3. Synthesis, base pairing and structure studies of geranylated RNA.

    PubMed

    Wang, Rui; Vangaveti, Sweta; Ranganathan, Srivathsan V; Basanta-Sanchez, Maria; Haruehanroengra, Phensinee; Chen, Alan; Sheng, Jia

    2016-07-27

    Natural RNAs utilize extensive chemical modifications to diversify their structures and functions. 2-Thiouridine geranylation is a special hydrophobic tRNA modification that has been discovered very recently in several bacteria, such as Escherichia coli, Enterobacter aerogenes, Pseudomonas aeruginosa and Salmonella Typhimurium The geranylated residues are located in the first anticodon position of tRNAs specific for lysine, glutamine and glutamic acid. This big hydrophobic terpene functional group affects the codon recognition patterns and reduces frameshifting errors during translation. We aimed to systematically study the structure, function and biosynthesis mechanism of this geranylation pathway, as well as answer the question of why nature uses such a hydrophobic modification in hydrophilic RNA systems. Recently, we have synthesized the deoxy-analog of S-geranyluridine and showed the geranylated T-G pair is much stronger than the geranylated T-A pair and other mismatched pairs in the B-form DNA duplex context, which is consistent with the observation that the geranylated tRNA(Glu) UUC recognizes GAG more efficiently than GAA. In this manuscript we report the synthesis and base pairing specificity studies of geranylated RNA oligos. We also report extensive molecular simulation studies to explore the structural features of the geranyl group in the context of A-form RNA and its effect on codon-anticodon interaction during ribosome binding. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. Diverse RNA-binding proteins interact with functionally related sets of RNAs, suggesting an extensive regulatory system.

    PubMed

    Hogan, Daniel J; Riordan, Daniel P; Gerber, André P; Herschlag, Daniel; Brown, Patrick O

    2008-10-28

    RNA-binding proteins (RBPs) have roles in the regulation of many post-transcriptional steps in gene expression, but relatively few RBPs have been systematically studied. We searched for the RNA targets of 40 proteins in the yeast Saccharomyces cerevisiae: a selective sample of the approximately 600 annotated and predicted RBPs, as well as several proteins not annotated as RBPs. At least 33 of these 40 proteins, including three of the four proteins that were not previously known or predicted to be RBPs, were reproducibly associated with specific sets of a few to several hundred RNAs. Remarkably, many of the RBPs we studied bound mRNAs whose protein products share identifiable functional or cytotopic features. We identified specific sequences or predicted structures significantly enriched in target mRNAs of 16 RBPs. These potential RNA-recognition elements were diverse in sequence, structure, and location: some were found predominantly in 3'-untranslated regions, others in 5'-untranslated regions, some in coding sequences, and many in two or more of these features. Although this study only examined a small fraction of the universe of yeast RBPs, 70% of the mRNA transcriptome had significant associations with at least one of these RBPs, and on average, each distinct yeast mRNA interacted with three of the RBPs, suggesting the potential for a rich, multidimensional network of regulation. These results strongly suggest that combinatorial binding of RBPs to specific recognition elements in mRNAs is a pervasive mechanism for multi-dimensional regulation of their post-transcriptional fate.

  5. Terminal Duplex Stability and Nucleotide Identity Differentially Control siRNA Loading and Activity in RNA Interference

    PubMed Central

    Angart, Phillip A.; Carlson, Rebecca J.; Adu-Berchie, Kwasi

    2016-01-01

    Efficient short interfering RNA (siRNA)-mediated gene silencing requires selection of a sequence that is complementary to the intended target and possesses sequence and structural features that encourage favorable functional interactions with the RNA interference (RNAi) pathway proteins. In this study, we investigated how terminal sequence and structural characteristics of siRNAs contribute to siRNA strand loading and silencing activity and how these characteristics ultimately result in a functionally asymmetric duplex in cultured HeLa cells. Our results reiterate that the most important characteristic in determining siRNA activity is the 5′ terminal nucleotide identity. Our findings further suggest that siRNA loading is controlled principally by the hybridization stability of the 5′ terminus (Nucleotides: 1–2) of each siRNA strand, independent of the opposing terminus. Postloading, RNA-induced silencing complex (RISC)–specific activity was found to be improved by lower hybridization stability in the 5′ terminus (Nucleotides: 3–4) of the loaded siRNA strand and greater hybridization stability toward the 3′ terminus (Nucleotides: 17–18). Concomitantly, specific recognition of the 5′ terminal nucleotide sequence by human Argonaute 2 (Ago2) improves RISC half-life. These findings indicate that careful selection of siRNA sequences can maximize both the loading and the specific activity of the intended guide strand. PMID:27399870

  6. DEAD-box Helicases as Integrators of RNA, Nucleotide and Protein Binding

    PubMed Central

    Putnam, Andrea A.

    2013-01-01

    DEAD-box helicases perform diverse cellular functions in virtually all steps of RNA metabolism from Bacteria to Humans. Although DEAD-box helicases share a highly conserved core domain, the enzymes catalyze a wide range of biochemical reactions. In addition to the well established RNA unwinding and corresponding ATPase activities, DEAD-box helicases promote duplex formation and displace proteins from RNA. They can also function as assembly platforms for larger ribonucleoprotein complexes, and as metabolite sensors. This review aims to provide a perspective on the diverse biochemical features of DEAD-box helicases and connections to structural information. We discuss these data in the context of a model that views the enzymes as integrators of RNA, nucleotide, and protein binding. PMID:23416748

  7. Computational and Experimental Characterization of Ribosomal DNA and RNA G-Quadruplexes

    NASA Astrophysics Data System (ADS)

    Cho, Samuel

    DNA G-quadruplexes in human telomeres and gene promoters are being extensively studied for their role in controlling the growth of cancer cells. Recent studies strongly suggest that guanine (G)-rich genes encoding pre-ribosomal RNA (pre-rRNA) are a potential anticancer target through the inhibition of RNA polymerase I (Pol I) in ribosome biogenesis. However, the structures of ribosomal G-quadruplexes at atomic resolution are unknown, and very little biophysical characterization has been performed on them to date. Here, we have modeled two putative rDNA G-quadruplex structures, NUC 19P and NUC 23P, which we observe via circular dichroism (CD) spectroscopy to adopt a predominantly parallel topology, and their counterpart rRNA. To validate and refine the putative ribosomal G-quadruplex structures, we performed all-atom molecular dynamics (MD) simulations using the CHARMM36 force field in the presence and absence of stabilizing K + or Na + ions. We optimized the CHARMM36 force field K + parameters to be more consistent with quantum mechanical calculations (and the polarizable Drude model force field) so that the K + ion is predominantly in the G-quadruplex channel. Our MD simulations show that the rDNA G-quadruplex have more well-defined, predominantly parallel-topology structures than rRNA and NUC 19P is more structured than NUC 23P, which features extended loops. Our study demonstrates that they are both potential targets for the design of novel chemotherapeutics.

  8. Using secondary structure to identify ribosomal numts: cautionary examples from the human genome.

    PubMed

    Olson, Link E; Yoder, Anne D

    2002-01-01

    The identification of inadvertently sequenced mitochondrial pseudogenes (numts) is critical to any study employing mitochondrial DNA sequence data. Failure to discriminate numts correctly can confound phylogenetic reconstruction and studies of molecular evolution. This is especially problematic for ribosomal mtDNA genes. Unlike protein-coding loci, whose pseudogenes tend to accumulate diagnostic frameshift or premature stop mutations, functional ribosomal genes are not constrained to maintain a reading frame and can accumulate insertion-deletion events of varying length, particularly in nonpairing regions. Several authors have advocated using structural features of the transcribed rRNA molecule to differentiate functional mitochondrial rRNA genes from their nuclear paralogs. We explored this approach using the mitochondrial 12S rRNA gene and three known 12S numts from the human genome in the context of anthropoid phylogeny and the inferred secondary structure of primate 12S rRNA. Contrary to expectation, each of the three human numts exhibits striking concordance with secondary structure models, with little, if any, indication of their pseudogene status, and would likely escape detection based on structural criteria alone. Furthermore, we show that the unwitting inclusion of a particularly ancient (18-25 Myr old) and surprisingly cryptic human numt in a phylogenetic analysis would yield a well-supported but dramatically incorrect conclusion regarding anthropoid relationships. Though we endorse the use of secondary structure models for inferring positional homology wholeheartedly, we caution against reliance on structural criteria for the discrimination of rRNA numts, given the potential fallibility of this approach.

  9. Functional Information Stored in the Conserved Structural RNA Domains of Flavivirus Genomes

    PubMed Central

    Fernández-Sanlés, Alba; Ríos-Marco, Pablo; Romero-López, Cristina; Berzal-Herranz, Alfredo

    2017-01-01

    The genus Flavivirus comprises a large number of small, positive-sense single-stranded, RNA viruses able to replicate in the cytoplasm of certain arthropod and/or vertebrate host cells. The genus, which has some 70 member species, includes a number of emerging and re-emerging pathogens responsible for outbreaks of human disease around the world, such as the West Nile, dengue, Zika, yellow fever, Japanese encephalitis, St. Louis encephalitis, and tick-borne encephalitis viruses. Like other RNA viruses, flaviviruses have a compact RNA genome that efficiently stores all the information required for the completion of the infectious cycle. The efficiency of this storage system is attributable to supracoding elements, i.e., discrete, structural units with essential functions. This information storage system overlaps and complements the protein coding sequence and is highly conserved across the genus. It therefore offers interesting potential targets for novel therapeutic strategies. This review summarizes our knowledge of the features of flavivirus genome functional RNA domains. It also provides a brief overview of the main achievements reported in the design of antiviral nucleic acid-based drugs targeting functional genomic RNA elements. PMID:28421048

  10. Applicability of PM3 to transphosphorylation reaction path: Toward designing a minimal ribozyme

    NASA Technical Reports Server (NTRS)

    Manchester, John I.; Shibata, Masayuki; Setlik, Robert F.; Ornstein, Rick L.; Rein, Robert

    1993-01-01

    A growing body of evidence shows that RNA can catalyze many of the reactions necessary both for replication of genetic material and the possible transition into the modern protein-based world. However, contemporary ribozymes are too large to have self-assembled from a prebiotic oligonucleotide pool. Still, it is likely that the major features of the earliest ribozymes have been preserved as molecular fossils in the catalytic RNA of today. Therefore, the search for a minimal ribozyme has been aimed at finding the necessary structural features of a modern ribozyme (Beaudry and Joyce, 1990). Both a three-dimensional model and quantum chemical calculations are required to quantitatively determine the effects of structural features of the ribozyme on the reaction it catalyzes. Using this model, quantum chemical calculations must be performed to determine quantitatively the effects of structural features on catalysis. Previous studies of the reaction path have been conducted at the ab initio level, but these methods are limited to small models due to enormous computational requirements. Semiempirical methods have been applied to large systems in the past; however, the accuracy of these methods depends largely on a simple model of the ribozyme-catalyzed reaction, or hydrolysis of phosphoric acid. We find that the results are qualitatively similar to ab initio results using large basis sets. Therefore, PM3 is suitable for studying the reaction path of the ribozyme-catalyzed reaction.

  11. Conserved and divergent features of the structure and function of La and La-related proteins (LARPs)

    PubMed Central

    Bayfield, Mark A.; Yang, Ruiqing; Maraia, Richard J.

    2010-01-01

    Genuine La proteins contain two RNA binding motifs, a La motif (LAM) followed by a RNA recognition motif (RRM), arranged in a unique way to bind RNA. These proteins interact with an extensive variety of cellular RNAs and exhibit activities in two broad categories: i) to promote the metabolism of nascent pol III transcripts, including precursor-tRNAs, by binding to their common, UUU-3’OH containing ends, and ii) to modulate the translation of certain mRNAs involving an unknown binding mechanism. Characterization of several La-RNA crystal structures as well as biochemical studies reveal insight into their unique two-motif domain architecture and how the LAM recognizes UUU-3’OH while the RRM binds other parts of a pre-tRNA. Recent studies of members of distinct families of conserved La-related proteins (LARPs) indicate that some of these harbor activity related to genuine La proteins, suggesting that their UUU-3’OH binding mode has been appropriated for the assembly and regulation of a specific snRNP (e.g., 7SK snRNA assembly by hLARP7/PIP7S). Analyses of other LARP family members (i.e., hLARP4, hLARP6) suggest more diverged RNA binding modes and specialization for cytoplasmic mRNA-related functions. Thus it appears that while genuine La proteins exhibit broad general involvement in both snRNA-related and mRNA-related functions, different LARP families may have evolved specialized activities in either snRNA or mRNA related functions. In this review, we summarize recent progress that has led to greater understanding of the structure and function of La proteins and their roles in tRNA processing and RNP assembly dynamics, as well as progress on the different LARPs. PMID:20138158

  12. Design of antisense RNA constructs for downregulation of the acetone formation pathway of Clostridium acetobutylicum.

    PubMed

    Tummala, Seshu B; Welker, Neil E; Papoutsakis, Eleftherios T

    2003-03-01

    We investigated the effect of antisense RNA (asRNA) structural properties on the downregulation efficacy of enzymes in the acetone-formation pathway (acetoacetate decarboxylase [AADC] and coenzyme A-transferase [CoAT]) of Clostridium acetobutylicum strain ATCC 824. First, we generated three strains, C. acetobutylicum ATCC 824 (pADC38AS), 824(pADC68AS), and 824(pADC100AS), which contain plasmids that produce asRNAs of various lengths against the AADC (adc) transcript. Western analysis showed that all three strains exhibit low levels of AADC compared to the plasmid control [ATCC 824(pSOS95del)]. By using computational algorithms, the three different asRNAs directed toward AADC, along with previously reported clostridial asRNAs, were examined for structural features (free nucleotides and components). When the normalized metrics of these structural features were plotted against percent downregulation, only the component/nucleotide ratio correlated well with in vivo asRNA effectiveness. Despite the significant downregulation of AADC in these strains, there were no concomitant effects on acetone formation. These findings suggest that AADC does not limit acetone formation and, thus, we targeted next the CoAT. Using the component/nucleotide ratio as a selection parameter, we developed three strains [ATCC 824 (pCTFA2AS), 824(pCTFB1AS), and 824(pCOAT11AS)] which express asRNAs to downregulate either or both of the CoAT subunits. Compared to the plasmid control strain, these strains produced substantially low levels of acetone and butanol and Western blot analyses showed significantly low levels of both CoAT subunits. These results show that CoAT is the rate-limiting enzyme in acetone formation and strengthen the hypothesis that the component/nucleotide ratio is a predictive indicator of asRNA effectiveness.

  13. Design of Antisense RNA Constructs for Downregulation of the Acetone Formation Pathway of Clostridium acetobutylicum

    PubMed Central

    Tummala, Seshu B.; Welker, Neil E.; Papoutsakis, Eleftherios T.

    2003-01-01

    We investigated the effect of antisense RNA (asRNA) structural properties on the downregulation efficacy of enzymes in the acetone-formation pathway (acetoacetate decarboxylase [AADC] and coenzyme A-transferase [CoAT]) of Clostridium acetobutylicum strain ATCC 824. First, we generated three strains, C. acetobutylicum ATCC 824 (pADC38AS), 824(pADC68AS), and 824(pADC100AS), which contain plasmids that produce asRNAs of various lengths against the AADC (adc) transcript. Western analysis showed that all three strains exhibit low levels of AADC compared to the plasmid control [ATCC 824(pSOS95del)]. By using computational algorithms, the three different asRNAs directed toward AADC, along with previously reported clostridial asRNAs, were examined for structural features (free nucleotides and components). When the normalized metrics of these structural features were plotted against percent downregulation, only the component/nucleotide ratio correlated well with in vivo asRNA effectiveness. Despite the significant downregulation of AADC in these strains, there were no concomitant effects on acetone formation. These findings suggest that AADC does not limit acetone formation and, thus, we targeted next the CoAT. Using the component/nucleotide ratio as a selection parameter, we developed three strains [ATCC 824 (pCTFA2AS), 824(pCTFB1AS), and 824(pCOAT11AS)] which express asRNAs to downregulate either or both of the CoAT subunits. Compared to the plasmid control strain, these strains produced substantially low levels of acetone and butanol and Western blot analyses showed significantly low levels of both CoAT subunits. These results show that CoAT is the rate-limiting enzyme in acetone formation and strengthen the hypothesis that the component/nucleotide ratio is a predictive indicator of asRNA effectiveness. PMID:12618456

  14. Synthesis, base pairing and structure studies of geranylated RNA

    PubMed Central

    Wang, Rui; Vangaveti, Sweta; Ranganathan, Srivathsan V.; Basanta-Sanchez, Maria; Haruehanroengra, Phensinee; Chen, Alan; Sheng, Jia

    2016-01-01

    Natural RNAs utilize extensive chemical modifications to diversify their structures and functions. 2-Thiouridine geranylation is a special hydrophobic tRNA modification that has been discovered very recently in several bacteria, such as Escherichia coli, Enterobacter aerogenes, Pseudomonas aeruginosa and Salmonella Typhimurium. The geranylated residues are located in the first anticodon position of tRNAs specific for lysine, glutamine and glutamic acid. This big hydrophobic terpene functional group affects the codon recognition patterns and reduces frameshifting errors during translation. We aimed to systematically study the structure, function and biosynthesis mechanism of this geranylation pathway, as well as answer the question of why nature uses such a hydrophobic modification in hydrophilic RNA systems. Recently, we have synthesized the deoxy-analog of S-geranyluridine and showed the geranylated T-G pair is much stronger than the geranylated T-A pair and other mismatched pairs in the B-form DNA duplex context, which is consistent with the observation that the geranylated tRNAGluUUC recognizes GAG more efficiently than GAA. In this manuscript we report the synthesis and base pairing specificity studies of geranylated RNA oligos. We also report extensive molecular simulation studies to explore the structural features of the geranyl group in the context of A-form RNA and its effect on codon–anticodon interaction during ribosome binding. PMID:27307604

  15. NMR studies of two spliced leader RNAs using isotope labeling

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lapham, J.; Crothers, D.M.

    1994-12-01

    Spliced leader RNAs are a class of RNA molecules (<200 nts) involved in the trans splicing of messenger RNA found in trypanosomes, nematodes, and other lower eukaryotes. The spliced leader RNA from the trypanosome Leptomonas Collosoma exists in two alternate structural forms with similar thermal stabilities. The 54 nucleotides on the 5{prime} end of the SL molecule is structurally independent from the 3{prime} half of the RNA, and displays the two structural forms. Furthermore, the favored of the two structures was shown to contain anomalous nuclease sensitivity and thermal stability features, which suggests that there may be tertiary interactions betweenmore » the splice site and other nucleotides in the 5{prime} end. Multidimensional NMR studies are underway to elucidate the structural elements present in the SL RNAs that give rise to their physical properties. Two spliced leader sequences have been studied. The first, the 54 nucleotides on the 5{prime} end of the L. Collosoma sequence, was selected because of earlier studies in our laboratory. The second sequence is the 5{prime} end of the trypanosome Crithidia Fasciculata, which was chosen because of its greater sequence homology to other SL sequences. Given the complexity of the NMR spectra for RNA molecules of this size, we have incorporated {sup 15}N/{sup 13}C-labeled nucleotides into the RNA. One of the techniques we have developed to simplify the spectra of these RNA molecules is isotope labeling of specific regions of the RNA. This has been especially helpful in assigning the secondary structure of molecules that may be able to adopt multiple conformations. Using this technique one can examine a part of the molecule without spectral interference from the unlabeled portion. We hope this approach will promote an avenue for studying the structure of larger RNAs in their native surroundings.« less

  16. Diverse activities of viral cis-acting RNA regulatory elements revealed using multicolor, long-term, single-cell imaging

    PubMed Central

    Pocock, Ginger M.; Zimdars, Laraine L.; Yuan, Ming; Eliceiri, Kevin W.; Ahlquist, Paul; Sherer, Nathan M.

    2017-01-01

    Cis-acting RNA structural elements govern crucial aspects of viral gene expression. How these structures and other posttranscriptional signals affect RNA trafficking and translation in the context of single cells is poorly understood. Herein we describe a multicolor, long-term (>24 h) imaging strategy for measuring integrated aspects of viral RNA regulatory control in individual cells. We apply this strategy to demonstrate differential mRNA trafficking behaviors governed by RNA elements derived from three retroviruses (HIV-1, murine leukemia virus, and Mason-Pfizer monkey virus), two hepadnaviruses (hepatitis B virus and woodchuck hepatitis virus), and an intron-retaining transcript encoded by the cellular NXF1 gene. Striking behaviors include “burst” RNA nuclear export dynamics regulated by HIV-1’s Rev response element and the viral Rev protein; transient aggregations of RNAs into discrete foci at or near the nuclear membrane triggered by multiple elements; and a novel, pulsiform RNA export activity regulated by the hepadnaviral posttranscriptional regulatory element. We incorporate single-cell tracking and a data-mining algorithm into our approach to obtain RNA element–specific, high-resolution gene expression signatures. Together these imaging assays constitute a tractable, systems-based platform for studying otherwise difficult to access spatiotemporal features of viral and cellular gene regulation. PMID:27903772

  17. Nop9 is a PUF-like protein that prevents premature cleavage to correctly process pre-18S rRNA

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhang, Jun; McCann, Kathleen L.; Qiu, Chen

    Numerous factors direct eukaryotic ribosome biogenesis, and defects in a single ribosome assembly factor may be lethal or produce tissue-specific human ribosomopathies. Pre-ribosomal RNAs (pre-rRNAs) must be processed stepwise and at the correct subcellular locations to produce the mature rRNAs. Nop9 is a conserved small ribosomal subunit biogenesis factor, essential in yeast. Here we report a 2.1-Å crystal structure of Nop9 and a small-angle X-ray-scattering model of a Nop9:RNA complex that reveals a ‘C’-shaped fold formed from 11 Pumilio repeats. We show that Nop9 recognizes sequence and structural features of the 20S pre-rRNA near the cleavage site of the nuclease,more » Nob1. We further demonstrate that Nop9 inhibits Nob1 cleavage, the final processing step to produce mature small ribosomal subunit 18S rRNA. Together, our results suggest that Nop9 is critical for timely cleavage of the 20S pre-rRNA. Moreover, the Nop9 structure exemplifies a new class of Pumilio repeat proteins.« less

  18. Identification of 15 candidate structured noncoding RNA motifs in fungi by comparative genomics.

    PubMed

    Li, Sanshu; Breaker, Ronald R

    2017-10-13

    With the development of rapid and inexpensive DNA sequencing, the genome sequences of more than 100 fungal species have been made available. This dataset provides an excellent resource for comparative genomics analyses, which can be used to discover genetic elements, including noncoding RNAs (ncRNAs). Bioinformatics tools similar to those used to uncover novel ncRNAs in bacteria, likewise, should be useful for searching fungal genomic sequences, and the relative ease of genetic experiments with some model fungal species could facilitate experimental validation studies. We have adapted a bioinformatics pipeline for discovering bacterial ncRNAs to systematically analyze many fungal genomes. This comparative genomics pipeline integrates information on conserved RNA sequence and structural features with alternative splicing information to reveal fungal RNA motifs that are candidate regulatory domains, or that might have other possible functions. A total of 15 prominent classes of structured ncRNA candidates were identified, including variant HDV self-cleaving ribozyme representatives, atypical snoRNA candidates, and possible structured antisense RNA motifs. Candidate regulatory motifs were also found associated with genes for ribosomal proteins, S-adenosylmethionine decarboxylase (SDC), amidase, and HexA protein involved in Woronin body formation. We experimentally confirm that the variant HDV ribozymes undergo rapid self-cleavage, and we demonstrate that the SDC RNA motif reduces the expression of SAM decarboxylase by translational repression. Furthermore, we provide evidence that several other motifs discovered in this study are likely to be functional ncRNA elements. Systematic screening of fungal genomes using a computational discovery pipeline has revealed the existence of a variety of novel structured ncRNAs. Genome contexts and similarities to known ncRNA motifs provide strong evidence for the biological and biochemical functions of some newly found ncRNA motifs. Although initial examinations of several motifs provide evidence for their likely functions, other motifs will require more in-depth analysis to reveal their functions.

  19. An empirical strategy to detect bacterial transcript structure from directional RNA-seq transcriptome data.

    PubMed

    Wang, Yejun; MacKenzie, Keith D; White, Aaron P

    2015-05-07

    As sequencing costs are being lowered continuously, RNA-seq has gradually been adopted as the first choice for comparative transcriptome studies with bacteria. Unlike microarrays, RNA-seq can directly detect cDNA derived from mRNA transcripts at a single nucleotide resolution. Not only does this allow researchers to determine the absolute expression level of genes, but it also conveys information about transcript structure. Few automatic software tools have yet been established to investigate large-scale RNA-seq data for bacterial transcript structure analysis. In this study, 54 directional RNA-seq libraries from Salmonella serovar Typhimurium (S. Typhimurium) 14028s were examined for potential relationships between read mapping patterns and transcript structure. We developed an empirical method, combined with statistical tests, to automatically detect key transcript features, including transcriptional start sites (TSSs), transcriptional termination sites (TTSs) and operon organization. Using our method, we obtained 2,764 TSSs and 1,467 TTSs for 1331 and 844 different genes, respectively. Identification of TSSs facilitated further discrimination of 215 putative sigma 38 regulons and 863 potential sigma 70 regulons. Combining the TSSs and TTSs with intergenic distance and co-expression information, we comprehensively annotated the operon organization in S. Typhimurium 14028s. Our results show that directional RNA-seq can be used to detect transcriptional borders at an acceptable resolution of ±10-20 nucleotides. Technical limitations of the RNA-seq procedure may prevent single nucleotide resolution. The automatic transcript border detection methods, statistical models and operon organization pipeline that we have described could be widely applied to RNA-seq studies in other bacteria. Furthermore, the TSSs, TTSs, operons, promoters and unstranslated regions that we have defined for S. Typhimurium 14028s may constitute valuable resources that can be used for comparative analyses with other Salmonella serotypes.

  20. Prediction of plant pre-microRNAs and their microRNAs in genome-scale sequences using structure-sequence features and support vector machine.

    PubMed

    Meng, Jun; Liu, Dong; Sun, Chao; Luan, Yushi

    2014-12-30

    MicroRNAs (miRNAs) are a family of non-coding RNAs approximately 21 nucleotides in length that play pivotal roles at the post-transcriptional level in animals, plants and viruses. These molecules silence their target genes by degrading transcription or suppressing translation. Studies have shown that miRNAs are involved in biological responses to a variety of biotic and abiotic stresses. Identification of these molecules and their targets can aid the understanding of regulatory processes. Recently, prediction methods based on machine learning have been widely used for miRNA prediction. However, most of these methods were designed for mammalian miRNA prediction, and few are available for predicting miRNAs in the pre-miRNAs of specific plant species. Although the complete Solanum lycopersicum genome has been published, only 77 Solanum lycopersicum miRNAs have been identified, far less than the estimated number. Therefore, it is essential to develop a prediction method based on machine learning to identify new plant miRNAs. A novel classification model based on a support vector machine (SVM) was trained to identify real and pseudo plant pre-miRNAs together with their miRNAs. An initial set of 152 novel features related to sequential structures was used to train the model. By applying feature selection, we obtained the best subset of 47 features for use with the Back Support Vector Machine-Recursive Feature Elimination (B-SVM-RFE) method for the classification of plant pre-miRNAs. Using this method, 63 features were obtained for plant miRNA classification. We then developed an integrated classification model, miPlantPreMat, which comprises MiPlantPre and MiPlantMat, to identify plant pre-miRNAs and their miRNAs. This model achieved approximately 90% accuracy using plant datasets from nine plant species, including Arabidopsis thaliana, Glycine max, Oryza sativa, Physcomitrella patens, Medicago truncatula, Sorghum bicolor, Arabidopsis lyrata, Zea mays and Solanum lycopersicum. Using miPlantPreMat, 522 Solanum lycopersicum miRNAs were identified in the Solanum lycopersicum genome sequence. We developed an integrated classification model, miPlantPreMat, based on structure-sequence features and SVM. MiPlantPreMat was used to identify both plant pre-miRNAs and the corresponding mature miRNAs. An improved feature selection method was proposed, resulting in high classification accuracy, sensitivity and specificity.

  1. The complete mitochondrial genome of the gall-forming fly, Fergusonina taylori Nelson and Yeates (Diptera: Fergusoninidae).

    PubMed

    Nelson, Leigh A; Cameron, Stephen L; Yeates, David K

    2011-10-01

    The monogeneric family Fergusoninidae consists of gall-forming flies that, together with Fergusobia (Tylenchida: Neotylenchidae) nematodes, form the only known mutualistic association between insects and nematodes. In this study, the entire 16,000 bp mitochondrial genome of Fergusonina taylori Nelson and Yeates was sequenced. The circular genome contains one encoding region including 27 genes and one non-coding A+T-rich region. The arrangement of the protein-coding, ribosomal RNA (rRNA) and transfer RNA (tRNA) genes was the same as that found in the ancestral insect. Nucleotide composition is highly A+T biased. All of the protein initiation codons are ATN, except for nad1 which begins with TTT. All 22 tRNA anticodons of F. taylori match those observed in Drosophila yakuba, and all form the typical cloverleaf structure except for tRNA-Ser((AGN)) which lacks a dihydrouridine (DHU) arm. Secondary structural features of the rRNA genes of Fergusonina are similar to those proposed for other insects, with minor modifications. The mitochondrial genome of Fergusonina presented here may prove valuable for resolving the sister group to the Fergusoninidae, and expands the available mtDNA data sources for acalyptrates overall.

  2. Conserved and divergent features of the structure and function of La and La-related proteins (LARPs).

    PubMed

    Bayfield, Mark A; Yang, Ruiqing; Maraia, Richard J

    2010-01-01

    Genuine La proteins contain two RNA binding motifs, a La motif (LAM) followed by a RNA recognition motif (RRM), arranged in a unique way to bind RNA. These proteins interact with an extensive variety of cellular RNAs and exhibit activities in two broad categories: i) to promote the metabolism of nascent pol III transcripts, including precursor-tRNAs, by binding to their common, UUU-3'OH containing ends, and ii) to modulate the translation of certain mRNAs involving an unknown binding mechanism. Characterization of several La-RNA crystal structures as well as biochemical studies reveal insight into their unique two-motif domain architecture and how the LAM recognizes UUU-3'OH while the RRM binds other parts of a pre-tRNA. Recent studies of members of distinct families of conserved La-related proteins (LARPs) indicate that some of these harbor activity related to genuine La proteins, suggesting that their UUU-3'OH binding mode has been appropriated for the assembly and regulation of a specific snRNP (e.g., 7SK snRNP assembly by hLARP7/PIP7S). Analyses of other LARP family members suggest more diverged RNA binding modes and specialization for cytoplasmic mRNA-related functions. Thus it appears that while genuine La proteins exhibit broad general involvement in both snRNA-related and mRNA-related functions, different LARP families may have evolved specialized activities in either snRNA or mRNA-related functions. In this review, we summarize recent progress that has led to greater understanding of the structure and function of La proteins and their roles in tRNA processing and RNP assembly dynamics, as well as progress on the different LARPs.

  3. Pseudoscorpion mitochondria show rearranged genes and genome-wide reductions of RNA gene sizes and inferred structures, yet typical nucleotide composition bias

    PubMed Central

    2012-01-01

    Background Pseudoscorpions are chelicerates and have historically been viewed as being most closely related to solifuges, harvestmen, and scorpions. No mitochondrial genomes of pseudoscorpions have been published, but the mitochondrial genomes of some lineages of Chelicerata possess unusual features, including short rRNA genes and tRNA genes that lack sequence to encode arms of the canonical cloverleaf-shaped tRNA. Additionally, some chelicerates possess an atypical guanine-thymine nucleotide bias on the major coding strand of their mitochondrial genomes. Results We sequenced the mitochondrial genomes of two divergent taxa from the chelicerate order Pseudoscorpiones. We find that these genomes possess unusually short tRNA genes that do not encode cloverleaf-shaped tRNA structures. Indeed, in one genome, all 22 tRNA genes lack sequence to encode canonical cloverleaf structures. We also find that the large ribosomal RNA genes are substantially shorter than those of most arthropods. We inferred secondary structures of the LSU rRNAs from both pseudoscorpions, and find that they have lost multiple helices. Based on comparisons with the crystal structure of the bacterial ribosome, two of these helices were likely contact points with tRNA T-arms or D-arms as they pass through the ribosome during protein synthesis. The mitochondrial gene arrangements of both pseudoscorpions differ from the ancestral chelicerate gene arrangement. One genome is rearranged with respect to the location of protein-coding genes, the small rRNA gene, and at least 8 tRNA genes. The other genome contains 6 tRNA genes in novel locations. Most chelicerates with rearranged mitochondrial genes show a genome-wide reversal of the CA nucleotide bias typical for arthropods on their major coding strand, and instead possess a GT bias. Yet despite their extensive rearrangement, these pseudoscorpion mitochondrial genomes possess a CA bias on the major coding strand. Phylogenetic analyses of all 13 mitochondrial protein-coding gene sequences consistently yield trees that place pseudoscorpions as sister to acariform mites. Conclusion The well-supported phylogenetic placement of pseudoscorpions as sister to Acariformes differs from some previous analyses based on morphology. However, these two lineages share multiple molecular evolutionary traits, including substantial mitochondrial genome rearrangements, extensive nucleotide substitution, and loss of helices in their inferred tRNA and rRNA structures. PMID:22409411

  4. Solenopsis invicta virus 3: mapping of structural proteins, ribosomal frameshifting, and similarities to Acyrthosiphon pisum virus and Kelp fly virus.

    PubMed

    Valles, Steven M; Bell, Susanne; Firth, Andrew E

    2014-01-01

    Solenopsis invicta virus 3 (SINV-3) is a positive-sense single-stranded RNA virus that infects the red imported fire ant, Solenopsis invicta. We show that the second open reading frame (ORF) of the dicistronic genome is expressed via a frameshifting mechanism and that the sequences encoding the structural proteins map to both ORF2 and the 3' end of ORF1, downstream of the sequence that encodes the RNA-dependent RNA polymerase. The genome organization and structural protein expression strategy resemble those of Acyrthosiphon pisum virus (APV), an aphid virus. The capsid protein that is encoded by the 3' end of ORF1 in SINV-3 and APV is predicted to have a jelly-roll fold similar to the capsid proteins of picornaviruses and caliciviruses. The capsid-extension protein that is produced by frameshifting, includes the jelly-roll fold domain encoded by ORF1 as its N-terminus, while the C-terminus encoded by the 5' half of ORF2 has no clear homology with other viral structural proteins. A third protein, encoded by the 3' half of ORF2, is associated with purified virions at sub-stoichiometric ratios. Although the structural proteins can be translated from the genomic RNA, we show that SINV-3 also produces a subgenomic RNA encoding the structural proteins. Circumstantial evidence suggests that APV may also produce such a subgenomic RNA. Both SINV-3 and APV are unclassified picorna-like viruses distantly related to members of the order Picornavirales and the family Caliciviridae. Within this grouping, features of the genome organization and capsid domain structure of SINV-3 and APV appear more similar to caliciviruses, perhaps suggesting the basis for a "Calicivirales" order.

  5. Avilamycin and evernimicin induce structural changes in rProteins uL16 and CTC that enhance the inhibition of A-site tRNA binding

    PubMed Central

    Krupkin, Miri; Wekselman, Itai; Matzov, Donna; Eyal, Zohar; Diskin Posner, Yael; Rozenberg, Haim; Zimmerman, Ella; Bashan, Anat; Yonath, Ada

    2016-01-01

    Two structurally unique ribosomal antibiotics belonging to the orthosomycin family, avilamycin and evernimicin, possess activity against Enterococci, Staphylococci, and Streptococci, and other Gram-positive bacteria. Here, we describe the high-resolution crystal structures of the eubacterial large ribosomal subunit in complex with them. Their extended binding sites span the A-tRNA entrance corridor, thus inhibiting protein biosynthesis by blocking the binding site of the A-tRNA elbow, a mechanism not shared with other known antibiotics. Along with using the ribosomal components that bind and discriminate the A-tRNA—namely, ribosomal RNA (rRNA) helices H89, H91, and ribosomal proteins (rProtein) uL16—these structures revealed novel interactions with domain 2 of the CTC protein, a feature typical to various Gram-positive bacteria. Furthermore, analysis of these structures explained how single nucleotide mutations and methylations in helices H89 and H91 confer resistance to orthosomycins and revealed the sequence variations in 23S rRNA nucleotides alongside the difference in the lengths of the eukaryotic and prokaryotic α1 helix of protein uL16 that play a key role in the selectivity of those drugs. The accurate interpretation of the crystal structures that could be performed beyond that recently reported in cryo-EM models provide structural insights that may be useful for the design of novel pathogen-specific antibiotics, and for improving the potency of orthosomycins. Because both drugs are extensively metabolized in vivo, their environmental toxicity is very low, thus placing them at the frontline of drugs with reduced ecological hazards. PMID:27791159

  6. Multiple capsid-stabilizing interactions revealed in a high-resolution structure of an emerging picornavirus causing neonatal sepsis

    NASA Astrophysics Data System (ADS)

    Shakeel, Shabih; Westerhuis, Brenda M.; Domanska, Ausra; Koning, Roman I.; Matadeen, Rishi; Koster, Abraham J.; Bakker, Arjen Q.; Beaumont, Tim; Wolthers, Katja C.; Butcher, Sarah J.

    2016-07-01

    The poorly studied picornavirus, human parechovirus 3 (HPeV3) causes neonatal sepsis with no therapies available. Our 4.3-Å resolution structure of HPeV3 on its own and at 15 Å resolution in complex with human monoclonal antibody Fabs demonstrates the expected picornavirus capsid structure with three distinct features. First, 25% of the HPeV3 RNA genome in 60 sites is highly ordered as confirmed by asymmetric reconstruction, and interacts with conserved regions of the capsid proteins VP1 and VP3. Second, the VP0 N terminus stabilizes the capsid inner surface, in contrast to other picornaviruses where on expulsion as VP4, it forms an RNA translocation channel. Last, VP1's hydrophobic pocket, the binding site for the antipicornaviral drug, pleconaril, is blocked and thus inappropriate for antiviral development. Together, these results suggest a direction for development of neutralizing antibodies, antiviral drugs based on targeting the RNA-protein interactions and dissection of virus assembly on the basis of RNA nucleation.

  7. The impact of feature selection on one and two-class classification performance for plant microRNAs.

    PubMed

    Khalifa, Waleed; Yousef, Malik; Saçar Demirci, Müşerref Duygu; Allmer, Jens

    2016-01-01

    MicroRNAs (miRNAs) are short nucleotide sequences that form a typical hairpin structure which is recognized by a complex enzyme machinery. It ultimately leads to the incorporation of 18-24 nt long mature miRNAs into RISC where they act as recognition keys to aid in regulation of target mRNAs. It is involved to determine miRNAs experimentally and, therefore, machine learning is used to complement such endeavors. The success of machine learning mostly depends on proper input data and appropriate features for parameterization of the data. Although, in general, two-class classification (TCC) is used in the field; because negative examples are hard to come by, one-class classification (OCC) has been tried for pre-miRNA detection. Since both positive and negative examples are currently somewhat limited, feature selection can prove to be vital for furthering the field of pre-miRNA detection. In this study, we compare the performance of OCC and TCC using eight feature selection methods and seven different plant species providing positive pre-miRNA examples. Feature selection was very successful for OCC where the best feature selection method achieved an average accuracy of 95.6%, thereby being ∼29% better than the worst method which achieved 66.9% accuracy. While the performance is comparable to TCC, which performs up to 3% better than OCC, TCC is much less affected by feature selection and its largest performance gap is ∼13% which only occurs for two of the feature selection methodologies. We conclude that feature selection is crucially important for OCC and that it can perform on par with TCC given the proper set of features.

  8. In silico design of ligand triggered RNA switches.

    PubMed

    Findeiß, Sven; Hammer, Stefan; Wolfinger, Michael T; Kühnl, Felix; Flamm, Christoph; Hofacker, Ivo L

    2018-04-13

    This contribution sketches a work flow to design an RNA switch that is able to adapt two structural conformations in a ligand-dependent way. A well characterized RNA aptamer, i.,e., knowing its K d and adaptive structural features, is an essential ingredient of the described design process. We exemplify the principles using the well-known theophylline aptamer throughout this work. The aptamer in its ligand-binding competent structure represents one structural conformation of the switch while an alternative fold that disrupts the binding-competent structure forms the other conformation. To keep it simple we do not incorporate any regulatory mechanism to control transcription or translation. We elucidate a commonly used design process by explicitly dissecting and explaining the necessary steps in detail. We developed a novel objective function which specifies the mechanistics of this simple, ligand-triggered riboswitch and describe an extensive in silico analysis pipeline to evaluate important kinetic properties of the designed sequences. This protocol and the developed software can be easily extended or adapted to fit novel design scenarios and thus can serve as a template for future needs. Copyright © 2018. Published by Elsevier Inc.

  9. Structural Chemistry of Human RNA Methyltransferases.

    PubMed

    Schapira, Matthieu

    2016-03-18

    RNA methyltransferases (RNMTs) play important roles in RNA stability, splicing, and epigenetic mechanisms. They constitute a promising target class that is underexplored by the medicinal chemistry community. Information of relevance to drug design can be extracted from the rich structural coverage of human RNMTs. In this work, the structural chemistry of this protein family is analyzed in depth. Unlike most methyltransferases, RNMTs generally feature a substrate-binding site that is largely open on the cofactor-binding pocket, favoring the design of bisubstrate inhibitors. Substrate purine or pyrimidines are often sandwiched between hydrophobic walls that can accommodate planar ring systems. When the substrate base is laying on a shallow surface, a 5' flanking base is sometimes anchored in a druggable cavity. The cofactor-binding site is structurally more diverse than in protein methyltransferases and more druggable in SPOUT than in Rossman-fold enzymes. Finally, conformational plasticity observed both at the substrate and cofactor binding sites may be a challenge for structure-based drug design. The landscape drawn here may inform ongoing efforts toward the discovery of the first human RNMT inhibitors.

  10. Mechanism for Coordinated RNA Packaging and Genome Replication by Rotavirus Polymerase VP1

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lu, Xiaohui; McDonald, Sarah M.; Tortorici, M. Alejandra

    2009-04-08

    Rotavirus RNA-dependent RNA polymerase VP1 catalyzes RNA synthesis within a subviral particle. This activity depends on core shell protein VP2. A conserved sequence at the 3' end of plus-strand RNA templates is important for polymerase association and genome replication. We have determined the structure of VP1 at 2.9 {angstrom} resolution, as apoenzyme and in complex with RNA. The cage-like enzyme is similar to reovirus {lambda}3, with four tunnels leading to or from a central, catalytic cavity. A distinguishing characteristic of VP1 is specific recognition, by conserved features of the template-entry channel, of four bases, UGUG, in the conserved 3' sequence.more » Well-defined interactions with these bases position the RNA so that its 3' end overshoots the initiating register, producing a stable but catalytically inactive complex. We propose that specific 3' end recognition selects rotavirus RNA for packaging and that VP2 activates the autoinhibited VP1/RNA complex to coordinate packaging and genome replication.« less

  11. Deep sequencing of foot-and-mouth disease virus reveals RNA sequences involved in genome packaging.

    PubMed

    Logan, Grace; Newman, Joseph; Wright, Caroline F; Lasecka-Dykes, Lidia; Haydon, Daniel T; Cottam, Eleanor M; Tuthill, Tobias J

    2017-10-18

    Non-enveloped viruses protect their genomes by packaging them into an outer shell or capsid of virus-encoded proteins. Packaging and capsid assembly in RNA viruses can involve interactions between capsid proteins and secondary structures in the viral genome as exemplified by the RNA bacteriophage MS2 and as proposed for other RNA viruses of plants, animals and human. In the picornavirus family of non-enveloped RNA viruses, the requirements for genome packaging remain poorly understood. Here we show a novel and simple approach to identify predicted RNA secondary structures involved in genome packaging in the picornavirus foot-and-mouth disease virus (FMDV). By interrogating deep sequencing data generated from both packaged and unpackaged populations of RNA we have determined multiple regions of the genome with constrained variation in the packaged population. Predicted secondary structures of these regions revealed stem loops with conservation of structure and a common motif at the loop. Disruption of these features resulted in attenuation of virus growth in cell culture due to a reduction in assembly of mature virions. This study provides evidence for the involvement of predicted RNA structures in picornavirus packaging and offers a readily transferable methodology for identifying packaging requirements in many other viruses. Importance In order to transmit their genetic material to a new host, non-enveloped viruses must protect their genomes by packaging them into an outer shell or capsid of virus-encoded proteins. For many non-enveloped RNA viruses the requirements for this critical part of the viral life cycle remain poorly understood. We have identified RNA sequences involved in genome packaging of the picornavirus foot-and-mouth disease virus. This virus causes an economically devastating disease of livestock affecting both the developed and developing world. The experimental methods developed to carry out this work are novel, simple and transferable to the study of packaging signals in other RNA viruses. Improved understanding of RNA packaging may lead to novel vaccine approaches or targets for antiviral drugs with broad spectrum activity. Copyright © 2017 Logan et al.

  12. Sequence features associated with the cleavage efficiency of CRISPR/Cas9 system.

    PubMed

    Liu, Xiaoxi; Homma, Ayaka; Sayadi, Jamasb; Yang, Shu; Ohashi, Jun; Takumi, Toru

    2016-01-27

    The CRISPR-Cas9 system has recently emerged as a versatile tool for biological and medical research. In this system, a single guide RNA (sgRNA) directs the endonuclease Cas9 to a targeted DNA sequence for site-specific manipulation. In addition to this targeting function, the sgRNA has also been shown to play a role in activating the endonuclease activity of Cas9. This dual function of the sgRNA likely underlies observations that different sgRNAs have varying on-target activities. Currently, our understanding of the relationship between sequence features of sgRNAs and their on-target cleavage efficiencies remains limited, largely due to difficulties in assessing the cleavage capacity of a large number of sgRNAs. In this study, we evaluated the cleavage activities of 218 sgRNAs using in vitro Surveyor assays. We found that nucleotides at both PAM-distal and PAM-proximal regions of the sgRNA are significantly correlated with on-target efficiency. Furthermore, we also demonstrated that the genomic context of the targeted DNA, the GC percentage, and the secondary structure of sgRNA are critical factors contributing to cleavage efficiency. In summary, our study reveals important parameters for the design of sgRNAs with high on-target efficiencies, especially in the context of high throughput applications.

  13. Structural and biochemical studies of RIG-I antiviral signaling.

    PubMed

    Feng, Miao; Ding, Zhanyu; Xu, Liang; Kong, Liangliang; Wang, Wenjia; Jiao, Shi; Shi, Zhubing; Greene, Mark I; Cong, Yao; Zhou, Zhaocai

    2013-02-01

    Retinoic acid-inducible gene I (RIG-I) is an important pattern recognition receptor that detects viral RNA and triggers the production of type-I interferons through the downstream adaptor MAVS (also called IPS-1, CARDIF, or VISA). A series of structural studies have elaborated some of the mechanisms of dsRNA recognition and activation of RIG-I. Recent studies have proposed that K63-linked ubiquitination of, or unanchored K63-linked polyubiquitin binding to RIG-I positively regulates MAVS-mediated antiviral signaling. Conversely phosphorylation of RIG-I appears to play an inhibitory role in controlling RIG-I antiviral signal transduction. Here we performed a combined structural and biochemical study to further define the regulatory features of RIG-I signaling. ATP and dsRNA binding triggered dimerization of RIG-I with conformational rearrangements of the tandem CARD domains. Full length RIG-I appeared to form a complex with dsRNA in a 2:2 molar ratio. Compared with the previously reported crystal structures of RIG-I in inactive state, our electron microscopic structure of full length RIG-I in complex with blunt-ended dsRNA, for the first time, revealed an exposed active conformation of the CARD domains. Moreover, we found that purified recombinant RIG-I proteins could bind to the CARD domain of MAVS independently of dsRNA, while S8E and T170E phosphorylation-mimicking mutants of RIG-I were defective in binding E3 ligase TRIM25, unanchored K63-linked polyubiquitin, and MAVS regardless of dsRNA. These findings suggested that phosphorylation of RIG inhibited downstream signaling by impairing RIG-I binding with polyubiquitin and its interaction with MAVS.

  14. New computational methods reveal tRNA identity element divergence between Proteobacteria and Cyanobacteria.

    PubMed

    Freyhult, Eva; Cui, Yuanyuan; Nilsson, Olle; Ardell, David H

    2007-10-01

    There are at least 21 subfunctional classes of tRNAs in most cells that, despite a very highly conserved and compact common structure, must interact specifically with different cliques of proteins or cause grave organismal consequences. Protein recognition of specific tRNA substrates is achieved in part through class-restricted tRNA features called tRNA identity determinants. In earlier work we used TFAM, a statistical classifier of tRNA function, to show evidence of unexpectedly large diversity among bacteria in tRNA identity determinants. We also created a data reduction technique called function logos to visualize identity determinants for a given taxon. Here we show evidence that determinants for lysylated isoleucine tRNAs are not the same in Proteobacteria as in other bacterial groups including the Cyanobacteria. Consistent with this, the lysylating biosynthetic enzyme TilS lacks a C-terminal domain in Cyanobacteria that is present in Proteobacteria. We present here, using function logos, a map estimating all potential identity determinants generally operational in Cyanobacteria and Proteobacteria. To further isolate the differences in potential tRNA identity determinants between Proteobacteria and Cyanobacteria, we created two new data reduction visualizations to contrast sequence and function logos between two taxa. One, called Information Difference logos (ID logos), shows the evolutionary gain or retention of functional information associated to features in one lineage. The other, Kullback-Leibler divergence Difference logos (KLD logos), shows recruitments or shifts in the functional associations of features, especially those informative in both lineages. We used these new logos to specifically isolate and visualize the differences in potential tRNA identity determinants between Proteobacteria and Cyanobacteria. Our graphical results point to numerous differences in potential tRNA identity determinants between these groups. Although more differences in general are explained by shifts in functional association rather than gains or losses, the apparent identity differences in lysylated isoleucine tRNAs appear to have evolved through both mechanisms.

  15. Computing the origin and evolution of the ribosome from its structure — Uncovering processes of macromolecular accretion benefiting synthetic biology

    PubMed Central

    Caetano-Anollés, Gustavo; Caetano-Anollés, Derek

    2015-01-01

    Accretion occurs pervasively in nature at widely different timeframes. The process also manifests in the evolution of macromolecules. Here we review recent computational and structural biology studies of evolutionary accretion that make use of the ideographic (historical, retrodictive) and nomothetic (universal, predictive) scientific frameworks. Computational studies uncover explicit timelines of accretion of structural parts in molecular repertoires and molecules. Phylogenetic trees of protein structural domains and proteomes and their molecular functions were built from a genomic census of millions of encoded proteins and associated terminal Gene Ontology terms. Trees reveal a ‘metabolic-first’ origin of proteins, the late development of translation, and a patchwork distribution of proteins in biological networks mediated by molecular recruitment. Similarly, the natural history of ancient RNA molecules inferred from trees of molecular substructures built from a census of molecular features shows patchwork-like accretion patterns. Ideographic analyses of ribosomal history uncover the early appearance of structures supporting mRNA decoding and tRNA translocation, the coevolution of ribosomal proteins and RNA, and a first evolutionary transition that brings ribosomal subunits together into a processive protein biosynthetic complex. Nomothetic structural biology studies of tertiary interactions and ancient insertions in rRNA complement these findings, once concentric layering assumptions are removed. Patterns of coaxial helical stacking reveal a frustrated dynamics of outward and inward ribosomal growth possibly mediated by structural grafting. The early rise of the ribosomal ‘turnstile’ suggests an evolutionary transition in natural biological computation. Results make explicit the need to understand processes of molecular growth and information transfer of macromolecules. PMID:27096056

  16. Innovative approaches to the use of polyamines for DNA nanoparticle preparation for gene therapy.

    PubMed

    Vijayanathan, Veena; Agostinelli, Enzo; Thomas, Thresia; Thomas, T J

    2014-03-01

    Advances in genomic technologies, such as next generation sequencing and disease specific gene targeting through anti-sense, anti-gene, siRNA and microRNA approaches require the transport of nucleic acid drugs through the cell membrane. Membrane transport of DNA/RNA drugs is an inefficient process, and the mechanism(s) by which this process occurs is not clear. A pre-requisite for effective transport of DNA and RNA in cells is their condensation to nanoparticles of ~100 nm size. Although viral vectors are effective in gene therapy, the immune response elicited by viral proteins poses a major challenge. Multivalent cations, such as natural polyamines are excellent promoters of DNA/RNA condensation to nanoparticles. During the past 20 years, our laboratory has synthesized and tested several analogs of the natural polyamine, spermine, for their efficacy to provoke DNA condensation to nanoparticles. We determined the thermodynamics of polyamine-mediated DNA condensation, measured the structural specificity effects of polyamine analogs in facilitating the cellular uptake of oligonucleotides, and evaluated the gene silencing activity of DNA nanoparticles in breast cancer cells. Polyamine-complexed oligonucleotides showed a synergistic effect on target gene inhibition at the mRNA level compared to the use of polyamines and oligonucleotides as single agents. Ionic and structural specificity effects were evident in DNA condensation and cellular transportation effects of polyamines. In condensed DNA structures, correlation exists between the attractive and repulsive forces with structurally different polyamines and cobalt hexamine, indicating the existence of a common force in stabilizing the condensed structures. Future studies aimed at defining the mechanism(s) of DNA compaction and structural features of DNA nanoparticles might aid in the development of novel gene delivery vehicles.

  17. RNABindRPlus: a predictor that combines machine learning and sequence homology-based methods to improve the reliability of predicted RNA-binding residues in proteins.

    PubMed

    Walia, Rasna R; Xue, Li C; Wilkins, Katherine; El-Manzalawy, Yasser; Dobbs, Drena; Honavar, Vasant

    2014-01-01

    Protein-RNA interactions are central to essential cellular processes such as protein synthesis and regulation of gene expression and play roles in human infectious and genetic diseases. Reliable identification of protein-RNA interfaces is critical for understanding the structural bases and functional implications of such interactions and for developing effective approaches to rational drug design. Sequence-based computational methods offer a viable, cost-effective way to identify putative RNA-binding residues in RNA-binding proteins. Here we report two novel approaches: (i) HomPRIP, a sequence homology-based method for predicting RNA-binding sites in proteins; (ii) RNABindRPlus, a new method that combines predictions from HomPRIP with those from an optimized Support Vector Machine (SVM) classifier trained on a benchmark dataset of 198 RNA-binding proteins. Although highly reliable, HomPRIP cannot make predictions for the unaligned parts of query proteins and its coverage is limited by the availability of close sequence homologs of the query protein with experimentally determined RNA-binding sites. RNABindRPlus overcomes these limitations. We compared the performance of HomPRIP and RNABindRPlus with that of several state-of-the-art predictors on two test sets, RB44 and RB111. On a subset of proteins for which homologs with experimentally determined interfaces could be reliably identified, HomPRIP outperformed all other methods achieving an MCC of 0.63 on RB44 and 0.83 on RB111. RNABindRPlus was able to predict RNA-binding residues of all proteins in both test sets, achieving an MCC of 0.55 and 0.37, respectively, and outperforming all other methods, including those that make use of structure-derived features of proteins. More importantly, RNABindRPlus outperforms all other methods for any choice of tradeoff between precision and recall. An important advantage of both HomPRIP and RNABindRPlus is that they rely on readily available sequence and sequence-derived features of RNA-binding proteins. A webserver implementation of both methods is freely available at http://einstein.cs.iastate.edu/RNABindRPlus/.

  18. Computational identification of binding energy hot spots in protein-RNA complexes using an ensemble approach.

    PubMed

    Pan, Yuliang; Wang, Zixiang; Zhan, Weihua; Deng, Lei

    2018-05-01

    Identifying RNA-binding residues, especially energetically favored hot spots, can provide valuable clues for understanding the mechanisms and functional importance of protein-RNA interactions. Yet, limited availability of experimentally recognized energy hot spots in protein-RNA crystal structures leads to the difficulties in developing empirical identification approaches. Computational prediction of RNA-binding hot spot residues is still in its infant stage. Here, we describe a computational method, PrabHot (Prediction of protein-RNA binding hot spots), that can effectively detect hot spot residues on protein-RNA binding interfaces using an ensemble of conceptually different machine learning classifiers. Residue interaction network features and new solvent exposure characteristics are combined together and selected for classification with the Boruta algorithm. In particular, two new reference datasets (benchmark and independent) have been generated containing 107 hot spots from 47 known protein-RNA complex structures. In 10-fold cross-validation on the training dataset, PrabHot achieves promising performances with an AUC score of 0.86 and a sensitivity of 0.78, which are significantly better than that of the pioneer RNA-binding hot spot prediction method HotSPRing. We also demonstrate the capability of our proposed method on the independent test dataset and gain a competitive advantage as a result. The PrabHot webserver is freely available at http://denglab.org/PrabHot/. leideng@csu.edu.cn. Supplementary data are available at Bioinformatics online.

  19. Modular architecture of eukaryotic RNase P and RNase MRP revealed by electron microscopy

    PubMed Central

    Hipp, Katharina; Galani, Kyriaki; Batisse, Claire; Prinz, Simone; Böttcher, Bettina

    2012-01-01

    Ribonuclease P (RNase P) and RNase MRP are closely related ribonucleoprotein enzymes, which process RNA substrates including tRNA precursors for RNase P and 5.8 S rRNA precursors, as well as some mRNAs, for RNase MRP. The structures of RNase P and RNase MRP have not yet been solved, so it is unclear how the proteins contribute to the structure of the complexes and how substrate specificity is determined. Using electron microscopy and image processing we show that eukaryotic RNase P and RNase MRP have a modular architecture, where proteins stabilize the RNA fold and contribute to cavities, channels and chambers between the modules. Such features are located at strategic positions for substrate recognition by shape and coordination of the cleaved-off sequence. These are also the sites of greatest difference between RNase P and RNase MRP, highlighting the importance of the adaptation of this region to the different substrates. PMID:22167472

  20. Probing RNA Native Conformational Ensembles with Structural Constraints.

    PubMed

    Fonseca, Rasmus; van den Bedem, Henry; Bernauer, Julie

    2016-05-01

    Noncoding ribonucleic acids (RNA) play a critical role in a wide variety of cellular processes, ranging from regulating gene expression to post-translational modification and protein synthesis. Their activity is modulated by highly dynamic exchanges between three-dimensional conformational substates, which are difficult to characterize experimentally and computationally. Here, we present an innovative, entirely kinematic computational procedure to efficiently explore the native ensemble of RNA molecules. Our procedure projects degrees of freedom onto a subspace of conformation space defined by distance constraints in the tertiary structure. The dimensionality reduction enables efficient exploration of conformational space. We show that the conformational distributions obtained with our method broadly sample the conformational landscape observed in NMR experiments. Compared to normal mode analysis-based exploration, our procedure diffuses faster through the experimental ensemble while also accessing conformational substates to greater precision. Our results suggest that conformational sampling with a highly reduced but fully atomistic representation of noncoding RNA expresses key features of their dynamic nature.

  1. Integrative analysis of Arabidopsis thaliana transcriptomics reveals intuitive splicing mechanism for circular RNA.

    PubMed

    Sun, Xiaoyong; Wang, Lin; Ding, Jiechao; Wang, Yanru; Wang, Jiansheng; Zhang, Xiaoyang; Che, Yulei; Liu, Ziwei; Zhang, Xinran; Ye, Jiazhen; Wang, Jie; Sablok, Gaurav; Deng, Zhiping; Zhao, Hongwei

    2016-10-01

    A new regulatory class of small endogenous RNAs called circular RNAs (circRNAs) has been described as miRNA sponges in animals. Using 16 Arabidopsis thaliana RNA-Seq data sets, we identified 803 circRNAs in RNase R-/non-RNase R-treated samples. The results revealed the following features: Canonical and noncanonical splicing can generate circRNAs; chloroplasts are a hotspot for circRNA generation; furthermore, limited complementary sequences exist not only in introns, but also in the sequences flanking splice sites. The latter finding suggests that multiple combinations between complementary sequences may facilitate the formation of the circular structure. Our results contribute to a better understanding of this novel class of plant circRNAs. © 2016 Federation of European Biochemical Societies.

  2. Structural variations of single and tandem mismatches in RNA duplexes: a joint MD simulation and crystal structure database analysis.

    PubMed

    Halder, Sukanya; Bhattacharyya, Dhananjay

    2012-10-04

    Internal loops within RNA duplex regions are formed by single or tandem basepairing mismatches with flanking canonical Watson-Crick basepairs on both sides. They are the most common motif observed in RNA secondary structures and play integral functional and structural roles. In this report, we have studied the structural features of 1 × 1, 2 × 2, and 3 × 3 internal loops using all-atom molecular dynamics (MD) simulation technique with explicit solvent model. As MD simulation is intricately dependent on the choice of force-field and these are often rather approximate, we have used both the most popular force-fields for nucleic acids-CHARMM27 and AMBER94-for a comparative analysis. We find that tandem noncanonical basepairs forming 2 × 2 and 3 × 3 internal loops are considerably more stable than the single mismatches forming 1 × 1 internal loops, irrespective of the force field. We have also analyzed crystal structure database to study the conservation of these helical fragments in the corresponding sets of RNA structures. We observe that the nature of stability in MD simulations mimic their fluctuating natures in crystal data sets also, probably indicating reliable natures of both the force fields to reproduce experimental results. We also notice significant structural changes in the wobble G:U basepairs present in these double helical stretches, leading to a biphasic stability for these wobble pairs to release the deformational strains introduced by internal loops within duplex regions.

  3. The standard operating procedure of the DOE-JGI Microbial Genome Annotation Pipeline (MGAP v.4).

    PubMed

    Huntemann, Marcel; Ivanova, Natalia N; Mavromatis, Konstantinos; Tripp, H James; Paez-Espino, David; Palaniappan, Krishnaveni; Szeto, Ernest; Pillay, Manoj; Chen, I-Min A; Pati, Amrita; Nielsen, Torben; Markowitz, Victor M; Kyrpides, Nikos C

    2015-01-01

    The DOE-JGI Microbial Genome Annotation Pipeline performs structural and functional annotation of microbial genomes that are further included into the Integrated Microbial Genome comparative analysis system. MGAP is applied to assembled nucleotide sequence datasets that are provided via the IMG submission site. Dataset submission for annotation first requires project and associated metadata description in GOLD. The MGAP sequence data processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNA features, as well as CRISPR elements. Structural annotation is followed by assignment of protein product names and functions.

  4. The cell's nucleolus: an emerging target for chemotherapeutic intervention.

    PubMed

    Pickard, Amanda J; Bierbach, Ulrich

    2013-09-01

    The transient nucleolus plays a central role in the up-regulated synthesis of ribosomal RNA (rRNA) to sustain ribosome biogenesis, a hallmark of aberrant cell growth. This function, in conjunction with its unique pathohistological features in malignant cells and its ability to mediate apoptosis, renders this sub-nuclear structure a potential target for chemotherapeutic agents. In this Minireview, structurally and functionally diverse small molecules are discussed that have been reported to either interact with the nucleolus directly or perturb its function indirectly by acting on its dynamic components. These molecules include all major classes of nucleic-acid-targeted agents, antimetabolites, kinase inhibitors, anti-inflammatory drugs, natural product antibiotics, oligopeptides, as well as nanoparticles. Together, these molecules are invaluable probes of structure and function of the nucleolus. They also provide a unique opportunity to develop novel strategies for more selective and therefore better-tolerated chemotherapeutic intervention. In this regard, inhibition of RNA polymerase-I-mediated rRNA synthesis appears to be a promising mechanism for killing cancer cells. The recent development of molecules targeted at G-quadruplex-forming rRNA gene sequences, which are currently undergoing clinical trials, seems to attest to the success of this approach. Copyright © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  5. Interaction of zanamivir with DNA and RNA: Models for drug DNA and drug RNA bindings

    NASA Astrophysics Data System (ADS)

    Nafisi, Shohreh; Kahangi, Fatemeh Ghoreyshi; Azizi, Ebrahim; Zebarjad, Nader; Tajmir-Riahi, Heidar-Ali

    2007-03-01

    Zanamivir (ZAN) is the first of a new generation of influenza virus-specific drugs known as neuraminidase inhibitors, which acts by interfering with life cycles of influenza viruses A and B. It prevents the virus spreading infection to other cells by blocking the neuraminidase enzyme present on the surface of the virus. The aim of this study was to examine the stability and structural features of calf thymus DNA and yeast RNA complexes with zanamivir in aqueous solution, using constant DNA or RNA concentration (12.5 mM) and various zanamivir/polynucleotide ( P) ratios of 1/20, 1/10, 1/4, and 1/2. FTIR and UV-visible spectroscopy are used to determine the drug external binding modes, the binding constant and the stability of zanamivir-DNA and RNA complexes in aqueous solution. Structural analysis showed major interaction of zanamivir with G-C (major groove) and A-T (minor groove) base pairs and minor perturbations of the backbone PO 2 group with overall binding constants of Kzanamivir-DNA = 1.30 × 10 4 M -1 and Kzanamivir-RNA = 1.38 × 10 4 M -1. The drug interaction induces a partial B to A-DNA transition, while RNA remains in A-conformation.

  6. Escherichia coli Ribosomal Protein S1 Unfolds Structured mRNAs Onto the Ribosome for Active Translation Initiation

    PubMed Central

    Duval, Mélodie; Korepanov, Alexey; Fuchsbauer, Olivier; Fechter, Pierre; Haller, Andrea; Fabbretti, Attilio; Choulier, Laurence; Micura, Ronald; Klaholz, Bruno P.; Romby, Pascale; Springer, Mathias; Marzi, Stefano

    2013-01-01

    Regulation of translation initiation is well appropriate to adapt cell growth in response to stress and environmental changes. Many bacterial mRNAs adopt structures in their 5′ untranslated regions that modulate the accessibility of the 30S ribosomal subunit. Structured mRNAs interact with the 30S in a two-step process where the docking of a folded mRNA precedes an accommodation step. Here, we used a combination of experimental approaches in vitro (kinetic of mRNA unfolding and binding experiments to analyze mRNA–protein or mRNA–ribosome complexes, toeprinting assays to follow the formation of ribosomal initiation complexes) and in vivo (genetic) to monitor the action of ribosomal protein S1 on the initiation of structured and regulated mRNAs. We demonstrate that r-protein S1 endows the 30S with an RNA chaperone activity that is essential for the docking and the unfolding of structured mRNAs, and for the correct positioning of the initiation codon inside the decoding channel. The first three OB-fold domains of S1 retain all its activities (mRNA and 30S binding, RNA melting activity) on the 30S subunit. S1 is not required for all mRNAs and acts differently on mRNAs according to the signals present at their 5′ ends. This work shows that S1 confers to the ribosome dynamic properties to initiate translation of a large set of mRNAs with diverse structural features. PMID:24339747

  7. Structure and Engineering of Francisella novicida Cas9

    PubMed Central

    Hirano, Hisato; Gootenberg, Jonathan S.; Horii, Takuro; Abudayyeh, Omar O.; Kimura, Mika; Hsu, Patrick D.; Nakane, Takanori; Ishitani, Ryuichiro; Hatada, Izuho; Zhang, Feng; Nishimasu, Hiroshi; Nureki, Osamu

    2016-01-01

    Summary The RNA-guided endonuclease Cas9 cleaves double-stranded DNA targets complementary to the guide RNA, and has been applied to programmable genome editing. Cas9-mediated cleavage requires a protospacer adjacent motif (PAM) juxtaposed with the DNA target sequence, thus constricting the range of targetable sites. Here, we report the 1.7 Å resolution crystal structures of Cas9 from Francisella novicida (FnCas9), one of the largest Cas9 orthologs, in complex with a guide RNA and its PAM-containing DNA targets. A structural comparison of FnCas9 with other Cas9 orthologs revealed striking conserved and divergent features among distantly related CRISPR-Cas9 systems. We found that FnCas9 recognizes the 5′-NGG-3′ PAM, and used the structural information to create a variant that can recognize the more relaxed 5′-YG-3′ PAM. Furthermore, we demonstrated that pre-assembled FnCas9 ribonucleoprotein complexes can be microinjected into mouse zygotes to edit endogenous sites with the 5′-YG-3′ PAMs, thus expanding the target space of the CRISPR-Cas9 toolbox. PMID:26875867

  8. Structure and Engineering of Francisella novicida Cas9.

    PubMed

    Hirano, Hisato; Gootenberg, Jonathan S; Horii, Takuro; Abudayyeh, Omar O; Kimura, Mika; Hsu, Patrick D; Nakane, Takanori; Ishitani, Ryuichiro; Hatada, Izuho; Zhang, Feng; Nishimasu, Hiroshi; Nureki, Osamu

    2016-02-25

    The RNA-guided endonuclease Cas9 cleaves double-stranded DNA targets complementary to the guide RNA and has been applied to programmable genome editing. Cas9-mediated cleavage requires a protospacer adjacent motif (PAM) juxtaposed with the DNA target sequence, thus constricting the range of targetable sites. Here, we report the 1.7 Å resolution crystal structures of Cas9 from Francisella novicida (FnCas9), one of the largest Cas9 orthologs, in complex with a guide RNA and its PAM-containing DNA targets. A structural comparison of FnCas9 with other Cas9 orthologs revealed striking conserved and divergent features among distantly related CRISPR-Cas9 systems. We found that FnCas9 recognizes the 5'-NGG-3' PAM, and used the structural information to create a variant that can recognize the more relaxed 5'-YG-3' PAM. Furthermore, we demonstrated that the FnCas9-ribonucleoprotein complex can be microinjected into mouse zygotes to edit endogenous sites with the 5'-YG-3' PAM, thus expanding the target space of the CRISPR-Cas9 toolbox. Copyright © 2016 Elsevier Inc. All rights reserved.

  9. Probing the structural dynamics of the CRISPR-Cas9 RNA-guided DNA-cleavage system by coarse-grained modeling.

    PubMed

    Zheng, Wenjun

    2017-02-01

    In the adaptive immune systems of many bacteria and archaea, the Cas9 endonuclease forms a complex with specific guide/scaffold RNA to identify and cleave complementary target sequences in foreign DNA. This DNA targeting machinery has been exploited in numerous applications of genome editing and transcription control. However, the molecular mechanism of the Cas9 system is still obscure. Recently, high-resolution structures have been solved for Cas9 in different structural forms (e.g., unbound forms, RNA-bound binary complexes, and RNA-DNA-bound tertiary complexes, corresponding to an inactive state, a pre-target-bound state, and a cleavage-competent or product state), which offered key structural insights to the Cas9 mechanism. To further probe the structural dynamics of Cas9 interacting with RNA and DNA at the amino-acid level of details, we have performed systematic coarse-grained modeling using an elastic network model and related analyses. Our normal mode analysis predicted a few key modes of collective motions that capture the observed conformational changes featuring large domain motions triggered by binding of RNA and DNA. Our flexibility analysis identified specific regions with high or low flexibility that coincide with key functional sites (such as DNA/RNA-binding sites, nuclease cleavage sites, and key hinges). We also identified a small set of hotspot residues that control the energetics of functional motions, which overlap with known functional sites and offer promising targets for future mutagenesis efforts to improve the specificity of Cas9. Finally, we modeled the conformational transitions of Cas9 from the unbound form to the binary complex and then the tertiary complex, and predicted a distinct sequence of domain motions. In sum, our findings have offered rich structural and dynamic details relevant to the Cas9 machinery, and will guide future investigation and engineering of the Cas9 systems. Proteins 2017; 85:342-353. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  10. LRSSLMDA: Laplacian Regularized Sparse Subspace Learning for MiRNA-Disease Association prediction

    PubMed Central

    Huang, Li

    2017-01-01

    Predicting novel microRNA (miRNA)-disease associations is clinically significant due to miRNAs’ potential roles of diagnostic biomarkers and therapeutic targets for various human diseases. Previous studies have demonstrated the viability of utilizing different types of biological data to computationally infer new disease-related miRNAs. Yet researchers face the challenge of how to effectively integrate diverse datasets and make reliable predictions. In this study, we presented a computational model named Laplacian Regularized Sparse Subspace Learning for MiRNA-Disease Association prediction (LRSSLMDA), which projected miRNAs/diseases’ statistical feature profile and graph theoretical feature profile to a common subspace. It used Laplacian regularization to preserve the local structures of the training data and a L1-norm constraint to select important miRNA/disease features for prediction. The strength of dimensionality reduction enabled the model to be easily extended to much higher dimensional datasets than those exploited in this study. Experimental results showed that LRSSLMDA outperformed ten previous models: the AUC of 0.9178 in global leave-one-out cross validation (LOOCV) and the AUC of 0.8418 in local LOOCV indicated the model’s superior prediction accuracy; and the average AUC of 0.9181+/-0.0004 in 5-fold cross validation justified its accuracy and stability. In addition, three types of case studies further demonstrated its predictive power. Potential miRNAs related to Colon Neoplasms, Lymphoma, Kidney Neoplasms, Esophageal Neoplasms and Breast Neoplasms were predicted by LRSSLMDA. Respectively, 98%, 88%, 96%, 98% and 98% out of the top 50 predictions were validated by experimental evidences. Therefore, we conclude that LRSSLMDA would be a valuable computational tool for miRNA-disease association prediction. PMID:29253885

  11. Evolution of introns in the archaeal world.

    PubMed

    Tocchini-Valentini, Giuseppe D; Fruscoloni, Paolo; Tocchini-Valentini, Glauco P

    2011-03-22

    The self-splicing group I introns are removed by an autocatalytic mechanism that involves a series of transesterification reactions. They require RNA binding proteins to act as chaperones to correctly fold the RNA into an active intermediate structure in vivo. Pre-tRNA introns in Bacteria and in higher eukaryote plastids are typical examples of self-splicing group I introns. By contrast, two striking features characterize RNA splicing in the archaeal world. First, self-splicing group I introns cannot be found, to this date, in that kingdom. Second, the RNA splicing scenario in Archaea is uniform: All introns, whether in pre-tRNA or elsewhere, are removed by tRNA splicing endonucleases. We suggest that in Archaea, the protein recruited for splicing is the preexisting tRNA splicing endonuclease and that this enzyme, together with the ligase, takes over the task of intron removal in a more efficient fashion than the ribozyme. The extinction of group I introns in Archaea would then be a consequence of recruitment of the tRNA splicing endonuclease. We deal here with comparative genome analysis, focusing specifically on the integration of introns into genes coding for 23S rRNA molecules, and how this newly acquired intron has to be removed to regenerate a functional RNA molecule. We show that all known oligomeric structures of the endonuclease can recognize and cleave a ribosomal intron, even when the endonuclease derives from a strain lacking rRNA introns. The persistence of group I introns in mitochondria and chloroplasts would be explained by the inaccessibility of these introns to the endonuclease.

  12. Insights into the Structural Dynamics of Nucleocytoplasmic Transport of tRNA by Exportin-t

    PubMed Central

    Gupta, Asmita; Kailasam, Senthilkumar; Bansal, Manju

    2016-01-01

    Exportin-t (Xpot) transports mature 5′- and 3′-end processed tRNA from the nucleus to the cytoplasm by associating with a small G-protein Ran (RAs-related nuclear protein), in the nucleus. The release of tRNA in cytoplasm involves RanGTP hydrolysis. Despite the availability of crystal structures of nuclear and cytosolic forms of Xpot, the molecular details regarding the sequential events leading to tRNA release and subsequent conformational changes occurring in Xpot remain unknown. We have performed a combination of classical all-atom and accelerated molecular dynamics simulations on a set of complexes involving Xpot to study a range of features including conformational flexibility of free and cargo-bound Xpot and functionally critical contacts between Xpot and its cargo. The systems investigated include free Xpot and its different complexes, bound either to Ran (GTP/GDP) or tRNA or both. This approach provided a statistically reliable estimate of structural dynamics of Xpot after cargo release. The mechanistic basis for Xpot opening after cargo release has been explained in terms of dynamic structural hinges, about which neighboring region could be displaced to facilitate the nuclear to cytosolic state transition. Post-RanGTP hydrolysis, a cascade of events including local conformational change in RanGTP and loss of critical contacts at Xpot/tRNA interface suggest factors responsible for eventual release of tRNA. The level of flexibility in different Xpot complexes varied depending on the arrangement of individual HEAT repeats. Current study provides one of the most comprehensive and robust analysis carried out on this protein using molecular dynamics schemes. PMID:27028637

  13. Topological Structure of the Space of Phenotypes: The Case of RNA Neutral Networks

    PubMed Central

    Aguirre, Jacobo; Buldú, Javier M.; Stich, Michael; Manrubia, Susanna C.

    2011-01-01

    The evolution and adaptation of molecular populations is constrained by the diversity accessible through mutational processes. RNA is a paradigmatic example of biopolymer where genotype (sequence) and phenotype (approximated by the secondary structure fold) are identified in a single molecule. The extreme redundancy of the genotype-phenotype map leads to large ensembles of RNA sequences that fold into the same secondary structure and can be connected through single-point mutations. These ensembles define neutral networks of phenotypes in sequence space. Here we analyze the topological properties of neutral networks formed by 12-nucleotides RNA sequences, obtained through the exhaustive folding of sequence space. A total of 412 sequences fragments into 645 subnetworks that correspond to 57 different secondary structures. The topological analysis reveals that each subnetwork is far from being random: it has a degree distribution with a well-defined average and a small dispersion, a high clustering coefficient, and an average shortest path between nodes close to its minimum possible value, i.e. the Hamming distance between sequences. RNA neutral networks are assortative due to the correlation in the composition of neighboring sequences, a feature that together with the symmetries inherent to the folding process explains the existence of communities. Several topological relationships can be analytically derived attending to structural restrictions and generic properties of the folding process. The average degree of these phenotypic networks grows logarithmically with their size, such that abundant phenotypes have the additional advantage of being more robust to mutations. This property prevents fragmentation of neutral networks and thus enhances the navigability of sequence space. In summary, RNA neutral networks show unique topological properties, unknown to other networks previously described. PMID:22028856

  14. Solution structure of a modified 2′,5′-linked RNA hairpin involved in an equilibrium with duplex

    PubMed Central

    Plevnik, Miha; Gdaniec, Zofia; Plavec, Janez

    2005-01-01

    The isomerization of phosphodiester functionality of nucleic acids from 3′,5′- to a less common 2′,5′-linkage influences the complex interplay of stereoelectronic effects that drive pseudorotational equilibrium of sugar rings and thus affect the conformational propensities for compact or more extended structures. The present study highlights the subtle balance of non-covalent forces at play in structural equilibrium of 2′,5′-linked RNA analogue, 3′-O-(2-methoxyethyl) substituted dodecamer *CG*CGAA*U*U*CG*CG, 3′-MOE-2′,5′-RNA, where all cytosines and uracils are methylated at C5. The NMR and UV spectroscopic studies have shown that 3′-MOE-2′,5′-RNA adopts both hairpin and duplex secondary structures, which are involved in a dynamic exchange that is slow on the NMR timescale and exhibits strand and salt concentration as well as pH dependence. Unusual effect of pH over a narrow physiological range is observed for imino proton resonances with exchange broadening observed at lower pH and relatively sharp lines observed at higher pH. The solution structure of 3′-MOE-2′,5′-RNA hairpin displays a unique and well-defined loop, which is stabilized by Watson–Crick A5·*U8 base pair and by n → π* stacking interactions of O4′ lone-pair electrons of A6 and *U8 with aromatic rings of A5 and *U7, respectively. In contrast, the stem region of 3′-MOE-2′,5′-RNA hairpin is more flexible. Our data highlight the important feature of backbone modifications that can have pronounced effects on interstrand association of nucleic acids. PMID:15788747

  15. MultiMiTar: a novel multi objective optimization based miRNA-target prediction method.

    PubMed

    Mitra, Ramkrishna; Bandyopadhyay, Sanghamitra

    2011-01-01

    Machine learning based miRNA-target prediction algorithms often fail to obtain a balanced prediction accuracy in terms of both sensitivity and specificity due to lack of the gold standard of negative examples, miRNA-targeting site context specific relevant features and efficient feature selection process. Moreover, all the sequence, structure and machine learning based algorithms are unable to distribute the true positive predictions preferentially at the top of the ranked list; hence the algorithms become unreliable to the biologists. In addition, these algorithms fail to obtain considerable combination of precision and recall for the target transcripts that are translationally repressed at protein level. In the proposed article, we introduce an efficient miRNA-target prediction system MultiMiTar, a Support Vector Machine (SVM) based classifier integrated with a multiobjective metaheuristic based feature selection technique. The robust performance of the proposed method is mainly the result of using high quality negative examples and selection of biologically relevant miRNA-targeting site context specific features. The features are selected by using a novel feature selection technique AMOSA-SVM, that integrates the multi objective optimization technique Archived Multi-Objective Simulated Annealing (AMOSA) and SVM. MultiMiTar is found to achieve much higher Matthew's correlation coefficient (MCC) of 0.583 and average class-wise accuracy (ACA) of 0.8 compared to the others target prediction methods for a completely independent test data set. The obtained MCC and ACA values of these algorithms range from -0.269 to 0.155 and 0.321 to 0.582, respectively. Moreover, it shows a more balanced result in terms of precision and sensitivity (recall) for the translationally repressed data set as compared to all the other existing methods. An important aspect is that the true positive predictions are distributed preferentially at the top of the ranked list that makes MultiMiTar reliable for the biologists. MultiMiTar is now available as an online tool at www.isical.ac.in/~bioinfo_miu/multimitar.htm. MultiMiTar software can be downloaded from www.isical.ac.in/~bioinfo_miu/multimitar-download.htm.

  16. Uncultivated Microbial Eukaryotic Diversity: A Method to Link ssu rRNA Gene Sequences with Morphology

    PubMed Central

    Hirst, Marissa B.; Kita, Kelley N.; Dawson, Scott C.

    2011-01-01

    Protists have traditionally been identified by cultivation and classified taxonomically based on their cellular morphologies and behavior. In the past decade, however, many novel protist taxa have been identified using cultivation independent ssu rRNA sequence surveys. New rRNA “phylotypes” from uncultivated eukaryotes have no connection to the wealth of prior morphological descriptions of protists. To link phylogenetically informative sequences with taxonomically informative morphological descriptions, we demonstrate several methods for combining whole cell rRNA-targeted fluorescent in situ hybridization (FISH) with cytoskeletal or organellar immunostaining. Either eukaryote or ciliate-specific ssu rRNA probes were combined with an anti-α-tubulin antibody or phalloidin, a common actin stain, to define cytoskeletal features of uncultivated protists in several environmental samples. The eukaryote ssu rRNA probe was also combined with Mitotracker® or a hydrogenosomal-specific anti-Hsp70 antibody to localize mitochondria and hydrogenosomes, respectively, in uncultivated protists from different environments. Using rRNA probes in combination with immunostaining, we linked ssu rRNA phylotypes with microtubule structure to describe flagellate and ciliate morphology in three diverse environments, and linked Naegleria spp. to their amoeboid morphology using actin staining in hay infusion samples. We also linked uncultivated ciliates to morphologically similar Colpoda-like ciliates using tubulin immunostaining with a ciliate-specific rRNA probe. Combining rRNA-targeted FISH with cytoskeletal immunostaining or stains targeting specific organelles provides a fast, efficient, high throughput method for linking genetic sequences with morphological features in uncultivated protists. When linked to phylotype, morphological descriptions of protists can both complement and vet the increasing number of sequences from uncultivated protists, including those of novel lineages, identified in diverse environments. PMID:22174774

  17. Post-transcriptional trafficking and regulation of neuronal gene expression.

    PubMed

    Goldie, Belinda J; Cairns, Murray J

    2012-02-01

    Intracellular messenger RNA (mRNA) traffic and translation must be highly regulated, both temporally and spatially, within eukaryotic cells to support the complex functional partitioning. This capacity is essential in neurons because it provides a mechanism for rapid input-restricted activity-dependent protein synthesis in individual dendritic spines. While this feature is thought to be important for synaptic plasticity, the structures and mechanisms that support this capability are largely unknown. Certainly specialized RNA binding proteins and binding elements in the 3' untranslated region (UTR) of translationally regulated mRNA are important, but the subtlety and complexity of this system suggests that an intermediate "specificity" component is also involved. Small non-coding microRNA (miRNA) are essential for CNS development and may fulfill this role by acting as the guide strand for mediating complex patterns of post-transcriptional regulation. In this review we examine post-synaptic gene regulation, mRNA trafficking and the emerging role of post-transcriptional gene silencing in synaptic plasticity.

  18. [Analysis of the primary and secondary structure of the mitochondrial serine transfer RNA in seven species of Lutzomyia].

    PubMed

    Vivero, Rafael José; Contreras-Gutiérrez, Maria Angélica; Bejarano, Eduar Elías

    2007-09-01

    Lutzomyia sand flies are involved in the transmission of the parasite Leishmania spp. in America. The taxonomy of these vectors is traditionally based on morphological features of the adult stage, particularly the paired structures of the head and genitalia. Although these characters are useful to distinguish most species of Lutzomyia, morphological identification may be complicated by the similarities within subgenera and species group. To evaluate the utility of mitochondrial serine transfer RNA tRNA Ser for taxonomic identification of Lutzomyia. Seven sand fly species, each representing one of the 27 taxonomic subdivisions in genus Lutzomyia, were analyzed including L. trinidadensis (Oswaldoi group), L. (Psychodopygus) panamensis, L.(Micropygomyia) cayennensis cayennensis, L. dubitans (Migonei group), L. (Lutzomyia) gomezi, L. rangeliana (ungrouped) and L. evansi (Verrucarum group). The mitochondrial tRNA Ser gene, flanked by the cytochrome b and NAD dehydrogenase subunit one genes, was extracted, amplified and sequenced from each specimen. Secondary structure of the tRNA Ser was predicted by comparisons with previously described homologous structures from other dipteran species. The tRNA Ser gene ranged in size from 66 base pairs in L. gomezi to 69 base pairs in L. trinidadensis. Fourteen polymorphic sites, including four insertion-deletion events, were observed in the aligned 70 nucleotide positions. The majority of the substitutions were located in the dihydrouridine, ribothymidine-pseudouridine-cytosine and variable loops, as well as in the basal extreme of the anticodon arm. Changes of primary sequence of the tRNASer provided useful molecular characters for taxonomic identification of the sand fly species under consideration.

  19. Structural and functional properties of the HIV-1 RNA-tRNA(Lys)3 primer complex annealed by the nucleocapsid protein: comparison with the heat-annealed complex.

    PubMed Central

    Brulé, Fabienne; Marquet, Roland; Rong, Liwei; Wainberg, Mark A; Roques, Bernard P; Le Grice, Stuart F J; Ehresmann, Bernard; Ehresmann, Chantal

    2002-01-01

    The conversion of the single-stranded RNA genome into double-stranded DNA by virus-coded reverse transcriptase (RT) is an essential step of the retrovirus life cycle. In human immunodeficiency virus type 1 (HIV-1), RT uses the cellular tRNA(Lys)3 to initiate the (-) strand DNA synthesis. Placement of the primer tRNA(Lys)3 involves binding of its 3'-terminal 18 nt to a complementary region of genomic RNA termed PBS. However, the PBS sequence is not the unique determinant of primer usage and additional contacts are important. This placement is believed to be achieved in vivo by the nucleocapsid domain of Gag or by the mature protein NCp. Up to now, structural information essentially arose from heat-annealed primer-template complexes (Isel et al., J Mol Biol, 1995, 247:236-250; Isel et al., EMBO J, 1999, 18:1038-1048). Here, we investigated the formation of the primer-template complex mediated by NCp and compared structural and functional properties of heat- and NCp-annealed complexes. We showed that both heat- and NCp-mediated procedures allow comparable high yields of annealing. Then, we investigated structural features of both kinds of complexes by enzymatic probing, and we compared their relative efficiency in (-) strong stop DNA synthesis. We did not find any significant differences between these complexes, suggesting that information derived from the heat-annealed complex can be transposed to the NCp-mediated complex and most likely to complexes formed in vivo. PMID:11873759

  20. Interactions of a Pop5/Rpp1 heterodimer with the catalytic domain of RNase MRP.

    PubMed

    Perederina, Anna; Khanova, Elena; Quan, Chao; Berezin, Igor; Esakova, Olga; Krasilnikov, Andrey S

    2011-10-01

    Ribonuclease (RNase) MRP is a multicomponent ribonucleoprotein complex closely related to RNase P. RNase MRP and eukaryotic RNase P share most of their protein components, as well as multiple features of their catalytic RNA moieties, but have distinct substrate specificities. While RNase P is practically universally found in all three domains of life, RNase MRP is essential in eukaryotes. The structural organizations of eukaryotic RNase P and RNase MRP are poorly understood. Here, we show that Pop5 and Rpp1, protein components found in both RNase P and RNase MRP, form a heterodimer that binds directly to the conserved area of the putative catalytic domain of RNase MRP RNA. The Pop5/Rpp1 binding site corresponds to the protein binding site in bacterial RNase P RNA. Structural and evolutionary roles of the Pop5/Rpp1 heterodimer in RNases P and MRP are discussed.

  1. Interactions of a Pop5/Rpp1 heterodimer with the catalytic domain of RNase MRP

    PubMed Central

    Perederina, Anna; Khanova, Elena; Quan, Chao; Berezin, Igor; Esakova, Olga; Krasilnikov, Andrey S.

    2011-01-01

    Ribonuclease (RNase) MRP is a multicomponent ribonucleoprotein complex closely related to RNase P. RNase MRP and eukaryotic RNase P share most of their protein components, as well as multiple features of their catalytic RNA moieties, but have distinct substrate specificities. While RNase P is practically universally found in all three domains of life, RNase MRP is essential in eukaryotes. The structural organizations of eukaryotic RNase P and RNase MRP are poorly understood. Here, we show that Pop5 and Rpp1, protein components found in both RNase P and RNase MRP, form a heterodimer that binds directly to the conserved area of the putative catalytic domain of RNase MRP RNA. The Pop5/Rpp1 binding site corresponds to the protein binding site in bacterial RNase P RNA. Structural and evolutionary roles of the Pop5/Rpp1 heterodimer in RNases P and MRP are discussed. PMID:21878546

  2. Packaging and structural phenotype of brome mosaic virus capsid protein with altered N-terminal {beta}-hexamer structure

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wispelaere, Melissanne de; Chaturvedi, Sonali; Wilkens, Stephan

    2011-10-10

    The first 45 amino acid region of brome mosaic virus (BMV) capsid protein (CP) contains RNA binding and structural domains that are implicated in the assembly of infectious virions. One such important structural domain encompassing amino acids {sup 28}QPVIV{sup 32}, highly conserved between BMV and cowpea chlorotic mottle virus (CCMV), exhibits a {beta}-hexamer structure. In this study we report that alteration of the {beta}-hexamer structure by mutating {sup 28}QPVIV{sup 32} to {sup 28}AAAAA{sup 32} had no effect either on symptom phenotype, local and systemic movement in Chenopodium quinoa and RNA profile of in vivo assembled virions. However, sensitivity to RNasemore » and assembly phenotypes distinguished virions assembled with CP subunits having {beta}-hexamer from those of wild type. A comparison of 3-D models obtained by cryo electron microscopy revealed overall similar structural features for wild type and mutant virions, with small but significant differences near the 3-fold axes of symmetry.« less

  3. A multigene locus containing the Manx and bobcat genes is required for development of chordate features in the ascidian tadpole larva.

    PubMed

    Swalla, B J; Just, M A; Pederson, E L; Jeffery, W R

    1999-04-01

    The Manx gene is required for the development of the tail and other chordate features in the ascidian tadpole larva. To determine the structure of the Manx gene, we isolated and sequenced genomic clones from the tailed ascidian Molgula oculata. The Manx gene contains 9 exons and encodes both major and minor Manx mRNAs, which differ in the length of their 5' untranslated regions. The coding region of the single-copy bobcat gene, which encodes a DEAD-box RNA helicase, is embedded within the first Manx intron. The organization of the bobcat and Manx transcription units was determined by comparing genomic and cDNA clones. The Manx-bobcat gene locus has an unusual organization in which a non-coding first exon is alternatively spliced at the 5' end of two different mRNAs. The bobcat and Manx genes are expressed coordinately during oogenesis and embryogenesis, but not during spermatogenesis, in which bobcat mRNA accumulates independently of Manx mRNA. Similar to Manx, zygotic bobcat transcripts accumulate in the embryonic primordia responsible for generating chordate features, including the dorsal neural tube and notochord, are downregulated during embryogenesis in the tailless species Molgula occulta and are upregulated in M. occulta X M. oculata hybrids, which restore these chordate features. Antisense experiments indicate that zygotic bobcat expression is required for development of the same suite of chordate features as Manx. The results show that the Manx-bobcat gene complex has a role in the development of chordate features in ascidian tadpole larvae.

  4. Small-angle X-ray solution scattering study of the multi-aminoacyl-tRNA synthetase complex reveals an elongated and multi-armed particle.

    PubMed

    Dias, José; Renault, Louis; Pérez, Javier; Mirande, Marc

    2013-08-16

    In animal cells, nine aminoacyl-tRNA synthetases are associated with the three auxiliary proteins p18, p38, and p43 to form a stable and conserved large multi-aminoacyl-tRNA synthetase complex (MARS), whose molecular mass has been proposed to be between 1.0 and 1.5 MDa. The complex acts as a molecular hub for coordinating protein synthesis and diverse regulatory signal pathways. Electron microscopy studies defined its low resolution molecular envelope as an overall rather compact, asymmetric triangular shape. Here, we have analyzed the composition and homogeneity of the native mammalian MARS isolated from rabbit liver and characterized its overall internal structure, size, and shape at low resolution by hydrodynamic methods and small-angle x-ray scattering in solution. Our data reveal that the MARS exhibits a much more elongated and multi-armed shape than expected from previous reports. The hydrodynamic and structural features of the MARS are large compared with other supramolecular assemblies involved in translation, including ribosome. The large dimensions and non-compact structural organization of MARS favor a large protein surface accessibility for all its components. This may be essential to allow structural rearrangements between the catalytic and cis-acting tRNA binding domains of the synthetases required for binding the bulky tRNA substrates. This non-compact architecture may also contribute to the spatiotemporal controlled release of some of its components, which participate in non-canonical functions after dissociation from the complex.

  5. A New Direction of Cancer Classification: Positive Effect of Low-Ranking MicroRNAs.

    PubMed

    Li, Feifei; Piao, Minghao; Piao, Yongjun; Li, Meijing; Ryu, Keun Ho

    2014-10-01

    Many studies based on microRNA (miRNA) expression profiles showed a new aspect of cancer classification. Because one characteristic of miRNA expression data is the high dimensionality, feature selection methods have been used to facilitate dimensionality reduction. The feature selection methods have one shortcoming thus far: they just consider the problem of where feature to class is 1:1 or n:1. However, because one miRNA may influence more than one type of cancer, human miRNA is considered to be ranked low in traditional feature selection methods and are removed most of the time. In view of the limitation of the miRNA number, low-ranking miRNAs are also important to cancer classification. We considered both high- and low-ranking features to cover all problems (1:1, n:1, 1:n, and m:n) in cancer classification. First, we used the correlation-based feature selection method to select the high-ranking miRNAs, and chose the support vector machine, Bayes network, decision tree, k-nearest-neighbor, and logistic classifier to construct cancer classification. Then, we chose Chi-square test, information gain, gain ratio, and Pearson's correlation feature selection methods to build the m:n feature subset, and used the selected miRNAs to determine cancer classification. The low-ranking miRNA expression profiles achieved higher classification accuracy compared with just using high-ranking miRNAs in traditional feature selection methods. Our results demonstrate that the m:n feature subset made a positive impression of low-ranking miRNAs in cancer classification.

  6. Circular RNA biogenesis can proceed through an exon-containing lariat precursor.

    PubMed

    Barrett, Steven P; Wang, Peter L; Salzman, Julia

    2015-06-09

    Pervasive expression of circular RNA is a recently discovered feature of eukaryotic gene expression programs, yet its function remains largely unknown. The presumed biogenesis of these RNAs involves a non-canonical 'backsplicing' event. Recent studies in mammalian cell culture posit that backsplicing is facilitated by inverted repeats flanking the circularized exon(s). Although such sequence elements are common in mammals, they are rare in lower eukaryotes, making current models insufficient to describe circularization. Through systematic splice site mutagenesis and the identification of splicing intermediates, we show that circular RNA in Schizosaccharomyces pombe is generated through an exon-containing lariat precursor. Furthermore, we have performed high-throughput and comprehensive mutagenesis of a circle-forming exon, which enabled us to discover a systematic effect of exon length on RNA circularization. Our results uncover a mechanism for circular RNA biogenesis that may account for circularization in genes that lack noticeable flanking intronic secondary structure.

  7. Exploring the molecular basis of dsRNA recognition by NS1 protein of influenza A virus using molecular dynamics simulation and free energy calculation.

    PubMed

    Pan, Dabo; Sun, Huijun; Shen, Yulin; Liu, Huanxiang; Yao, Xiaojun

    2011-12-01

    The frequent outbreak of influenza pandemic and the limited available anti-influenza drugs highlight the urgent need for the development of new antiviral drugs. The dsRNA-binding surface of nonstructural protein 1 of influenza A virus (NS1A) is a promising target. The detailed understanding of NS1A-dsRNA interaction will be valuable for structure-based anti-influenza drug discovery. To characterize and explore the key interaction features between dsRNA and NS1A, molecular dynamics simulation combined with MM-GBSA calculations were performed. Based on the MM-GBSA calculations, we find that the intermolecular van der Waals interaction and the nonpolar solvation term provide the main driving force for the binding process. Meanwhile, 17 key residues from NS1A were identified to be responsible for the dsRNA binding. Compared with the wild type NS1A, all the studied mutants S42A, T49A, R38A, R35AR46A have obvious reduced binding free energies with dsRNA reflecting in the reduction of the polar and/or nonpolar interactions. In addition, the structural and energy analysis indicate the mutations have a small effect to the backbone structures but the loss of side chain interactions is responsible for the decrease of the binding affinity. The uncovering of NS1A-dsRNA recognition mechanism will provide some useful insights and new chances for the development of anti-influenza drugs. Copyright © 2011 Elsevier B.V. All rights reserved.

  8. The standard operating procedure of the DOE-JGI Microbial Genome Annotation Pipeline (MGAP v.4)

    DOE PAGES

    Huntemann, Marcel; Ivanova, Natalia N.; Mavromatis, Konstantinos; ...

    2015-10-26

    The DOE-JGI Microbial Genome Annotation Pipeline performs structural and functional annotation of microbial genomes that are further included into the Integrated Microbial Genome comparative analysis system. MGAP is applied to assembled nucleotide sequence datasets that are provided via the IMG submission site. Dataset submission for annotation first requires project and associated metadata description in GOLD. The MGAP sequence data processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNA features, as well as CRISPR elements. In conclusion, structural annotation is followed by assignment of protein product names and functions.

  9. The standard operating procedure of the DOE-JGI Microbial Genome Annotation Pipeline (MGAP v.4)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Huntemann, Marcel; Ivanova, Natalia N.; Mavromatis, Konstantinos

    The DOE-JGI Microbial Genome Annotation Pipeline performs structural and functional annotation of microbial genomes that are further included into the Integrated Microbial Genome comparative analysis system. MGAP is applied to assembled nucleotide sequence datasets that are provided via the IMG submission site. Dataset submission for annotation first requires project and associated metadata description in GOLD. The MGAP sequence data processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNA features, as well as CRISPR elements. In conclusion, structural annotation is followed by assignment of protein product names and functions.

  10. Circular RNA Expression: Its Potential Regulation and Function.

    PubMed

    Salzman, Julia

    2016-05-01

    In 2012, a new feature of eukaryotic gene expression emerged: ubiquitous expression of circular RNA (circRNA) from genes traditionally thought to express messenger or linear noncoding (nc)RNA only. CircRNAs are covalently closed, circular RNA molecules that typically comprise exonic sequences and are spliced at canonical splice sites. This feature of gene expression was first recognized in humans and mouse, but it quickly emerged that it was common across essentially all eukaryotes studied by molecular biologists. CircRNA abundance, and even which alternatively spliced circRNA isoforms are expressed, varies by cell type and can exceed the abundance of the traditional linear mRNA or ncRNA transcript. CircRNAs are enriched in the brain and increase in abundance during fetal development. Together, these features raise fundamental questions regarding the regulation of circRNA in cis and in trans, and its function. Copyright © 2016. Published by Elsevier Ltd.

  11. The role of RNA structure in the interaction of U1A protein with U1 hairpin II RNA

    PubMed Central

    Law, Michael J.; Rice, Andrew J.; Lin, Patti; Laird-Offringa, Ite A.

    2006-01-01

    The N-terminal RNA Recognition Motif (RRM1) of the spliceosomal protein U1A interacting with its target U1 hairpin II (U1hpII) has been used as a paradigm for RRM-containing proteins interacting with their RNA targets. U1A binds to U1hpII via direct interactions with a 7-nucleotide (nt) consensus binding sequence at the 5′ end of a 10-nt loop, and via hydrogen bonds with the closing C–G base pair at the top of the RNA stem. Using surface plasmon resonance (Biacore), we have examined the role of structural features of U1hpII in binding to U1A RRM1. Mutational analysis of the closing base pair suggests it plays a minor role in binding and mainly prevents “breathing” of the loop. Lengthening the stem and nontarget part of the loop suggests that the increased negative charge of the RNA might slightly aid association. However, this is offset by an increase in dissociation, which may be caused by attraction of the RRM to nontarget parts of the RNA. Studies of a single stranded target and RNAs with untethered loops indicate that structure is not very relevant for association but is important for complex stability. In particular, breaking the link between the stem and the 5′ side of the loop greatly increases complex dissociation, presumably by hindering simultaneous contacts between the RRM and stem and loop nucleotides. While binding of U1A to a single stranded target is much weaker than to U1hpII, it occurs with nanomolar affinity, supporting recent evidence that binding of unstructured RNA by U1A has physiological significance. PMID:16738410

  12. The role of RNA structure in the interaction of U1A protein with U1 hairpin II RNA.

    PubMed

    Law, Michael J; Rice, Andrew J; Lin, Patti; Laird-Offringa, Ite A

    2006-07-01

    The N-terminal RNA Recognition Motif (RRM1) of the spliceosomal protein U1A interacting with its target U1 hairpin II (U1hpII) has been used as a paradigm for RRM-containing proteins interacting with their RNA targets. U1A binds to U1hpII via direct interactions with a 7-nucleotide (nt) consensus binding sequence at the 5' end of a 10-nt loop, and via hydrogen bonds with the closing C-G base pair at the top of the RNA stem. Using surface plasmon resonance (Biacore), we have examined the role of structural features of U1hpII in binding to U1A RRM1. Mutational analysis of the closing base pair suggests it plays a minor role in binding and mainly prevents "breathing" of the loop. Lengthening the stem and nontarget part of the loop suggests that the increased negative charge of the RNA might slightly aid association. However, this is offset by an increase in dissociation, which may be caused by attraction of the RRM to nontarget parts of the RNA. Studies of a single stranded target and RNAs with untethered loops indicate that structure is not very relevant for association but is important for complex stability. In particular, breaking the link between the stem and the 5' side of the loop greatly increases complex dissociation, presumably by hindering simultaneous contacts between the RRM and stem and loop nucleotides. While binding of U1A to a single stranded target is much weaker than to U1hpII, it occurs with nanomolar affinity, supporting recent evidence that binding of unstructured RNA by U1A has physiological significance.

  13. microRNA-122 target sites in the hepatitis C virus RNA NS5B coding region and 3' untranslated region: function in replication and influence of RNA secondary structure.

    PubMed

    Gerresheim, Gesche K; Dünnes, Nadia; Nieder-Röhrmann, Anika; Shalamova, Lyudmila A; Fricke, Markus; Hofacker, Ivo; Höner Zu Siederdissen, Christian; Marz, Manja; Niepmann, Michael

    2017-02-01

    We have analyzed the binding of the liver-specific microRNA-122 (miR-122) to three conserved target sites of hepatitis C virus (HCV) RNA, two in the non-structural protein 5B (NS5B) coding region and one in the 3' untranslated region (3'UTR). miR-122 binding efficiency strongly depends on target site accessibility under conditions when the range of flanking sequences available for the formation of local RNA secondary structures changes. Our results indicate that the particular sequence feature that contributes most to the correlation between target site accessibility and binding strength varies between different target sites. This suggests that the dynamics of miRNA/Ago2 binding not only depends on the target site itself but also on flanking sequence context to a considerable extent, in particular in a small viral genome in which strong selection constraints act on coding sequence and overlapping cis-signals and model the accessibility of cis-signals. In full-length genomes, single and combination mutations in the miR-122 target sites reveal that site 5B.2 is positively involved in regulating overall genome replication efficiency, whereas mutation of site 5B.3 showed a weaker effect. Mutation of the 3'UTR site and double or triple mutants showed no significant overall effect on genome replication, whereas in a translation reporter RNA, the 3'UTR target site inhibits translation directed by the HCV 5'UTR. Thus, the miR-122 target sites in the 3'-region of the HCV genome are involved in a complex interplay in regulating different steps of the HCV replication cycle.

  14. An effective tumor-targeting strategy utilizing hypoxia-sensitive siRNA delivery system for improved anti-tumor outcome.

    PubMed

    Kang, Lin; Fan, Bo; Sun, Ping; Huang, Wei; Jin, Mingji; Wang, Qiming; Gao, Zhonggao

    2016-10-15

    Hypoxia is a feature of most solid tumors, targeting hypoxia is considered as the best validated yet not extensively exploited strategy in cancer therapy. Here, we reported a novel tumor-targeting strategy using a hypoxia-sensitive siRNA delivery system. In the study, 2-nitroimidazole (NI), a hydrophobic component that can be converted to hydrophilic 2-aminoimidazole (AI) through bioreduction under hypoxic conditions, was conjugated to the alkylated polyethyleneimine (bPEI1.8k-C6) to form amphiphilic bPEI1.8k-C6-NI polycations. bPEI1.8k-C6-NI could self-assemble into micelle-like aggregations in aqueous, which contributed to the improved stability of the bPEI1.8k-C6-NI/siRNA polyplexes, resulted in increased cellular uptake. After being transported into the hypoxic tumor cells, the selective nitro-to-amino reduction would cause structural change and elicit a relatively loose structure to facilitate the siRNA dissociation in the cytoplasm, for enhanced gene silencing efficiency ultimately. Therefore, the conflict between the extracellular stability and the intracellular siRNA release ability of the polyplexes was solved by introducing the hypoxia-responsive unit. Consequently, the survivin-targeted siRNA loaded polyplexes shown remarkable anti-tumor effect not only in hypoxic cells, but also in tumor spheroids and tumor-bearing mice, indicating that the hypoxia-sensitive siRNA delivery system had great potential for tumor-targeted therapy. Hypoxia is one of the most remarkable features of most solid tumors, and targeting hypoxia is considered as the best validated strategy in cancer therapy. However, in the past decades, there were few reports about using this strategy in the drug delivery system, especially in siRNA delivery system. Therefore, we constructed a hypoxia-sensitive siRNA delivery system utilizing a hypoxia-responsive unit, 2-nitroimidazole, by which the unavoidable conflict between improved extracellular stability and promoted intracellular siRNA release in the same delivery system could be effectively solved, resulting in enhanced siRNA silencing efficiency in tumor cells. To our knowledge, the described work is the first demonstration of a siRNA delivery system using a hypoxia trigger for regulation of siRNA release, which represents a new strategy for tumor-targeted therapy, and it is expected that this meaningful strategy must be widely applied in the future. Copyright © 2016 Acta Materialia Inc. Published by Elsevier Ltd. All rights reserved.

  15. ToNER: A tool for identifying nucleotide enrichment signals in feature-enriched RNA-seq data.

    PubMed

    Promworn, Yuttachon; Kaewprommal, Pavita; Shaw, Philip J; Intarapanich, Apichart; Tongsima, Sissades; Piriyapongsa, Jittima

    2017-01-01

    Biochemical methods are available for enriching 5' ends of RNAs in prokaryotes, which are employed in the differential RNA-seq (dRNA-seq) and the more recent Cappable-seq protocols. Computational methods are needed to locate RNA 5' ends from these data by statistical analysis of the enrichment. Although statistical-based analysis methods have been developed for dRNA-seq, they may not be suitable for Cappable-seq data. The more efficient enrichment method employed in Cappable-seq compared with dRNA-seq could affect data distribution and thus algorithm performance. We present Transformation of Nucleotide Enrichment Ratios (ToNER), a tool for statistical modeling of enrichment from RNA-seq data obtained from enriched and unenriched libraries. The tool calculates nucleotide enrichment scores and determines the global transformation for fitting to the normal distribution using the Box-Cox procedure. From the transformed distribution, sites of significant enrichment are identified. To increase power of detection, meta-analysis across experimental replicates is offered. We tested the tool on Cappable-seq and dRNA-seq data for identifying Escherichia coli transcript 5' ends and compared the results with those from the TSSAR tool, which is designed for analyzing dRNA-seq data. When combining results across Cappable-seq replicates, ToNER detects more known transcript 5' ends than TSSAR. In general, the transcript 5' ends detected by ToNER but not TSSAR occur in regions which cannot be locally modeled by TSSAR. ToNER uses a simple yet robust statistical modeling approach, which can be used for detecting RNA 5'ends from Cappable-seq data, in particular when combining information from experimental replicates. The ToNER tool could potentially be applied for analyzing other RNA-seq datasets in which enrichment for other structural features of RNA is employed. The program is freely available for download at ToNER webpage (http://www4a.biotec.or.th/GI/tools/toner) and GitHub repository (https://github.com/PavitaKae/ToNER).

  16. SARNAclust: Semi-automatic detection of RNA protein binding motifs from immunoprecipitation data

    PubMed Central

    Dotu, Ivan; Adamson, Scott I.; Coleman, Benjamin; Fournier, Cyril; Ricart-Altimiras, Emma; Eyras, Eduardo

    2018-01-01

    RNA-protein binding is critical to gene regulation, controlling fundamental processes including splicing, translation, localization and stability, and aberrant RNA-protein interactions are known to play a role in a wide variety of diseases. However, molecular understanding of RNA-protein interactions remains limited; in particular, identification of RNA motifs that bind proteins has long been challenging, especially when such motifs depend on both sequence and structure. Moreover, although RNA binding proteins (RBPs) often contain more than one binding domain, algorithms capable of identifying more than one binding motif simultaneously have not been developed. In this paper we present a novel pipeline to determine binding peaks in crosslinking immunoprecipitation (CLIP) data, to discover multiple possible RNA sequence/structure motifs among them, and to experimentally validate such motifs. At the core is a new semi-automatic algorithm SARNAclust, the first unsupervised method to identify and deconvolve multiple sequence/structure motifs simultaneously. SARNAclust computes similarity between sequence/structure objects using a graph kernel, providing the ability to isolate the impact of specific features through the bulge graph formalism. Application of SARNAclust to synthetic data shows its capability of clustering 5 motifs at once with a V-measure value of over 0.95, while GraphClust achieves only a V-measure of 0.083 and RNAcontext cannot detect any of the motifs. When applied to existing eCLIP sets, SARNAclust finds known motifs for SLBP and HNRNPC and novel motifs for several other RBPs such as AGGF1, AKAP8L and ILF3. We demonstrate an experimental validation protocol, a targeted Bind-n-Seq-like high-throughput sequencing approach that relies on RNA inverse folding for oligo pool design, that can validate the components within the SLBP motif. Finally, we use this protocol to experimentally interrogate the SARNAclust motif predictions for protein ILF3. Our results support a newly identified partially double-stranded UUUUUGAGA motif similar to that known for the splicing factor HNRNPC. PMID:29596423

  17. Identification of a novel box C/D snoRNA from mouse nucleolar cDNA library.

    PubMed

    Zhou, Hui; Zhao, Jin; Yu, Chuan-He; Luo, Qing-Jun; Chen, Yue-Qin; Xiao, Yu; Qu, Liang-Hu

    2004-02-18

    By construction and screen of mouse nucleolar cDNA library, a novel mammalian small nucleolar RNAs (snoRNA) was identified. The novel snoRNA, 70 nt in length, displays structural features typical of C/D box snoRNA family. The snoRNA possesses an 11-nt-long rRNA antisense element and is predicted to guide the 2'-O-methylation of mouse 28S rRNA at G4043, a site unknown so far to be modified in vertebrates. The comparison of functional element of snoRNA guides among eukaryotes reveals that the novel snoRNA is a mammalian counterpart of yeast snR38 despite highly divergent sequence between them. Mouse and human snR38 and other cognates in distant vertebrates were positively detected with slight length variability. As expected, the rRNA ribose-methylation site predicted by mouse snR38 was precisely mapped by specific-primer extension assay. Furthermore, our analyses show that mouse and human snR38 gene have multiple variants and are nested in the introns of different host genes with unknown function. Thus, snR38 is a phylogenetically conserved methylation guide but exhibits different genomic organization in eukaryotes.

  18. Mutational analysis of the RNA-binding domain of the Prunus necrotic ringspot virus (PNRSV) movement protein reveals its requirement for cell-to-cell movement

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Carmen Herranz, Ma; Sanchez-Navarro, Jesus-Angel; Sauri, Ana

    2005-08-15

    The movement protein (MP) of Prunus necrotic ringspot virus (PNRSV) is required for cell-to-cell movement. MP subcellular localization studies using a GFP fusion protein revealed highly punctate structures between neighboring cells, believed to represent plasmodesmata. Deletion of the RNA-binding domain (RBD) of PNRSV MP abolishes the cell-to-cell movement. A mutational analysis on this RBD was performed in order to identify in vivo the features that govern viral transport. Loss of positive charges prevented the cell-to-cell movement even though all mutants showed a similar accumulation level in protoplasts to those observed with the wild-type (wt) MP. Synthetic peptides representing the mutantsmore » and wild-type RBDs were used to study RNA-binding affinities by EMSA assays being approximately 20-fold lower in the mutants. Circular dichroism analyses revealed that the secondary structure of the peptides was not significantly affected by mutations. The involvement of the affinity changes between the viral RNA and the MP in the viral cell-to-cell movement is discussed.« less

  19. A tick-borne segmented RNA virus contains genome segments derived from unsegmented viral ancestors

    PubMed Central

    Qin, Xin-Cheng; Shi, Mang; Tian, Jun-Hua; Lin, Xian-Dan; Gao, Dong-Ya; He, Jin-Rong; Wang, Jian-Bo; Li, Ci-Xiu; Kang, Yan-Jun; Yu, Bin; Zhou, Dun-Jin; Xu, Jianguo; Plyusnin, Alexander; Holmes, Edward C.; Zhang, Yong-Zhen

    2014-01-01

    Although segmented and unsegmented RNA viruses are commonplace, the evolutionary links between these two very different forms of genome organization are unclear. We report the discovery and characterization of a tick-borne virus—Jingmen tick virus (JMTV)—that reveals an unexpected connection between segmented and unsegmented RNA viruses. The JMTV genome comprises four segments, two of which are related to the nonstructural protein genes of the genus Flavivirus (family Flaviviridae), whereas the remaining segments are unique to this virus, have no known homologs, and contain a number of features indicative of structural protein genes. Remarkably, homology searching revealed that sequences related to JMTV were present in the cDNA library from Toxocara canis (dog roundworm; Nematoda), and that shared strong sequence and structural resemblances. Epidemiological studies showed that JMTV is distributed in tick populations across China, especially Rhipicephalus and Haemaphysalis spp., and experiences frequent host-switching and genomic reassortment. To our knowledge, JMTV is the first example of a segmented RNA virus with a genome derived in part from unsegmented viral ancestors. PMID:24753611

  20. mRNA bound to the 30S subunit is a HigB toxin substrate

    PubMed Central

    Schureck, Marc A.; Maehigashi, Tatsuya; Miles, Stacey J.; Marquez, Jhomar; Dunham, Christine M.

    2016-01-01

    Activation of bacterial toxins during stress results in cleavage of mRNAs in the context of the ribosome. These toxins are thought to function as global translational inhibitors yet recent studies suggest each may have distinct mRNA specificities that result in selective translation for bacterial survival. Here we demonstrate that mRNA in the context of a bacterial 30S subunit is sufficient for ribosome-dependent toxin HigB endonucleolytic activity, suggesting that HigB interferes with the initiation step of translation. We determined the X-ray crystal structure of HigB bound to the 30S, revealing that two solvent-exposed clusters of HigB basic residues directly interact with 30S 16S rRNA helices 18, 30, and 31. We further show that these HigB residues are essential for ribosome recognition and function. Comparison with other ribosome-dependent toxins RelE and YoeB reveals that each interacts with similar features of the 30S aminoacyl (A) site yet does so through presentation of diverse structural motifs. PMID:27307497

  1. Mutational analysis of the RNA-binding domain of the Prunus necrotic ringspot virus (PNRSV) movement protein reveals its requirement for cell-to-cell movement.

    PubMed

    Carmen Herranz, Ma; Sanchez-Navarro, Jesús-Angel; Saurí, Ana; Mingarro, Ismael; Pallás, Vicente

    2005-08-15

    The movement protein (MP) of Prunus necrotic ringspot virus (PNRSV) is required for cell-to-cell movement. MP subcellular localization studies using a GFP fusion protein revealed highly punctate structures between neighboring cells, believed to represent plasmodesmata. Deletion of the RNA-binding domain (RBD) of PNRSV MP abolishes the cell-to-cell movement. A mutational analysis on this RBD was performed in order to identify in vivo the features that govern viral transport. Loss of positive charges prevented the cell-to-cell movement even though all mutants showed a similar accumulation level in protoplasts to those observed with the wild-type (wt) MP. Synthetic peptides representing the mutants and wild-type RBDs were used to study RNA-binding affinities by EMSA assays being approximately 20-fold lower in the mutants. Circular dichroism analyses revealed that the secondary structure of the peptides was not significantly affected by mutations. The involvement of the affinity changes between the viral RNA and the MP in the viral cell-to-cell movement is discussed.

  2. Optimized approach for Ion Proton RNA sequencing reveals details of RNA splicing and editing features of the transcriptome.

    PubMed

    Brown, Roger B; Madrid, Nathaniel J; Suzuki, Hideaki; Ness, Scott A

    2017-01-01

    RNA-sequencing (RNA-seq) has become the standard method for unbiased analysis of gene expression but also provides access to more complex transcriptome features, including alternative RNA splicing, RNA editing, and even detection of fusion transcripts formed through chromosomal translocations. However, differences in library methods can adversely affect the ability to recover these different types of transcriptome data. For example, some methods have bias for one end of transcripts or rely on low-efficiency steps that limit the complexity of the resulting library, making detection of rare transcripts less likely. We tested several commonly used methods of RNA-seq library preparation and found vast differences in the detection of advanced transcriptome features, such as alternatively spliced isoforms and RNA editing sites. By comparing several different protocols available for the Ion Proton sequencer and by utilizing detailed bioinformatics analysis tools, we were able to develop an optimized random primer based RNA-seq technique that is reliable at uncovering rare transcript isoforms and RNA editing features, as well as fusion reads from oncogenic chromosome rearrangements. The combination of optimized libraries and rapid Ion Proton sequencing provides a powerful platform for the transcriptome analysis of research and clinical samples.

  3. An anionic phthalocyanine decreases NRAS expression by breaking down its RNA G-quadruplex.

    PubMed

    Kawauchi, Keiko; Sugimoto, Wataru; Yasui, Takatoshi; Murata, Kohei; Itoh, Katsuhiko; Takagi, Kazuki; Tsuruoka, Takaaki; Akamatsu, Kensuke; Tateishi-Karimata, Hisae; Sugimoto, Naoki; Miyoshi, Daisuke

    2018-06-11

    Aberrant activation of RAS signalling pathways contributes to aggressive phenotypes of cancer cells. The RAS-targeted therapies for cancer, therefore, have been recognised to be effective; however, current developments on targeting RAS have not advanced due to structural features of the RAS protein. Here, we show that expression of NRAS, a major isoform of RAS, can be controlled by photo-irradiation with an anionic phthalocyanine, ZnAPC, targeting NRAS mRNA. In vitro experiments reveal that ZnAPC binds to a G-quadruplex-forming oligonucleotide derived from the 5'-untranslated region of NRAS mRNA even in the presence of excess double-stranded RNA, which is abundant in cells, resulting in selective cleavage of the target RNA's G-quadruplex upon photo-irradiation. In line with these results, upon photo-irradiation, ZnAPC decreases NRAS mRNA and NRAS expression and thus viability of cancer cells. These results indicate that ZnAPC may be a prominent photosensitiser for a molecularly targeted photodynamic therapy for cancer.

  4. The first phlebo-like virus infecting plants: a case study on the adaptation of negative-stranded RNA viruses to new hosts.

    PubMed

    Navarro, Beatriz; Minutolo, Maria; De Stradis, Angelo; Palmisano, Francesco; Alioto, Daniela; Di Serio, Francesco

    2018-05-01

    A novel negative-stranded (ns) RNA virus associated with a severe citrus disease reported more than 80 years ago has been identified. Transmission electron microscopy showed that this novel virus, tentatively named citrus concave gum-associated virus, is flexuous and non-enveloped. Notwithstanding, its two genomic RNAs share structural features with members of the genus Phlebovirus, which are enveloped arthropod-transmitted viruses infecting mammals, and with a group of still unclassified phlebo-like viruses mainly infecting arthropods. CCGaV genomic RNAs code for an RNA-dependent RNA polymerase, a nucleocapsid protein and a putative movement protein showing structural and phylogenetic relationships with phlebo-like viruses, phleboviruses and the unrelated ophioviruses, respectively, thus providing intriguing evidence of a modular genome evolution. Phylogenetic reconstructions identified an invertebrate-restricted virus as the most likely ancestor of this virus, revealing that its adaptation to plants was independent from and possibly predated that of the other nsRNA plant viruses. These data are consistent with an evolutionary scenario in which trans-kingdom adaptation occurred several times during the history of nsRNA viruses and followed different evolutionary pathways, in which genomic RNA segments were gained or lost. The need to create a new genus for this bipartite nsRNA virus and the impact of the rapid and specific detection methods developed here on citrus sanitation and certification are also discussed. © 2017 BSPP AND JOHN WILEY & SONS LTD.

  5. Visualizing bacterial tRNA identity determinants and antideterminants using function logos and inverse function logos

    PubMed Central

    Freyhult, Eva; Moulton, Vincent; Ardell, David H.

    2006-01-01

    Sequence logos are stacked bar graphs that generalize the notion of consensus sequence. They employ entropy statistics very effectively to display variation in a structural alignment of sequences of a common function, while emphasizing its over-represented features. Yet sequence logos cannot display features that distinguish functional subclasses within a structurally related superfamily nor do they display under-represented features. We introduce two extensions to address these needs: function logos and inverse logos. Function logos display subfunctions that are over-represented among sequences carrying a specific feature. Inverse logos generalize both sequence logos and function logos by displaying under-represented, rather than over-represented, features or functions in structural alignments. To make inverse logos, a compositional inverse is applied to the feature or function frequency distributions before logo construction, where a compositional inverse is a mathematical transform that makes common features or functions rare and vice versa. We applied these methods to a database of structurally aligned bacterial tDNAs to create highly condensed, birds-eye views of potentially all so-called identity determinants and antideterminants that confer specific amino acid charging or initiator function on tRNAs in bacteria. We recovered both known and a few potentially novel identity elements. Function logos and inverse logos are useful tools for exploratory bioinformatic analysis of structure–function relationships in sequence families and superfamilies. PMID:16473848

  6. Analysis and Prediction of Exon Skipping Events from RNA-Seq with Sequence Information Using Rotation Forest.

    PubMed

    Du, Xiuquan; Hu, Changlin; Yao, Yu; Sun, Shiwei; Zhang, Yanping

    2017-12-12

    In bioinformatics, exon skipping (ES) event prediction is an essential part of alternative splicing (AS) event analysis. Although many methods have been developed to predict ES events, a solution has yet to be found. In this study, given the limitations of machine learning algorithms with RNA-Seq data or genome sequences, a new feature, called RS (RNA-seq and sequence) features, was constructed. These features include RNA-Seq features derived from the RNA-Seq data and sequence features derived from genome sequences. We propose a novel Rotation Forest classifier to predict ES events with the RS features (RotaF-RSES). To validate the efficacy of RotaF-RSES, a dataset from two human tissues was used, and RotaF-RSES achieved an accuracy of 98.4%, a specificity of 99.2%, a sensitivity of 94.1%, and an area under the curve (AUC) of 98.6%. When compared to the other available methods, the results indicate that RotaF-RSES is efficient and can predict ES events with RS features.

  7. Structural features of LC8-induced self-association of swallow.

    PubMed

    Kidane, Ariam I; Song, Yujuan; Nyarko, Afua; Hall, Justin; Hare, Michael; Löhr, Frank; Barbar, Elisar

    2013-09-03

    Cell functions depend on the collective activity of protein networks within which a few proteins, called hubs, participate in a large number of interactions. Dynein light chain LC8, first discovered as a subunit of the motor protein dynein, is considered to have a role broader than that of dynein, and its participation in diverse systems fits the description of a hub. Among its partners is Swallow with which LC8 is essential for proper localization of bicoid mRNA at the anterior cortex of Drosophila oocytes. Why LC8 is essential in this process is not clear, but emerging evidence suggests that LC8 functions by promoting self-association and/or structural organization of its diverse binding partners. This work addresses the energetics and structural features of LC8-induced Swallow self-association distant from LC8 binding. Mutational design based on a hypothetical helical wheel, intermonomer nuclear Overhauser effects assigned to residues expected at interface positions, and circular dichroism spectral characteristics indicate that the LC8-promoted dimer of Swallow is a coiled coil. Secondary chemical shifts and (15)N backbone relaxation identify the boundaries and distinguishing structural features of the coiled coil. Thermodynamic analysis of Swallow polypeptides designed to decouple self-association from LC8 binding reveals that the higher binding affinity of the engineered bivalent Swallow is of purely entropic origin and that the linker separating the coiled coil from the LC8 binding site remains disordered. We speculate that the LC8-promoted coiled coil is critical for bicoid mRNA localization because it favors structural organization of Swallow, which except for the central LC8-promoted coiled coil is primarily disordered.

  8. Structural Features of LC8-Induced Self Association of Swallow†

    PubMed Central

    Kidane, Ariam I.; Song, Yujuan; Nyarko, Afua; Hall, Justin; Hare, Michael; Löhr, Frank; Barbar, Elisar

    2013-01-01

    Cell function depends on the collective activity of protein networks within which a few proteins, called hubs, participate in a large number of interactions. Dynein light chain LC8, first discovered as a subunit of the motor protein dynein, is considered to have a role broader than dynein and its participation in diverse systems fits the description of a hub. Among its partners is Swallow with which LC8 is essential for proper localization of bicoid mRNA at the anterior cortex of Drosophila oocytes. Why LC8 is essential in this process is not clear, but emerging evidence suggests that LC8 functions by promoting self-association and/or structural organization of its diverse binding partners. This work addresses the mechanistic and structural features of LC8-induced Swallow self-association distant from LC8 binding. Mutational design based on a hypothetical helical wheel, inter-monomer NOEs assigned to residues expected at interface positions and circular dichroism spectral characteristics indicate that the LC8-promoted dimer of Swallow is a coiled-coil. Secondary chemical shifts and 15N backbone relaxation identify the boundaries and distinguishing structural features of the coiled-coil. Thermodynamic analysis of Swallow polypeptides designed to decouple self-association from LC8 binding reveals that the higher binding affinity of the engineered bivalent Swallow is of purely entropic origin and that the linker separating the coiled-coil from the LC8 binding site remains disordered. We speculate that the LC8-promoted coiled-coil is critical for bicoid mRNA localization because it could induce structural organization of Swallow, which except for the central LC8-promoted coiled-coil is primarily disordered. PMID:23914803

  9. The Structural Basis for Recognition of the PreQ0 Metabolite by an Unusually Small Riboswitch Aptamer Domain*S⃞♦

    PubMed Central

    Spitale, Robert C.; Torelli, Andrew T.; Krucinska, Jolanta; Bandarian, Vahe; Wedekind, Joseph E.

    2009-01-01

    Riboswitches are RNA elements that control gene expression through metabolite binding. The preQ1 riboswitch exhibits the smallest known ligand-binding domain and is of interest for its economical organization and high affinity interactions with guanine-derived metabolites required to confer tRNA wobbling. Here we present the crystal structure of a preQ1 aptamer domain in complex with its precursor metabolite preQ0. The structure is highly compact with a core that features a stem capped by a well organized decaloop. The metabolite is recognized within a deep pocket via Watson-Crick pairing with C15. Additional hydrogen bonds are made to invariant bases U6 and A29. The ligand-bound state confers continuous helical stacking throughout the core fold, thus providing a platform to promote Watson-Crick base pairing between C9 of the decaloop and the first base of the ribosome-binding site, G33. The structure offers insight into the mode of ribosome-binding site sequestration by a minimal RNA fold stabilized by metabolite binding and has implications for understanding the molecular basis by which bacterial genes are regulated. PMID:19261617

  10. ATP-dependent human RISC assembly pathways.

    PubMed

    Yoda, Mayuko; Kawamata, Tomoko; Paroo, Zain; Ye, Xuecheng; Iwasaki, Shintaro; Liu, Qinghua; Tomari, Yukihide

    2010-01-01

    The assembly of RNA-induced silencing complex (RISC) is a key process in small RNA-mediated gene silencing. In humans, small interfering RNAs (siRNAs) and microRNAs (miRNAs) are incorporated into RISCs containing the Argonaute (AGO) subfamily proteins Ago1-4. Previous studies have proposed that, unlike Drosophila melanogaster RISC assembly pathways, human RISC assembly is coupled with dicing and is independent of ATP. Here we show by careful reexamination that, in humans, RISC assembly and dicing are uncoupled, and ATP greatly facilitates RISC loading of small-RNA duplexes. Moreover, all four human AGO proteins show remarkably similar structural preferences for small-RNA duplexes: central mismatches promote RISC loading, and seed or 3'-mid (guide position 12-15) mismatches facilitate unwinding. All these features of human AGO proteins are highly reminiscent of fly Ago1 but not fly Ago2.

  11. Engineering RNA for Targeted siRNA Delivery and Medical Application

    PubMed Central

    Guo, Peixuan; Coban, Oana; Snead, Nick; Trebley, Joe; Hoeprich, Steve; Guo, Songchuan; Shu, Yi

    2010-01-01

    RNA engineering for nanotechnology and medical applications is an exciting emerging research field. RNA has intrinsically defined features on the nanometer scale and is a particularly interesting candidate for such applications due to its amazing diversity, flexibility and versatility in structure and function. Specifically, the current use of siRNA to silence target genes involved in disease has generated much excitement in the scientific community. The intrinsic ability to sequence-specifically down-regulate gene expression in a temporally- and spatially-controlled fashion has led to heightened interest and rapid development of siRNA-based therapeutics. Though methods for gene silencing with high efficacy and specificity have been achieved in vitro, the effective delivery of nucleic acids to specific cells in vivo has been a hurdle for RNA therapeutics. This review covers different RNA-based approaches for diagnosis, prevention and treatment of human disease, with a focus on the latest developments of nonviral carriers of siRNA for delivery in vivo. The applications and challenges of siRNA therapy, as well as potential solutions to these problems, the approaches for using phi29 pRNA-based vectors as polyvalent vehicles for specific delivery of siRNA, ribozymes, drugs or other therapeutic agents to specific cells for therapy will also be addressed. PMID:20230868

  12. RNA nanotechnology for computer design and in vivo computation

    PubMed Central

    Qiu, Meikang; Khisamutdinov, Emil; Zhao, Zhengyi; Pan, Cheryl; Choi, Jeong-Woo; Leontis, Neocles B.; Guo, Peixuan

    2013-01-01

    Molecular-scale computing has been explored since 1989 owing to the foreseeable limitation of Moore's law for silicon-based computation devices. With the potential of massive parallelism, low energy consumption and capability of working in vivo, molecular-scale computing promises a new computational paradigm. Inspired by the concepts from the electronic computer, DNA computing has realized basic Boolean functions and has progressed into multi-layered circuits. Recently, RNA nanotechnology has emerged as an alternative approach. Owing to the newly discovered thermodynamic stability of a special RNA motif (Shu et al. 2011 Nat. Nanotechnol. 6, 658–667 (doi:10.1038/nnano.2011.105)), RNA nanoparticles are emerging as another promising medium for nanodevice and nanomedicine as well as molecular-scale computing. Like DNA, RNA sequences can be designed to form desired secondary structures in a straightforward manner, but RNA is structurally more versatile and more thermodynamically stable owing to its non-canonical base-pairing, tertiary interactions and base-stacking property. A 90-nucleotide RNA can exhibit 490 nanostructures, and its loops and tertiary architecture can serve as a mounting dovetail that eliminates the need for external linking dowels. Its enzymatic and fluorogenic activity creates diversity in computational design. Varieties of small RNA can work cooperatively, synergistically or antagonistically to carry out computational logic circuits. The riboswitch and enzymatic ribozyme activities and its special in vivo attributes offer a great potential for in vivo computation. Unique features in transcription, termination, self-assembly, self-processing and acid resistance enable in vivo production of RNA nanoparticles that harbour various regulators for intracellular manipulation. With all these advantages, RNA computation is promising, but it is still in its infancy. Many challenges still exist. Collaborations between RNA nanotechnologists and computer scientists are necessary to advance this nascent technology. PMID:24000362

  13. RNA nanotechnology for computer design and in vivo computation.

    PubMed

    Qiu, Meikang; Khisamutdinov, Emil; Zhao, Zhengyi; Pan, Cheryl; Choi, Jeong-Woo; Leontis, Neocles B; Guo, Peixuan

    2013-10-13

    Molecular-scale computing has been explored since 1989 owing to the foreseeable limitation of Moore's law for silicon-based computation devices. With the potential of massive parallelism, low energy consumption and capability of working in vivo, molecular-scale computing promises a new computational paradigm. Inspired by the concepts from the electronic computer, DNA computing has realized basic Boolean functions and has progressed into multi-layered circuits. Recently, RNA nanotechnology has emerged as an alternative approach. Owing to the newly discovered thermodynamic stability of a special RNA motif (Shu et al. 2011 Nat. Nanotechnol. 6, 658-667 (doi:10.1038/nnano.2011.105)), RNA nanoparticles are emerging as another promising medium for nanodevice and nanomedicine as well as molecular-scale computing. Like DNA, RNA sequences can be designed to form desired secondary structures in a straightforward manner, but RNA is structurally more versatile and more thermodynamically stable owing to its non-canonical base-pairing, tertiary interactions and base-stacking property. A 90-nucleotide RNA can exhibit 4⁹⁰ nanostructures, and its loops and tertiary architecture can serve as a mounting dovetail that eliminates the need for external linking dowels. Its enzymatic and fluorogenic activity creates diversity in computational design. Varieties of small RNA can work cooperatively, synergistically or antagonistically to carry out computational logic circuits. The riboswitch and enzymatic ribozyme activities and its special in vivo attributes offer a great potential for in vivo computation. Unique features in transcription, termination, self-assembly, self-processing and acid resistance enable in vivo production of RNA nanoparticles that harbour various regulators for intracellular manipulation. With all these advantages, RNA computation is promising, but it is still in its infancy. Many challenges still exist. Collaborations between RNA nanotechnologists and computer scientists are necessary to advance this nascent technology.

  14. Poliovirus Polymerase Leu420 Facilitates RNA Recombination and Ribavirin Resistance

    PubMed Central

    Kempf, Brian J.; Peersen, Olve B.

    2016-01-01

    ABSTRACT RNA recombination is important in the formation of picornavirus species groups and the ongoing evolution of viruses within species groups. In this study, we examined the structure and function of poliovirus polymerase, 3Dpol, as it relates to RNA recombination. Recombination occurs when nascent RNA products exchange one viral RNA template for another during RNA replication. Because recombination is a natural aspect of picornavirus replication, we hypothesized that some features of 3Dpol may exist, in part, to facilitate RNA recombination. Furthermore, we reasoned that alanine substitution mutations that disrupt 3Dpol-RNA interactions within the polymerase elongation complex might increase and/or decrease the magnitudes of recombination. We found that an L420A mutation in 3Dpol decreased the frequency of RNA recombination, whereas alanine substitutions at other sites in 3Dpol increased the frequency of recombination. The 3Dpol Leu420 side chain interacts with a ribose in the nascent RNA product 3 nucleotides from the active site of the polymerase. Notably, the L420A mutation that reduced recombination also rendered the virus more susceptible to inhibition by ribavirin, coincident with the accumulation of ribavirin-induced G→A and C→U mutations in viral RNA. We conclude that 3Dpol Leu420 is critically important for RNA recombination and that RNA recombination contributes to ribavirin resistance. IMPORTANCE Recombination contributes to the formation of picornavirus species groups and the emergence of circulating vaccine-derived polioviruses (cVDPVs). The recombinant viruses that arise in nature are occasionally more fit than either parental strain, especially when the two partners in recombination are closely related, i.e., members of characteristic species groups, such as enterovirus species groups A to H or rhinovirus species groups A to C. Our study shows that RNA recombination requires conserved features of the viral polymerase. Furthermore, a polymerase mutation that disables recombination renders the virus more susceptible to the antiviral drug ribavirin, suggesting that recombination contributes to ribavirin resistance. Elucidating the molecular mechanisms of RNA replication and recombination may help mankind achieve and maintain poliovirus eradication. PMID:27412593

  15. Efficient Translation of Pelargonium line pattern virus RNAs Relies on a TED-Like 3´-Translational Enhancer that Communicates with the Corresponding 5´-Region through a Long-Distance RNA-RNA Interaction

    PubMed Central

    Blanco-Pérez, Marta; Pérez-Cañamás, Miryam; Ruiz, Leticia; Hernández, Carmen

    2016-01-01

    Cap-independent translational enhancers (CITEs) have been identified at the 3´-terminal regions of distinct plant positive-strand RNA viruses belonging to families Tombusviridae and Luteoviridae. On the bases of their structural and/or functional requirements, at least six classes of CITEs have been defined whose distribution does not correlate with taxonomy. The so-called TED class has been relatively under-studied and its functionality only confirmed in the case of Satellite tobacco necrosis virus, a parasitic subviral agent. The 3´-untranslated region of the monopartite genome of Pelargonium line pattern virus (PLPV), the recommended type member of a tentative new genus (Pelarspovirus) in the family Tombusviridae, was predicted to contain a TED-like CITE. Similar CITEs can be anticipated in some other related viruses though none has been experimentally verified. Here, in the first place, we have performed a reassessment of the structure of the putative PLPV-TED through in silico predictions and in vitro SHAPE analysis with the full-length PLPV genome, which has indicated that the presumed TED element is larger than previously proposed. The extended conformation of the TED is strongly supported by the pattern of natural sequence variation, thus providing comparative structural evidence in support of the structural data obtained by in silico and in vitro approaches. Next, we have obtained experimental evidence demonstrating the in vivo activity of the PLPV-TED in the genomic (g) RNA, and also in the subgenomic (sg) RNA that the virus produces to express 3´-proximal genes. Besides other structural features, the results have highlighted the key role of long-distance kissing-loop interactions between the 3´-CITE and 5´-proximal hairpins for gRNA and sgRNA translation. Bioassays of CITE mutants have confirmed the importance of the identified 5´-3´ RNA communication for viral infectivity and, moreover, have underlined the strong evolutionary constraints that may operate on genome stretches with both regulatory and coding functions. PMID:27043436

  16. Efficient Translation of Pelargonium line pattern virus RNAs Relies on a TED-Like 3´-Translational Enhancer that Communicates with the Corresponding 5´-Region through a Long-Distance RNA-RNA Interaction.

    PubMed

    Blanco-Pérez, Marta; Pérez-Cañamás, Miryam; Ruiz, Leticia; Hernández, Carmen

    2016-01-01

    Cap-independent translational enhancers (CITEs) have been identified at the 3´-terminal regions of distinct plant positive-strand RNA viruses belonging to families Tombusviridae and Luteoviridae. On the bases of their structural and/or functional requirements, at least six classes of CITEs have been defined whose distribution does not correlate with taxonomy. The so-called TED class has been relatively under-studied and its functionality only confirmed in the case of Satellite tobacco necrosis virus, a parasitic subviral agent. The 3´-untranslated region of the monopartite genome of Pelargonium line pattern virus (PLPV), the recommended type member of a tentative new genus (Pelarspovirus) in the family Tombusviridae, was predicted to contain a TED-like CITE. Similar CITEs can be anticipated in some other related viruses though none has been experimentally verified. Here, in the first place, we have performed a reassessment of the structure of the putative PLPV-TED through in silico predictions and in vitro SHAPE analysis with the full-length PLPV genome, which has indicated that the presumed TED element is larger than previously proposed. The extended conformation of the TED is strongly supported by the pattern of natural sequence variation, thus providing comparative structural evidence in support of the structural data obtained by in silico and in vitro approaches. Next, we have obtained experimental evidence demonstrating the in vivo activity of the PLPV-TED in the genomic (g) RNA, and also in the subgenomic (sg) RNA that the virus produces to express 3´-proximal genes. Besides other structural features, the results have highlighted the key role of long-distance kissing-loop interactions between the 3´-CITE and 5´-proximal hairpins for gRNA and sgRNA translation. Bioassays of CITE mutants have confirmed the importance of the identified 5´-3´ RNA communication for viral infectivity and, moreover, have underlined the strong evolutionary constraints that may operate on genome stretches with both regulatory and coding functions.

  17. Systematic analysis and evolution of 5S ribosomal DNA in metazoans.

    PubMed

    Vierna, J; Wehner, S; Höner zu Siederdissen, C; Martínez-Lage, A; Marz, M

    2013-11-01

    Several studies on 5S ribosomal DNA (5S rDNA) have been focused on a subset of the following features in mostly one organism: number of copies, pseudogenes, secondary structure, promoter and terminator characteristics, genomic arrangements, types of non-transcribed spacers and evolution. In this work, we systematically analyzed 5S rDNA sequence diversity in available metazoan genomes, and showed organism-specific and evolutionary-conserved features. Putatively functional sequences (12,766) from 97 organisms allowed us to identify general features of this multigene family in animals. Interestingly, we show that each mammal species has a highly conserved (housekeeping) 5S rRNA type and many variable ones. The genomic organization of 5S rDNA is still under debate. Here, we report the occurrence of several paralog 5S rRNA sequences in 58 of the examined species, and a flexible genome organization of 5S rDNA in animals. We found heterogeneous 5S rDNA clusters in several species, supporting the hypothesis of an exchange of 5S rDNA from one locus to another. A rather high degree of variation of upstream, internal and downstream putative regulatory regions appears to characterize metazoan 5S rDNA. We systematically studied the internal promoters and described three different types of termination signals, as well as variable distances between the coding region and the typical termination signal. Finally, we present a statistical method for detection of linkage among noncoding RNA (ncRNA) gene families. This method showed no evolutionary-conserved linkage among 5S rDNAs and any other ncRNA genes within Metazoa, even though we found 5S rDNA to be linked to various ncRNAs in several clades.

  18. Systematic analysis and evolution of 5S ribosomal DNA in metazoans

    PubMed Central

    Vierna, J; Wehner, S; Höner zu Siederdissen, C; Martínez-Lage, A; Marz, M

    2013-01-01

    Several studies on 5S ribosomal DNA (5S rDNA) have been focused on a subset of the following features in mostly one organism: number of copies, pseudogenes, secondary structure, promoter and terminator characteristics, genomic arrangements, types of non-transcribed spacers and evolution. In this work, we systematically analyzed 5S rDNA sequence diversity in available metazoan genomes, and showed organism-specific and evolutionary-conserved features. Putatively functional sequences (12 766) from 97 organisms allowed us to identify general features of this multigene family in animals. Interestingly, we show that each mammal species has a highly conserved (housekeeping) 5S rRNA type and many variable ones. The genomic organization of 5S rDNA is still under debate. Here, we report the occurrence of several paralog 5S rRNA sequences in 58 of the examined species, and a flexible genome organization of 5S rDNA in animals. We found heterogeneous 5S rDNA clusters in several species, supporting the hypothesis of an exchange of 5S rDNA from one locus to another. A rather high degree of variation of upstream, internal and downstream putative regulatory regions appears to characterize metazoan 5S rDNA. We systematically studied the internal promoters and described three different types of termination signals, as well as variable distances between the coding region and the typical termination signal. Finally, we present a statistical method for detection of linkage among noncoding RNA (ncRNA) gene families. This method showed no evolutionary-conserved linkage among 5S rDNAs and any other ncRNA genes within Metazoa, even though we found 5S rDNA to be linked to various ncRNAs in several clades. PMID:23838690

  19. Regulation of gene expression by the BLM helicase correlates with the presence of G-quadruplex DNA motifs

    PubMed Central

    Nguyen, Giang Huong; Tang, Weiliang; Robles, Ana I.; Beyer, Richard P.; Gray, Lucas T.; Welsh, Judith A.; Schetter, Aaron J.; Kumamoto, Kensuke; Wang, Xin Wei; Hickson, Ian D.; Maizels, Nancy; Monnat, Raymond J.; Harris, Curtis C.

    2014-01-01

    Bloom syndrome is a rare autosomal recessive disorder characterized by genetic instability and cancer predisposition, and caused by mutations in the gene encoding the Bloom syndrome, RecQ helicase-like (BLM) protein. To determine whether altered gene expression might be responsible for pathological features of Bloom syndrome, we analyzed mRNA and microRNA (miRNA) expression in fibroblasts from individuals with Bloom syndrome and in BLM-depleted control fibroblasts. We identified mRNA and miRNA expression differences in Bloom syndrome patient and BLM-depleted cells. Differentially expressed mRNAs are connected with cell proliferation, survival, and molecular mechanisms of cancer, and differentially expressed miRNAs target genes involved in cancer and in immune function. These and additional altered functions or pathways may contribute to the proportional dwarfism, elevated cancer risk, immune dysfunction, and other features observed in Bloom syndrome individuals. BLM binds to G-quadruplex (G4) DNA, and G4 motifs were enriched at transcription start sites (TSS) and especially within first introns (false discovery rate ≤ 0.001) of differentially expressed mRNAs in Bloom syndrome compared with normal cells, suggesting that G-quadruplex structures formed at these motifs are physiologic targets for BLM. These results identify a network of mRNAs and miRNAs that may drive the pathogenesis of Bloom syndrome. PMID:24958861

  20. Regulation of gene expression by the BLM helicase correlates with the presence of G-quadruplex DNA motifs.

    PubMed

    Nguyen, Giang Huong; Tang, Weiliang; Robles, Ana I; Beyer, Richard P; Gray, Lucas T; Welsh, Judith A; Schetter, Aaron J; Kumamoto, Kensuke; Wang, Xin Wei; Hickson, Ian D; Maizels, Nancy; Monnat, Raymond J; Harris, Curtis C

    2014-07-08

    Bloom syndrome is a rare autosomal recessive disorder characterized by genetic instability and cancer predisposition, and caused by mutations in the gene encoding the Bloom syndrome, RecQ helicase-like (BLM) protein. To determine whether altered gene expression might be responsible for pathological features of Bloom syndrome, we analyzed mRNA and microRNA (miRNA) expression in fibroblasts from individuals with Bloom syndrome and in BLM-depleted control fibroblasts. We identified mRNA and miRNA expression differences in Bloom syndrome patient and BLM-depleted cells. Differentially expressed mRNAs are connected with cell proliferation, survival, and molecular mechanisms of cancer, and differentially expressed miRNAs target genes involved in cancer and in immune function. These and additional altered functions or pathways may contribute to the proportional dwarfism, elevated cancer risk, immune dysfunction, and other features observed in Bloom syndrome individuals. BLM binds to G-quadruplex (G4) DNA, and G4 motifs were enriched at transcription start sites (TSS) and especially within first introns (false discovery rate ≤ 0.001) of differentially expressed mRNAs in Bloom syndrome compared with normal cells, suggesting that G-quadruplex structures formed at these motifs are physiologic targets for BLM. These results identify a network of mRNAs and miRNAs that may drive the pathogenesis of Bloom syndrome.

  1. Type and Level of RMRP Functional Impairment Predicts Phenotype in the Cartilage Hair Hypoplasia–Anauxetic Dysplasia Spectrum

    PubMed Central

    Thiel, Christian T. ; Mortier, Geert ; Kaitila, Ilkka ; Reis, André ; Rauch, Anita 

    2007-01-01

    Mutations in the RMRP gene lead to a wide spectrum of autosomal recessive skeletal dysplasias, ranging from the milder phenotypes metaphyseal dysplasia without hypotrichosis and cartilage hair hypoplasia (CHH) to the severe anauxetic dysplasia (AD). This clinical spectrum includes different degrees of short stature, hair hypoplasia, defective erythrogenesis, and immunodeficiency. The RMRP gene encodes the untranslated RNA component of the mitochondrial RNA–processing ribonuclease, RNase MRP. We recently demonstrated that mutations may affect both messenger RNA (mRNA) and ribosomal RNA (rRNA) cleavage and thus cell-cycle regulation and protein synthesis. To investigate the genotype-phenotype correlation, we analyzed the position and the functional effect of 13 mutations in patients with variable features of the CHH-AD spectrum. Those at the end of the spectrum include a novel patient with anauxetic dysplasia who was compound heterozygous for the null mutation g.254_263delCTCAGCGCGG and the mutation g.195C→T, which was previously described in patients with milder phenotypes. Mapping of nucleotide conservation to the two-dimensional structure of the RMRP gene revealed that disease-causing mutations either affect evolutionarily conserved nucleotides or are likely to alter secondary structure through mispairing in stem regions. In vitro testing of RNase MRP multiprotein-specific mRNA and rRNA cleavage of different mutations revealed a strong correlation between the decrease in rRNA cleavage in ribosomal assembly and the degree of bone dysplasia, whereas reduced mRNA cleavage, and thus cell-cycle impairment, predicts the presence of hair hypoplasia, immunodeficiency, and hematological abnormalities and thus increased cancer risk. PMID:17701897

  2. Diverse activities of viral cis-acting RNA regulatory elements revealed using multicolor, long-term, single-cell imaging.

    PubMed

    Pocock, Ginger M; Zimdars, Laraine L; Yuan, Ming; Eliceiri, Kevin W; Ahlquist, Paul; Sherer, Nathan M

    2017-02-01

    Cis-acting RNA structural elements govern crucial aspects of viral gene expression. How these structures and other posttranscriptional signals affect RNA trafficking and translation in the context of single cells is poorly understood. Herein we describe a multicolor, long-term (>24 h) imaging strategy for measuring integrated aspects of viral RNA regulatory control in individual cells. We apply this strategy to demonstrate differential mRNA trafficking behaviors governed by RNA elements derived from three retroviruses (HIV-1, murine leukemia virus, and Mason-Pfizer monkey virus), two hepadnaviruses (hepatitis B virus and woodchuck hepatitis virus), and an intron-retaining transcript encoded by the cellular NXF1 gene. Striking behaviors include "burst" RNA nuclear export dynamics regulated by HIV-1's Rev response element and the viral Rev protein; transient aggregations of RNAs into discrete foci at or near the nuclear membrane triggered by multiple elements; and a novel, pulsiform RNA export activity regulated by the hepadnaviral posttranscriptional regulatory element. We incorporate single-cell tracking and a data-mining algorithm into our approach to obtain RNA element-specific, high-resolution gene expression signatures. Together these imaging assays constitute a tractable, systems-based platform for studying otherwise difficult to access spatiotemporal features of viral and cellular gene regulation. © 2017 Pocock et al. This article is distributed by The American Society for Cell Biology under license from the author(s). Two months after publication it is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).

  3. Staufen1 dimerizes via a conserved motif and a degenerate dsRNA-binding domain to promote mRNA decay

    PubMed Central

    Gleghorn, Michael L.; Gong, Chenguang; Kielkopf, Clara L.; Maquat, Lynne E.

    2014-01-01

    Staufen (STAU)1-mediated mRNA decay (SMD) degrades mammalian-cell mRNAs that bind the double-stranded (ds)RNA-binding protein STAU1 in their 3′-untranslated region. We report a new motif, which typifies STAU homologs from all vertebrate classes, that is responsible for human (h)STAU1 homodimerization. Our crystal structure and mutagenesis analyses reveal that this motif, now named the Staufen-swapping motif (SSM), and dsRNA-binding domain 5 (‘RBD’5) mediate protein dimerization: the two SSM α-helices of one molecule interact primarily through a hydrophobic patch with the two ‘RBD’5 α-helices of a second molecule. ‘RBD’5 adopts the canonical α-β-β-β-α fold of a functional RBD, but it lacks residues and features needed to bind duplex RNA. In cells, SSM-mediated hSTAU1 dimerization increases the efficiency of SMD by augmenting hSTAU1 binding to the ATP-dependent RNA helicase hUPF1. Dimerization regulates keratinocyte-mediated wound-healing and, undoubtedly, many other cellular processes. PMID:23524536

  4. Molecular Phylogenetics and Systematics of the Bivalve Family Ostreidae Based on rRNA Sequence-Structure Models and Multilocus Species Tree

    PubMed Central

    Salvi, Daniele; Macali, Armando; Mariottini, Paolo

    2014-01-01

    The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassotreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics. PMID:25250663

  5. Molecular phylogenetics and systematics of the bivalve family Ostreidae based on rRNA sequence-structure models and multilocus species tree.

    PubMed

    Salvi, Daniele; Macali, Armando; Mariottini, Paolo

    2014-01-01

    The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassostreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized [corrected]. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics.

  6. Small-angle X-ray Solution Scattering Study of the Multi-aminoacyl-tRNA Synthetase Complex Reveals an Elongated and Multi-armed particle*

    PubMed Central

    Dias, José; Renault, Louis; Pérez, Javier; Mirande, Marc

    2013-01-01

    In animal cells, nine aminoacyl-tRNA synthetases are associated with the three auxiliary proteins p18, p38, and p43 to form a stable and conserved large multi-aminoacyl-tRNA synthetase complex (MARS), whose molecular mass has been proposed to be between 1.0 and 1.5 MDa. The complex acts as a molecular hub for coordinating protein synthesis and diverse regulatory signal pathways. Electron microscopy studies defined its low resolution molecular envelope as an overall rather compact, asymmetric triangular shape. Here, we have analyzed the composition and homogeneity of the native mammalian MARS isolated from rabbit liver and characterized its overall internal structure, size, and shape at low resolution by hydrodynamic methods and small-angle x-ray scattering in solution. Our data reveal that the MARS exhibits a much more elongated and multi-armed shape than expected from previous reports. The hydrodynamic and structural features of the MARS are large compared with other supramolecular assemblies involved in translation, including ribosome. The large dimensions and non-compact structural organization of MARS favor a large protein surface accessibility for all its components. This may be essential to allow structural rearrangements between the catalytic and cis-acting tRNA binding domains of the synthetases required for binding the bulky tRNA substrates. This non-compact architecture may also contribute to the spatiotemporal controlled release of some of its components, which participate in non-canonical functions after dissociation from the complex. PMID:23836901

  7. Reducing the worst case running times of a family of RNA and CFG problems, using Valiant's approach.

    PubMed

    Zakov, Shay; Tsur, Dekel; Ziv-Ukelson, Michal

    2011-08-18

    RNA secondary structure prediction is a mainstream bioinformatic domain, and is key to computational analysis of functional RNA. In more than 30 years, much research has been devoted to defining different variants of RNA structure prediction problems, and to developing techniques for improving prediction quality. Nevertheless, most of the algorithms in this field follow a similar dynamic programming approach as that presented by Nussinov and Jacobson in the late 70's, which typically yields cubic worst case running time algorithms. Recently, some algorithmic approaches were applied to improve the complexity of these algorithms, motivated by new discoveries in the RNA domain and by the need to efficiently analyze the increasing amount of accumulated genome-wide data. We study Valiant's classical algorithm for Context Free Grammar recognition in sub-cubic time, and extract features that are common to problems on which Valiant's approach can be applied. Based on this, we describe several problem templates, and formulate generic algorithms that use Valiant's technique and can be applied to all problems which abide by these templates, including many problems within the world of RNA Secondary Structures and Context Free Grammars. The algorithms presented in this paper improve the theoretical asymptotic worst case running time bounds for a large family of important problems. It is also possible that the suggested techniques could be applied to yield a practical speedup for these problems. For some of the problems (such as computing the RNA partition function and base-pair binding probabilities), the presented techniques are the only ones which are currently known for reducing the asymptotic running time bounds of the standard algorithms.

  8. Reducing the worst case running times of a family of RNA and CFG problems, using Valiant's approach

    PubMed Central

    2011-01-01

    Background RNA secondary structure prediction is a mainstream bioinformatic domain, and is key to computational analysis of functional RNA. In more than 30 years, much research has been devoted to defining different variants of RNA structure prediction problems, and to developing techniques for improving prediction quality. Nevertheless, most of the algorithms in this field follow a similar dynamic programming approach as that presented by Nussinov and Jacobson in the late 70's, which typically yields cubic worst case running time algorithms. Recently, some algorithmic approaches were applied to improve the complexity of these algorithms, motivated by new discoveries in the RNA domain and by the need to efficiently analyze the increasing amount of accumulated genome-wide data. Results We study Valiant's classical algorithm for Context Free Grammar recognition in sub-cubic time, and extract features that are common to problems on which Valiant's approach can be applied. Based on this, we describe several problem templates, and formulate generic algorithms that use Valiant's technique and can be applied to all problems which abide by these templates, including many problems within the world of RNA Secondary Structures and Context Free Grammars. Conclusions The algorithms presented in this paper improve the theoretical asymptotic worst case running time bounds for a large family of important problems. It is also possible that the suggested techniques could be applied to yield a practical speedup for these problems. For some of the problems (such as computing the RNA partition function and base-pair binding probabilities), the presented techniques are the only ones which are currently known for reducing the asymptotic running time bounds of the standard algorithms. PMID:21851589

  9. GFFview: A Web Server for Parsing and Visualizing Annotation Information of Eukaryotic Genome.

    PubMed

    Deng, Feilong; Chen, Shi-Yi; Wu, Zhou-Lin; Hu, Yongsong; Jia, Xianbo; Lai, Song-Jia

    2017-10-01

    Owing to wide application of RNA sequencing (RNA-seq) technology, more and more eukaryotic genomes have been extensively annotated, such as the gene structure, alternative splicing, and noncoding loci. Annotation information of genome is prevalently stored as plain text in General Feature Format (GFF), which could be hundreds or thousands Mb in size. Therefore, it is a challenge for manipulating GFF file for biologists who have no bioinformatic skill. In this study, we provide a web server (GFFview) for parsing the annotation information of eukaryotic genome and then generating statistical description of six indices for visualization. GFFview is very useful for investigating quality and difference of the de novo assembled transcriptome in RNA-seq studies.

  10. Structure of the 30 kDa HIV-1 RNA Dimerization Signal by a Hybrid Cryo-EM, NMR, and Molecular Dynamics Approach.

    PubMed

    Zhang, Kaiming; Keane, Sarah C; Su, Zhaoming; Irobalieva, Rossitza N; Chen, Muyuan; Van, Verna; Sciandra, Carly A; Marchant, Jan; Heng, Xiao; Schmid, Michael F; Case, David A; Ludtke, Steven J; Summers, Michael F; Chiu, Wah

    2018-03-06

    Cryoelectron microscopy (cryo-EM) and nuclear magnetic resonance (NMR) spectroscopy are routinely used to determine structures of macromolecules with molecular weights over 65 and under 25 kDa, respectively. We combined these techniques to study a 30 kDa HIV-1 dimer initiation site RNA ([DIS] 2 ; 47 nt/strand). A 9 Å cryo-EM map clearly shows major groove features of the double helix and a right-handed superhelical twist. Simulated cryo-EM maps generated from time-averaged molecular dynamics trajectories (10 ns) exhibited levels of detail similar to those in the experimental maps, suggesting internal structural flexibility limits the cryo-EM resolution. Simultaneous inclusion of the cryo-EM map and 2 H-edited NMR-derived distance restraints during structure refinement generates a structure consistent with both datasets and supporting a flipped-out base within a conserved purine-rich bulge. Our findings demonstrate the power of combining global and local structural information from these techniques for structure determination of modest-sized RNAs. Copyright © 2018 Elsevier Ltd. All rights reserved.

  11. Interdomain Contacts Control Native State Switching of RfaH on a Dual-Funneled Landscape

    PubMed Central

    Ramírez-Sarmiento, César A.; Noel, Jeffrey K.; Valenzuela, Sandro L.; Artsimovitch, Irina

    2015-01-01

    RfaH is a virulence factor from Escherichia coli whose C-terminal domain (CTD) undergoes a dramatic α-to-β conformational transformation. The CTD in its α-helical fold is stabilized by interactions with the N-terminal domain (NTD), masking an RNA polymerase binding site until a specific recruitment site is encountered. Domain dissociation is triggered upon binding to DNA, allowing the NTD to interact with RNA polymerase to facilitate transcription while the CTD refolds into the β-barrel conformation that interacts with the ribosome to activate translation. However, structural details of this transformation process in the context of the full protein remain to be elucidated. Here, we explore the mechanism of the α-to-β conformational transition of RfaH in the full-length protein using a dual-basin structure-based model. Our simulations capture several features described experimentally, such as the requirement of disruption of interdomain contacts to trigger the α-to-β transformation, confirms the roles of previously indicated residues E48 and R138, and suggests a new important role for F130, in the stability of the interdomain interaction. These native basins are connected through an intermediate state that builds up upon binding to the NTD and shares features from both folds, in agreement with previous in silico studies of the isolated CTD. We also examine the effect of RNA polymerase binding on the stabilization of the β fold. Our study shows that native-biased models are appropriate for interrogating the detailed mechanisms of structural rearrangements during the dramatic transformation process of RfaH. PMID:26230837

  12. Untangling the origin of viruses and their impact on cellular evolution.

    PubMed

    Nasir, Arshan; Sun, Feng-Jie; Kim, Kyung Mo; Caetano-Anollés, Gustavo

    2015-04-01

    The origin and evolution of viruses remain mysterious. Here, we focus on the distribution of viral replicons in host organisms, their morphological features, and the evolution of highly conserved protein and nucleic acid structures. The apparent inability of RNA viral replicons to infect contemporary akaryotic species suggests an early origin of RNA viruses and their subsequent loss in akaryotes. A census of virion morphotypes reveals that advanced forms were unique to viruses infecting a specific supergroup, while simpler forms were observed in viruses infecting organisms in all forms of cellular life. Results hint toward an ancient origin of viruses from an ancestral virus harboring either filamentous or spherical virions. Finally, phylogenetic trees built from protein domain and tRNA structures in thousands of genomes suggest that viruses evolved via reductive evolution from ancient cells. The analysis presents a complete account of the evolutionary history of cells and viruses and identifies viruses as crucial agents influencing cellular evolution. © 2015 New York Academy of Sciences.

  13. Nucleophosmin integrates within the nucleolus via multi-modal interactions with proteins displaying R-rich linear motifs and rRNA.

    PubMed

    Mitrea, Diana M; Cika, Jaclyn A; Guy, Clifford S; Ban, David; Banerjee, Priya R; Stanley, Christopher B; Nourse, Amanda; Deniz, Ashok A; Kriwacki, Richard W

    2016-02-02

    The nucleolus is a membrane-less organelle formed through liquid-liquid phase separation of its components from the surrounding nucleoplasm. Here, we show that nucleophosmin (NPM1) integrates within the nucleolus via a multi-modal mechanism involving multivalent interactions with proteins containing arginine-rich linear motifs (R-motifs) and ribosomal RNA (rRNA). Importantly, these R-motifs are found in canonical nucleolar localization signals. Based on a novel combination of biophysical approaches, we propose a model for the molecular organization within liquid-like droplets formed by the N-terminal domain of NPM1 and R-motif peptides, thus providing insights into the structural organization of the nucleolus. We identify multivalency of acidic tracts and folded nucleic acid binding domains, mediated by N-terminal domain oligomerization, as structural features required for phase separation of NPM1 with other nucleolar components in vitro and for localization within mammalian nucleoli. We propose that one mechanism of nucleolar localization involves phase separation of proteins within the nucleolus.

  14. Ferritin iron minerals are chelator targets, antioxidants, and coated, dietary iron.

    PubMed

    Theil, Elizabeth C

    2010-08-01

    Cellular ferritin is central for iron balance during transfusions therapies; serum ferritin is a small fraction of body ferritin, albeit a convenient reporter. Iron overload induces extra ferritin protein synthesis but the protein is overfilled with the extra iron that damages ferritin, with conversion to toxic hemosiderin. Three new approaches that manipulate ferritin to address excess iron, hemosiderin, and associated oxidative damage in Cooley's Anemia and other iron overload conditions are faster removal of ferritin iron with chelators guided to ferritin gated pores by peptides; more ferritin protein synthesis using ferritin mRNA activators, by metal complexes that target mRNA 3D structures; and determining if endocytotic absorption of iron from legumes, which is mostly ferritin, is regulated during iron overload to prevent excess iron entry while providing protein. More of a focus on ferritin features, including protein cage structure, iron mineral, regulatable mRNA, and specific gut absorption properties, will achieve the three novel experimental goals for managing iron homeostasis with transfusion therapies.

  15. Structure of Ljungan virus provides insight into genome packaging of this picornavirus

    NASA Astrophysics Data System (ADS)

    Zhu, Ling; Wang, Xiangxi; Ren, Jingshan; Porta, Claudine; Wenham, Hannah; Ekström, Jens-Ola; Panjwani, Anusha; Knowles, Nick J.; Kotecha, Abhay; Siebert, C. Alistair; Lindberg, A. Michael; Fry, Elizabeth E.; Rao, Zihe; Tuthill, Tobias J.; Stuart, David I.

    2015-10-01

    Picornaviruses are responsible for a range of human and animal diseases, but how their RNA genome is packaged remains poorly understood. A particularly poorly studied group within this family are those that lack the internal coat protein, VP4. Here we report the atomic structure of one such virus, Ljungan virus, the type member of the genus Parechovirus B, which has been linked to diabetes and myocarditis in humans. The 3.78-Å resolution cryo-electron microscopy structure shows remarkable features, including an extended VP1 C terminus, forming a major protuberance on the outer surface of the virus, and a basic motif at the N terminus of VP3, binding to which orders some 12% of the viral genome. This apparently charge-driven RNA attachment suggests that this branch of the picornaviruses uses a different mechanism of genome encapsidation, perhaps explored early in the evolution of picornaviruses.

  16. Structure of Ljungan virus provides insight into genome packaging of this picornavirus.

    PubMed

    Zhu, Ling; Wang, Xiangxi; Ren, Jingshan; Porta, Claudine; Wenham, Hannah; Ekström, Jens-Ola; Panjwani, Anusha; Knowles, Nick J; Kotecha, Abhay; Siebert, C Alistair; Lindberg, A Michael; Fry, Elizabeth E; Rao, Zihe; Tuthill, Tobias J; Stuart, David I

    2015-10-08

    Picornaviruses are responsible for a range of human and animal diseases, but how their RNA genome is packaged remains poorly understood. A particularly poorly studied group within this family are those that lack the internal coat protein, VP4. Here we report the atomic structure of one such virus, Ljungan virus, the type member of the genus Parechovirus B, which has been linked to diabetes and myocarditis in humans. The 3.78-Å resolution cryo-electron microscopy structure shows remarkable features, including an extended VP1 C terminus, forming a major protuberance on the outer surface of the virus, and a basic motif at the N terminus of VP3, binding to which orders some 12% of the viral genome. This apparently charge-driven RNA attachment suggests that this branch of the picornaviruses uses a different mechanism of genome encapsidation, perhaps explored early in the evolution of picornaviruses.

  17. Evidence that viral RNAs have evolved for efficient, two-stage packaging.

    PubMed

    Borodavka, Alexander; Tuma, Roman; Stockley, Peter G

    2012-09-25

    Genome packaging is an essential step in virus replication and a potential drug target. Single-stranded RNA viruses have been thought to encapsidate their genomes by gradual co-assembly with capsid subunits. In contrast, using a single molecule fluorescence assay to monitor RNA conformation and virus assembly in real time, with two viruses from differing structural families, we have discovered that packaging is a two-stage process. Initially, the genomic RNAs undergo rapid and dramatic (approximately 20-30%) collapse of their solution conformations upon addition of cognate coat proteins. The collapse occurs with a substoichiometric ratio of coat protein subunits and is followed by a gradual increase in particle size, consistent with the recruitment of additional subunits to complete a growing capsid. Equivalently sized nonviral RNAs, including high copy potential in vivo competitor mRNAs, do not collapse. They do support particle assembly, however, but yield many aberrant structures in contrast to viral RNAs that make only capsids of the correct size. The collapse is specific to viral RNA fragments, implying that it depends on a series of specific RNA-protein interactions. For bacteriophage MS2, we have shown that collapse is driven by subsequent protein-protein interactions, consistent with the RNA-protein contacts occurring in defined spatial locations. Conformational collapse appears to be a distinct feature of viral RNA that has evolved to facilitate assembly. Aspects of this process mimic those seen in ribosome assembly.

  18. Thermodynamic and spectroscopic investigations of TMPyP4 association with guanine- and cytosine-rich DNA and RNA repeats of C9orf72.

    PubMed

    Alniss, Hasan; Zamiri, Bita; Khalaj, Melisa; Pearson, Christopher E; Macgregor, Robert B

    2018-01-22

    An expansion of the hexanucleotide repeat (GGGGCC)n·(GGCCCC)n in the C9orf72 promoter has been shown to be the cause of Amyotrophic lateral sclerosis and frontotemporal dementia (ALS-FTD). The C9orf72 repeat can form four-stranded structures; the cationic porphyrin (TMPyP4) binds and distorts these structures. Isothermal titration calorimetry (ITC), and circular dichroism (CD) were used to study the binding of TMPyP4 to the C-rich and G-rich DNA and RNA oligos containing the hexanucleotide repeat at pH 7.5 and 0.1 M K + . The CD spectra of G-rich DNA and RNA TMPyP4 complexes showed features of antiparallel and parallel G-quadruplexes, respectively. The shoulder at 260 nm in the CD spectrum becomes more intense upon formation of complexes between TMPyP4 and the C-rich DNA. The peak at 290 nm becomes more intense in the c-rich RNA molecules, suggesting induction of an i-motif structure. The ITC data showed that TMPyP4 binds at two independent sites for all DNA and RNA molecules. For DNA, the data are consistent with TMPyP4 stacking on the terminal tetrads and intercalation. For RNA, the thermodynamics of the two binding modes are consistent with groove binding and intercalation. In both cases, intercalation is the weaker binding mode. These findings are considered with respect to the structural differences of the folded DNA and RNA molecules and the energetics of the processes that drive site-specific recognition by TMPyP4; these data will be helpful in efforts to optimize the specificity and affinity of the binding of porphyrin-like molecules. Copyright © 2018 Elsevier Inc. All rights reserved.

  19. Bioinformatics analysis of plant orthologous introns: identification of an intronic tRNA-like sequence.

    PubMed

    Akkuratov, Evgeny E; Walters, Lorraine; Saha-Mandal, Arnab; Khandekar, Sushant; Crawford, Erin; Zirbel, Craig L; Leisner, Scott; Prakash, Ashwin; Fedorova, Larisa; Fedorov, Alexei

    2014-09-10

    Orthologous introns have identical positions relative to the coding sequence in orthologous genes of different species. By analyzing the complete genomes of five plants we generated a database of 40,512 orthologous intron groups of dicotyledonous plants, 28,519 orthologous intron groups of angiosperms, and 15,726 of land plants (moss and angiosperms). Multiple sequence alignments of each orthologous intron group were obtained using the Mafft algorithm. The number of conserved regions in plant introns appeared to be hundreds of times fewer than that in mammals or vertebrates. Approximately three quarters of conserved intronic regions among angiosperms and dicots, in particular, correspond to alternatively-spliced exonic sequences. We registered only a handful of conserved intronic ncRNAs of flowering plants. However, the most evolutionarily conserved intronic region, which is ubiquitous for all plants examined in this study, including moss, possessed multiple structural features of tRNAs, which caused us to classify it as a putative tRNA-like ncRNA. Intronic sequences encoding tRNA-like structures are not unique to plants. Bioinformatics examination of the presence of tRNA inside introns revealed an unusually long-term association of four glycine tRNAs inside the Vac14 gene of fish, amniotes, and mammals. Copyright © 2014 Elsevier B.V. All rights reserved.

  20. Cancer survival classification using integrated data sets and intermediate information.

    PubMed

    Kim, Shinuk; Park, Taesung; Kon, Mark

    2014-09-01

    Although numerous studies related to cancer survival have been published, increasing the prediction accuracy of survival classes still remains a challenge. Integration of different data sets, such as microRNA (miRNA) and mRNA, might increase the accuracy of survival class prediction. Therefore, we suggested a machine learning (ML) approach to integrate different data sets, and developed a novel method based on feature selection with Cox proportional hazard regression model (FSCOX) to improve the prediction of cancer survival time. FSCOX provides us with intermediate survival information, which is usually discarded when separating survival into 2 groups (short- and long-term), and allows us to perform survival analysis. We used an ML-based protocol for feature selection, integrating information from miRNA and mRNA expression profiles at the feature level. To predict survival phenotypes, we used the following classifiers, first, existing ML methods, support vector machine (SVM) and random forest (RF), second, a new median-based classifier using FSCOX (FSCOX_median), and third, an SVM classifier using FSCOX (FSCOX_SVM). We compared these methods using 3 types of cancer tissue data sets: (i) miRNA expression, (ii) mRNA expression, and (iii) combined miRNA and mRNA expression. The latter data set included features selected either from the combined miRNA/mRNA profile or independently from miRNAs and mRNAs profiles (IFS). In the ovarian data set, the accuracy of survival classification using the combined miRNA/mRNA profiles with IFS was 75% using RF, 86.36% using SVM, 84.09% using FSCOX_median, and 88.64% using FSCOX_SVM with a balanced 22 short-term and 22 long-term survivor data set. These accuracies are higher than those using miRNA alone (70.45%, RF; 75%, SVM; 75%, FSCOX_median; and 75%, FSCOX_SVM) or mRNA alone (65.91%, RF; 63.64%, SVM; 72.73%, FSCOX_median; and 70.45%, FSCOX_SVM). Similarly in the glioblastoma multiforme data, the accuracy of miRNA/mRNA using IFS was 75.51% (RF), 87.76% (SVM) 85.71% (FSCOX_median), 85.71% (FSCOX_SVM). These results are higher than the results of using miRNA expression and mRNA expression alone. In addition we predict 16 hsa-miR-23b and hsa-miR-27b target genes in ovarian cancer data sets, obtained by SVM-based feature selection through integration of sequence information and gene expression profiles. Among the approaches used, the integrated miRNA and mRNA data set yielded better results than the individual data sets. The best performance was achieved using the FSCOX_SVM method with independent feature selection, which uses intermediate survival information between short-term and long-term survival time and the combination of the 2 different data sets. The results obtained using the combined data set suggest that there are some strong interactions between miRNA and mRNA features that are not detectable in the individual analyses. Copyright © 2014 Elsevier B.V. All rights reserved.

  1. New support vector machine-based method for microRNA target prediction.

    PubMed

    Li, L; Gao, Q; Mao, X; Cao, Y

    2014-06-09

    MicroRNA (miRNA) plays important roles in cell differentiation, proliferation, growth, mobility, and apoptosis. An accurate list of precise target genes is necessary in order to fully understand the importance of miRNAs in animal development and disease. Several computational methods have been proposed for miRNA target-gene identification. However, these methods still have limitations with respect to their sensitivity and accuracy. Thus, we developed a new miRNA target-prediction method based on the support vector machine (SVM) model. The model supplies information of two binding sites (primary and secondary) for a radial basis function kernel as a similarity measure for SVM features. The information is categorized based on structural, thermodynamic, and sequence conservation. Using high-confidence datasets selected from public miRNA target databases, we obtained a human miRNA target SVM classifier model with high performance and provided an efficient tool for human miRNA target gene identification. Experiments have shown that our method is a reliable tool for miRNA target-gene prediction, and a successful application of an SVM classifier. Compared with other methods, the method proposed here improves the sensitivity and accuracy of miRNA prediction. Its performance can be further improved by providing more training examples.

  2. MicroRNA-Target Network Inference and Local Network Enrichment Analysis Identify Two microRNA Clusters with Distinct Functions in Head and Neck Squamous Cell Carcinoma

    PubMed Central

    Sass, Steffen; Pitea, Adriana; Unger, Kristian; Hess, Julia; Mueller, Nikola S.; Theis, Fabian J.

    2015-01-01

    MicroRNAs represent ~22 nt long endogenous small RNA molecules that have been experimentally shown to regulate gene expression post-transcriptionally. One main interest in miRNA research is the investigation of their functional roles, which can typically be accomplished by identification of mi-/mRNA interactions and functional annotation of target gene sets. We here present a novel method “miRlastic”, which infers miRNA-target interactions using transcriptomic data as well as prior knowledge and performs functional annotation of target genes by exploiting the local structure of the inferred network. For the network inference, we applied linear regression modeling with elastic net regularization on matched microRNA and messenger RNA expression profiling data to perform feature selection on prior knowledge from sequence-based target prediction resources. The novelty of miRlastic inference originates in predicting data-driven intra-transcriptome regulatory relationships through feature selection. With synthetic data, we showed that miRlastic outperformed commonly used methods and was suitable even for low sample sizes. To gain insight into the functional role of miRNAs and to determine joint functional properties of miRNA clusters, we introduced a local enrichment analysis procedure. The principle of this procedure lies in identifying regions of high functional similarity by evaluating the shortest paths between genes in the network. We can finally assign functional roles to the miRNAs by taking their regulatory relationships into account. We thoroughly evaluated miRlastic on a cohort of head and neck cancer (HNSCC) patients provided by The Cancer Genome Atlas. We inferred an mi-/mRNA regulatory network for human papilloma virus (HPV)-associated miRNAs in HNSCC. The resulting network best enriched for experimentally validated miRNA-target interaction, when compared to common methods. Finally, the local enrichment step identified two functional clusters of miRNAs that were predicted to mediate HPV-associated dysregulation in HNSCC. Our novel approach was able to characterize distinct pathway regulations from matched miRNA and mRNA data. An R package of miRlastic was made available through: http://icb.helmholtz-muenchen.de/mirlastic. PMID:26694379

  3. MicroRNA-Target Network Inference and Local Network Enrichment Analysis Identify Two microRNA Clusters with Distinct Functions in Head and Neck Squamous Cell Carcinoma.

    PubMed

    Sass, Steffen; Pitea, Adriana; Unger, Kristian; Hess, Julia; Mueller, Nikola S; Theis, Fabian J

    2015-12-18

    MicroRNAs represent ~22 nt long endogenous small RNA molecules that have been experimentally shown to regulate gene expression post-transcriptionally. One main interest in miRNA research is the investigation of their functional roles, which can typically be accomplished by identification of mi-/mRNA interactions and functional annotation of target gene sets. We here present a novel method "miRlastic", which infers miRNA-target interactions using transcriptomic data as well as prior knowledge and performs functional annotation of target genes by exploiting the local structure of the inferred network. For the network inference, we applied linear regression modeling with elastic net regularization on matched microRNA and messenger RNA expression profiling data to perform feature selection on prior knowledge from sequence-based target prediction resources. The novelty of miRlastic inference originates in predicting data-driven intra-transcriptome regulatory relationships through feature selection. With synthetic data, we showed that miRlastic outperformed commonly used methods and was suitable even for low sample sizes. To gain insight into the functional role of miRNAs and to determine joint functional properties of miRNA clusters, we introduced a local enrichment analysis procedure. The principle of this procedure lies in identifying regions of high functional similarity by evaluating the shortest paths between genes in the network. We can finally assign functional roles to the miRNAs by taking their regulatory relationships into account. We thoroughly evaluated miRlastic on a cohort of head and neck cancer (HNSCC) patients provided by The Cancer Genome Atlas. We inferred an mi-/mRNA regulatory network for human papilloma virus (HPV)-associated miRNAs in HNSCC. The resulting network best enriched for experimentally validated miRNA-target interaction, when compared to common methods. Finally, the local enrichment step identified two functional clusters of miRNAs that were predicted to mediate HPV-associated dysregulation in HNSCC. Our novel approach was able to characterize distinct pathway regulations from matched miRNA and mRNA data. An R package of miRlastic was made available through: http://icb.helmholtz-muenchen.de/mirlastic.

  4. Reverse Transcription of a Self-Primed Retrotransposon Requires an RNA Structure Similar to the U5-IR Stem-Loop of Retroviruses

    PubMed Central

    Lin, Jia-Hwei; Levin, Henry L.

    1998-01-01

    An inverted repeat (IR) within the U5 region of the Rous sarcoma virus (RSV) mRNA forms a structure composed of a 7-bp stem and a 5-nucleotide (nt) loop. This U5-IR structure has been shown to be required for the initiation of reverse transcription. The mRNA of Tf1, long terminal repeat-containing retrotransposon from fission yeast (Schizosaccharomyces pombe) contains nucleotides with the potential to form a U5-IR stem-loop that is strikingly similar to that of RSV. The putative U5-IR stem-loop of Tf1 consists of a 7-bp stem and a 25-nt loop. Results from mutagenesis studies indicate that the U5-IR stem-loop in the mRNA of Tf1 does form and that it is required for Tf1 transposition. Although the loop is required for transposition, we were surprised that the specific sequence of the nucleotides within the loop was unimportant for function. Additional investigation indicates that the loss of transposition activity due to a reduction in the loop size to 6 nt could be rescued by increasing the GC content of the stem. This result indicates that the large loop in the Tf1 mRNA relative to that of the RSV allows the formation of the relatively weak U5-IR stem. The levels of Tf1 proteins expressed and the amounts of Tf1 RNA packaged into the virus-like particles were not affected by mutations in the U5-IR structure. However, all of the mutations in the U5-IR structure that caused defects in transposition produced low amounts of reverse transcripts. A unique feature in the initiation of Tf1 reverse transcription is that, instead of a tRNA, the first 11 nt of the Tf1 mRNA serve as the minus-strand primer. Analysis of the 5′ end of Tf1 mRNA revealed that the mutations in the U5-IR stem-loop that resulted in defects in reverse transcription caused a reduction in the cleavage activity required to generate the Tf1 primer. Our results indicate that the U5-IR stems of Tf1 and RSV are conserved in size, position, and function. PMID:9774699

  5. The Diversity of Ribonuclease P: Protein and RNA Catalysts with Analogous Biological Functions

    PubMed Central

    Klemm, Bradley P.; Wu, Nancy; Chen, Yu; Liu, Xin; Kaitany, Kipchumba J.; Howard, Michael J.; Fierke, Carol A.

    2016-01-01

    Ribonuclease P (RNase P) is an essential endonuclease responsible for catalyzing 5’ end maturation in precursor transfer RNAs. Since its discovery in the 1970s, RNase P enzymes have been identified and studied throughout the three domains of life. Interestingly, RNase P is either RNA-based, with a catalytic RNA subunit, or a protein-only (PRORP) enzyme with differential evolutionary distribution. The available structural data, including the active site data, provides insight into catalysis and substrate recognition. The hydrolytic and kinetic mechanisms of the two forms of RNase P enzymes are similar, yet features unique to the RNA-based and PRORP enzymes are consistent with different evolutionary origins. The various RNase P enzymes, in addition to their primary role in tRNA 5’ maturation, catalyze cleavage of a variety of alternative substrates, indicating a diversification of RNase P function in vivo. The review concludes with a discussion of recent advances and interesting research directions in the field. PMID:27187488

  6. Circular RNA biogenesis can proceed through an exon-containing lariat precursor

    PubMed Central

    Barrett, Steven P; Wang, Peter L; Salzman, Julia

    2015-01-01

    Pervasive expression of circular RNA is a recently discovered feature of eukaryotic gene expression programs, yet its function remains largely unknown. The presumed biogenesis of these RNAs involves a non-canonical ‘backsplicing’ event. Recent studies in mammalian cell culture posit that backsplicing is facilitated by inverted repeats flanking the circularized exon(s). Although such sequence elements are common in mammals, they are rare in lower eukaryotes, making current models insufficient to describe circularization. Through systematic splice site mutagenesis and the identification of splicing intermediates, we show that circular RNA in Schizosaccharomyces pombe is generated through an exon-containing lariat precursor. Furthermore, we have performed high-throughput and comprehensive mutagenesis of a circle-forming exon, which enabled us to discover a systematic effect of exon length on RNA circularization. Our results uncover a mechanism for circular RNA biogenesis that may account for circularization in genes that lack noticeable flanking intronic secondary structure. DOI: http://dx.doi.org/10.7554/eLife.07540.001 PMID:26057830

  7. Post-transcriptional control by bacteriophage T4: mRNA decay and inhibition of translation initiation

    PubMed Central

    2010-01-01

    Over 50 years of biological research with bacteriophage T4 includes notable discoveries in post-transcriptional control, including the genetic code, mRNA, and tRNA; the very foundations of molecular biology. In this review we compile the past 10 - 15 year literature on RNA-protein interactions with T4 and some of its related phages, with particular focus on advances in mRNA decay and processing, and on translational repression. Binding of T4 proteins RegB, RegA, gp32 and gp43 to their cognate target RNAs has been characterized. For several of these, further study is needed for an atomic-level perspective, where resolved structures of RNA-protein complexes are awaiting investigation. Other features of post-transcriptional control are also summarized. These include: RNA structure at translation initiation regions that either inhibit or promote translation initiation; programmed translational bypassing, where T4 orchestrates ribosome bypass of a 50 nucleotide mRNA sequence; phage exclusion systems that involve T4-mediated activation of a latent endoribonuclease (PrrC) and cofactor-assisted activation of EF-Tu proteolysis (Gol-Lit); and potentially important findings on ADP-ribosylation (by Alt and Mod enzymes) of ribosome-associated proteins that might broadly impact protein synthesis in the infected cell. Many of these problems can continue to be addressed with T4, whereas the growing database of T4-related phage genome sequences provides new resources and potentially new phage-host systems to extend the work into a broader biological, evolutionary context. PMID:21129205

  8. SU-E-T-338: Ultrastable PRNA 3WJ Nanoparticles as Potential I-125 and C-131 Carriers for Targeted Radiation Therapy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Luo, W; Li, H; Guo, P

    2014-06-01

    Purpose: To study the feasibility of using the pRNA 3WJ nanoparticles to carry I-125 or Cs-131 to target and treat cancer. As the first step, we investigated the stabilities of pRNA 3WJ nanoparticles that are essential for cancer targeting and treatment in this study. Methods: The thermodynamic stability of assembled RNA 3WJ nanoparticles was studied using the TGGE system. The nanoparticles were irradiated with I-125 or Cs-131 radioactive sources that were immersed in the RNA nanoparticle/DNA structure sample liquid contained in a small vial. The irradiation of the RNA samples was performed for different time periods and doses. The purposemore » was to distinguish the effects of radiation on DNA and RNA structures. Unradiated samples were used as control. Results: RNA nanoparticles were formed by mixing three pieces of oligos, 3WJa, 3WJb, and 3WJc at 1:1:1 molar ratio. Figure 4 demonstrates that 2′-F modified 3WJ nanoparticles remained stable at temperatures as high as 66.8 ± 2°C, and exhibited melting temperatures of 71 ± 2°C. The radiation stability test was performed with I- 125 and Cs-131 irradiation. Several DNA structures including plasmids were included as control. The first test introduced I-125 and a low dose of 1 Gy to both RNA and DNA samples, but no change was observed. When the dose was increased to 30 Gy, DNA was damaged while RNA remained unchanged. Three tests were also conducted with Cs-131 with 7 Gy, 21 Gy, 30 Gy, and 89 Gy, and the results were similar to those with I-125. Conclusion: pRNA 3WJ nanoparticles are able to form efficiently by onepot self-assembly. They remained stable at high temperatures and high therapeutic doses over a long time. These unique features suggest that RNA 3WJ nanoparticles have the potential to be used for targeted radiation therapy for cancer treatment.« less

  9. Prediction of bacterial small RNAs in the RsmA (CsrA) and ToxT pathways: a machine learning approach.

    PubMed

    Fakhry, Carl Tony; Kulkarni, Prajna; Chen, Ping; Kulkarni, Rahul; Zarringhalam, Kourosh

    2017-08-22

    Small RNAs (sRNAs) constitute an important class of post-transcriptional regulators that control critical cellular processes in bacteria. Recent research using high-throughput transcriptomic approaches has led to a dramatic increase in the discovery of bacterial sRNAs. However, it is generally believed that the currently identified sRNAs constitute a limited subset of the bacterial sRNA repertoire. In several cases, sRNAs belonging to a specific class are already known and the challenge is to identify additional sRNAs belonging to the same class. In such cases, machine-learning approaches can be used to predict novel sRNAs in a given class. In this work, we develop novel bioinformatics approaches that integrate sequence and structure-based features to train machine-learning models for the discovery of bacterial sRNAs. We show that features derived from recurrent structural motifs in the ensemble of low energy secondary structures can distinguish the RNA classes with high accuracy. We apply this approach to predict new members in two broad classes of bacterial small RNAs: 1) sRNAs that bind to the RNA-binding protein RsmA/CsrA in diverse bacterial species and 2) sRNAs regulated by the master regulator of virulence, ToxT, in Vibrio cholerae. The involvement of sRNAs in bacterial adaptation to changing environments is an increasingly recurring theme in current research in microbiology. It is likely that future research, combining experimental and computational approaches, will discover many more examples of sRNAs as components of critical regulatory pathways in bacteria. We have developed a novel approach for prediction of small RNA regulators in important bacterial pathways. This approach can be applied to specific classes of sRNAs for which several members have been identified and the challenge is to identify additional sRNAs.

  10. [Bioinformatics Analysis of Clustered Regularly Interspaced Short Palindromic Repeats in the Genomes of Shigella].

    PubMed

    Wang, Pengfei; Wang, Yingfang; Duan, Guangcai; Xue, Zerun; Wang, Linlin; Guo, Xiangjiao; Yang, Haiyan; Xi, Yuanlin

    2015-04-01

    This study was aimed to explore the features of clustered regularly interspaced short palindromic repeats (CRISPR) structures in Shigella by using bioinformatics. We used bioinformatics methods, including BLAST, alignment and RNA structure prediction, to analyze the CRISPR structures of Shigella genomes. The results showed that the CRISPRs existed in the four groups of Shigella, and the flanking sequences of upstream CRISPRs could be classified into the same group with those of the downstream. We also found some relatively conserved palindromic motifs in the leader sequences. Repeat sequences had the same group with corresponding flanking sequences, and could be classified into two different types by their RNA secondary structures, which contain "stem" and "ring". Some spacers were found to homologize with part sequences of plasmids or phages. The study indicated that there were correlations between repeat sequences and flanking sequences, and the repeats might act as a kind of recognition mechanism to mediate the interaction between foreign genetic elements and Cas proteins.

  11. Macromolecular structures probed by combining single-shot free-electron laser diffraction with synchrotron coherent X-ray imaging.

    PubMed

    Gallagher-Jones, Marcus; Bessho, Yoshitaka; Kim, Sunam; Park, Jaehyun; Kim, Sangsoo; Nam, Daewoong; Kim, Chan; Kim, Yoonhee; Noh, Do Young; Miyashita, Osamu; Tama, Florence; Joti, Yasumasa; Kameshima, Takashi; Hatsui, Takaki; Tono, Kensuke; Kohmura, Yoshiki; Yabashi, Makina; Hasnain, S Samar; Ishikawa, Tetsuya; Song, Changyong

    2014-05-02

    Nanostructures formed from biological macromolecular complexes utilizing the self-assembly properties of smaller building blocks such as DNA and RNA hold promise for many applications, including sensing and drug delivery. New tools are required for their structural characterization. Intense, femtosecond X-ray pulses from X-ray free-electron lasers enable single-shot imaging allowing for instantaneous views of nanostructures at ambient temperatures. When combined judiciously with synchrotron X-rays of a complimentary nature, suitable for observing steady-state features, it is possible to perform ab initio structural investigation. Here we demonstrate a successful combination of femtosecond X-ray single-shot diffraction with an X-ray free-electron laser and coherent diffraction imaging with synchrotron X-rays to provide an insight into the nanostructure formation of a biological macromolecular complex: RNA interference microsponges. This newly introduced multimodal analysis with coherent X-rays can be applied to unveil nano-scale structural motifs from functional nanomaterials or biological nanocomplexes, without requiring a priori knowledge.

  12. Structure of the Triatoma virus capsid

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Squires, Gaëlle; Pous, Joan; Agirre, Jon

    The crystallographic structure of TrV shows specific morphological and functional features that clearly distinguish it from the type species of the Cripavirus genus, CrPV. The members of the Dicistroviridae family are non-enveloped positive-sense single-stranded RNA (+ssRNA) viruses pathogenic to beneficial arthropods as well as insect pests of medical importance. Triatoma virus (TrV), a member of this family, infects several species of triatomine insects (popularly named kissing bugs), which are vectors for human trypanosomiasis, more commonly known as Chagas disease. The potential use of dicistroviruses as biological control agents has drawn considerable attention in the past decade, and several viruses ofmore » this family have been identified, with their targets covering honey bees, aphids and field crickets, among others. Here, the crystal structure of the TrV capsid at 2.5 Å resolution is reported, showing that as expected it is very similar to that of Cricket paralysis virus (CrPV). Nevertheless, a number of distinguishing structural features support the introduction of a new genus (Triatovirus; type species TrV) under the Dicistroviridae family. The most striking differences are the absence of icosahedrally ordered VP4 within the infectious particle and the presence of prominent projections that surround the fivefold axis. Furthermore, the structure identifies a second putative autoproteolytic DDF motif in protein VP3, in addition to the conserved one in VP1 which is believed to be responsible for VP0 cleavage during capsid maturation. The potential meaning of these new findings is discussed.« less

  13. Insights into peptide nucleic acid (PNA) structural features: The crystal structure of a d-lysine-based chiral PNA–DNA duplex

    PubMed Central

    Menchise, Valeria; De Simone, Giuseppina; Tedeschi, Tullia; Corradini, Roberto; Sforza, Stefano; Marchelli, Rosangela; Capasso, Domenica; Saviano, Michele; Pedone, Carlo

    2003-01-01

    Peptide nucleic acids (PNAs) are oligonucleotide analogues in which the sugar-phosphate backbone has been replaced by a pseudopeptide skeleton. They bind DNA and RNA with high specificity and selectivity, leading to PNA–RNA and PNA–DNA hybrids more stable than the corresponding nucleic acid complexes. The binding affinity and selectivity of PNAs for nucleic acids can be modified by the introduction of stereogenic centers (such as d-Lys-based units) into the PNA backbone. To investigate the structural features of chiral PNAs, the structure of a PNA decamer containing three d-Lys-based monomers (namely H-GpnTpnApnGpnAdlTdlCdlApnCpnTpn-NH2, in which pn represents a pseudopeptide link and dl represents a d-Lys analogue) hybridized with its complementary antiparallel DNA has been solved at a 1.66-Å resolution by means of a single-wavelength anomalous diffraction experiment on a brominated derivative. Thed-Lys-based chiral PNA–DNA (LPD) heteroduplex adopts the so-called P-helix conformation. From the substantial similarity between the PNA conformation in LPD and the conformations observed in other PNA structures, it can be concluded that PNAs possess intrinsic conformational preferences for the P-helix, and that their flexibility is rather restricted. The conformational rigidity of PNAs is enhanced by the presence of the chiral centers, limiting the ability of PNA strands to adopt other conformations and, ultimately, increasing the selectivity in molecular recognition. PMID:14512516

  14. Parallel computation of genome-scale RNA secondary structure to detect structural constraints on human genome.

    PubMed

    Kawaguchi, Risa; Kiryu, Hisanori

    2016-05-06

    RNA secondary structure around splice sites is known to assist normal splicing by promoting spliceosome recognition. However, analyzing the structural properties of entire intronic regions or pre-mRNA sequences has been difficult hitherto, owing to serious experimental and computational limitations, such as low read coverage and numerical problems. Our novel software, "ParasoR", is designed to run on a computer cluster and enables the exact computation of various structural features of long RNA sequences under the constraint of maximal base-pairing distance. ParasoR divides dynamic programming (DP) matrices into smaller pieces, such that each piece can be computed by a separate computer node without losing the connectivity information between the pieces. ParasoR directly computes the ratios of DP variables to avoid the reduction of numerical precision caused by the cancellation of a large number of Boltzmann factors. The structural preferences of mRNAs computed by ParasoR shows a high concordance with those determined by high-throughput sequencing analyses. Using ParasoR, we investigated the global structural preferences of transcribed regions in the human genome. A genome-wide folding simulation indicated that transcribed regions are significantly more structural than intergenic regions after removing repeat sequences and k-mer frequency bias. In particular, we observed a highly significant preference for base pairing over entire intronic regions as compared to their antisense sequences, as well as to intergenic regions. A comparison between pre-mRNAs and mRNAs showed that coding regions become more accessible after splicing, indicating constraints for translational efficiency. Such changes are correlated with gene expression levels, as well as GC content, and are enriched among genes associated with cytoskeleton and kinase functions. We have shown that ParasoR is very useful for analyzing the structural properties of long RNA sequences such as mRNAs, pre-mRNAs, and long non-coding RNAs whose lengths can be more than a million bases in the human genome. In our analyses, transcribed regions including introns are indicated to be subject to various types of structural constraints that cannot be explained from simple sequence composition biases. ParasoR is freely available at https://github.com/carushi/ParasoR .

  15. The crystal structure of mammalian inositol 1,3,4,5,6-pentakisphosphate 2-kinase reveals a new zinc-binding site and key features for protein function

    PubMed Central

    Franco-Echevarría, Elsa; Sanz-Aparicio, Julia; Brearley, Charles A.; González-Rubio, Juana M.; González, Beatriz

    2017-01-01

    Inositol 1,3,4,5,6-pentakisphosphate 2-kinases (IP5 2-Ks) are part of a family of enzymes in charge of synthesizing inositol hexakisphosphate (IP6) in eukaryotic cells. This protein and its product IP6 present many roles in cells, participating in mRNA export, embryonic development, and apoptosis. We reported previously that the full-length IP5 2-K from Arabidopsis thaliana is a zinc metallo-enzyme, including two separated lobes (the N- and C-lobes). We have also shown conformational changes in IP5 2-K and have identified the residues involved in substrate recognition and catalysis. However, the specific features of mammalian IP5 2-Ks remain unknown. To this end, we report here the first structure for a murine IP5 2-K in complex with ATP/IP5 or IP6. Our structural findings indicated that the general folding in N- and C-lobes is conserved with A. thaliana IP5 2-K. A helical scaffold in the C-lobe constitutes the inositol phosphate-binding site, which, along with the participation of the N-lobe, endows high specificity to this protein. However, we also noted large structural differences between the orthologues from these two eukaryotic kingdoms. These differences include a novel zinc-binding site and regions unique to the mammalian IP5 2-K, as an unexpected basic patch on the protein surface. In conclusion, our findings have uncovered distinct features of a mammalian IP5 2-K and set the stage for investigations into protein-protein or protein-RNA interactions important for IP5 2-K function and activity. PMID:28450399

  16. Phylogenetic origins of the plant mitochondrion based on a comparative analysis of 5S ribosomal RNA sequences

    NASA Technical Reports Server (NTRS)

    Villanueva, E.; Delihas, N.; Luehrsen, K. R.; Fox, G. E.; Gibson, J.

    1985-01-01

    The complete nucleotide sequences of 5S ribosomal RNAs from Rhodocyclus gelatinosa, Rhodobacter sphaeroides, and Pseudomonas cepacia were determined. Comparisons of these 5S RNA sequences show that rather than being phylogenetically related to one another, the two photosynthetic bacterial 5S RNAs share more sequence and signature homology with the RNAs of two nonphotosynthetic strains. Rhodobacter sphaeroides is specifically related to Paracoccus denitrificans and Rc. gelatinosa is related to Ps. cepacia. These results support earlier 16S ribosomal RNA studies and add two important groups to the 5S RNA data base. Unique 5S RNA structural features previously found in P. denitrificans are present also in the 5S RNA of Rb. sphaeroides; these provide the basis for subdivisional signatures. The immediate consequence of obtaining these new sequences is that it is possible to clarify the phylogenetic origins of the plant mitochondrion. In particular, a close phylogenetic relationship is found between the plant mitochondria and members of the alpha subdivision of the purple photosynthetic bacteria, namely, Rb. sphaeroides, P. denitrificans, and Rhodospirillum rubrum.

  17. LBSizeCleav: improved support vector machine (SVM)-based prediction of Dicer cleavage sites using loop/bulge length.

    PubMed

    Bao, Yu; Hayashida, Morihiro; Akutsu, Tatsuya

    2016-11-25

    Dicer is necessary for the process of mature microRNA (miRNA) formation because the Dicer enzyme cleaves pre-miRNA correctly to generate miRNA with correct seed regions. Nonetheless, the mechanism underlying the selection of a Dicer cleavage site is still not fully understood. To date, several studies have been conducted to solve this problem, for example, a recent discovery indicates that the loop/bulge structure plays a central role in the selection of Dicer cleavage sites. In accordance with this breakthrough, a support vector machine (SVM)-based method called PHDCleav was developed to predict Dicer cleavage sites which outperforms other methods based on random forest and naive Bayes. PHDCleav, however, tests only whether a position in the shift window belongs to a loop/bulge structure. In this paper, we used the length of loop/bulge structures (in addition to their presence or absence) to develop an improved method, LBSizeCleav, for predicting Dicer cleavage sites. To evaluate our method, we used 810 empirically validated sequences of human pre-miRNAs and performed fivefold cross-validation. In both 5p and 3p arms of pre-miRNAs, LBSizeCleav showed greater prediction accuracy than PHDCleav did. This result suggests that the length of loop/bulge structures is useful for prediction of Dicer cleavage sites. We developed a novel algorithm for feature space mapping based on the length of a loop/bulge for predicting Dicer cleavage sites. The better performance of our method indicates the usefulness of the length of loop/bulge structures for such predictions.

  18. High-Throughput Genetic Identification of Functionally Important Regions of the Yeast DEAD-Box Protein Mss116p

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mohr, Georg; Del Campo, Mark; Turner, Kathryn G.

    The Saccharomyces cerevisiae DEAD-box protein Mss116p is a general RNA chaperone that functions in splicing mitochondrial group I and group II introns. Recent X-ray crystal structures of Mss116p in complex with ATP analogs and single-stranded RNA show that the helicase core induces a bend in the bound RNA, as in other DEAD-box proteins, while a C-terminal extension (CTE) induces a second bend, resulting in RNA crimping. Here, we illuminate these structures by using high-throughput genetic selections, unigenic evolution, and analyses of in vivo splicing activity to comprehensively identify functionally important regions and permissible amino acid substitutions throughout Mss116p. The functionallymore » important regions include those containing conserved sequence motifs involved in ATP and RNA binding or interdomain interactions, as well as previously unidentified regions, including surface loops that may function in protein-protein interactions. The genetic selections recapitulate major features of the conserved helicase motifs seen in other DEAD-box proteins but also show surprising variations, including multiple novel variants of motif III (SAT). Patterns of amino acid substitutions indicate that the RNA bend induced by the helicase core depends on ionic and hydrogen-bonding interactions with the bound RNA; identify a subset of critically interacting residues; and indicate that the bend induced by the CTE results primarily from a steric block. Finally, we identified two conserved regions - one the previously noted post II region in the helicase core and the other in the CTE - that may help displace or sequester the opposite RNA strand during RNA unwinding.« less

  19. Deriving quantitative dynamics information for proteins and RNAs using ROTDIF with a graphical user interface.

    PubMed

    Berlin, Konstantin; Longhini, Andrew; Dayie, T Kwaku; Fushman, David

    2013-12-01

    To facilitate rigorous analysis of molecular motions in proteins, DNA, and RNA, we present a new version of ROTDIF, a program for determining the overall rotational diffusion tensor from single- or multiple-field nuclear magnetic resonance relaxation data. We introduce four major features that expand the program's versatility and usability. The first feature is the ability to analyze, separately or together, (13)C and/or (15)N relaxation data collected at a single or multiple fields. A significant improvement in the accuracy compared to direct analysis of R2/R1 ratios, especially critical for analysis of (13)C relaxation data, is achieved by subtracting high-frequency contributions to relaxation rates. The second new feature is an improved method for computing the rotational diffusion tensor in the presence of biased errors, such as large conformational exchange contributions, that significantly enhances the accuracy of the computation. The third new feature is the integration of the domain alignment and docking module for relaxation-based structure determination of multi-domain systems. Finally, to improve accessibility to all the program features, we introduced a graphical user interface that simplifies and speeds up the analysis of the data. Written in Java, the new ROTDIF can run on virtually any computer platform. In addition, the new ROTDIF achieves an order of magnitude speedup over the previous version by implementing a more efficient deterministic minimization algorithm. We not only demonstrate the improvement in accuracy and speed of the new algorithm for synthetic and experimental (13)C and (15)N relaxation data for several proteins and nucleic acids, but also show that careful analysis required especially for characterizing RNA dynamics allowed us to uncover subtle conformational changes in RNA as a function of temperature that were opaque to previous analysis.

  20. ToNER: A tool for identifying nucleotide enrichment signals in feature-enriched RNA-seq data

    PubMed Central

    Promworn, Yuttachon; Kaewprommal, Pavita; Shaw, Philip J.; Intarapanich, Apichart; Tongsima, Sissades

    2017-01-01

    Background Biochemical methods are available for enriching 5′ ends of RNAs in prokaryotes, which are employed in the differential RNA-seq (dRNA-seq) and the more recent Cappable-seq protocols. Computational methods are needed to locate RNA 5′ ends from these data by statistical analysis of the enrichment. Although statistical-based analysis methods have been developed for dRNA-seq, they may not be suitable for Cappable-seq data. The more efficient enrichment method employed in Cappable-seq compared with dRNA-seq could affect data distribution and thus algorithm performance. Results We present Transformation of Nucleotide Enrichment Ratios (ToNER), a tool for statistical modeling of enrichment from RNA-seq data obtained from enriched and unenriched libraries. The tool calculates nucleotide enrichment scores and determines the global transformation for fitting to the normal distribution using the Box-Cox procedure. From the transformed distribution, sites of significant enrichment are identified. To increase power of detection, meta-analysis across experimental replicates is offered. We tested the tool on Cappable-seq and dRNA-seq data for identifying Escherichia coli transcript 5′ ends and compared the results with those from the TSSAR tool, which is designed for analyzing dRNA-seq data. When combining results across Cappable-seq replicates, ToNER detects more known transcript 5′ ends than TSSAR. In general, the transcript 5′ ends detected by ToNER but not TSSAR occur in regions which cannot be locally modeled by TSSAR. Conclusion ToNER uses a simple yet robust statistical modeling approach, which can be used for detecting RNA 5′ends from Cappable-seq data, in particular when combining information from experimental replicates. The ToNER tool could potentially be applied for analyzing other RNA-seq datasets in which enrichment for other structural features of RNA is employed. The program is freely available for download at ToNER webpage (http://www4a.biotec.or.th/GI/tools/toner) and GitHub repository (https://github.com/PavitaKae/ToNER). PMID:28542466

  1. Genetic and biochemical findings in Chinese children with Leigh syndrome.

    PubMed

    Ma, Yan-Yan; Wu, Tong-Fei; Liu, Yu-Peng; Wang, Qiao; Song, Jin-Qing; Li, Xi-Yuan; Shi, Xiu-Yu; Zhang, Wei-Na; Zhao, Meng; Hu, Lin-Yan; Yang, Yan-Ling; Zou, Li-Ping

    2013-11-01

    This study investigated the genetic and enzymological features of Leigh syndrome due to respiratory chain complex deficiency in Chinese patients. The clinical features of 75 patients were recorded. Mitochondrial respiratory chain enzyme activities were determined via spectrophotometry. Mitochondrial gene sequence analysis was performed in 23 patients. Five core pedigrees were investigated via restriction fragment length polymorphism and gene sequencing. Psychomotor retardation (55%), motor regression (20%), weakness (29%), and epilepsy (25%) were the most frequent manifestations. Sixty-four patients (85.3%) had isolated respiratory complex deficiencies: complex I was seen in 28 patients (37.3%); complex II, seven (9.3%); complex III, six (8%); complex IV, ten (13.3%); and complex V, 13 patients (17.3%). Eleven patients (14.7%) had combined complex deficiencies. Mitochondrial DNA mutations were detected in 10 patients. Eight point mutations were found in mitochondrial structural genes: m.4833A>G in ND2, m.10191T>C in ND3, m.12338T>C and m.13513G>A in ND5, m.14502T>C and m.14487T>C in ND6, m.8108A>G in COXII, and m.8993T>G in ATPase6. Three mutations were found in tRNA genes: m.4395A>G in tRNA-Gln, m.10454T>C in tRNA-Arg, and m.5587T>C in tRNA-Ala. One patient and their mother both had the m.12338T>C and m.8993T>C mutations. In conclusion, mitochondrial respiratory chain complex I deficiency and structural gene mutations frequently occur in Chinese Leigh syndrome patients. Copyright © 2013 Elsevier Ltd. All rights reserved.

  2. ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules

    PubMed Central

    Ashkenazy, Haim; Abadi, Shiran; Martz, Eric; Chay, Ofer; Mayrose, Itay; Pupko, Tal; Ben-Tal, Nir

    2016-01-01

    The degree of evolutionary conservation of an amino acid in a protein or a nucleic acid in DNA/RNA reflects a balance between its natural tendency to mutate and the overall need to retain the structural integrity and function of the macromolecule. The ConSurf web server (http://consurf.tau.ac.il), established over 15 years ago, analyses the evolutionary pattern of the amino/nucleic acids of the macromolecule to reveal regions that are important for structure and/or function. Starting from a query sequence or structure, the server automatically collects homologues, infers their multiple sequence alignment and reconstructs a phylogenetic tree that reflects their evolutionary relations. These data are then used, within a probabilistic framework, to estimate the evolutionary rates of each sequence position. Here we introduce several new features into ConSurf, including automatic selection of the best evolutionary model used to infer the rates, the ability to homology-model query proteins, prediction of the secondary structure of query RNA molecules from sequence, the ability to view the biological assembly of a query (in addition to the single chain), mapping of the conservation grades onto 2D RNA models and an advanced view of the phylogenetic tree that enables interactively rerunning ConSurf with the taxa of a sub-tree. PMID:27166375

  3. Novel determinants of mammalian primary microRNA processing revealed by systematic evaluation of hairpin-containing transcripts and human genetic variation

    PubMed Central

    Roden, Christine; Gaillard, Jonathan; Kanoria, Shaveta; Rennie, William; Barish, Syndi; Cheng, Jijun; Pan, Wen; Liu, Jun; Cotsapas, Chris; Ding, Ye; Lu, Jun

    2017-01-01

    Mature microRNAs (miRNAs) are processed from hairpin-containing primary miRNAs (pri-miRNAs). However, rules that distinguish pri-miRNAs from other hairpin-containing transcripts in the genome are incompletely understood. By developing a computational pipeline to systematically evaluate 30 structural and sequence features of mammalian RNA hairpins, we report several new rules that are preferentially utilized in miRNA hairpins and govern efficient pri-miRNA processing. We propose that a hairpin stem length of 36 ± 3 nt is optimal for pri-miRNA processing. We identify two bulge-depleted regions on the miRNA stem, located ∼16–21 nt and ∼28–32 nt from the base of the stem, that are less tolerant of unpaired bases. We further show that the CNNC primary sequence motif selectively enhances the processing of optimal-length hairpins. We predict that a small but significant fraction of human single-nucleotide polymorphisms (SNPs) alter pri-miRNA processing, and confirm several predictions experimentally including a disease-causing mutation. Our study enhances the rules governing mammalian pri-miRNA processing and suggests a diverse impact of human genetic variation on miRNA biogenesis. PMID:28087842

  4. Role of RNA interference (RNAi) in the Moss Physcomitrella patens.

    PubMed

    Arif, Muhammad Asif; Frank, Wolfgang; Khraiwesh, Basel

    2013-01-14

    RNA interference (RNAi) is a mechanism that regulates genes by either transcriptional (TGS) or posttranscriptional gene silencing (PTGS), required for genome maintenance and proper development of an organism. Small non-coding RNAs are the key players in RNAi and have been intensively studied in eukaryotes. In plants, several classes of small RNAs with specific sizes and dedicated functions have evolved. The major classes of small RNAs include microRNAs (miRNAs) and small interfering RNAs (siRNAs), which differ in their biogenesis. miRNAs are synthesized from a short hairpin structure while siRNAs are derived from long double-stranded RNAs (dsRNA). Both miRNA and siRNAs control the expression of cognate target RNAs by binding to reverse complementary sequences mediating cleavage or translational inhibition of the target RNA. They also act on the DNA and cause epigenetic changes such as DNA methylation and histone modifications. In the last years, the analysis of plant RNAi pathways was extended to the bryophyte Physcomitrella patens, a non-flowering, non-vascular ancient land plant that diverged from the lineage of seed plants approximately 450 million years ago. Based on a number of characteristic features and its phylogenetic key position in land plant evolution P. patens emerged as a plant model species to address basic as well as applied topics in plant biology. Here we summarize the current knowledge on the role of RNAi in P. patens that shows functional overlap with RNAi pathways from seed plants, and also unique features specific to this species.

  5. Feature Selection Has a Large Impact on One-Class Classification Accuracy for MicroRNAs in Plants.

    PubMed

    Yousef, Malik; Saçar Demirci, Müşerref Duygu; Khalifa, Waleed; Allmer, Jens

    2016-01-01

    MicroRNAs (miRNAs) are short RNA sequences involved in posttranscriptional gene regulation. Their experimental analysis is complicated and, therefore, needs to be supplemented with computational miRNA detection. Currently computational miRNA detection is mainly performed using machine learning and in particular two-class classification. For machine learning, the miRNAs need to be parametrized and more than 700 features have been described. Positive training examples for machine learning are readily available, but negative data is hard to come by. Therefore, it seems prerogative to use one-class classification instead of two-class classification. Previously, we were able to almost reach two-class classification accuracy using one-class classifiers. In this work, we employ feature selection procedures in conjunction with one-class classification and show that there is up to 36% difference in accuracy among these feature selection methods. The best feature set allowed the training of a one-class classifier which achieved an average accuracy of ~95.6% thereby outperforming previous two-class-based plant miRNA detection approaches by about 0.5%. We believe that this can be improved upon in the future by rigorous filtering of the positive training examples and by improving current feature clustering algorithms to better target pre-miRNA feature selection.

  6. Inter-molecular β-sheet structure facilitates lung-targeting siRNA delivery

    NASA Astrophysics Data System (ADS)

    Zhou, Jihan; Li, Dong; Wen, Hao; Zheng, Shuquan; Su, Cuicui; Yi, Fan; Wang, Jue; Liang, Zicai; Tang, Tao; Zhou, Demin; Zhang, Li-He; Liang, Dehai; Du, Quan

    2016-03-01

    Size-dependent passive targeting based on the characteristics of tissues is a basic mechanism of drug delivery. While the nanometer-sized particles are efficiently captured by the liver and spleen, the micron-sized particles are most likely entrapped within the lung owing to its unique capillary structure and physiological features. To exploit this property in lung-targeting siRNA delivery, we designed and studied a multi-domain peptide named K-β, which was able to form inter-molecular β-sheet structures. Results showed that K-β peptides and siRNAs formed stable complex particles of 60 nm when mixed together. A critical property of such particles was that, after being intravenously injected into mice, they further associated into loose and micron-sized aggregates, and thus effectively entrapped within the capillaries of the lung, leading to a passive accumulation and gene-silencing. The large size aggregates can dissociate or break down by the shear stress generated by blood flow, alleviating the pulmonary embolism. Besides the lung, siRNA enrichment and targeted gene silencing were also observed in the liver. This drug delivery strategy, together with the low toxicity, biodegradability, and programmability of peptide carriers, show great potentials in vivo applications.

  7. Structure of the SPRY domain of the human RNA helicase DDX1, a putative interaction platform within a DEAD-box protein

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kellner, Julian N.; Meinhart, Anton, E-mail: anton.meinhart@mpimf-heidelberg.mpg.de

    The structure of the SPRY domain of the human RNA helicase DDX1 was determined at 2.0 Å resolution. The SPRY domain provides a putative protein–protein interaction platform within DDX1 that differs from other SPRY domains in its structure and conserved regions. The human RNA helicase DDX1 in the DEAD-box family plays an important role in RNA processing and has been associated with HIV-1 replication and tumour progression. Whereas previously described DEAD-box proteins have a structurally conserved core, DDX1 shows a unique structural feature: a large SPRY-domain insertion in its RecA-like consensus fold. SPRY domains are known to function as protein–proteinmore » interaction platforms. Here, the crystal structure of the SPRY domain of human DDX1 (hDSPRY) is reported at 2.0 Å resolution. The structure reveals two layers of concave, antiparallel β-sheets that stack onto each other and a third β-sheet beneath the β-sandwich. A comparison with SPRY-domain structures from other eukaryotic proteins showed that the general β-sandwich fold is conserved; however, differences were detected in the loop regions, which were identified in other SPRY domains to be essential for interaction with cognate partners. In contrast, in hDSPRY these loop regions are not strictly conserved across species. Interestingly, though, a conserved patch of positive surface charge is found that may replace the connecting loops as a protein–protein interaction surface. The data presented here comprise the first structural information on DDX1 and provide insights into the unique domain architecture of this DEAD-box protein. By providing the structure of a putative interaction domain of DDX1, this work will serve as a basis for further studies of the interaction network within the hetero-oligomeric complexes of DDX1 and of its recruitment to the HIV-1 Rev protein as a viral replication factor.« less

  8. The tmRNA website

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hudson, Corey M.; Williams, Kelly P.

    We report that the transfer-messenger RNA (tmRNA) and its partner protein SmpB act together in resolving problems arising when translating bacterial ribosomes reach the end of mRNA with no stop codon. Their genes have been found in nearly all bacterial genomes and in some organelles. The tmRNA Website serves tmRNA sequences, alignments and feature annotations, and has recently moved to http: //bioinformatics.sandia.gov/tmrna/. New features include software used to find the sequences, an update raising the number of unique tmRNA sequences from 492 to 1716, and a database of SmpB sequences which are served along with the tmRNA sequence from themore » same organism.« less

  9. The tmRNA website

    DOE PAGES

    Hudson, Corey M.; Williams, Kelly P.

    2014-11-05

    We report that the transfer-messenger RNA (tmRNA) and its partner protein SmpB act together in resolving problems arising when translating bacterial ribosomes reach the end of mRNA with no stop codon. Their genes have been found in nearly all bacterial genomes and in some organelles. The tmRNA Website serves tmRNA sequences, alignments and feature annotations, and has recently moved to http: //bioinformatics.sandia.gov/tmrna/. New features include software used to find the sequences, an update raising the number of unique tmRNA sequences from 492 to 1716, and a database of SmpB sequences which are served along with the tmRNA sequence from themore » same organism.« less

  10. Degenerate RNA packaging signals in the genome of Satellite Tobacco Necrosis Virus: implications for the assembly of a T=1 capsid.

    PubMed

    Bunka, David H J; Lane, Stephen W; Lane, Claire L; Dykeman, Eric C; Ford, Robert J; Barker, Amy M; Twarock, Reidun; Phillips, Simon E V; Stockley, Peter G

    2011-10-14

    Using a recombinant, T=1 Satellite Tobacco Necrosis Virus (STNV)-like particle expressed in Escherichia coli, we have established conditions for in vitro disassembly and reassembly of the viral capsid. In vivo assembly is dependent on the presence of the coat protein (CP) N-terminal region, and in vitro assembly requires RNA. Using immobilised CP monomers under reassembly conditions with "free" CP subunits, we have prepared a range of partially assembled CP species for RNA aptamer selection. SELEX directed against the RNA-binding face of the STNV CP resulted in the isolation of several clones, one of which (B3) matches the STNV-1 genome in 16 out of 25 nucleotide positions, including across a statistically significant 10/10 stretch. This 10-base region folds into a stem-loop displaying the motif ACAA and has been shown to bind to STNV CP. Analysis of the other aptamer sequences reveals that the majority can be folded into stem-loops displaying versions of this motif. Using a sequence and secondary structure search motif to analyse the genomic sequence of STNV-1, we identified 30 stem-loops displaying the sequence motif AxxA. The implication is that there are many stem-loops in the genome carrying essential recognition features for binding STNV CP. Secondary structure predictions of the genomic RNA using Mfold showed that only 8 out of 30 of these stem-loops would be formed in the lowest-energy structure. These results are consistent with an assembly mechanism based on kinetically driven folding of the RNA. Copyright © 2011 Elsevier Ltd. All rights reserved.

  11. Cell-type specific features of circular RNA expression.

    PubMed

    Salzman, Julia; Chen, Raymond E; Olsen, Mari N; Wang, Peter L; Brown, Patrick O

    2013-01-01

    Thousands of loci in the human and mouse genomes give rise to circular RNA transcripts; at many of these loci, the predominant RNA isoform is a circle. Using an improved computational approach for circular RNA identification, we found widespread circular RNA expression in Drosophila melanogaster and estimate that in humans, circular RNA may account for 1% as many molecules as poly(A) RNA. Analysis of data from the ENCODE consortium revealed that the repertoire of genes expressing circular RNA, the ratio of circular to linear transcripts for each gene, and even the pattern of splice isoforms of circular RNAs from each gene were cell-type specific. These results suggest that biogenesis of circular RNA is an integral, conserved, and regulated feature of the gene expression program.

  12. Sequence-Based Prediction of RNA-Binding Proteins Using Random Forest with Minimum Redundancy Maximum Relevance Feature Selection.

    PubMed

    Ma, Xin; Guo, Jing; Sun, Xiao

    2015-01-01

    The prediction of RNA-binding proteins is one of the most challenging problems in computation biology. Although some studies have investigated this problem, the accuracy of prediction is still not sufficient. In this study, a highly accurate method was developed to predict RNA-binding proteins from amino acid sequences using random forests with the minimum redundancy maximum relevance (mRMR) method, followed by incremental feature selection (IFS). We incorporated features of conjoint triad features and three novel features: binding propensity (BP), nonbinding propensity (NBP), and evolutionary information combined with physicochemical properties (EIPP). The results showed that these novel features have important roles in improving the performance of the predictor. Using the mRMR-IFS method, our predictor achieved the best performance (86.62% accuracy and 0.737 Matthews correlation coefficient). High prediction accuracy and successful prediction performance suggested that our method can be a useful approach to identify RNA-binding proteins from sequence information.

  13. Advanced Design of Dumbbell-shaped Genetic Minimal Vectors Improves Non-coding and Coding RNA Expression.

    PubMed

    Jiang, Xiaoou; Yu, Han; Teo, Cui Rong; Tan, Genim Siu Xian; Goh, Sok Chin; Patel, Parasvi; Chua, Yiqiang Kevin; Hameed, Nasirah Banu Sahul; Bertoletti, Antonio; Patzel, Volker

    2016-09-01

    Dumbbell-shaped DNA minimal vectors lacking nontherapeutic genes and bacterial sequences are considered a stable, safe alternative to viral, nonviral, and naked plasmid-based gene-transfer systems. We investigated novel molecular features of dumbbell vectors aiming to reduce vector size and to improve the expression of noncoding or coding RNA. We minimized small hairpin RNA (shRNA) or microRNA (miRNA) expressing dumbbell vectors in size down to 130 bp generating the smallest genetic expression vectors reported. This was achieved by using a minimal H1 promoter with integrated transcriptional terminator transcribing the RNA hairpin structure around the dumbbell loop. Such vectors were generated with high conversion yields using a novel protocol. Minimized shRNA-expressing dumbbells showed accelerated kinetics of delivery and transcription leading to enhanced gene silencing in human tissue culture cells. In primary human T cells, minimized miRNA-expressing dumbbells revealed higher stability and triggered stronger target gene suppression as compared with plasmids and miRNA mimics. Dumbbell-driven gene expression was enhanced up to 56- or 160-fold by implementation of an intron and the SV40 enhancer compared with control dumbbells or plasmids. Advanced dumbbell vectors may represent one option to close the gap between durable expression that is achievable with integrating viral vectors and short-term effects triggered by naked RNA.

  14. Cassandra retrotransposons carry independently transcribed 5S RNA

    PubMed Central

    Kalendar, Ruslan; Tanskanen, Jaakko; Chang, Wei; Antonius, Kristiina; Sela, Hanan; Peleg, Ofer; Schulman, Alan H.

    2008-01-01

    We report a group of TRIMs (terminal-repeat retrotransposons in miniature), which are small nonautonomous retrotransposons. These elements, named Cassandra, universally carry conserved 5S RNA sequences and associated RNA polymerase (pol) III promoters and terminators in their long terminal repeats (LTRs). They were found in all vascular plants investigated. Uniquely for LTR retrotransposons, Cassandra produces noncapped, polyadenylated transcripts from the 5S pol III promoter. Capped, read-through transcripts containing Cassandra sequences can also be detected in RNA and in EST databases. The predicted Cassandra RNA 5S secondary structures resemble those for cellular 5S rRNA, with high information content specifically in the pol III promoter region. Genic integration sites are common for Cassandra, an unusual feature for abundant retrotransposons. The 5S in each LTR produces a tandem 5S arrangement with an inter-5S spacing resembling that of cellular 5S. The distribution of 5S genes is very variable in flowering plants and may be partially explained by Cassandra activity. Cassandra thus appears both to have adapted a ubiquitous cellular gene for ribosomal RNA for use as a promoter and to parasitize an as-yet-unidentified group of retrotransposons for the proteins needed in its lifecycle. PMID:18408163

  15. On the structural features of hairpin triloops in rRNA: from nucleotide to global conformational change upon ligand binding.

    PubMed

    Mitrasinovic, Petar M

    2006-03-01

    RNA structure can be viewed as both a construct composed of various structural motifs and a flexible polymer that is substantially influenced by its environment. In this light, the present paper represents an attempt to reconcile the two standpoints. By using the 3D structures both of four (16S and 23S) portions of unbound 50S, H50S, and T30S ribosomal subunits and of 38 large ribonucleoligand complexes as the starting point, the behavior, which is induced by ligand binding, of 73 hairpin triloops with closing g-c and c-g base pairs was investigated using root-mean-square deviation (RMSD) approach and pseudotorsional (eta,theta) convention at the nucleotide-by-nucleotide level. Triloops were annotated in accordance with a recent proposal of geometric nomenclature. A simple measure for the determination of the strain of a triloop is introduced. It is believed that a possible classification of the interior triloops, based on the 2D eta-theta unique path, will aid to conceive their local behavior upon ligand binding. All rRNA residues in contact with ligands as well as regions of considerable conformational changes upon complex formation were identified. The analysis offers the answer to: how proximal to and how far from the actual ligand-binding sites the structural changes occur?

  16. Three-dimensional super-resolution microscopy of the inactive X chromosome territory reveals a collapse of its active nuclear compartment harboring distinct Xist RNA foci

    PubMed Central

    2014-01-01

    Background A Xist RNA decorated Barr body is the structural hallmark of the compacted inactive X territory in female mammals. Using super-resolution three-dimensional structured illumination microscopy (3D-SIM) and quantitative image analysis, we compared its ultrastructure with active chromosome territories (CTs) in human and mouse somatic cells, and explored the spatio-temporal process of Barr body formation at onset of inactivation in early differentiating mouse embryonic stem cells (ESCs). Results We demonstrate that all CTs are composed of structurally linked chromatin domain clusters (CDCs). In active CTs the periphery of CDCs harbors low-density chromatin enriched with transcriptionally competent markers, called the perichromatin region (PR). The PR borders on a contiguous channel system, the interchromatin compartment (IC), which starts at nuclear pores and pervades CTs. We propose that the PR and macromolecular complexes in IC channels together form the transcriptionally permissive active nuclear compartment (ANC). The Barr body differs from active CTs by a partially collapsed ANC with CDCs coming significantly closer together, although a rudimentary IC channel system connected to nuclear pores is maintained. Distinct Xist RNA foci, closely adjacent to the nuclear matrix scaffold attachment factor-A (SAF-A) localize throughout Xi along the rudimentary ANC. In early differentiating ESCs initial Xist RNA spreading precedes Barr body formation, which occurs concurrent with the subsequent exclusion of RNA polymerase II (RNAP II). Induction of a transgenic autosomal Xist RNA in a male ESC triggers the formation of an ‘autosomal Barr body’ with less compacted chromatin and incomplete RNAP II exclusion. Conclusions 3D-SIM provides experimental evidence for profound differences between the functional architecture of transcriptionally active CTs and the Barr body. Basic structural features of CT organization such as CDCs and IC channels are however still recognized, arguing against a uniform compaction of the Barr body at the nucleosome level. The localization of distinct Xist RNA foci at boundaries of the rudimentary ANC may be considered as snap-shots of a dynamic interaction with silenced genes. Enrichment of SAF-A within Xi territories and its close spatial association with Xist RNA suggests their cooperative function for structural organization of Xi. PMID:25057298

  17. RNA-Seq of Bacillus licheniformis: active regulatory RNA features expressed within a productive fermentation.

    PubMed

    Wiegand, Sandra; Dietrich, Sascha; Hertel, Robert; Bongaerts, Johannes; Evers, Stefan; Volland, Sonja; Daniel, Rolf; Liesegang, Heiko

    2013-10-01

    The production of enzymes by an industrial strain requires a complex adaption of the bacterial metabolism to the conditions within the fermenter. Regulatory events within the process result in a dynamic change of the transcriptional activity of the genome. This complex network of genes is orchestrated by proteins as well as regulatory RNA elements. Here we present an RNA-Seq based study considering selected phases of an industry-oriented fermentation of Bacillus licheniformis. A detailed analysis of 20 strand-specific RNA-Seq datasets revealed a multitude of transcriptionally active genomic regions. 3314 RNA features encoded by such active loci have been identified and sorted into ten functional classes. The identified sequences include the expected RNA features like housekeeping sRNAs, metabolic riboswitches and RNA switches well known from studies on Bacillus subtilis as well as a multitude of completely new candidates for regulatory RNAs. An unexpectedly high number of 855 RNA features are encoded antisense to annotated protein and RNA genes, in addition to 461 independently transcribed small RNAs. These antisense transcripts contain molecules with a remarkable size range variation from 38 to 6348 base pairs in length. The genome of the type strain B. licheniformis DSM13 was completely reannotated using data obtained from RNA-Seq analyses and from public databases. The hereby generated data-sets represent a solid amount of knowledge on the dynamic transcriptional activities during the investigated fermentation stages. The identified regulatory elements enable research on the understanding and the optimization of crucial metabolic activities during a productive fermentation of Bacillus licheniformis strains.

  18. RNA adducts with Na 2SeO 4 and Na 2SeO 3 - Stability and structural features

    NASA Astrophysics Data System (ADS)

    Nafisi, Shohreh; Manouchehri, Firouzeh; Montazeri, Maryam

    2011-12-01

    Selenium compounds are widely available in dietary supplements and have been extensively studied for their antioxidant and anticancer properties. Low blood Se levels were found to be associated with an increased incidence and mortality from various types of cancers. Although many in vivo and clinical trials have been conducted using these compounds, their biochemical and chemical mechanisms of efficacy are the focus of much current research. This study was designed to examine the interaction of Na 2SeO 4 and Na 2SeO 3 with RNA in aqueous solution at physiological conditions, using a constant RNA concentration (6.25 mM) and various sodium selenate and sodium selenite/polynucleotide (phosphate) ratios of 1/80, 1/40, 1/20, 1/10, 1/5, 1/2 and 1/1. Fourier transform infrared, UV-Visible spectroscopic methods were used to determine the drug binding modes, the binding constants, and the stability of Na 2SeO 4 and Na 2SeO 3-RNA complexes in aqueous solution. Spectroscopic evidence showed that Na 2SeO 4 and Na 2SeO 3 bind to the major and minor grooves of RNA ( via G, A and U bases) with some degree of the Se-phosphate (PO 2) interaction for both compounds with overall binding constants of K(Na 2SeO 4-RNA) = 8.34 × 10 3 and K(Na 2SeO 3-RNA) = 4.57 × 10 3 M -1. The order of selenium salts-biopolymer stability was Na 2SeO 4-RNA > Na 2SeO 3-RNA. RNA aggregations occurred at higher selenium concentrations. No biopolymer conformational changes were observed upon Na 2SeO 4 and Na 2SeO 3 interactions, while RNA remains in the A-family structure.

  19. Productive mRNA stem loop-mediated transcriptional slippage: Crucial features in common with intrinsic terminators.

    PubMed

    Penno, Christophe; Sharma, Virag; Coakley, Arthur; O'Connell Motherway, Mary; van Sinderen, Douwe; Lubkowska, Lucyna; Kireeva, Maria L; Kashlev, Mikhail; Baranov, Pavel V; Atkins, John F

    2015-04-21

    Escherichia coli and yeast DNA-dependent RNA polymerases are shown to mediate efficient nascent transcript stem loop formation-dependent RNA-DNA hybrid realignment. The realignment was discovered on the heteropolymeric sequence T5C5 and yields transcripts lacking a C residue within a corresponding U5C4. The sequence studied is derived from a Roseiflexus insertion sequence (IS) element where the resulting transcriptional slippage is required for transposase synthesis. The stability of the RNA structure, the proximity of the stem loop to the slippage site, the length and composition of the slippage site motif, and the identity of its 3' adjacent nucleotides (nt) are crucial for transcripts lacking a single C. In many respects, the RNA structure requirements for this slippage resemble those for hairpin-dependent transcription termination. In a purified in vitro system, the slippage efficiency ranges from 5% to 75% depending on the concentration ratios of the nucleotides specified by the slippage sequence and the 3' nt context. The only previous proposal of stem loop mediated slippage, which was in Ebola virus expression, was based on incorrect data interpretation. We propose a mechanical slippage model involving the RNAP translocation state as the main motor in slippage directionality and efficiency. It is distinct from previously described models, including the one proposed for paramyxovirus, where following random movement efficiency is mainly dependent on the stability of the new realigned hybrid. In broadening the scope for utilization of transcription slippage for gene expression, the stimulatory structure provides parallels with programmed ribosomal frameshifting at the translation level.

  20. RNA Editing During Sexual Development Occurs in Distantly Related Filamentous Ascomycetes

    PubMed Central

    Teichert, Ines; Dahlmann, Tim A.; Kück, Ulrich

    2017-01-01

    RNA editing is a post-transcriptional process that modifies RNA molecules leading to transcript sequences that differ from their template DNA. A-to-I editing was found to be widely distributed in nuclear transcripts of metazoa, but was detected in fungi only recently in a study of the filamentous ascomycete Fusarium graminearum that revealed extensive A-to-I editing of mRNAs in sexual structures (fruiting bodies). Here, we searched for putative RNA editing events in RNA-seq data from Sordaria macrospora and Pyronema confluens, two distantly related filamentous ascomycetes, and in data from the Taphrinomycete Schizosaccharomyces pombe. Like F. graminearum, S. macrospora is a member of the Sordariomycetes, whereas P. confluens belongs to the early-diverging group of Pezizomycetes. We found extensive A-to-I editing in RNA-seq data from sexual mycelium from both filamentous ascomycetes, but not in vegetative structures. A-to-I editing was not detected in different stages of meiosis of S. pombe. A comparison of A-to-I editing in S. macrospora with F. graminearum and P. confluens, respectively, revealed little conservation of individual editing sites. An analysis of RNA-seq data from two sterile developmental mutants of S. macrospora showed that A-to-I editing is strongly reduced in these strains. Sequencing of cDNA fragments containing more than one editing site from P. confluens showed that at the beginning of sexual development, transcripts were incompletely edited or unedited, whereas in later stages transcripts were more extensively edited. Taken together, these data suggest that A-to-I RNA editing is an evolutionary conserved feature during fruiting body development in filamentous ascomycetes. PMID:28338982

  1. RNA Editing During Sexual Development Occurs in Distantly Related Filamentous Ascomycetes.

    PubMed

    Teichert, Ines; Dahlmann, Tim A; Kück, Ulrich; Nowrousian, Minou

    2017-04-01

    RNA editing is a post-transcriptional process that modifies RNA molecules leading to transcript sequences that differ from their template DNA. A-to-I editing was found to be widely distributed in nuclear transcripts of metazoa, but was detected in fungi only recently in a study of the filamentous ascomycete Fusarium graminearum that revealed extensive A-to-I editing of mRNAs in sexual structures (fruiting bodies). Here, we searched for putative RNA editing events in RNA-seq data from Sordaria macrospora and Pyronema confluens, two distantly related filamentous ascomycetes, and in data from the Taphrinomycete Schizosaccharomyces pombe. Like F. graminearum, S. macrospora is a member of the Sordariomycetes, whereas P. confluens belongs to the early-diverging group of Pezizomycetes. We found extensive A-to-I editing in RNA-seq data from sexual mycelium from both filamentous ascomycetes, but not in vegetative structures. A-to-I editing was not detected in different stages of meiosis of S. pombe. A comparison of A-to-I editing in S. macrospora with F. graminearum and P. confluens, respectively, revealed little conservation of individual editing sites. An analysis of RNA-seq data from two sterile developmental mutants of S. macrospora showed that A-to-I editing is strongly reduced in these strains. Sequencing of cDNA fragments containing more than one editing site from P. confluens showed that at the beginning of sexual development, transcripts were incompletely edited or unedited, whereas in later stages transcripts were more extensively edited. Taken together, these data suggest that A-to-I RNA editing is an evolutionary conserved feature during fruiting body development in filamentous ascomycetes. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  2. A second RNA-binding protein is essential for ethanol tolerance provided by the bacterial OLE ribonucleoprotein complex.

    PubMed

    Harris, Kimberly A; Zhou, Zhiyuan; Peters, Michelle L; Wilkins, Sarah G; Breaker, Ronald R

    2018-06-18

    OLE (ornate, large, extremophilic) RNAs comprise a class of structured noncoding RNAs (ncRNAs) found in many extremophilic bacteria species. OLE RNAs constitute one of the longest and most widespread bacterial ncRNA classes whose major biochemical function remains unknown. In the Gram-positive alkaliphile Bacillus halodurans , OLE RNA is abundant, and localizes to the cell membrane by association with the transmembrane OLE-associated protein called OapA (formerly OAP). These characteristics, along with the well-conserved sequence and structural features of OLE RNAs, suggest that the OLE ribonucleoprotein (RNP) complex performs important biological functions. B. halodurans strains lacking OLE RNA ( ∆ole ) or OapA ( ∆oapA ) are less tolerant of cold (20 °C) and short-chain alcohols (e.g., ethanol). Here, we describe the effects of a mutant OapA (called PM1) that more strongly inhibits growth under cold or ethanol stress compared with strains lacking the oapA gene, even when wild-type OapA is present. This dominant-negative effect of PM1 is reversed by mutations that render OLE RNA nonfunctional. This finding demonstrates that the deleterious PM1 phenotype requires an intact RNP complex, and suggests that the complex has one or more additional undiscovered components. A genetic screen uncovered PM1 phenotype suppressor mutations in the ybzG gene, which codes for a putative RNA-binding protein of unknown biological function. We observe that YbzG protein (also called OapB) selectively binds OLE RNA in vitro, whereas a mutant version of the protein is not observed to bind OLE RNA. Thus, YbzG/OapB is an important component of the functional OLE RNP complex in B. halodurans .

  3. Cell-Type Specific Features of Circular RNA Expression

    PubMed Central

    Salzman, Julia; Chen, Raymond E.; Olsen, Mari N.; Wang, Peter L.; Brown, Patrick O.

    2013-01-01

    Thousands of loci in the human and mouse genomes give rise to circular RNA transcripts; at many of these loci, the predominant RNA isoform is a circle. Using an improved computational approach for circular RNA identification, we found widespread circular RNA expression in Drosophila melanogaster and estimate that in humans, circular RNA may account for 1% as many molecules as poly(A) RNA. Analysis of data from the ENCODE consortium revealed that the repertoire of genes expressing circular RNA, the ratio of circular to linear transcripts for each gene, and even the pattern of splice isoforms of circular RNAs from each gene were cell-type specific. These results suggest that biogenesis of circular RNA is an integral, conserved, and regulated feature of the gene expression program. PMID:24039610

  4. Structural features of NS3 of Dengue virus serotypes 2 and 4 in solution and insight into RNA binding and the inhibitory role of quercetin.

    PubMed

    Pan, Ankita; Saw, Wuan Geok; Subramanian Manimekalai, Malathy Sony; Grüber, Ardina; Joon, Shin; Matsui, Tsutomu; Weiss, Thomas M; Grüber, Gerhard

    2017-05-01

    Dengue virus (DENV), which has four serotypes (DENV-1 to DENV-4), is the causative agent of the viral infection dengue. DENV nonstructural protein 3 (NS3) comprises a serine protease domain and an RNA helicase domain which has nucleotide triphosphatase activities that are essential for RNA replication and viral assembly. Here, solution X-ray scattering was used to provide insight into the overall structure and flexibility of the entire NS3 and its recombinant helicase and protease domains for Dengue virus serotypes 2 and 4 in solution. The DENV-2 and DENV-4 NS3 forms are elongated and flexible in solution. The importance of the linker residues in flexibility and domain-domain arrangement was shown by the compactness of the individual protease and helicase domains. Swapping of the 174 PPAVP 179 linker stretch of the related Hepatitis C virus (HCV) NS3 into DENV-2 NS3 did not alter the elongated shape of the engineered mutant. Conformational alterations owing to RNA binding are described in the protease domain, which undergoes substantial conformational alterations that are required for the optimal catalysis of bound RNA. Finally, the effects of ATPase inhibitors on the enzymatically active DENV-2 and DENV-4 NS3 and the individual helicases are presented, and insight into the allosteric effect of the inhibitor quercetin is provided.

  5. Structure-Function Analysis of Rny1 in tRNA Cleavage and Growth Inhibition

    PubMed Central

    Luhtala, Natalie; Parker, Roy

    2012-01-01

    T2 ribonucleases are conserved nucleases that affect a variety of processes in eukaryotic cells including the regulation of self-incompatibility by S-RNases in plants, modulation of host immune cell responses by viral and schistosome T2 enzymes, and neurological development and tumor progression in humans. These roles for RNaseT2’s can be due to catalytic or catalytic-independent functions of the molecule. Despite this broad importance, the features of RNaseT2 proteins that modulate catalytic and catalytic-independent functions are poorly understood. Herein, we analyze the features of Rny1 in Saccharomyces cerevisiae to determine the requirements for cleaving tRNA in vivo and for inhibiting cellular growth in a catalytic-independent manner. We demonstrate that catalytic-independent inhibition of growth is a combinatorial property of the protein and is affected by a fungal-specific C-terminal extension, the conserved catalytic core, and the presence of a signal peptide. Catalytic functions of Rny1 are independent of the C-terminal extension, are affected by many mutations in the catalytic core, and also require a signal peptide. Biochemical flotation assays reveal that in rny1Δ cells, some tRNA molecules associate with membranes suggesting that cleavage of tRNAs by Rny1 can involve either tRNA association with, or uptake into, membrane compartments. PMID:22829915

  6. The Model-Based Study of the Effectiveness of Reporting Lists of Small Feature Sets Using RNA-Seq Data.

    PubMed

    Kim, Eunji; Ivanov, Ivan; Hua, Jianping; Lampe, Johanna W; Hullar, Meredith Aj; Chapkin, Robert S; Dougherty, Edward R

    2017-01-01

    Ranking feature sets for phenotype classification based on gene expression is a challenging issue in cancer bioinformatics. When the number of samples is small, all feature selection algorithms are known to be unreliable, producing significant error, and error estimators suffer from different degrees of imprecision. The problem is compounded by the fact that the accuracy of classification depends on the manner in which the phenomena are transformed into data by the measurement technology. Because next-generation sequencing technologies amount to a nonlinear transformation of the actual gene or RNA concentrations, they can potentially produce less discriminative data relative to the actual gene expression levels. In this study, we compare the performance of ranking feature sets derived from a model of RNA-Seq data with that of a multivariate normal model of gene concentrations using 3 measures: (1) ranking power, (2) length of extensions, and (3) Bayes features. This is the model-based study to examine the effectiveness of reporting lists of small feature sets using RNA-Seq data and the effects of different model parameters and error estimators. The results demonstrate that the general trends of the parameter effects on the ranking power of the underlying gene concentrations are preserved in the RNA-Seq data, whereas the power of finding a good feature set becomes weaker when gene concentrations are transformed by the sequencing machine.

  7. The role of MicroRNA molecules and MicroRNA-regulating machinery in the pathogenesis and progression of epithelial ovarian cancer.

    PubMed

    Wang, Xiyin; Ivan, Mircea; Hawkins, Shannon M

    2017-11-01

    MicroRNA molecules are small, single-stranded RNA molecules that function to regulate networks of genes. They play important roles in normal female reproductive tract biology, as well as in the pathogenesis and progression of epithelial ovarian cancer. DROSHA, DICER, and Argonaute proteins are components of the microRNA-regulatory machinery and mediate microRNA production and function. This review discusses aberrant expression of microRNA molecules and microRNA-regulating machinery associated with clinical features of epithelial ovarian cancer. Understanding the regulation of microRNA molecule production and function may facilitate the development of novel diagnostic and therapeutic strategies to improve the prognosis of women with epithelial ovarian cancer. Additionally, understanding microRNA molecules and microRNA-regulatory machinery associations with clinical features may influence prevention and early detection efforts. Copyright © 2017 Elsevier Inc. All rights reserved.

  8. Improved measurements of RNA structure conservation with generalized centroid estimators.

    PubMed

    Okada, Yohei; Saito, Yutaka; Sato, Kengo; Sakakibara, Yasubumi

    2011-01-01

    Identification of non-protein-coding RNAs (ncRNAs) in genomes is a crucial task for not only molecular cell biology but also bioinformatics. Secondary structures of ncRNAs are employed as a key feature of ncRNA analysis since biological functions of ncRNAs are deeply related to their secondary structures. Although the minimum free energy (MFE) structure of an RNA sequence is regarded as the most stable structure, MFE alone could not be an appropriate measure for identifying ncRNAs since the free energy is heavily biased by the nucleotide composition. Therefore, instead of MFE itself, several alternative measures for identifying ncRNAs have been proposed such as the structure conservation index (SCI) and the base pair distance (BPD), both of which employ MFE structures. However, these measurements are unfortunately not suitable for identifying ncRNAs in some cases including the genome-wide search and incur high false discovery rate. In this study, we propose improved measurements based on SCI and BPD, applying generalized centroid estimators to incorporate the robustness against low quality multiple alignments. Our experiments show that our proposed methods achieve higher accuracy than the original SCI and BPD for not only human-curated structural alignments but also low quality alignments produced by CLUSTAL W. Furthermore, the centroid-based SCI on CLUSTAL W alignments is more accurate than or comparable with that of the original SCI on structural alignments generated with RAF, a high quality structural aligner, for which twofold expensive computational time is required on average. We conclude that our methods are more suitable for genome-wide alignments which are of low quality from the point of view on secondary structures than the original SCI and BPD.

  9. Nucleophosmin integrates within the nucleolus via multi-modal interactions with proteins displaying R-rich linear motifs and rRNA

    PubMed Central

    Mitrea, Diana M; Cika, Jaclyn A; Guy, Clifford S; Ban, David; Banerjee, Priya R; Stanley, Christopher B; Nourse, Amanda; Deniz, Ashok A; Kriwacki, Richard W

    2016-01-01

    The nucleolus is a membrane-less organelle formed through liquid-liquid phase separation of its components from the surrounding nucleoplasm. Here, we show that nucleophosmin (NPM1) integrates within the nucleolus via a multi-modal mechanism involving multivalent interactions with proteins containing arginine-rich linear motifs (R-motifs) and ribosomal RNA (rRNA). Importantly, these R-motifs are found in canonical nucleolar localization signals. Based on a novel combination of biophysical approaches, we propose a model for the molecular organization within liquid-like droplets formed by the N-terminal domain of NPM1 and R-motif peptides, thus providing insights into the structural organization of the nucleolus. We identify multivalency of acidic tracts and folded nucleic acid binding domains, mediated by N-terminal domain oligomerization, as structural features required for phase separation of NPM1 with other nucleolar components in vitro and for localization within mammalian nucleoli. We propose that one mechanism of nucleolar localization involves phase separation of proteins within the nucleolus. DOI: http://dx.doi.org/10.7554/eLife.13571.001 PMID:26836305

  10. Nucleophosmin integrates within the nucleolus via multi-modal interactions with proteins displaying R-rich linear motifs and rRNA

    DOE PAGES

    Mitrea, Diana M.; Cika, Jaclyn A.; Guy, Clifford S.; ...

    2016-02-02

    In this study, the nucleolus is a membrane-less organelle formed through liquid-liquid phase separation of its components from the surrounding nucleoplasm. Here, we show that nucleophosmin (NPM1) integrates within the nucleolus via a multi-modal mechanism involving multivalent interactions with proteins containing arginine-rich linear motifs (R-motifs) and ribosomal RNA (rRNA). Importantly, these R-motifs are found in canonical nucleolar localization signals. Based on a novel combination of biophysical approaches, we propose a model for the molecular organization within liquid-like droplets formed by the N-terminal domain of NPM1 and R-motif peptides, thus providing insights into the structural organization of the nucleolus. We identifymore » multivalency of acidic tracts and folded nucleic acid binding domains, mediated by N-terminal domain oligomerization, as structural features required for phase separation of NPM1 with other nucleolar components in vitro and for localization within mammalian nucleoli. We propose that one mechanism of nucleolar localization involves phase separation of proteins within the nucleolus.« less

  11. Multiclass cancer classification using a feature subset-based ensemble from microRNA expression profiles.

    PubMed

    Piao, Yongjun; Piao, Minghao; Ryu, Keun Ho

    2017-01-01

    Cancer classification has been a crucial topic of research in cancer treatment. In the last decade, messenger RNA (mRNA) expression profiles have been widely used to classify different types of cancers. With the discovery of a new class of small non-coding RNAs; known as microRNAs (miRNAs), various studies have shown that the expression patterns of miRNA can also accurately classify human cancers. Therefore, there is a great demand for the development of machine learning approaches to accurately classify various types of cancers using miRNA expression data. In this article, we propose a feature subset-based ensemble method in which each model is learned from a different projection of the original feature space to classify multiple cancers. In our method, the feature relevance and redundancy are considered to generate multiple feature subsets, the base classifiers are learned from each independent miRNA subset, and the average posterior probability is used to combine the base classifiers. To test the performance of our method, we used bead-based and sequence-based miRNA expression datasets and conducted 10-fold and leave-one-out cross validations. The experimental results show that the proposed method yields good results and has higher prediction accuracy than popular ensemble methods. The Java program and source code of the proposed method and the datasets in the experiments are freely available at https://sourceforge.net/projects/mirna-ensemble/. Copyright © 2016 Elsevier Ltd. All rights reserved.

  12. Correlation of the lung microbiota with metabolic profiles in bronchoalveolar lavage fluid in HIV infection.

    PubMed

    Cribbs, Sushma K; Uppal, Karan; Li, Shuzhao; Jones, Dean P; Huang, Laurence; Tipton, Laura; Fitch, Adam; Greenblatt, Ruth M; Kingsley, Lawrence; Guidot, David M; Ghedin, Elodie; Morris, Alison

    2016-01-20

    While 16S ribosomal RNA (rRNA) sequencing has been used to characterize the lung's bacterial microbiota in human immunodeficiency virus (HIV)-infected individuals, taxonomic studies provide limited information on bacterial function and impact on the host. Metabolic profiles can provide functional information on host-microbe interactions in the lungs. We investigated the relationship between the respiratory microbiota and metabolic profiles in the bronchoalveolar lavage fluid of HIV-infected and HIV-uninfected outpatients. Targeted sequencing of the 16S rRNA gene was used to analyze the bacterial community structure and liquid chromatography-high-resolution mass spectrometry was used to detect features in bronchoalveolar lavage fluid. Global integration of all metabolic features with microbial species was done using sparse partial least squares regression. Thirty-nine HIV-infected subjects and 20 HIV-uninfected controls without acute respiratory symptoms were enrolled. Twelve mass-to-charge ratio (m/z) features from C18 analysis were significantly different between HIV-infected individuals and controls (false discovery rate (FDR) = 0.2); another 79 features were identified by network analysis. Further metabolite analysis demonstrated that four features were significantly overrepresented in the bronchoalveolar lavage (BAL) fluid of HIV-infected individuals compared to HIV-uninfected, including cystine, two complex carbohydrates, and 3,5-dibromo-L-tyrosine. There were 231 m/z features significantly associated with peripheral blood CD4 cell counts identified using sparse partial least squares regression (sPLS) at a variable importance on projection (VIP) threshold of 2. Twenty-five percent of these 91 m/z features were associated with various microbial species. Bacteria from families Caulobacteraceae, Staphylococcaceae, Nocardioidaceae, and genus Streptococcus were associated with the greatest number of features. Glycerophospholipid and lineolate pathways correlated with these bacteria. In bronchoalveolar lavage fluid, specific metabolic profiles correlated with bacterial organisms known to play a role in the pathogenesis of pneumonia in HIV-infected individuals. These findings suggest that microbial communities and their interactions with the host may have functional metabolic impact in the lung.

  13. MiR-205-5p and miR-342-3p cooperate in the repression of the E2F1 transcription factor in the context of anticancer chemotherapy resistance

    PubMed Central

    Lai, Xin; Gupta, Shailendra K; Schmitz, Ulf; Marquardt, Stephan; Knoll, Susanne; Spitschak, Alf; Wolkenhauer, Olaf; Pützer, Brigitte M; Vera, Julio

    2018-01-01

    High rates of lethal outcome in tumour metastasis are associated with the acquisition of invasiveness and chemoresistance. Several clinical studies indicate that E2F1 overexpression across high-grade tumours culminates in unfavourable prognosis and chemoresistance in patients. Thus, fine-tuning the expression of E2F1 could be a promising approach for treating patients showing chemoresistance. Methods: We integrated bioinformatics, structural and kinetic modelling, and experiments to study cooperative regulation of E2F1 by microRNA (miRNA) pairs in the context of anticancer chemotherapy resistance. Results: We showed that an enhanced E2F1 repression efficiency can be achieved in chemoresistant tumour cells through two cooperating miRNAs. Sequence and structural information were used to identify potential miRNA pairs that can form tertiary structures with E2F1 mRNA. We then employed molecular dynamics simulations to show that among the identified triplexes, miR-205-5p and miR-342-3p can form the most stable triplex with E2F1 mRNA. A mathematical model simulating the E2F1 regulation by the cooperative miRNAs predicted enhanced E2F1 repression, a feature that was verified by in vitro experiments. Finally, we integrated this cooperative miRNA regulation into a more comprehensive network to account for E2F1-related chemoresistance in tumour cells. The network model simulations and experimental data indicate the ability of enhanced expression of both miR-205-5p and miR-342-3p to decrease tumour chemoresistance by cooperatively repressing E2F1. Conclusions: Our results suggest that pairs of cooperating miRNAs could be used as potential RNA therapeutics to reduce E2F1-related chemoresistance. PMID:29464002

  14. Complete fold annotation of the human proteome using a novel structural feature space.

    PubMed

    Middleton, Sarah A; Illuminati, Joseph; Kim, Junhyong

    2017-04-13

    Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this method by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families.

  15. Complete fold annotation of the human proteome using a novel structural feature space

    PubMed Central

    Middleton, Sarah A.; Illuminati, Joseph; Kim, Junhyong

    2017-01-01

    Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this method by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families. PMID:28406174

  16. State-of-the-art on viral microRNAs in HPV infection and cancer development.

    PubMed

    Poltronieri, Palmiro; Sun, Binlian; Huang, Kai-Yao; Chang, Tzu-Hao; Lee, Tzong-Yi

    2018-03-27

    high-risk HPV subtypes are driving forces for human cancer development: HPV-16 and HPV-18 are responsible for most HPV-caused cancers. This review describes the present knowledge on HR-HPV genomes coding potential for viral miRNAs. HPV subtypes miRNA database, VIRmiRtar, has been constructed applying bioinformatics and a computational method, ViralMir, exploiting structural features, presence of hairpins, and validation by comparison with RNA sequencing datasets. Several miRNA candidates have been localised in the genomes of high-risk HPV subtypes. Among these, HPV-16 miR-1, miR-2 and miR-3. The database contains a list of host candidate gene targets that may be responsible for the oncogenesis in the various cellular environments. miRNA silencing therapies, based on specific cellular uptake of miRNA mimics and antagomiRs, directed towards HPV encoded miRNAs and/or microRNAs deregulated in the host cells, could be a valuable approach to support pharmaceutical interventions in the treatment of HPV dependent cancers. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  17. In silico ribozyme evolution in a metabolically coupled RNA population.

    PubMed

    Könnyű, Balázs; Szilágyi, András; Czárán, Tamás

    2015-05-27

    The RNA World hypothesis offers a plausible bridge from no-life to life on prebiotic Earth, by assuming that RNA, the only known molecule type capable of playing genetic and catalytic roles at the same time, could have been the first evolvable entity on the evolutionary path to the first living cell. We have developed the Metabolically Coupled Replicator System (MCRS), a spatially explicit simulation modelling approach to prebiotic RNA-World evolution on mineral surfaces, in which we incorporate the most important experimental facts and theoretical considerations to comply with recent knowledge on RNA and prebiotic evolution. In this paper the MCRS model framework has been extended in order to investigate the dynamical and evolutionary consequences of adding an important physico-chemical detail, namely explicit replicator structure - nucleotide sequence and 2D folding calculated from thermodynamical criteria - and their possible mutational changes, to the assumptions of a previously less detailed toy model. For each mutable nucleotide sequence the corresponding 2D folded structure with minimum free energy is calculated, which in turn is used to determine the fitness components (degradation rate, replicability and metabolic enzyme activity) of the replicator. We show that the community of such replicators providing the monomer supply for their own replication by evolving metabolic enzyme activities features an improved propensity for stable coexistence and structural adaptation. These evolutionary advantages are due to the emergent uniformity of metabolic replicator fitnesses imposed on the community by local group selection and attained through replicator trait convergence, i.e., the tendency of replicator lengths, ribozyme activities and population sizes to become similar between the coevolving replicator species that are otherwise both structurally and functionally different. In the most general terms it is the surprisingly high extra viability of the metabolic replicator system that the present model adds to the MCRS concept of the origin of life. Surface-bound, metabolically coupled RNA replicators tend to evolve different, enzymatically active sites within thermodynamically stable secondary structures, and the system as a whole evolves towards the robust coexistence of a complete set of such ribozymes driving the metabolism producing monomers for their own replication.

  18. Fabrication of pRNA nanoparticles to deliver therapeutic RNAs and bioactive compounds into tumor cells

    PubMed Central

    Shu, Yi; Shu, Dan; Haque, Farzin; Guo, Peixuan

    2013-01-01

    RNA nanotechnology is a term that refers to the design, fabrication, and utilization of nanoparticles mainly composed of ribonucleic acids via bottom-up self-assembly. The packaging RNA (pRNA) of the bacteriophage phi29 DNA packaging motor has been developed into a nano-delivery platform. This protocol describes the synthesis, assembly, and functionalization of pRNA nanoparticles based on three ‘toolkits’ derived from pRNA structural features: interlocking loops for hand-in-hand interactions, palindrome sequences for foot-to-foot interactions, and an RNA three-way junction for branch-extension. siRNAs, ribozymes, aptamers, chemical ligands, fluorophores, and other functionalities can also be fused to the pRNA prior to the assembly of the nanoparticles, so as to ensure the production of homogeneous nanoparticles and the retention of appropriate folding and function of the incorporated modules. The resulting self-assembled multivalent pRNA nanoparticles are thermodynamically and chemically stable, and they remain intact at ultra-low concentrations. Gene silencing effects are progressively enhanced with increasing number of siRNA in each pRNA nanoparticle. Systemic injection of the pRNA nanoparticles into xenograft-bearing mice has revealed strong binding to tumors without accumulation in vital organs or tissues. The pRNA-based nano-delivery scaffold paves a new way towards nanotechnological application of pRNA-based nanoparticles for disease detection and treatment. The time required for completing one round of this protocol is 3–4 weeks, including in vitro functional assays, or 2–3 months including in vivo studies. PMID:23928498

  19. High-throughput annotation of full-length long noncoding RNAs with capture long-read sequencing.

    PubMed

    Lagarde, Julien; Uszczynska-Ratajczak, Barbara; Carbonell, Silvia; Pérez-Lluch, Sílvia; Abad, Amaya; Davis, Carrie; Gingeras, Thomas R; Frankish, Adam; Harrow, Jennifer; Guigo, Roderic; Johnson, Rory

    2017-12-01

    Accurate annotation of genes and their transcripts is a foundation of genomics, but currently no annotation technique combines throughput and accuracy. As a result, reference gene collections remain incomplete-many gene models are fragmentary, and thousands more remain uncataloged, particularly for long noncoding RNAs (lncRNAs). To accelerate lncRNA annotation, the GENCODE consortium has developed RNA Capture Long Seq (CLS), which combines targeted RNA capture with third-generation long-read sequencing. Here we present an experimental reannotation of the GENCODE intergenic lncRNA populations in matched human and mouse tissues that resulted in novel transcript models for 3,574 and 561 gene loci, respectively. CLS approximately doubled the annotated complexity of targeted loci, outperforming existing short-read techniques. Full-length transcript models produced by CLS enabled us to definitively characterize the genomic features of lncRNAs, including promoter and gene structure, and protein-coding potential. Thus, CLS removes a long-standing bottleneck in transcriptome annotation and generates manual-quality full-length transcript models at high-throughput scales.

  20. [Novel hybrid inhibitors of the phage T7 RNA polymerase: synthesis, docking and screening in vitro].

    PubMed

    Kostina, V H; Pal'chykovs'ka, L H; Platonov, M O; Vasyl'chenko, O V; Lysenko, N A; Alekseeva, I V

    2012-01-01

    A number of new hybrid heteroaromatic compounds, consisting of tricyclic fragments (acridone, thioxanthone and phenazine) and bicyclic fragments (benzimidazole, benzothiazole and benzoxazole) were synthesized using the method, developed by the authors. As a result of screening against the transcription model system of the phage T7 DNA-dependent RNA polymerase three effective inhibitors of the RNA syntheses with the IC50 value of 8.9, 5.7 and 19.8 microM were detected. To cast light on the mode of interaction between the synthesized compounds and the target, the molecular docking was applied to the model pocket of the phage T7 RNA polymerase transcription complex. It was established that these ligands form networks of H-bonds with residues of the pocket conservative amino acids and pi-interaction with the Mg2+ ion. A planar geometry of the hybrid molecules, realized due to the intramolecular H-bonds, proved to be an important structural feature, which correlates with an efficacious inhibitory activity.

  1. CHRONOS: a time-varying method for microRNA-mediated subpathway enrichment analysis.

    PubMed

    Vrahatis, Aristidis G; Dimitrakopoulou, Konstantina; Balomenos, Panos; Tsakalidis, Athanasios K; Bezerianos, Anastasios

    2016-03-15

    In the era of network medicine and the rapid growth of paired time series mRNA/microRNA expression experiments, there is an urgent need for pathway enrichment analysis methods able to capture the time- and condition-specific 'active parts' of the biological circuitry as well as the microRNA impact. Current methods ignore the multiple dynamical 'themes'-in the form of enriched biologically relevant microRNA-mediated subpathways-that determine the functionality of signaling networks across time. To address these challenges, we developed time-vaRying enriCHment integrOmics Subpathway aNalysis tOol (CHRONOS) by integrating time series mRNA/microRNA expression data with KEGG pathway maps and microRNA-target interactions. Specifically, microRNA-mediated subpathway topologies are extracted and evaluated based on the temporal transition and the fold change activity of the linked genes/microRNAs. Further, we provide measures that capture the structural and functional features of subpathways in relation to the complete organism pathway atlas. Our application to synthetic and real data shows that CHRONOS outperforms current subpathway-based methods into unraveling the inherent dynamic properties of pathways. CHRONOS is freely available at http://biosignal.med.upatras.gr/chronos/ tassos.bezerianos@nus.edu.sg Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  2. Conserved RNA binding activity of a Yin-Yang 1 homologue in the ova of the purple sea urchin Strongylocentrotus purpuratus.

    PubMed

    Belak, Zachery R; Ovsenek, Nicholas; Eskiw, Christopher H

    2018-05-23

    Yin-Yang 1 (YY1) is a highly conserved transcription factor possessing RNA-binding activity. A putative YY1 homologue was previously identified in the developmental model organism Strongylocentrotus purpuratus (the purple sea urchin) by genomic sequencing. We identified a high degree of sequence similarity with YY1 homologues of vertebrate origin which shared 100% protein sequence identity over the DNA- and RNA-binding zinc-finger region with high similarity in the N-terminal transcriptional activation domain. SpYY1 demonstrated identical DNA- and RNA-binding characteristics between Xenopus laevis and S. purpuratus indicating that it maintains similar functional and biochemical properties across widely divergent deuterostome species. SpYY1 binds to the consensus YY1 DNA element, and also to U-rich RNA sequences. Although we detected SpYY1 RNA-binding activity in ova lysates and observed cytoplasmic localization, SpYY1 was not associated with maternal mRNA in ova. SpYY1 expressed in Xenopus oocytes was excluded from the nucleus and associated with maternally expressed cytoplasmic mRNA molecules. These data demonstrate the existence of an YY1 homologue in S. purpuratus with similar structural and biochemical features to those of the well-studied vertebrate YY1; however, the data reveal major differences in the biological role of YY1 in the regulation of maternally expressed mRNA in the two species.

  3. RNAiFold 2.0: a web server and software to design custom and Rfam-based RNA molecules.

    PubMed

    Garcia-Martin, Juan Antonio; Dotu, Ivan; Clote, Peter

    2015-07-01

    Several algorithms for RNA inverse folding have been used to design synthetic riboswitches, ribozymes and thermoswitches, whose activity has been experimentally validated. The RNAiFold software is unique among approaches for inverse folding in that (exhaustive) constraint programming is used instead of heuristic methods. For that reason, RNAiFold can generate all sequences that fold into the target structure or determine that there is no solution. RNAiFold 2.0 is a complete overhaul of RNAiFold 1.0, rewritten from the now defunct COMET language to C++. The new code properly extends the capabilities of its predecessor by providing a user-friendly pipeline to design synthetic constructs having the functionality of given Rfam families. In addition, the new software supports amino acid constraints, even for proteins translated in different reading frames from overlapping coding sequences; moreover, structure compatibility/incompatibility constraints have been expanded. With these features, RNAiFold 2.0 allows the user to design single RNA molecules as well as hybridization complexes of two RNA molecules. the web server, source code and linux binaries are publicly accessible at http://bioinformatics.bc.edu/clotelab/RNAiFold2.0. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. How the Sequence of a Gene Specifies Structural Symmetry in Proteins

    PubMed Central

    Shen, Xiaojuan; Huang, Tongcheng; Wang, Guanyu; Li, Guanglin

    2015-01-01

    Internal symmetry is commonly observed in the majority of fundamental protein folds. Meanwhile, sufficient evidence suggests that nascent polypeptide chains of proteins have the potential to start the co-translational folding process and this process allows mRNA to contain additional information on protein structure. In this paper, we study the relationship between gene sequences and protein structures from the viewpoint of symmetry to explore how gene sequences code for structural symmetry in proteins. We found that, for a set of two-fold symmetric proteins from left-handed beta-helix fold, intragenic symmetry always exists in their corresponding gene sequences. Meanwhile, codon usage bias and local mRNA structure might be involved in modulating translation speed for the formation of structural symmetry: a major decrease of local codon usage bias in the middle of the codon sequence can be identified as a common feature; and major or consecutive decreases in local mRNA folding energy near the boundaries of the symmetric substructures can also be observed. The results suggest that gene duplication and fusion may be an evolutionarily conserved process for this protein fold. In addition, the usage of rare codons and the formation of higher order of secondary structure near the boundaries of symmetric substructures might have coevolved as conserved mechanisms to slow down translation elongation and to facilitate effective folding of symmetric substructures. These findings provide valuable insights into our understanding of the mechanisms of translation and its evolution, as well as the design of proteins via symmetric modules. PMID:26641668

  5. A twice-as-smart synthetic G-quartet: PyroTASQ is both a smart quadruplex ligand and a smart fluorescent probe.

    PubMed

    Laguerre, Aurélien; Stefan, Loic; Larrouy, Manuel; Genest, David; Novotna, Jana; Pirrotta, Marc; Monchaud, David

    2014-09-03

    Recent and unambiguous evidences of the formation of DNA and RNA G-quadruplexes in cells has provided solid support for these structures to be considered as valuable targets in oncology. Beyond this, they have lent further credence to the anticancer strategies relying on small molecules that selectively target these higher-order DNA/RNA architectures, referred to as G-quadruplex ligands. They have also shed bright light on the necessity of designing multitasking ligands, displaying not only enticing quadruplex interacting properties (affinity, structural selectivity) but also additional features that make them usable for detecting quadruplexes in living cells, notably for determining whether, when, and where these structures fold and unfold during the cell cycle and also for better assessing the consequences of their stabilization by external agents. Herein, we report a brand new design of such multitasking ligands, whose structure experiences a quadruplex-promoted conformational switch that triggers not only its quadruplex affinity (i.e., smart ligands, which display high affinity and selectivity for DNA/RNA quadruplexes) but also its fluorescence (i.e., smart probes, which behave as selective light-up fluorescent reporters on the basis of a fluorogenic electron redistribution). The first prototype of such multifunctional ligands, termed PyroTASQ, represents a brand new generation of quadruplex ligands that can be referred to as "twice-as-smart" quadruplex ligands.

  6. Structural basis for translational surveillance by the large ribosomal subunit-associated protein quality control complex

    PubMed Central

    Lyumkis, Dmitry; Oliveira dos Passos, Dario; Tahara, Erich B.; Webb, Kristofor; Bennett, Eric J.; Vinterbo, Staal; Potter, Clinton S.; Carragher, Bridget; Joazeiro, Claudio A. P.

    2014-01-01

    All organisms have evolved mechanisms to manage the stalling of ribosomes upon translation of aberrant mRNA. In eukaryotes, the large ribosomal subunit-associated quality control complex (RQC), composed of the listerin/Ltn1 E3 ubiquitin ligase and cofactors, mediates the ubiquitylation and extraction of ribosome-stalled nascent polypeptide chains for proteasomal degradation. How RQC recognizes stalled ribosomes and performs its functions has not been understood. Using single-particle cryoelectron microscopy, we have determined the structure of the RQC complex bound to stalled 60S ribosomal subunits. The structure establishes how Ltn1 associates with the large ribosomal subunit and properly positions its E3-catalytic RING domain to mediate nascent chain ubiquitylation. The structure also reveals that a distinguishing feature of stalled 60S particles is an exposed, nascent chain-conjugated tRNA, and that the Tae2 subunit of RQC, which facilitates Ltn1 binding, is responsible for selective recognition of stalled 60S subunits. RQC components are engaged in interactions across a large span of the 60S subunit surface, connecting the tRNA in the peptidyl transferase center to the distally located nascent chain tunnel exit. This work provides insights into a mechanism linking translation and protein degradation that targets defective proteins immediately after synthesis, while ignoring nascent chains in normally translating ribosomes. PMID:25349383

  7. Identification of More Feasible MicroRNA-mRNA Interactions within Multiple Cancers Using Principal Component Analysis Based Unsupervised Feature Extraction.

    PubMed

    Taguchi, Y-H

    2016-05-10

    MicroRNA(miRNA)-mRNA interactions are important for understanding many biological processes, including development, differentiation and disease progression, but their identification is highly context-dependent. When computationally derived from sequence information alone, the identification should be verified by integrated analyses of mRNA and miRNA expression. The drawback of this strategy is the vast number of identified interactions, which prevents an experimental or detailed investigation of each pair. In this paper, we overcome this difficulty by the recently proposed principal component analysis (PCA)-based unsupervised feature extraction (FE), which reduces the number of identified miRNA-mRNA interactions that properly discriminate between patients and healthy controls without losing biological feasibility. The approach is applied to six cancers: hepatocellular carcinoma, non-small cell lung cancer, esophageal squamous cell carcinoma, prostate cancer, colorectal/colon cancer and breast cancer. In PCA-based unsupervised FE, the significance does not depend on the number of samples (as in the standard case) but on the number of features, which approximates the number of miRNAs/mRNAs. To our knowledge, we have newly identified miRNA-mRNA interactions in multiple cancers based on a single common (universal) criterion. Moreover, the number of identified interactions was sufficiently small to be sequentially curated by literature searches.

  8. A lack of Birbeck granules in Langerhans cells is associated with a naturally occurring point mutation in the human Langerin gene.

    PubMed

    Verdijk, Pauline; Dijkman, Remco; Plasmeijer, Elsemieke I; Mulder, Aat A; Zoutman, Willem H; Mieke Mommaas, A; Tensen, Cornelis P

    2005-04-01

    A heterozygous mutation in the Langerin gene corresponding to position 837 in the Langerin mRNA was identified in a person deficient in Birbeck granules (BG). This mutation results in an amino acid replacement of tryptophan by arginine at position 264 in the carbohydrate recognition domain of the Langerine protein. Expression of mutated Langerin in human fibroblasts induces tubular-like structures that are negative for BG-specific antibodies and do not resemble the characteristic structural features of BG.

  9. Carbonate Biogenic Structures in Storrs Lake, Bahamas

    NASA Technical Reports Server (NTRS)

    Byrne, Monica; Morris, Penny A.; Wentworth, Susan J.; Brigmon, Robin L.; McKay, David S.

    2001-01-01

    Storr's Lake, an inland hypersaline lake on San Salvador Island, Bahamas, contains calcium carbonate-rich lithified mats of filamentous microorganisms, diatoms, associated photosynthetic and chemotrophic bacteria, and trapped sediment. In addition, 16S rRNA analysis indicates the presence of five sulfur-reducing genera of bacteria. These microbes are potential modern-day analogs to some ancient stromatolitic structures. The goals of this study are to identify unique compositional and biogenic features, possibly correlating some of these with some of the sulfate-reducing bacteria. Additional information is contained in the original extended abstract.

  10. Functional and genetic plasticities of the poliovirus genome: quasi-infectious RNAs modified in the 5'-untranslated region yield a variety of pseudorevertants.

    PubMed Central

    Gmyl, A P; Pilipenko, E V; Maslova, S V; Belov, G A; Agol, V I

    1993-01-01

    Poliovirus RNA species with nucleotides 564 to 571 deleted or with a secondary structure domain (positions 564 to 629) replaced by a shorter irregular oligonucleotide have been engineered previously; these RNAs have been considered quasi-infectious (yielding a single late revertant plaque) and dead, respectively (E. Pilipenko, A. Gmyl, Y. Svitkin, S. Maslova, A. Sinyakov, and V. Agol, Cell 68:119-131, 1992). By using large amounts of these RNAs for transfections, revertant clones with a great variety of genetic changes (point mutations, insertions of foreign sequences, short or extended deletions) were isolated. The pattern of these changes supported the notion that an appropriately spaced oligopyrimidine-AUG tandem is important for efficient poliovirus RNA translation. Structural features within and around this tandem modulated the initiation efficiency. The functional and genetic plasticities of the poliovirus genome are briefly discussed. Images PMID:8396686

  11. RNA-SSPT: RNA Secondary Structure Prediction Tools.

    PubMed

    Ahmad, Freed; Mahboob, Shahid; Gulzar, Tahsin; Din, Salah U; Hanif, Tanzeela; Ahmad, Hifza; Afzal, Muhammad

    2013-01-01

    The prediction of RNA structure is useful for understanding evolution for both in silico and in vitro studies. Physical methods like NMR studies to predict RNA secondary structure are expensive and difficult. Computational RNA secondary structure prediction is easier. Comparative sequence analysis provides the best solution. But secondary structure prediction of a single RNA sequence is challenging. RNA-SSPT is a tool that computationally predicts secondary structure of a single RNA sequence. Most of the RNA secondary structure prediction tools do not allow pseudoknots in the structure or are unable to locate them. Nussinov dynamic programming algorithm has been implemented in RNA-SSPT. The current studies shows only energetically most favorable secondary structure is required and the algorithm modification is also available that produces base pairs to lower the total free energy of the secondary structure. For visualization of RNA secondary structure, NAVIEW in C language is used and modified in C# for tool requirement. RNA-SSPT is built in C# using Dot Net 2.0 in Microsoft Visual Studio 2005 Professional edition. The accuracy of RNA-SSPT is tested in terms of Sensitivity and Positive Predicted Value. It is a tool which serves both secondary structure prediction and secondary structure visualization purposes.

  12. RNA-SSPT: RNA Secondary Structure Prediction Tools

    PubMed Central

    Ahmad, Freed; Mahboob, Shahid; Gulzar, Tahsin; din, Salah U; Hanif, Tanzeela; Ahmad, Hifza; Afzal, Muhammad

    2013-01-01

    The prediction of RNA structure is useful for understanding evolution for both in silico and in vitro studies. Physical methods like NMR studies to predict RNA secondary structure are expensive and difficult. Computational RNA secondary structure prediction is easier. Comparative sequence analysis provides the best solution. But secondary structure prediction of a single RNA sequence is challenging. RNA-SSPT is a tool that computationally predicts secondary structure of a single RNA sequence. Most of the RNA secondary structure prediction tools do not allow pseudoknots in the structure or are unable to locate them. Nussinov dynamic programming algorithm has been implemented in RNA-SSPT. The current studies shows only energetically most favorable secondary structure is required and the algorithm modification is also available that produces base pairs to lower the total free energy of the secondary structure. For visualization of RNA secondary structure, NAVIEW in C language is used and modified in C# for tool requirement. RNA-SSPT is built in C# using Dot Net 2.0 in Microsoft Visual Studio 2005 Professional edition. The accuracy of RNA-SSPT is tested in terms of Sensitivity and Positive Predicted Value. It is a tool which serves both secondary structure prediction and secondary structure visualization purposes. PMID:24250115

  13. T box transcription antitermination riboswitch: Influence of nucleotide sequence and orientation on tRNA binding by the antiterminator element

    PubMed Central

    Fauzi, Hamid; Agyeman, Akwasi; Hines, Jennifer V.

    2008-01-01

    Many bacteria utilize riboswitch transcription regulation to monitor and appropriately respond to cellular levels of important metabolites or effector molecules. The T box transcription antitermination riboswitch responds to cognate uncharged tRNA by specifically stabilizing an antiterminator element in the 5′-untranslated mRNA leader region and precluding formation of a thermodynamically more stable terminator element. Stabilization occurs when the tRNA acceptor end base pairs with the first four nucleotides in the seven nucleotide bulge of the highly conserved antiterminator element. The significance of the conservation of the antiterminator bulge nucleotides that do not base pair with the tRNA is unknown, but they are required for optimal function. In vitro selection was used to determine if the isolated antiterminator bulge context alone dictates the mode in which the tRNA acceptor end binds the bulge nucleotides. No sequence conservation beyond complementarity was observed and the location was not constrained to the first four bases of the bulge. The results indicate that formation of a structure that recognizes the tRNA acceptor end in isolation is not the determinant driving force for the high phylogenetic sequence conservation observed within the antiterminator bulge. Additional factors or T box leader features more likely influenced the phylogenetic sequence conservation. PMID:19152843

  14. A common class of transcripts with 5'-intron depletion, distinct early coding sequence features, and N1-methyladenosine modification.

    PubMed

    Cenik, Can; Chua, Hon Nian; Singh, Guramrit; Akef, Abdalla; Snyder, Michael P; Palazzo, Alexander F; Moore, Melissa J; Roth, Frederick P

    2017-03-01

    Introns are found in 5' untranslated regions (5'UTRs) for 35% of all human transcripts. These 5'UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5'UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5'UTR intron status, we developed a classifier that can predict 5'UTR intron status with >80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5 ' proximal- i ntron- m inus-like-coding regions ("5IM" transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5' cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5' proximal positions. Finally, N 1 -methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising ∼20% of human transcripts. This class is defined by depletion of 5' proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N 1 -methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC. © 2017 Cenik et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  15. Targeted polymeric micelles for siRNA treatment of experimental cancer by intravenous injection.

    PubMed

    Christie, R James; Matsumoto, Yu; Miyata, Kanjiro; Nomoto, Takahiro; Fukushima, Shigeto; Osada, Kensuke; Halnaut, Julien; Pittella, Frederico; Kim, Hyun Jin; Nishiyama, Nobuhiro; Kataoka, Kazunori

    2012-06-26

    Small interfering ribonucleic acid (siRNA) cancer therapies administered by intravenous injection require a delivery system for transport from the bloodstream into the cytoplasm of diseased cells to perform the function of gene silencing. Here we describe nanosized polymeric micelles that deliver siRNA to solid tumors and elicit a therapeutic effect. Stable multifunctional micelle structures on the order of 45 nm in size formed by spontaneous self-assembly of block copolymers with siRNA. Block copolymers used for micelle formation were designed and synthesized to contain three main features: a siRNA binding segment containing thiols, a hydrophilic nonbinding segment, and a cell-surface binding peptide. Specifically, poly(ethylene glycol)-block-poly(L-lysine) (PEG-b-PLL) comprising lysine amines modified with 2-iminothiolane (2IT) and the cyclo-Arg-Gly-Asp (cRGD) peptide on the PEG terminus was used. Modification of PEG-b-PLL with 2IT led to improved control of micelle formation and also increased stability in the blood compartment, while installation of the cRGD peptide improved biological activity. Incorporation of siRNA into stable micelle structures containing the cRGD peptide resulted in increased gene silencing ability, improved cell uptake, and broader subcellular distribution in vitro and also improved accumulation in both the tumor mass and tumor-associated blood vessels following intravenous injection into mice. Furthermore, stable and targeted micelles inhibited the growth of subcutaneous HeLa tumor models and demonstrated gene silencing in the tumor mass following treatment with antiangiogenic siRNAs. This new micellar nanomedicine could potentially expand the utility of siRNA-based therapies for cancer treatments that require intravenous injection.

  16. Integrated Cox's model for predicting survival time of glioblastoma multiforme.

    PubMed

    Ai, Zhibing; Li, Longti; Fu, Rui; Lu, Jing-Min; He, Jing-Dong; Li, Sen

    2017-04-01

    Glioblastoma multiforme is the most common primary brain tumor and is highly lethal. This study aims to figure out signatures for predicting the survival time of patients with glioblastoma multiforme. Clinical information, messenger RNA expression, microRNA expression, and single-nucleotide polymorphism array data of patients with glioblastoma multiforme were retrieved from The Cancer Genome Atlas. Patients were separated into two groups by using 1 year as a cutoff, and a logistic regression model was used to figure out any variables that can predict whether the patient was able to live longer than 1 year. Furthermore, Cox's model was used to find out features that were correlated with the survival time. Finally, a Cox model integrated the significant clinical variables, messenger RNA expression, microRNA expression, and single-nucleotide polymorphism was built. Although the classification method failed, signatures of clinical features, messenger RNA expression levels, and microRNA expression levels were figured out by using Cox's model. However, no single-nucleotide polymorphisms related to prognosis were found. The selected clinical features were age at initial diagnosis, Karnofsky score, and race, all of which had been suggested to correlate with survival time. Both of the two significant microRNAs, microRNA-221 and microRNA-222, were targeted to p27 Kip1 protein, which implied the important role of p27 Kip1 on the prognosis of glioblastoma multiforme patients. Our results suggested that survival modeling was more suitable than classification to figure out prognostic biomarkers for patients with glioblastoma multiforme. An integrated model containing clinical features, messenger RNA levels, and microRNA expression levels was built, which has the potential to be used in clinics and thus to improve the survival status of glioblastoma multiforme patients.

  17. Molecular Mechanism of Scanning and Start Codon Selection in Eukaryotes

    PubMed Central

    Hinnebusch, Alan G.

    2011-01-01

    Summary: The correct translation of mRNA depends critically on the ability to initiate at the right AUG codon. For most mRNAs in eukaryotic cells, this is accomplished by the scanning mechanism, wherein the small (40S) ribosomal subunit attaches to the 5′ end of the mRNA and then inspects the leader base by base for an AUG in a suitable context, using complementarity with the anticodon of methionyl initiator tRNA (Met-tRNAiMet) as the key means of identifying AUG. Over the past decade, a combination of yeast genetics, biochemical analysis in reconstituted systems, and structural biology has enabled great progress in deciphering the mechanism of ribosomal scanning. A robust molecular model now exists, describing the roles of initiation factors, notably eukaryotic initiation factor 1 (eIF1) and eIF1A, in stabilizing an “open” conformation of the 40S subunit with Met-tRNAiMet bound in a low-affinity state conducive to scanning and in triggering rearrangement into a “closed” conformation incompatible with scanning, which features Met-tRNAiMet more tightly bound to the “P” site and base paired with AUG. It has also emerged that multiple DEAD-box RNA helicases participate in producing a single-stranded “landing pad” for the 40S subunit and in removing the secondary structure to enable the mRNA to traverse the 40S mRNA-binding channel in the single-stranded form for base-by-base inspection in the P site. PMID:21885680

  18. Mutant swarms of a totivirus-like entities are present in the red macroalga Chondrus crispus and have been partially transferred to the nuclear genome.

    PubMed

    Rousvoal, Sylvie; Bouyer, Betty; López-Cristoffanini, Camilo; Boyen, Catherine; Collén, Jonas

    2016-08-01

    Chondrus crispus Stackhouse (Gigartinales) is a red seaweed found on North Atlantic rocky shores. Electrophoresis of RNA extracts showed a prominent band with a size of around 6,000 bp. Sequencing of the band revealed several sequences with similarity to totiviruses, double-stranded RNA viruses that normally infect fungi. This virus-like entity was named C. crispus virus (CcV). It should probably be regarded as an extreme viral quasispecies or a mutant swarm since low identity (<65%) was found between sequences. Totiviruses typically code for two genes: one capsid gene (gag) and one RNA-dependent RNA polymerase gene (pol) with a pseudoknot structure between the genes. Both the genes and the intergenic structures were found in the CcV sequences. A nonidentical gag gene was also found in the nuclear genome of C. crispus, with associated expressed sequence tags (EST) and upstream regulatory features. The gene was presumably horizontally transferred from the virus to the alga. Similar dsRNA bands were seen in extracts from different life cycle stages of C. crispus and from all geographic locations tested. In addition, similar bands were also observed in RNA extractions from other red algae; however, the significance of this apparently widespread phenomenon is unknown. Neither phenotype caused by the infection nor any virus particles or capsid proteins were identified; thus, the presence of viral particles has not been validated. These findings increase the known host range of totiviruses to include marine red algae. © 2016 Phycological Society of America.

  19. Optimized guide RNA structure for genome editing via Cas9

    PubMed Central

    Xu, Jianyong; Lian, Wei; Jia, Yuning; Li, Lingyun; Huang, Zhong

    2017-01-01

    The genome editing tool Cas9-gRNA (guide RNA) has been successfully applied in different cell types and organisms with high efficiency. However, more efforts need to be made to enhance both efficiency and specificity. In the current study, we optimized the guide RNA structure of Streptococcus pyogenes CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)/Cas (CRISPR-associated) system to improve its genome editing efficiency. Comparing with the original functional structure of guide RNA, which is composed of crRNA and tracrRNA, the widely used chimeric gRNA has shorter crRNA and tracrRNA sequence. The deleted RNA sequence could form extra loop structure, which might enhance the stability of the guide RNA structure and subsequently the genome editing efficiency. Thus the genome editing efficiency of different forms of guide RNA was tested. And we found that the chimeric structure of gRNA with original full length of crRNA and tracrRNA showed higher genome editing efficiency than the conventional chimeric structure or other types of gRNA we tested. Therefore our data here uncovered the new type of gRNA structure with higher genome editing efficiency. PMID:29212218

  20. Fusion genes with ALK as recurrent partner in ependymoma-like gliomas: a new brain tumor entity?

    PubMed Central

    Olsen, Thale Kristin; Panagopoulos, Ioannis; Meling, Torstein R.; Micci, Francesca; Gorunova, Ludmila; Thorsen, Jim; Due-Tønnessen, Bernt; Scheie, David; Lund-Iversen, Marius; Krossnes, Bård; Saxhaug, Cathrine; Heim, Sverre; Brandal, Petter

    2015-01-01

    Background We have previously characterized 19 ependymal tumors using Giemsa banding and high-resolution comparative genomic hybridization. The aim of this study was to analyze these tumors searching for fusion genes. Methods RNA sequencing was performed in 12 samples. Potential fusion transcripts were assessed by seed count and structural chromosomal aberrations. Transcripts of interest were validated using fluorescence in situ hybridization and PCR followed by direct sequencing. Results RNA sequencing identified rearrangements of the anaplastic lymphoma kinase gene (ALK) in 2 samples. Both tumors harbored structural aberrations involving the ALK locus 2p23. Tumor 1 had an unbalanced t(2;14)(p23;q22) translocation which led to the fusion gene KTN1-ALK. Tumor 2 had an interstitial del(2)(p16p23) deletion causing the fusion of CCDC88A and ALK. In both samples, the breakpoint of ALK was located between exons 19 and 20. Both patients were infants and both tumors were supratentorial. The tumors were well demarcated from surrounding tissue and had both ependymal and astrocytic features but were diagnosed and treated as ependymomas. Conclusions By combining karyotyping and RNA sequencing, we identified the 2 first ever reported ALK rearrangements in CNS tumors. Such rearrangements may represent the hallmark of a new entity of pediatric glioma characterized by both ependymal and astrocytic features. Our findings are of particular importance because crizotinib, a selective ALK inhibitor, has demonstrated effect in patients with lung cancer harboring ALK rearrangements. Thus, ALK emerges as an interesting therapeutic target in patients with ependymal tumors carrying ALK fusions. PMID:25795305

  1. Whole mitochondrial genome sequence for an osteoarthritis model of Guinea pig (Caviidae; Cavia).

    PubMed

    Cui, Xin-Gang; Liu, Cheng-Yao; Wei, Bo; Zhao, Wen-Jian; Zhang, Wen-Feng

    2016-11-01

    Animal models played an important role in osteoarthritis studies. Here, the complete mitochondrial genome sequence of the Guinea pig was reported for the first time. The total length of the mitogenome was 16,797 bp. It contained the typical structure, including two ribosomal RNA genes, 13 protein-coding genes, 22 transfer RNA genes and one non-coding control region (D-loop region). The overall composition of the mitogenome was estimated to be 34.9% for A, 26.1% for T, 26.0% for C and 13.0% for G showing an A-T (61.0%)-rich feature. This mitochondrial genome sequence will provide new genetic resource into osteoarthritis disease.

  2. PDZ Binding Domains, Structural Disorder and Phosphorylation: A Menage-a-trois Tailing Dcp2 mRNA Decapping Enzymes.

    PubMed

    Gunawardana, Dilantha

    2016-01-01

    Diverse cellular activities are mediated through the interaction of protein domains and their binding partners. One such protein domain widely distributed in the higher metazoan world is the PDZ domain, which facilitates abundant protein-protein interactions. The PDZ domain-PDZ binding domain interaction has been implicated in several pathologies including Alzheimer's disease, Parkinson's disease and Down syndrome. PDZ domains bind to C-terminal peptides/proteins which have either of the following combinations: S/T-X-hydrophobic-COOH for type I, hydrophobic-Xhydrophobic- COOH for type II, and D/E-X-hydrophobic-COOH for type III, although hydrophobicity in the termini form the key characteristic of the PDZ-binding domains. We identified and characterized a Dcp2 type mRNA decapping enzyme from Arabidopsis thaliana, a protein containing a putative PDZ-binding domain using mutagenesis and protein biochemistry. Now we are using bioinformatics to study the Cterminal end of mRNA decapping enzymes from complex metazoans with the aim of (1) identifying putative PDZ-binding domains (2) Correlating structural disorder with PDZ binding domains and (3) Demonstrating the presence of phosphorylation sites in C-terminal extremities of Dcp2 type mRNA decapping enzymes. It is proposed here that the trinity of PDZbinding domains, structural disorder and phosphorylation-susceptible sites are a feature of the Dcp2 family of decapping enzymes and perhaps is a wider trick in protein evolution where scaffolding/tethering is a requirement for localization and function. It is critical though laboratory-based supporting evidence is sought to back-up this bioinformatics exploration into tail regions of mRNA decapping enzymes.

  3. Conservation and variability in the structure and function of the Cas5d endoribonuclease in the CRISPR-mediated microbial immune system.

    PubMed

    Koo, Yoon; Ka, Donghyun; Kim, Eun-Jin; Suh, Nayoung; Bae, Euiyoung

    2013-10-23

    Clustered regularly interspaced short palindromic repeats (CRISPRs) and CRISPR-associated (Cas) proteins form an RNA-mediated microbial immune system against invading foreign genetic elements. Cas5 proteins constitute one of the most prevalent Cas protein families in CRISPR-Cas systems and are predicted to have RNA recognition motif (RRM) domains. Cas5d is a subtype I-C-specific Cas5 protein that can be divided into two distinct subgroups, one of which has extra C-terminal residues while the other contains a longer insertion in the middle of its N-terminal RRM domain. Here, we report crystal structures of Cas5d from Streptococcus pyogenes and Xanthomonas oryzae, which respectively represent the two Cas5d subgroups. Despite a common domain architecture consisting of an N-terminal RRM domain and a C-terminal β-sheet domain, the structural differences between the two Cas5d proteins are highlighted by the presence of a unique extended helical region protruding from the N-terminal RRM domain of X. oryzae Cas5d. We also demonstrate that Cas5d proteins possess not only specific endoribonuclease activity for CRISPR RNAs but also nonspecific double-stranded DNA binding affinity. These findings suggest that Cas5d may play multiple roles in CRISPR-mediated immunity. Furthermore, the specific RNA processing was also observed between S. pyogenes Cas5d protein and X. oryzae CRISPR RNA and vice versa. This cross-species activity of Cas5d provides a special opportunity for elucidating conserved features of the CRISPR RNA processing event. Copyright © 2013 Elsevier Ltd. All rights reserved.

  4. Identifying 5-methylcytosine sites in RNA sequence using composite encoding feature into Chou's PseKNC.

    PubMed

    Sabooh, M Fazli; Iqbal, Nadeem; Khan, Mukhtaj; Khan, Muslim; Maqbool, H F

    2018-05-01

    This study examines accurate and efficient computational method for identification of 5-methylcytosine sites in RNA modification. The occurrence of 5-methylcytosine (m 5 C) plays a vital role in a number of biological processes. For better comprehension of the biological functions and mechanism it is necessary to recognize m 5 C sites in RNA precisely. The laboratory techniques and procedures are available to identify m 5 C sites in RNA, but these procedures require a lot of time and resources. This study develops a new computational method for extracting the features of RNA sequence. In this method, first the RNA sequence is encoded via composite feature vector, then, for the selection of discriminate features, the minimum-redundancy-maximum-relevance algorithm was used. Secondly, the classification method used has been based on a support vector machine by using jackknife cross validation test. The suggested method efficiently identifies m 5 C sites from non- m 5 C sites and the outcome of the suggested algorithm is 93.33% with sensitivity of 90.0 and specificity of 96.66 on bench mark datasets. The result exhibits that proposed algorithm shown significant identification performance compared to the existing computational techniques. This study extends the knowledge about the occurrence sites of RNA modification which paves the way for better comprehension of the biological uses and mechanism. Copyright © 2018 Elsevier Ltd. All rights reserved.

  5. Analysis of Antisense Expression by Whole Genome Tiling Microarrays and siRNAs Suggests Mis-Annotation of Arabidopsis Orphan Protein-Coding Genes

    PubMed Central

    Richardson, Casey R.; Luo, Qing-Jun; Gontcharova, Viktoria; Jiang, Ying-Wen; Samanta, Manoj; Youn, Eunseog; Rock, Christopher D.

    2010-01-01

    Background MicroRNAs (miRNAs) and trans-acting small-interfering RNAs (tasi-RNAs) are small (20–22 nt long) RNAs (smRNAs) generated from hairpin secondary structures or antisense transcripts, respectively, that regulate gene expression by Watson-Crick pairing to a target mRNA and altering expression by mechanisms related to RNA interference. The high sequence homology of plant miRNAs to their targets has been the mainstay of miRNA prediction algorithms, which are limited in their predictive power for other kingdoms because miRNA complementarity is less conserved yet transitive processes (production of antisense smRNAs) are active in eukaryotes. We hypothesize that antisense transcription and associated smRNAs are biomarkers which can be computationally modeled for gene discovery. Principal Findings We explored rice (Oryza sativa) sense and antisense gene expression in publicly available whole genome tiling array transcriptome data and sequenced smRNA libraries (as well as C. elegans) and found evidence of transitivity of MIRNA genes similar to that found in Arabidopsis. Statistical analysis of antisense transcript abundances, presence of antisense ESTs, and association with smRNAs suggests several hundred Arabidopsis ‘orphan’ hypothetical genes are non-coding RNAs. Consistent with this hypothesis, we found novel Arabidopsis homologues of some MIRNA genes on the antisense strand of previously annotated protein-coding genes. A Support Vector Machine (SVM) was applied using thermodynamic energy of binding plus novel expression features of sense/antisense transcription topology and siRNA abundances to build a prediction model of miRNA targets. The SVM when trained on targets could predict the “ancient” (deeply conserved) class of validated Arabidopsis MIRNA genes with an accuracy of 84%, and 76% for “new” rapidly-evolving MIRNA genes. Conclusions Antisense and smRNA expression features and computational methods may identify novel MIRNA genes and other non-coding RNAs in plants and potentially other kingdoms, which can provide insight into antisense transcription, miRNA evolution, and post-transcriptional gene regulation. PMID:20520764

  6. RNA-based micelles: A novel platform for paclitaxel loading and delivery.

    PubMed

    Shu, Yi; Yin, Hongran; Rajabi, Mehdi; Li, Hui; Vieweger, Mario; Guo, Sijin; Shu, Dan; Guo, Peixuan

    2018-04-28

    RNA can serve as powerful building blocks for bottom-up fabrication of nanostructures for biotechnological and biomedical applications. In addition to current self-assembly strategies utilizing base pairing, motif piling and tertiary interactions, we reported for the first time the formation of RNA based micellar nanoconstruct with a cholesterol molecule conjugated onto one helical end of a branched pRNA three-way junction (3WJ) motif. The resulting amphiphilic RNA micelles consist of a hydrophilic RNA head and a covalently linked hydrophobic lipid tail that can spontaneously assemble in aqueous solution via hydrophobic interaction. Taking advantage of pRNA 3WJ branched structure, the assembled RNA micelles are capable of escorting multiple functional modules. As a proof of concept for delivery for therapeutics, Paclitaxel was loaded into the RNA micelles with significantly improved water solubility. The successful construction of the drug loaded RNA micelles was confirmed and characterized by agarose gel electrophoresis, atomic force microscopy (AFM), dynamic light scattering (DLS), and fluorescence Nile Red encapsulation assay. The estimate critical micelle formation concentration ranges from 39 nM to 78 nM. The Paclitaxel loaded RNA micelles can internalize into cancer cells and inhibit their proliferation. Further studies showed that the Paclitaxel loaded RNA micelles induced cancer cell apoptosis in a Caspase-3 dependent manner but RNA micelles alone exhibited low cytotoxicity. Finally, the Paclitaxel loaded RNA micelles targeted to tumor in vivo without accumulation in healthy tissues and organs. There is also no or very low induction of pro-inflammatory response. Therefore, multivalence, cancer cell permeability, combined with controllable assembly, low or non toxic nature, and tumor targeting are all promising features that make our pRNA micelles a suitable platform for potential drug delivery. Copyright © 2018 Elsevier B.V. All rights reserved.

  7. The 28S–18S rDNA intergenic spacer from Crithidia fasciculata: repeated sequences, length heterogeneity, putative processing sites and potential interactions between U3 small nucleolar RNA and the ribosomal RNA precursor

    PubMed Central

    Schnare, Murray N.; Collings, James C.; Spencer, David F.; Gray, Michael W.

    2000-01-01

    In Crithidia fasciculata, the ribosomal RNA (rRNA) gene repeats range in size from ∼11 to 12 kb. This length heterogeneity is localized to a region of the intergenic spacer (IGS) that contains tandemly repeated copies of a 19mer sequence. The IGS also contains four copies of an ∼55 nt repeat that has an internal inverted repeat and is also present in the IGS of Leishmania species. We have mapped the C.fasciculata transcription initiation site as well as two other reverse transcriptase stop sites that may be analogous to the A0 and A′ pre-rRNA processing sites within the 5′ external transcribed spacer (ETS) of other eukaryotes. Features that could influence processing at these sites include two stretches of conserved primary sequence and three secondary structure elements present in the 5′ ETS. We also characterized the C.fasciculata U3 snoRNA, which has the potential for base-pairing with pre-rRNA sequences. Finally, we demonstrate that biosynthesis of large subunit rRNA in both C.fasciculata and Trypanosoma brucei involves 3′-terminal addition of three A residues that are not present in the corresponding DNA sequences. PMID:10982863

  8. Phylogeny of the order Choreotrichida (Ciliophora, Spirotricha, Oligotrichea) as inferred from morphology, ultrastructure, ontogenesis, and SSrRNA gene sequences

    PubMed Central

    Agatha, Sabine; Strüder-Kypke, Michaela C.

    2010-01-01

    The phylogeny within the order Choreotrichida is reconstructed using (i) morphologic, ontogenetic, and ultrastructural evidence for the cladistic approach and (ii) the small subunit ribosomal RNA (SSrRNA) gene sequences, including the new sequence of Rimostrombidium lacustris. The morphologic cladograms and the gene trees converge rather well for the Choreotrichida, demonstrating that hyaline and agglutinated loricae do not characterize distinct lineages, i.e., both lorica types can be associated with the most highly developed ciliary pattern. The position of Rimostrombidium lacustris within the family Strobilidiidae is corroborated by the genealogical analyses. The diagnosis of the genus Tintinnidium is improved, adding cytological features, and the genus is divided into two subgenera based on the structure of the somatic kineties. The diagnosis of the family Lohmanniellidae and the genus Lohmanniella are improved, and Rimostrombidium glacicolum​ Petz, Song and Wilbert, 1995 is affiliated. PMID:17166704

  9. Probing the closed-loop model of mRNA translation in living cells

    PubMed Central

    Archer, Stuart K; Shirokikh, Nikolay E; Hallwirth, Claus V; Beilharz, Traude H; Preiss, Thomas

    2015-01-01

    The mRNA closed-loop, formed through interactions between the cap structure, poly(A) tail, eIF4E, eIF4G and PAB, features centrally in models of eukaryotic translation initiation, although direct support for its existence in vivo is not well established. Here, we investigated the closed-loop using a combination of mRNP isolation from rapidly cross-linked cells and high-throughput qPCR. Using the interaction between these factors and the opposing ends of mRNAs as a proxy for the closed-loop, we provide evidence that it is prevalent for eIF4E/4G-bound but unexpectedly sparse for PAB1-bound mRNAs, suggesting it primarily occurs during a distinct phase of polysome assembly. We observed mRNA-specific variation in the extent of closed-loop formation, consistent with a role for polysome topology in the control of gene expression. PMID:25826658

  10. The impact of age, biogenesis, and genomic clustering on Drosophila microRNA evolution

    PubMed Central

    Mohammed, Jaaved; Flynt, Alex S.; Siepel, Adam; Lai, Eric C.

    2013-01-01

    The molecular evolutionary signatures of miRNAs inform our understanding of their emergence, biogenesis, and function. The known signatures of miRNA evolution have derived mostly from the analysis of deeply conserved, canonical loci. In this study, we examine the impact of age, biogenesis pathway, and genomic arrangement on the evolutionary properties of Drosophila miRNAs. Crucial to the accuracy of our results was our curation of high-quality miRNA alignments, which included nearly 150 corrections to ortholog calls and nucleotide sequences of the global 12-way Drosophilid alignments currently available. Using these data, we studied primary sequence conservation, normalized free-energy values, and types of structure-preserving substitutions. We expand upon common miRNA evolutionary patterns that reflect fundamental features of miRNAs that are under functional selection. We observe that melanogaster-subgroup-specific miRNAs, although recently emerged and rapidly evolving, nonetheless exhibit evolutionary signatures that are similar to well-conserved miRNAs and distinct from other structured noncoding RNAs and bulk conserved non-miRNA hairpins. This provides evidence that even young miRNAs may be selected for regulatory activities. More strikingly, we observe that mirtrons and clustered miRNAs both exhibit distinct evolutionary properties relative to solo, well-conserved miRNAs, even after controlling for sequence depth. These studies highlight the previously unappreciated impact of biogenesis strategy and genomic location on the evolutionary dynamics of miRNAs, and affirm that miRNAs do not evolve as a unitary class. PMID:23882112

  11. Common features of microRNA target prediction tools

    PubMed Central

    Peterson, Sarah M.; Thompson, Jeffrey A.; Ufkin, Melanie L.; Sathyanarayana, Pradeep; Liaw, Lucy; Congdon, Clare Bates

    2014-01-01

    The human genome encodes for over 1800 microRNAs (miRNAs), which are short non-coding RNA molecules that function to regulate gene expression post-transcriptionally. Due to the potential for one miRNA to target multiple gene transcripts, miRNAs are recognized as a major mechanism to regulate gene expression and mRNA translation. Computational prediction of miRNA targets is a critical initial step in identifying miRNA:mRNA target interactions for experimental validation. The available tools for miRNA target prediction encompass a range of different computational approaches, from the modeling of physical interactions to the incorporation of machine learning. This review provides an overview of the major computational approaches to miRNA target prediction. Our discussion highlights three tools for their ease of use, reliance on relatively updated versions of miRBase, and range of capabilities, and these are DIANA-microT-CDS, miRanda-mirSVR, and TargetScan. In comparison across all miRNA target prediction tools, four main aspects of the miRNA:mRNA target interaction emerge as common features on which most target prediction is based: seed match, conservation, free energy, and site accessibility. This review explains these features and identifies how they are incorporated into currently available target prediction tools. MiRNA target prediction is a dynamic field with increasing attention on development of new analysis tools. This review attempts to provide a comprehensive assessment of these tools in a manner that is accessible across disciplines. Understanding the basis of these prediction methodologies will aid in user selection of the appropriate tools and interpretation of the tool output. PMID:24600468

  12. Common features of microRNA target prediction tools.

    PubMed

    Peterson, Sarah M; Thompson, Jeffrey A; Ufkin, Melanie L; Sathyanarayana, Pradeep; Liaw, Lucy; Congdon, Clare Bates

    2014-01-01

    The human genome encodes for over 1800 microRNAs (miRNAs), which are short non-coding RNA molecules that function to regulate gene expression post-transcriptionally. Due to the potential for one miRNA to target multiple gene transcripts, miRNAs are recognized as a major mechanism to regulate gene expression and mRNA translation. Computational prediction of miRNA targets is a critical initial step in identifying miRNA:mRNA target interactions for experimental validation. The available tools for miRNA target prediction encompass a range of different computational approaches, from the modeling of physical interactions to the incorporation of machine learning. This review provides an overview of the major computational approaches to miRNA target prediction. Our discussion highlights three tools for their ease of use, reliance on relatively updated versions of miRBase, and range of capabilities, and these are DIANA-microT-CDS, miRanda-mirSVR, and TargetScan. In comparison across all miRNA target prediction tools, four main aspects of the miRNA:mRNA target interaction emerge as common features on which most target prediction is based: seed match, conservation, free energy, and site accessibility. This review explains these features and identifies how they are incorporated into currently available target prediction tools. MiRNA target prediction is a dynamic field with increasing attention on development of new analysis tools. This review attempts to provide a comprehensive assessment of these tools in a manner that is accessible across disciplines. Understanding the basis of these prediction methodologies will aid in user selection of the appropriate tools and interpretation of the tool output.

  13. Inducamides A–C, Chlorinated Alkaloids from an RNA Polymerase Mutant Strain of Streptomyces sp.

    PubMed Central

    2015-01-01

    Inducamides A–C (1–3), three new chlorinated alkaloids featuring an amide skeleton generated by a tryptophan fragment and a 6-methylsalicylic acid unit, were isolated from a chemically induced mutant strain of Streptomyces sp. with the inducamides only being produced in the mutant strain. Their structures, including stereochemistry, were determined by spectroscopic analysis, Marfey’s method, and CD spectroscopy. PMID:25338006

  14. A semi-supervised learning approach for RNA secondary structure prediction.

    PubMed

    Yonemoto, Haruka; Asai, Kiyoshi; Hamada, Michiaki

    2015-08-01

    RNA secondary structure prediction is a key technology in RNA bioinformatics. Most algorithms for RNA secondary structure prediction use probabilistic models, in which the model parameters are trained with reliable RNA secondary structures. Because of the difficulty of determining RNA secondary structures by experimental procedures, such as NMR or X-ray crystal structural analyses, there are still many RNA sequences that could be useful for training whose secondary structures have not been experimentally determined. In this paper, we introduce a novel semi-supervised learning approach for training parameters in a probabilistic model of RNA secondary structures in which we employ not only RNA sequences with annotated secondary structures but also ones with unknown secondary structures. Our model is based on a hybrid of generative (stochastic context-free grammars) and discriminative models (conditional random fields) that has been successfully applied to natural language processing. Computational experiments indicate that the accuracy of secondary structure prediction is improved by incorporating RNA sequences with unknown secondary structures into training. To our knowledge, this is the first study of a semi-supervised learning approach for RNA secondary structure prediction. This technique will be useful when the number of reliable structures is limited. Copyright © 2015 Elsevier Ltd. All rights reserved.

  15. NS3 from Hepatitis C Virus Strain JFH-1 Is an Unusually Robust Helicase That Is Primed To Bind and Unwind Viral RNA

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhou, Ting; Ren, Xiaoming; Adams, Rebecca L.

    Hepatitis C viruses (HCV) encode a helicase enzyme that is essential for viral replication and assembly (nonstructural protein 3 [NS3]). This helicase has become the focus of extensive basic research on the general helicase mechanism, and it is also of interest as a novel drug target. Despite the importance of this protein, mechanistic work on NS3 has been conducted almost exclusively on variants from HCV genotype 1. Our understanding of NS3 from the highly active HCV strains that are used to study HCV genetics and mechanism in cell culture (such as JFH-1) is lacking. We therefore set out to determinemore » whether NS3 from the replicatively efficient genotype 2a strain JFH-1 displays novel functional or structural properties. Using biochemical assays for RNA binding and duplex unwinding, we show that JFH-1 NS3 binds RNA much more rapidly than the previously studied NS3 variants from genotype 1b. Unlike NS3 variants from other genotypes, JFH-1 NS3 binds RNA with high affinity in a functionally active form that is capable of immediately unwinding RNA duplexes without undergoing rate-limiting conformational changes that precede activation. Unlike other superfamily 2 (SF2) helicases, JFH-1 NS3 does not require long 3' overhangs, and it unwinds duplexes that are flanked by only a few nucleotides, as in the folded HCV genome. To understand the physical basis for this, we solved the crystal structure of JFH-1 NS3, revealing a novel conformation that contains an open, positively charged RNA binding cleft that is primed for productive interaction with RNA targets, potentially explaining robust replication by HCV JFH-1. IMPORTANCEGenotypes of HCV are as divergent as different types of flavivirus, and yet mechanistic features of HCV variants are presumed to be held in common. One of the most well-studied components of the HCV replication complex is a helicase known as nonstructural protein 3 (NS3). We set out to determine whether this important mechanical component possesses biochemical and structural properties that differ between common strains such as those of genotype 1b and a strain of HCV that replicates with exceptional efficiency (JFH-1, classified as genotype 2a). Indeed, unlike the inefficient genotype 1b NS3, which has been well studied, JFH-1 NS3 is a superhelicase with strong RNA affinity and high unwinding efficiency on a broad range of targets. Crystallographic analysis reveals architectural features that promote enhanced biochemical activity of JFH-1 NS3. These findings show that even within a single family of viruses, drift in sequence can result in the acquisition of radically new functional properties that enhance viral fitness.« less

  16. Deriving Quantitative Dynamics Information for Proteins and RNAs using ROTDIF with a Graphical User Interface

    PubMed Central

    Berlin, Konstantin; Longhini, Andrew; Dayie, T. Kwaku; Fushman, David

    2013-01-01

    To facilitate rigorous analysis of molecular motions in proteins, DNA, and RNA, we present a new version of ROTDIF, a program for determining the overall rotational diffusion tensor from single-or multiple-field Nuclear Magnetic Resonance (NMR) relaxation data. We introduce four major features that expand the program’s versatility and usability. The first feature is the ability to analyze, separately or together, 13C and/or 15N relaxation data collected at a single or multiple fields. A significant improvement in the accuracy compared to direct analysis of R2/R1 ratios, especially critical for analysis of 13C relaxation data, is achieved by subtracting high-frequency contributions to relaxation rates. The second new feature is an improved method for computing the rotational diffusion tensor in the presence of biased errors, such as large conformational exchange contributions, that significantly enhances the accuracy of the computation. The third new feature is the integration of the domain alignment and docking module for relaxation-based structure determination of multi-domain systems. Finally, to improve accessibility to all the program features, we introduced a graphical user interface (GUI) that simplifies and speeds up the analysis of the data. Written in Java, the new ROTDIF can run on virtually any computer platform. In addition, the new ROTDIF achieves an order of magnitude speedup over the previous version by implementing a more efficient deterministic minimization algorithm. We not only demonstrate the improvement in accuracy and speed of the new algorithm for synthetic and experimental 13C and 15N relaxation data for several proteins and nucleic acids, but also show that careful analysis required especially for characterizing RNA dynamics allowed us to uncover subtle conformational changes in RNA as a function of temperature that were opaque to previous analysis. PMID:24170368

  17. Structural imprints in vivo decode RNA regulatory mechanisms.

    PubMed

    Spitale, Robert C; Flynn, Ryan A; Zhang, Qiangfeng Cliff; Crisalli, Pete; Lee, Byron; Jung, Jong-Wha; Kuchelmeister, Hannes Y; Batista, Pedro J; Torre, Eduardo A; Kool, Eric T; Chang, Howard Y

    2015-03-26

    Visualizing the physical basis for molecular behaviour inside living cells is a great challenge for biology. RNAs are central to biological regulation, and the ability of RNA to adopt specific structures intimately controls every step of the gene expression program. However, our understanding of physiological RNA structures is limited; current in vivo RNA structure profiles include only two of the four nucleotides that make up RNA. Here we present a novel biochemical approach, in vivo click selective 2'-hydroxyl acylation and profiling experiment (icSHAPE), which enables the first global view, to our knowledge, of RNA secondary structures in living cells for all four bases. icSHAPE of the mouse embryonic stem cell transcriptome versus purified RNA folded in vitro shows that the structural dynamics of RNA in the cellular environment distinguish different classes of RNAs and regulatory elements. Structural signatures at translational start sites and ribosome pause sites are conserved from in vitro conditions, suggesting that these RNA elements are programmed by sequence. In contrast, focal structural rearrangements in vivo reveal precise interfaces of RNA with RNA-binding proteins or RNA-modification sites that are consistent with atomic-resolution structural data. Such dynamic structural footprints enable accurate prediction of RNA-protein interactions and N(6)-methyladenosine (m(6)A) modification genome wide. These results open the door for structural genomics of RNA in living cells and reveal key physiological structures controlling gene expression.

  18. RNApdbee 2.0: multifunctional tool for RNA structure annotation.

    PubMed

    Zok, Tomasz; Antczak, Maciej; Zurkowski, Michal; Popenda, Mariusz; Blazewicz, Jacek; Adamiak, Ryszard W; Szachniuk, Marta

    2018-04-30

    In the field of RNA structural biology and bioinformatics, an access to correctly annotated RNA structure is of crucial importance, especially in the secondary and 3D structure predictions. RNApdbee webserver, introduced in 2014, primarily aimed to address the problem of RNA secondary structure extraction from the PDB files. Its new version, RNApdbee 2.0, is a highly advanced multifunctional tool for RNA structure annotation, revealing the relationship between RNA secondary and 3D structure given in the PDB or PDBx/mmCIF format. The upgraded version incorporates new algorithms for recognition and classification of high-ordered pseudoknots in large RNA structures. It allows analysis of isolated base pairs impact on RNA structure. It can visualize RNA secondary structures-including that of quadruplexes-with depiction of non-canonical interactions. It also annotates motifs to ease identification of stems, loops and single-stranded fragments in the input RNA structure. RNApdbee 2.0 is implemented as a publicly available webserver with an intuitive interface and can be freely accessed at http://rnapdbee.cs.put.poznan.pl/.

  19. Investigating DNA-, RNA-, and protein-based features as a means to discriminate pathogenic synonymous variants.

    PubMed

    Livingstone, Mark; Folkman, Lukas; Yang, Yuedong; Zhang, Ping; Mort, Matthew; Cooper, David N; Liu, Yunlong; Stantic, Bela; Zhou, Yaoqi

    2017-10-01

    Synonymous single-nucleotide variants (SNVs), although they do not alter the encoded protein sequences, have been implicated in many genetic diseases. Experimental studies indicate that synonymous SNVs can lead to changes in the secondary and tertiary structures of DNA and RNA, thereby affecting translational efficiency, cotranslational protein folding as well as the binding of DNA-/RNA-binding proteins. However, the importance of these various features in disease phenotypes is not clearly understood. Here, we have built a support vector machine (SVM) model (termed DDIG-SN) as a means to discriminate disease-causing synonymous variants. The model was trained and evaluated on nearly 900 disease-causing variants. The method achieves robust performance with the area under the receiver operating characteristic curve of 0.84 and 0.85 for protein-stratified 10-fold cross-validation and independent testing, respectively. We were able to show that the disease-causing effects in the immediate proximity to exon-intron junctions (1-3 bp) are driven by the loss of splicing motif strength, whereas the gain of splicing motif strength is the primary cause in regions further away from the splice site (4-69 bp). The method is available as a part of the DDIG server at http://sparks-lab.org/ddig. © 2017 Wiley Periodicals, Inc.

  20. LncRNA, a new component of expanding RNA-protein regulatory network important for animal sperm development.

    PubMed

    Zhang, Chenwang; Gao, Liuze; Xu, Eugene Yujun

    2016-11-01

    Spermatogenesis is one of the fundamental processes of sexual reproduction, present in almost all metazoan animals. Like many other reproductive traits, developmental features and traits of spermatogenesis are under strong selective pressure to change, both at morphological and underlying molecular levels. Yet evidence suggests that some fundamental features of spermatogenesis may be ancient and conserved among metazoan species. Identifying the underlying conserved molecular mechanisms could reveal core components of metazoan spermatogenic machinery and provide novel insight into causes of human infertility. Conserved RNA-binding proteins and their interacting RNA network emerge to be a common theme important for animal sperm development. We review research on the recent addition to the RNA family - Long non-coding RNA (lncRNA) and its roles in spermatogenesis in the context of the expanding RNA-protein network. Copyright © 2016 Elsevier Ltd. All rights reserved.

  1. Expansion of the aminoglycoside-resistance 16S rRNA (m(1)A1408) methyltransferase family: expression and functional characterization of four hypothetical enzymes of diverse bacterial origin.

    PubMed

    Witek, Marta A; Conn, Graeme L

    2014-09-01

    The global dissemination, potential activity in diverse species and broad resistance spectrum conferred by the aminoglycoside-resistance ribosomal RNA methyltransferases make them a significant potential new threat to the efficacy of aminoglycoside antibiotics in the treatment of serious bacterial infections. The N1 methylation of adenosine 1408 (m(1)A1408) confers resistance to structurally diverse aminoglycosides, including kanamycin, neomycin and apramycin. The limited analyses to date of the enzymes responsible have identified common features but also potential differences in their molecular details of action. Therefore, with the goal of expanding the known 16S rRNA (m(1)A1408) methyltransferase family as a platform for developing a more complete mechanistic understanding, we report here the cloning, expression and functional analyses of four hypothetical aminoglycoside-resistance rRNA methyltransferases from recent genome sequences of diverse bacterial species. Each of the genes produced a soluble, folded protein with a secondary structure, as determined from circular dichroism (CD) spectra, consistent with enzymes for which high-resolution structures are available. For each enzyme, antibiotic minimum inhibitory concentration (MIC) assays revealed a resistance spectrum characteristic of the known 16S rRNA (m(1)A1408) methyltransferases and the modified nucleotide was confirmed by reverse transcription as A1408. In common with other family members, higher binding affinity for the methylation reaction by-product S-adenosylhomocysteine (SAH) than the cosubstrate S-adenosyl-L-methionine (SAM) was observed for three methyltransferases, while one unexpectedly showed no measurable affinity for SAH. Collectively, these results confirm that each hypothetical enzyme is a functional 16S rRNA (m(1)A1408) methyltransferase but also point to further potential mechanistic variation within this enzyme family. Copyright © 2014 Elsevier B.V. All rights reserved.

  2. Recognition of RNA by amide modified backbone nucleic acids: molecular dynamics simulations of DNA-RNA hybrids in aqueous solution.

    PubMed

    Nina, Mafalda; Fonné-Pfister, Raymonde; Beaudegnies, Renaud; Chekatt, Habiba; Jung, Pierre M J; Murphy-Kessabi, Fiona; De Mesmaeker, Alain; Wendeborn, Sebastian

    2005-04-27

    Thermodynamic and structural properties of a chemically modified DNA-RNA hybrid in which a phosphodiester linkage is replaced by a neutral amide-3 linkage (3'-CH(2)-CONH-5') were investigated using UV melting experiments, molecular dynamics simulations in explicit water, and continuum solvent models. van't Hoff analysis of the experimental UV melting curves suggests that the significant increase of the thermodynamic stability of a 15-mer DNA-RNA with seven alternated amide-3 modifications (+11 degrees C) is mainly due to an increased binding enthalpy. To further evaluate the origin in the observed affinities differences, the electrostatic contribution to the binding free energy was calculated by solving the Poisson-Boltzmann equation numerically. The nonelectrostatic contribution was estimated as the product of a hydrophobic surface tension coefficient and the surface area that is buried upon double strand formation. Structures were taken from 10 ns molecular dynamics simulations computed in a consistent fashion using explicit solvent, counterions, and the particle-mesh Ewald procedure. The present preliminary thermodynamic study suggests that the favorable binding free energy of the amide-3 DNA single strand to the complementary RNA is equally driven by electrostatic and nonpolar contributions to the binding compared to their natural analogues. In addition, molecular dynamics simulations in explicit water were performed on an amide-3 DNA single strand and the corresponding natural DNA. Results from the conformations cluster analysis of the simulated amide-3 DNA single strand ensembles suggest that the 25% of the population sampled within 10 ns has a pre-organized conformation where the sugar C3' endo pucker is favored at the 3'-flanking nucleotides. These structural and thermodynamic features contribute to the understanding of the observed increased affinities of the amide-3 DNA-RNA hybrids at the microscopic level.

  3. RNA FRABASE 2.0: an advanced web-accessible database with the capacity to search the three-dimensional fragments within RNA structures

    PubMed Central

    2010-01-01

    Background Recent discoveries concerning novel functions of RNA, such as RNA interference, have contributed towards the growing importance of the field. In this respect, a deeper knowledge of complex three-dimensional RNA structures is essential to understand their new biological functions. A number of bioinformatic tools have been proposed to explore two major structural databases (PDB, NDB) in order to analyze various aspects of RNA tertiary structures. One of these tools is RNA FRABASE 1.0, the first web-accessible database with an engine for automatic search of 3D fragments within PDB-derived RNA structures. This search is based upon the user-defined RNA secondary structure pattern. In this paper, we present and discuss RNA FRABASE 2.0. This second version of the system represents a major extension of this tool in terms of providing new data and a wide spectrum of novel functionalities. An intuitionally operated web server platform enables very fast user-tailored search of three-dimensional RNA fragments, their multi-parameter conformational analysis and visualization. Description RNA FRABASE 2.0 has stored information on 1565 PDB-deposited RNA structures, including all NMR models. The RNA FRABASE 2.0 search engine algorithms operate on the database of the RNA sequences and the new library of RNA secondary structures, coded in the dot-bracket format extended to hold multi-stranded structures and to cover residues whose coordinates are missing in the PDB files. The library of RNA secondary structures (and their graphics) is made available. A high level of efficiency of the 3D search has been achieved by introducing novel tools to formulate advanced searching patterns and to screen highly populated tertiary structure elements. RNA FRABASE 2.0 also stores data and conformational parameters in order to provide "on the spot" structural filters to explore the three-dimensional RNA structures. An instant visualization of the 3D RNA structures is provided. RNA FRABASE 2.0 is freely available at http://rnafrabase.cs.put.poznan.pl. Conclusions RNA FRABASE 2.0 provides a novel database and powerful search engine which is equipped with new data and functionalities that are unavailable elsewhere. Our intention is that this advanced version of the RNA FRABASE will be of interest to all researchers working in the RNA field. PMID:20459631

  4. RNA FRABASE 2.0: an advanced web-accessible database with the capacity to search the three-dimensional fragments within RNA structures.

    PubMed

    Popenda, Mariusz; Szachniuk, Marta; Blazewicz, Marek; Wasik, Szymon; Burke, Edmund K; Blazewicz, Jacek; Adamiak, Ryszard W

    2010-05-06

    Recent discoveries concerning novel functions of RNA, such as RNA interference, have contributed towards the growing importance of the field. In this respect, a deeper knowledge of complex three-dimensional RNA structures is essential to understand their new biological functions. A number of bioinformatic tools have been proposed to explore two major structural databases (PDB, NDB) in order to analyze various aspects of RNA tertiary structures. One of these tools is RNA FRABASE 1.0, the first web-accessible database with an engine for automatic search of 3D fragments within PDB-derived RNA structures. This search is based upon the user-defined RNA secondary structure pattern. In this paper, we present and discuss RNA FRABASE 2.0. This second version of the system represents a major extension of this tool in terms of providing new data and a wide spectrum of novel functionalities. An intuitionally operated web server platform enables very fast user-tailored search of three-dimensional RNA fragments, their multi-parameter conformational analysis and visualization. RNA FRABASE 2.0 has stored information on 1565 PDB-deposited RNA structures, including all NMR models. The RNA FRABASE 2.0 search engine algorithms operate on the database of the RNA sequences and the new library of RNA secondary structures, coded in the dot-bracket format extended to hold multi-stranded structures and to cover residues whose coordinates are missing in the PDB files. The library of RNA secondary structures (and their graphics) is made available. A high level of efficiency of the 3D search has been achieved by introducing novel tools to formulate advanced searching patterns and to screen highly populated tertiary structure elements. RNA FRABASE 2.0 also stores data and conformational parameters in order to provide "on the spot" structural filters to explore the three-dimensional RNA structures. An instant visualization of the 3D RNA structures is provided. RNA FRABASE 2.0 is freely available at http://rnafrabase.cs.put.poznan.pl. RNA FRABASE 2.0 provides a novel database and powerful search engine which is equipped with new data and functionalities that are unavailable elsewhere. Our intention is that this advanced version of the RNA FRABASE will be of interest to all researchers working in the RNA field.

  5. Competing endogenous RNA regulatory network in papillary thyroid carcinoma.

    PubMed

    Chen, Shouhua; Fan, Xiaobin; Gu, He; Zhang, Lili; Zhao, Wenhua

    2018-05-11

    The present study aimed to screen all types of RNAs involved in the development of papillary thyroid carcinoma (PTC). RNA‑sequencing data of PTC and normal samples were used for screening differentially expressed (DE) microRNAs (DE‑miRNAs), long non‑coding RNAs (DE‑lncRNAs) and genes (DEGs). Subsequently, lncRNA‑miRNA, miRNA‑gene (that is, miRNA‑mRNA) and gene‑gene interaction pairs were extracted and used to construct regulatory networks. Feature genes in the miRNA‑mRNA network were identified by topological analysis and recursive feature elimination analysis. A support vector machine (SVM) classifier was built using 15 feature genes, and its classification effect was validated using two microarray data sets that were downloaded from the Gene Expression Omnibus (GEO) database. In addition, Gene Ontology function and Kyoto Encyclopedia Genes and Genomes pathway enrichment analyses were conducted for genes identified in the ceRNA network. A total of 506 samples, including 447 tumor samples and 59 normal samples, were obtained from The Cancer Genome Atlas (TCGA); 16 DE‑lncRNAs, 917 DEGs and 30 DE‑miRNAs were screened. The miRNA‑mRNA regulatory network comprised 353 nodes and 577 interactions. From these data, 15 feature genes with high predictive precision (>95%) were extracted from the network and were used to form an SVM classifier with an accuracy of 96.05% (486/506) for PTC samples downloaded from TCGA, and accuracies of 96.81 and 98.46% for GEO downloaded data sets. The ceRNA regulatory network comprised 596 lines (or interactions) and 365 nodes. Genes in the ceRNA network were significantly enriched in 'neuron development', 'differentiation', 'neuroactive ligand‑receptor interaction', 'metabolism of xenobiotics by cytochrome P450', 'drug metabolism' and 'cytokine‑cytokine receptor interaction' pathways. Hox transcript antisense RNA, miRNA‑206 and kallikrein‑related peptidase 10 were nodes in the ceRNA regulatory network of the selected feature gene, and they may serve import roles in the development of PTC.

  6. Sequence determinants of improved CRISPR sgRNA design.

    PubMed

    Xu, Han; Xiao, Tengfei; Chen, Chen-Hao; Li, Wei; Meyer, Clifford A; Wu, Qiu; Wu, Di; Cong, Le; Zhang, Feng; Liu, Jun S; Brown, Myles; Liu, X Shirley

    2015-08-01

    The CRISPR/Cas9 system has revolutionized mammalian somatic cell genetics. Genome-wide functional screens using CRISPR/Cas9-mediated knockout or dCas9 fusion-mediated inhibition/activation (CRISPRi/a) are powerful techniques for discovering phenotype-associated gene function. We systematically assessed the DNA sequence features that contribute to single guide RNA (sgRNA) efficiency in CRISPR-based screens. Leveraging the information from multiple designs, we derived a new sequence model for predicting sgRNA efficiency in CRISPR/Cas9 knockout experiments. Our model confirmed known features and suggested new features including a preference for cytosine at the cleavage site. The model was experimentally validated for sgRNA-mediated mutation rate and protein knockout efficiency. Tested on independent data sets, the model achieved significant results in both positive and negative selection conditions and outperformed existing models. We also found that the sequence preference for CRISPRi/a is substantially different from that for CRISPR/Cas9 knockout and propose a new model for predicting sgRNA efficiency in CRISPRi/a experiments. These results facilitate the genome-wide design of improved sgRNA for both knockout and CRISPRi/a studies. © 2015 Xu et al.; Published by Cold Spring Harbor Laboratory Press.

  7. Structural imprints in vivo decode RNA regulatory mechanisms

    PubMed Central

    Spitale, Robert C.; Flynn, Ryan A.; Zhang, Qiangfeng Cliff; Crisalli, Pete; Lee, Byron; Jung, Jong-Wha; Kuchelmeister, Hannes Y.; Batista, Pedro J.; Torre, Eduardo A.; Kool, Eric T.; Chang, Howard Y.

    2015-01-01

    Visualizing the physical basis for molecular behavior inside living cells is a grand challenge in biology. RNAs are central to biological regulation, and RNA’s ability to adopt specific structures intimately controls every step of the gene expression program1. However, our understanding of physiological RNA structures is limited; current in vivo RNA structure profiles view only two of four nucleotides that make up RNA2,3. Here we present a novel biochemical approach, In Vivo Click SHAPE (icSHAPE), that enables the first global view of RNA secondary structures of all four bases in living cells. icSHAPE of mouse embryonic stem cell transcriptome versus purified RNA folded in vitro shows that the structural dynamics of RNA in the cellular environment distinguishes different classes of RNAs and regulatory elements. Structural signatures at translational start sites and ribosome pause sites are conserved from in vitro, suggesting that these RNA elements are programmed by sequence. In contrast, focal structural rearrangements in vivo reveal precise interfaces of RNA with RNA binding proteins or RNA modification sites that are consistent with atomic-resolution structural data. Such dynamic structural footprints enable accurate prediction of RNA-protein interactions and N6-methyladenosine (m6A) modification genome-wide. These results open the door for structural genomics of RNA in living cells and reveal key physiological structures controlling gene expression. PMID:25799993

  8. RNA polymerase pausing and nascent RNA structure formation are linked through clamp domain movement

    PubMed Central

    Hein, Pyae P.; Kolb, Kellie E.; Windgassen, Tricia; Bellecourt, Michael J.; Darst, Seth A.; Mooney, Rachel A.; Landick, Robert

    2014-01-01

    The rates of RNA synthesis and nascent RNA folding into biologically active structures are linked via pausing by RNA polymerase (RNAP). Structures that form within the RNA exit channel can increase pausing by interacting with bacterial RNAP or decrease pausing by preventing backtracking. Conversely, pausing is required for proper folding of some RNAs. Opening of the RNAP clamp domain is proposed to mediate some effects of nascent RNA structures. However, the connections among RNA structure formation, clamp movement, and catalytic activity remain uncertain. We assayed exit-channel structure formation in Escherichia coli RNAP together with disulfide crosslinks that favor closed or open clamp conformations and found that clamp position directly influences RNA structure formation and catalytic activity. We report that exit-channel RNA structures slow pause escape by favoring clamp opening and through interactions with the flap that slow translocation. PMID:25108353

  9. Novel Compound Heterozygous Mutations Expand the Recognized Phenotypes of FARS2-Linked Disease.

    PubMed

    Walker, Melissa A; Mohler, Kyle P; Hopkins, Kyle W; Oakley, Derek H; Sweetser, David A; Ibba, Michael; Frosch, Matthew P; Thibert, Ronald L

    2016-08-01

    Mutations in mitochondrial aminoacyl-tRNA synthetases are an increasingly recognized cause of human diseases, often arising in individuals with compound heterozygous mutations and presenting with system-specific phenotypes, frequently neurologic. FARS2 encodes mitochondrial phenylalanyl transfer ribonucleic acid (RNA) synthetase (mtPheRS), perturbations of which have been reported in 6 cases of an infantile, lethal disease with refractory epilepsy and progressive myoclonus. Here the authors report the case of juvenile onset refractory epilepsy and progressive myoclonus with compound heterozygous FARS2 mutations. The authors describe the clinical course over 6 years of care at their institution and diagnostic studies including electroencephalogram (EEG), brain magnetic resonance imaging (MRI), serum and cerebrospinal fluid analyses, skeletal muscle biopsy histology, and autopsy gross and histologic findings, which include features shared with Alpers-Huttenlocher syndrome, Leigh syndrome, and a previously published case of FARS2 mutation associated infantile onset disease. The authors also present structure-guided analysis of the relevant mutations based on published mitochondrial phenylalanyl transfer RNA synthetase and related protein crystal structures as well as biochemical analysis of the corresponding recombinant mutant proteins. © The Author(s) 2016.

  10. RNA2DMut: a web tool for the design and analysis of RNA structure mutations.

    PubMed

    Moss, Walter N

    2018-03-01

    With the widespread application of high-throughput sequencing, novel RNA sequences are being discovered at an astonishing rate. The analysis of function, however, lags behind. In both the cis - and trans -regulatory functions of RNA, secondary structure (2D base-pairing) plays essential regulatory roles. In order to test RNA function, it is essential to be able to design and analyze mutations that can affect structure. This was the motivation for the creation of the RNA2DMut web tool. With RNA2DMut, users can enter in RNA sequences to analyze, constrain mutations to specific residues, or limit changes to purines/pyrimidines. The sequence is analyzed at each base to determine the effect of every possible point mutation on 2D structure. The metrics used in RNA2DMut rely on the calculation of the Boltzmann structure ensemble and do not require a robust 2D model of RNA structure for designing mutations. This tool can facilitate a wide array of uses involving RNA: for example, in designing and evaluating mutants for biological assays, interrogating RNA-protein interactions, identifying key regions to alter in SELEX experiments, and improving RNA folding and crystallization properties for structural biology. Additional tools are available to help users introduce other mutations (e.g., indels and substitutions) and evaluate their effects on RNA structure. Example calculations are shown for five RNAs that require 2D structure for their function: the MALAT1 mascRNA, an influenza virus splicing regulatory motif, the EBER2 viral noncoding RNA, the Xist lncRNA repA region, and human Y RNA 5. RNA2DMut can be accessed at https://rna2dmut.bb.iastate.edu/. © 2018 Moss; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  11. miRToolsGallery: a tag-based and rankable microRNA bioinformatics resources database portal

    PubMed Central

    Chen, Liang; Heikkinen, Liisa; Wang, ChangLiang; Yang, Yang; Knott, K Emily

    2018-01-01

    Abstract Hundreds of bioinformatics tools have been developed for MicroRNA (miRNA) investigations including those used for identification, target prediction, structure and expression profile analysis. However, finding the correct tool for a specific application requires the tedious and laborious process of locating, downloading, testing and validating the appropriate tool from a group of nearly a thousand. In order to facilitate this process, we developed a novel database portal named miRToolsGallery. We constructed the portal by manually curating > 950 miRNA analysis tools and resources. In the portal, a query to locate the appropriate tool is expedited by being searchable, filterable and rankable. The ranking feature is vital to quickly identify and prioritize the more useful from the obscure tools. Tools are ranked via different criteria including the PageRank algorithm, date of publication, number of citations, average of votes and number of publications. miRToolsGallery provides links and data for the comprehensive collection of currently available miRNA tools with a ranking function which can be adjusted using different criteria according to specific requirements. Database URL: http://www.mirtoolsgallery.org PMID:29688355

  12. Cytosolic Hsp70 and co-chaperones constitute a novel system for tRNA import into the nucleus

    PubMed Central

    Takano, Akira; Kajita, Takuya; Mochizuki, Makoto; Endo, Toshiya; Yoshihisa, Tohru

    2015-01-01

    tRNAs are unique among various RNAs in that they shuttle between the nucleus and the cytoplasm, and their localization is regulated by nutrient conditions. Although nuclear export of tRNAs has been well documented, the import machinery is poorly understood. Here, we identified Ssa2p, a major cytoplasmic Hsp70 in Saccharomyces cerevisiae, as a tRNA-binding protein whose deletion compromises nuclear accumulation of tRNAs upon nutrient starvation. Ssa2p recognizes several structural features of tRNAs through its nucleotide-binding domain, but prefers loosely-folded tRNAs, suggesting that Ssa2p has a chaperone-like activity for RNAs. Ssa2p also binds Nup116, one of the yeast nucleoporins. Sis1p and Ydj1p, cytoplasmic co-chaperones for Ssa proteins, were also found to contribute to the tRNA import. These results unveil a novel function of the Ssa2p system as a tRNA carrier for nuclear import by a novel mode of substrate recognition. Such Ssa2p-mediated tRNA import likely contributes to quality control of cytosolic tRNAs. DOI: http://dx.doi.org/10.7554/eLife.04659.001 PMID:25853343

  13. Transterm—extended search facilities and improved integration with other databases

    PubMed Central

    Jacobs, Grant H.; Stockwell, Peter A.; Tate, Warren P.; Brown, Chris M.

    2006-01-01

    Transterm has now been publicly available for >10 years. Major changes have been made since its last description in this database issue in 2002. The current database provides data for key regions of mRNA sequences, a curated database of mRNA motifs and tools to allow users to investigate their own motifs or mRNA sequences. The key mRNA regions database is derived computationally from Genbank. It contains 3′ and 5′ flanking regions, the initiation and termination signal context and coding sequence for annotated CDS features from Genbank and RefSeq. The database is non-redundant, enabling summary files and statistics to be prepared for each species. Advances include providing extended search facilities, the database may now be searched by BLAST in addition to regular expressions (patterns) allowing users to search for motifs such as known miRNA sequences, and the inclusion of RefSeq data. The database contains >40 motifs or structural patterns important for translational control. In this release, patterns from UTRsite and Rfam are also incorporated with cross-referencing. Users may search their sequence data with Transterm or user-defined patterns. The system is accessible at . PMID:16381889

  14. The complete mitochondrial genome of Plodia interpunctella (Lepidoptera: Pyralidae) and comparison with other Pyraloidea insects.

    PubMed

    Liu, Qiu-Ning; Chai, Xin-Yue; Bian, Dan-Dan; Zhou, Chun-Lin; Tang, Bo-Ping

    2016-01-01

    The mitochondrial (mt) genome can provide important information for the understanding of phylogenetic relationships. The complete mt genome of Plodia interpunctella (Lepidoptera: Pyralidae) has been sequenced. The circular genome is 15 287 bp in size, encoding 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The AT skew of this mt genome is slightly negative, and the nucleotide composition is biased toward A+T nucleotides (80.15%). All PCGs start with the typical ATN (ATA, ATC, ATG, and ATT) codons, except for the cox1 gene which may start with the CGA codon. Four of the 13 PCGs harbor the incomplete termination codon T or TA. All the tRNA genes are folded into the typical clover-leaf structure of mitochondrial tRNA, except for trnS1 (AGN) in which the DHU arm fails to form a stable stem-loop structure. The overlapping sequences are 35 bp in total and are found in seven different locations. A total of 240 bp of intergenic spacers are scattered in 16 regions. The control region of the mt genome is 327 bp in length and consisted of several features common to the sequenced lepidopteran insects. Phylogenetic analysis based on 13 PCGs using the Maximum Likelihood method shows that the placement of P. interpunctella was within the Pyralidae.

  15. CRISPR RNA and anti-CRISPR protein binding to the Xanthomonas albilineans Csy1-Csy2 heterodimer in the type I-F CRISPR-Cas system.

    PubMed

    Hong, Suji; Ka, Donghyun; Yoon, Seo Jeong; Suh, Nayoung; Jeong, Migyeong; Suh, Jeong-Yong; Bae, Euiyoung

    2018-02-23

    Clustered regularly interspaced short palindromic repeats (CRISPRs) and CRISPR-associated (Cas) proteins provide microbial adaptive immunity against bacteriophages. In type I-F CRISPR-Cas systems, multiple Cas proteins (Csy1-4) compose a surveillance complex (Csy complex) with CRISPR RNA (crRNA) for target recognition. Here, we report the biochemical characterization of the Csy1-Csy2 subcomplex from Xanthomonas albilineans , including the analysis of its interaction with crRNA and AcrF2, an anti-CRISPR (Acr) protein from a phage that infects Pseudomonas aeruginosa The X. albilineans Csy1 and Csy2 proteins (XaCsy1 and XaCsy2, respectively) formed a stable heterodimeric complex that specifically bound the 8-nucleotide (nt) 5'-handle of the crRNA. In contrast, the XaCsy1-XaCsy2 heterodimer exhibited reduced affinity for the 28-nt X. albilineans CRISPR repeat RNA containing the 5'-handle sequence. Chromatographic and calorimetric analyses revealed tight binding between the Acr protein from the P. aeruginosa phage and the heterodimeric subunit of the X. albilineans Csy complex, suggesting that AcrF2 recognizes conserved features of Csy1-Csy2 heterodimers. We found that neither XaCsy1 nor XaCsy2 alone forms a stable complex with AcrF2 and the 5'-handle RNA, indicating that XaCsy1-XaCsy2 heterodimerization is required for binding them. We also solved the crystal structure of AcrF2 to a resolution of 1.34 Å, enabling a more detailed structural analysis of the residues involved in the interactions with the Csy1-Csy2 heterodimer. Our results provide information about the order of events during the formation of the multisubunit crRNA-guided surveillance complex and suggest that the Acr protein inactivating type I-F CRISPR-Cas systems has broad specificity. © 2018 by The American Society for Biochemistry and Molecular Biology, Inc.

  16. NMR studies of protein-nucleic acid interactions.

    PubMed

    Varani, Gabriele; Chen, Yu; Leeper, Thomas C

    2004-01-01

    Protein-DNA and protein-RNA complexes play key functional roles in every living organism. Therefore, the elucidation of their structure and dynamics is an important goal of structural and molecular biology. Nuclear magnetic resonance (NMR) studies of protein and nucleic acid complexes have common features with studies of protein-protein complexes: the interaction surfaces between the molecules must be carefully delineated, the relative orientation of the two species needs to be accurately and precisely determined, and close intermolecular contacts defined by nuclear Overhauser effects (NOEs) must be obtained. However, differences in NMR properties (e.g., chemical shifts) and biosynthetic pathways for sample productions generate important differences. Chemical shift differences between the protein and nucleic acid resonances can aid the NMR structure determination process; however, the relatively limited dispersion of the RNA ribose resonances makes the process of assigning intermolecular NOEs more difficult. The analysis of the resulting structures requires computational tools unique to nucleic acid interactions. This chapter summarizes the most important elements of the structure determination by NMR of protein-nucleic acid complexes and their analysis. The main emphasis is on recent developments (e.g., residual dipolar couplings and new Web-based analysis tools) that have facilitated NMR studies of these complexes and expanded the type of biological problems to which NMR techniques of structural elucidation can now be applied.

  17. Comprehensive analysis of a long noncoding RNA-associated competing endogenous RNA network in colorectal cancer.

    PubMed

    Fan, Qiaowei; Liu, Bingrong

    2018-01-01

    This study was aimed to develop a lncRNA-associated competing endogenous RNA (ceRNA) network to provide further understanding of the ceRNA regulatory mechanism and pathogenesis in colorectal cancer (CRC). Expression profiles of mRNAs, lncRNAs, and miRNAs, and clinical information for CRC patients were obtained from The Cancer Genome Atlas. The differentially expressed mRNAs, lncRNAs, and miRNAs (referred to as "DEmRNAs", "DElncRNAs", and "DEmiRNAs", respectively) were screened out between 539 CRC samples and 11 normal samples. The interactions between DElncRNAs and DEmiRNAs were predicted by miRcode. The DEmRNAs targeted by the DEmiRNAs were retrieved according to TargetScan, miRTar-Base, and miRDB. The lncRNA-miRNA-mRNA ceRNA network was constructed based on the DEmiRNA-DElncRNA and DEmiRNA-DEmRNA interactions. Functional enrichment analysis revealed the biological processes and pathways of DEmRNAs involved in the development of CRC. Key lncRNAs were further analyzed for their associations with overall survival and clinical features of CRC patients. A total of 1,767 DEmRNAs, 608 DElncRNAs, and 283 DEmiRNAs were identified as CRC-specific RNAs. Three hundred eighty-two DEmiRNA-DElncRNA interactions and 68 DEmiRNA-DEmRNA interactions were recognized according to the relevant databases. The lncRNA-miRNA-mRNA ceRNA network was constructed using 25 DEmiRNAs, 52 DEmRNAs, and 64 DElncRNAs. Two DElncRNAs, five DEmiRNAs, and six DEmRNAs were demonstrated to be related to the prognosis of CRC patients. Four DElncRNAs were found to be associated with clinical features. Twenty-eight Gene Ontology terms and 10 Kyoto Encyclopedia of Genes and Genomes pathways were found to be significantly enriched by the DEmRNAs in the ceRNA network. Our results showed cancer-specific mRNA, lncRNA, and miRNA expression patterns and enabled us to construct an lncRNA-associated ceRNA network that provided new insights into the molecular mechanisms of CRC. Key RNA transcripts related to the overall survival and clinical features were also found with promising potential as biomarkers for diagnosis, survival prediction, and classification of CRC.

  18. NMR structure and Mg2+ binding of an RNA segment that underlies the L7/L12 stalk in the E.coli 50S ribosomal subunit

    PubMed Central

    Zhao, Qin; Nagaswamy, Uma; Lee, Hunjoong; Xia, Youlin; Huang, Hung-Chung; Gao, Xiaolian; Fox, George E.

    2005-01-01

    Helix 42 of Domain II of Escherichia coli 23S ribosomal RNA underlies the L7/L12 stalk in the ribosome and may be significant in positioning this feature relative to the rest of the 50S ribosomal subunit. Unlike the Haloarcula marismortui and Deinococcus radiodurans examples, the lower portion of helix 42 in E.coli contains two consecutive G•A oppositions with both adenines on the same side of the stem. Herein, the structure of an analog of positions 1037–1043 and 1112–1118 in the helix 42 region is reported. NMR spectra and structure calculations support a cis Watson–Crick/Watson–Crick (cis W.C.) G•A conformation for the tandem (G•A)2 in the analog and a minimally perturbed helical duplex stem. Mg2+ titration studies imply that the cis W.C. geometry of the tandem (G•A)2 probably allows O6 of G20 and N1 of A4 to coordinate with a Mg2+ ion as indicated by the largest chemical shift changes associated with the imino group of G20 and the H8 of G20 and A4. A cross-strand bridging Mg2+ coordination has also been found in a different sequence context in the crystal structure of H.marismortui 23S rRNA, and therefore it may be a rare but general motif in Mg2+ coordination. PMID:15939932

  19. Novel Mechanisms in the Regulation of G Protein-coupled Receptor Trafficking to the Plasma Membrane*

    PubMed Central

    Tholanikunnel, Baby G.; Joseph, Kusumam; Kandasamy, Karthikeyan; Baldys, Aleksander; Raymond, John R.; Luttrell, Louis M.; McDermott, Paul J.; Fernandes, Daniel J.

    2010-01-01

    β2-Adrenergic receptors (β2-AR) are low abundance, integral membrane proteins that mediate the effects of catecholamines at the cell surface. Whereas the processes governing desensitization of activated β2-ARs and their subsequent removal from the cell surface have been characterized in considerable detail, little is known about the mechanisms controlling trafficking of neo-synthesized receptors to the cell surface. Since the discovery of the signal peptide, the targeting of the integral membrane proteins to plasma membrane has been thought to be determined by structural features of the amino acid sequence alone. Here we report that localization of translationally silenced β2-AR mRNA to the peripheral cytoplasmic regions is critical for receptor localization to the plasma membrane. β2-AR mRNA is recognized by the nucleocytoplasmic shuttling RNA-binding protein HuR, which silences translational initiation while chaperoning the mRNA-protein complex to the cell periphery. When HuR expression is down-regulated, β2-AR mRNA translation is initiated prematurely in perinuclear polyribosomes, leading to overproduction of receptors but defective trafficking to the plasma membrane. Our results underscore the importance of the spatiotemporal relationship between β2-AR mRNA localization, translation, and trafficking to the plasma membrane, and establish a novel mechanism whereby G protein-coupled receptor (GPCR) responsiveness is regulated by RNA-based signals. PMID:20739277

  20. Analysis of the interaction with the hepatitis C virus mRNA reveals an alternative mode of RNA recognition by the human La protein.

    PubMed

    Martino, Luigi; Pennell, Simon; Kelly, Geoff; Bui, Tam T T; Kotik-Kogan, Olga; Smerdon, Stephen J; Drake, Alex F; Curry, Stephen; Conte, Maria R

    2012-02-01

    Human La protein is an essential factor in the biology of both coding and non-coding RNAs. In the nucleus, La binds primarily to 3' oligoU containing RNAs, while in the cytoplasm La interacts with an array of different mRNAs lacking a 3' UUU(OH) trailer. An example of the latter is the binding of La to the IRES domain IV of the hepatitis C virus (HCV) RNA, which is associated with viral translation stimulation. By systematic biophysical investigations, we have found that La binds to domain IV using an RNA recognition that is quite distinct from its mode of binding to RNAs with a 3' UUU(OH) trailer: although the La motif and first RNA recognition motif (RRM1) are sufficient for high-affinity binding to 3' oligoU, recognition of HCV domain IV requires the La motif and RRM1 to work in concert with the atypical RRM2 which has not previously been shown to have a significant role in RNA binding. This new mode of binding does not appear sequence specific, but recognizes structural features of the RNA, in particular a double-stranded stem flanked by single-stranded extensions. These findings pave the way for a better understanding of the role of La in viral translation initiation.

  1. Quantifying the relationship between sequence and three-dimensional structure conservation in RNA

    PubMed Central

    2010-01-01

    Background In recent years, the number of available RNA structures has rapidly grown reflecting the increased interest on RNA biology. Similarly to the studies carried out two decades ago for proteins, which gave the fundamental grounds for developing comparative protein structure prediction methods, we are now able to quantify the relationship between sequence and structure conservation in RNA. Results Here we introduce an all-against-all sequence- and three-dimensional (3D) structure-based comparison of a representative set of RNA structures, which have allowed us to quantitatively confirm that: (i) there is a measurable relationship between sequence and structure conservation that weakens for alignments resulting in below 60% sequence identity, (ii) evolution tends to conserve more RNA structure than sequence, and (iii) there is a twilight zone for RNA homology detection. Discussion The computational analysis here presented quantitatively describes the relationship between sequence and structure for RNA molecules and defines a twilight zone region for detecting RNA homology. Our work could represent the theoretical basis and limitations for future developments in comparative RNA 3D structure prediction. PMID:20550657

  2. Discrimination against RNA Backbones by a ssDNA Binding Protein.

    PubMed

    Lloyd, Neil R; Wuttke, Deborah S

    2018-05-01

    Pot1 is the shelterin component responsible for the protection of the single-stranded DNA (ssDNA) overhang at telomeres in nearly all eukaryotic organisms. The C-terminal domain of the DNA-binding domain, Pot1pC, exhibits non-specific ssDNA recognition, achieved through thermodynamically equivalent alternative binding conformations. Given this flexibility, it is unclear how specificity for ssDNA over RNA, an activity required for biological function, is achieved. Examination of the ribose-position specificity of Pot1pC shows that ssDNA specificity is additive but not uniformly distributed across the ligand. High-resolution structures of several Pot1pC complexes with RNA-DNA chimeric ligands reveal Pot1pC discriminates against RNA by utilizing non-compensatory binding modes that feature significant rearrangement of the binding interface. These alternative conformations, accessed through both ligand and protein flexibility, recover much, but not all, of the binding energy, leading to the observed reduction in affinities. These findings suggest that intermolecular interfaces are remarkably sophisticated in their tuning of specificity toward flexible ligands. Copyright © 2018 Elsevier Ltd. All rights reserved.

  3. HomoTarget: a new algorithm for prediction of microRNA targets in Homo sapiens.

    PubMed

    Ahmadi, Hamed; Ahmadi, Ali; Azimzadeh-Jamalkandi, Sadegh; Shoorehdeli, Mahdi Aliyari; Salehzadeh-Yazdi, Ali; Bidkhori, Gholamreza; Masoudi-Nejad, Ali

    2013-02-01

    MiRNAs play an essential role in the networks of gene regulation by inhibiting the translation of target mRNAs. Several computational approaches have been proposed for the prediction of miRNA target-genes. Reports reveal a large fraction of under-predicted or falsely predicted target genes. Thus, there is an imperative need to develop a computational method by which the target mRNAs of existing miRNAs can be correctly identified. In this study, combined pattern recognition neural network (PRNN) and principle component analysis (PCA) architecture has been proposed in order to model the complicated relationship between miRNAs and their target mRNAs in humans. The results of several types of intelligent classifiers and our proposed model were compared, showing that our algorithm outperformed them with higher sensitivity and specificity. Using the recent release of the mirBase database to find potential targets of miRNAs, this model incorporated twelve structural, thermodynamic and positional features of miRNA:mRNA binding sites to select target candidates. Copyright © 2012 Elsevier Inc. All rights reserved.

  4. lncRScan-SVM: A Tool for Predicting Long Non-Coding RNAs Using Support Vector Machine.

    PubMed

    Sun, Lei; Liu, Hui; Zhang, Lin; Meng, Jia

    2015-01-01

    Functional long non-coding RNAs (lncRNAs) have been bringing novel insight into biological study, however it is still not trivial to accurately distinguish the lncRNA transcripts (LNCTs) from the protein coding ones (PCTs). As various information and data about lncRNAs are preserved by previous studies, it is appealing to develop novel methods to identify the lncRNAs more accurately. Our method lncRScan-SVM aims at classifying PCTs and LNCTs using support vector machine (SVM). The gold-standard datasets for lncRScan-SVM model training, lncRNA prediction and method comparison were constructed according to the GENCODE gene annotations of human and mouse respectively. By integrating features derived from gene structure, transcript sequence, potential codon sequence and conservation, lncRScan-SVM outperforms other approaches, which is evaluated by several criteria such as sensitivity, specificity, accuracy, Matthews correlation coefficient (MCC) and area under curve (AUC). In addition, several known human lncRNA datasets were assessed using lncRScan-SVM. LncRScan-SVM is an efficient tool for predicting the lncRNAs, and it is quite useful for current lncRNA study.

  5. A Glu-urea-Lys Ligand-conjugated Lipid Nanoparticle/siRNA System Inhibits Androgen Receptor Expression In Vivo

    PubMed Central

    Lee, Justin B; Zhang, Kaixin; Tam, Yuen Yi C; Quick, Joslyn; Tam, Ying K; Lin, Paulo JC; Chen, Sam; Liu, Yan; Nair, Jayaprakash K; Zlatev, Ivan; Rajeev, Kallanthottathil G; Manoharan, Muthiah; Rennie, Paul S; Cullis, Pieter R

    2016-01-01

    The androgen receptor plays a critical role in the progression of prostate cancer. Here, we describe targeting the prostate-specific membrane antigen using a lipid nanoparticle formulation containing small interfering RNA designed to silence expression of the messenger RNA encoding the androgen receptor. Specifically, a Glu-urea-Lys PSMA-targeting ligand was incorporated into the lipid nanoparticle system formulated with a long alkyl chain polyethylene glycol-lipid to enhance accumulation at tumor sites and facilitate intracellular uptake into tumor cells following systemic administration. Through these features, and by using a structurally refined cationic lipid and an optimized small interfering RNA payload, a lipid nanoparticle system with improved potency and significant therapeutic potential against prostate cancer and potentially other solid tumors was developed. Decreases in serum prostate-specific antigen, tumor cellular proliferation, and androgen receptor levels were observed in a mouse xenograft model following intravenous injection. These results support the potential clinical utility of a prostate-specific membrane antigen–targeted lipid nanoparticle system to silence the androgen receptor in advanced prostate cancer. PMID:28131285

  6. Characterization of hMTr1, a Human Cap1 2′-O-Ribose Methyltransferase*

    PubMed Central

    Bélanger, François; Stepinski, Janusz; Darzynkiewicz, Edward; Pelletier, Jerry

    2010-01-01

    Cellular eukaryotic mRNAs are capped at their 5′ ends with a 7-methylguanosine nucleotide, a structural feature that has been shown to be important for conferring mRNA stability, stimulating mRNA biogenesis (splicing, poly(A) addition, nucleocytoplasmic transport), and increasing translational efficiency. Whereas yeast mRNAs have no additional modifications to the cap, called cap0, higher eukaryotes are methylated at the 2′-O-ribose of the first or the first and second transcribed nucleotides, called cap1 and cap2, respectively. In the present study, we identify the methyltransferase responsible for cap1 formation in human cells, which we call hMTr1 (also known as FTSJD2 and ISG95). We show in vitro that hMTr1 catalyzes specific methylation of the 2′-O-ribose of the first nucleotide of a capped RNA transcript. Using siRNA-mediated knockdown of hMTr1 in HeLa cells, we demonstrate that hMTr1 is responsible for cap1 formation in vivo. PMID:20713356

  7. Defining the mRNA recognition signature of a bacterial toxin protein

    DOE PAGES

    Schureck, Marc A.; Dunkle, Jack A.; Maehigashi, Tatsuya; ...

    2015-10-27

    Bacteria contain multiple type II toxins that selectively degrade mRNAs bound to the ribosome to regulate translation and growth and facilitate survival during the stringent response. Ribosome-dependent toxins recognize a variety of three-nucleotide codons within the aminoacyl (A) site, but how these endonucleases achieve substrate specificity remains poorly understood. In this paper, we identify the critical features for how the host inhibition of growth B (HigB) toxin recognizes each of the three A-site nucleotides for cleavage. X-ray crystal structures of HigB bound to two different codons on the ribosome illustrate how HigB uses a microbial RNase-like nucleotide recognition loop tomore » recognize either cytosine or adenosine at the second A-site position. Strikingly, a single HigB residue and 16S rRNA residue C1054 form an adenosine-specific pocket at the third A-site nucleotide, in contrast to how tRNAs decode mRNA. Finally, our results demonstrate that the most important determinant for mRNA cleavage by ribosome-dependent toxins is interaction with the third A-site nucleotide.« less

  8. Defining the mRNA recognition signature of a bacterial toxin protein

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schureck, Marc A.; Dunkle, Jack A.; Maehigashi, Tatsuya

    Bacteria contain multiple type II toxins that selectively degrade mRNAs bound to the ribosome to regulate translation and growth and facilitate survival during the stringent response. Ribosome-dependent toxins recognize a variety of three-nucleotide codons within the aminoacyl (A) site, but how these endonucleases achieve substrate specificity remains poorly understood. In this paper, we identify the critical features for how the host inhibition of growth B (HigB) toxin recognizes each of the three A-site nucleotides for cleavage. X-ray crystal structures of HigB bound to two different codons on the ribosome illustrate how HigB uses a microbial RNase-like nucleotide recognition loop tomore » recognize either cytosine or adenosine at the second A-site position. Strikingly, a single HigB residue and 16S rRNA residue C1054 form an adenosine-specific pocket at the third A-site nucleotide, in contrast to how tRNAs decode mRNA. Finally, our results demonstrate that the most important determinant for mRNA cleavage by ribosome-dependent toxins is interaction with the third A-site nucleotide.« less

  9. Complete fold annotation of the human proteome using a novel structural feature space

    DOE PAGES

    Middleton, Sarah A.; Illuminati, Joseph; Kim, Junhyong

    2017-04-13

    Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this methodmore » by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Finally, our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families.« less

  10. An efficient algorithm for planar drawing of RNA structures with pseudoknots of any type.

    PubMed

    Byun, Yanga; Han, Kyungsook

    2016-06-01

    An RNA pseudoknot is a tertiary structural element in which bases of a loop pair with complementary bases are outside the loop. A drawing of RNA secondary structures is a tree, but a drawing of RNA pseudoknots is a graph that has an inner cycle within a pseudoknot and possibly outer cycles formed between the pseudoknot and other structural elements. Visualizing a large-scale RNA structure with pseudoknots as a planar drawing is challenging because a planar drawing of an RNA structure requires both pseudoknots and an entire structure enclosing the pseudoknots to be embedded into a plane without overlapping or crossing. This paper presents an efficient heuristic algorithm for visualizing a pseudoknotted RNA structure as a planar drawing. The algorithm consists of several parts for finding crossing stems and page mapping the stems, for the layout of stem-loops and pseudoknots, and for overlap detection between structural elements and resolving it. Unlike previous algorithms, our algorithm generates a planar drawing for a large RNA structure with pseudoknots of any type and provides a bracket view of the structure. It generates a compact and aesthetic structure graph for a large pseudoknotted RNA structure in O([Formula: see text]) time, where n is the number of stems of the RNA structure.

  11. Computational strategies for the automated design of RNA nanoscale structures from building blocks using NanoTiler.

    PubMed

    Bindewald, Eckart; Grunewald, Calvin; Boyle, Brett; O'Connor, Mary; Shapiro, Bruce A

    2008-10-01

    One approach to designing RNA nanoscale structures is to use known RNA structural motifs such as junctions, kissing loops or bulges and to construct a molecular model by connecting these building blocks with helical struts. We previously developed an algorithm for detecting internal loops, junctions and kissing loops in RNA structures. Here we present algorithms for automating or assisting many of the steps that are involved in creating RNA structures from building blocks: (1) assembling building blocks into nanostructures using either a combinatorial search or constraint satisfaction; (2) optimizing RNA 3D ring structures to improve ring closure; (3) sequence optimisation; (4) creating a unique non-degenerate RNA topology descriptor. This effectively creates a computational pipeline for generating molecular models of RNA nanostructures and more specifically RNA ring structures with optimized sequences from RNA building blocks. We show several examples of how the algorithms can be utilized to generate RNA tecto-shapes.

  12. Computational strategies for the automated design of RNA nanoscale structures from building blocks using NanoTiler☆

    PubMed Central

    Bindewald, Eckart; Grunewald, Calvin; Boyle, Brett; O’Connor, Mary; Shapiro, Bruce A.

    2013-01-01

    One approach to designing RNA nanoscale structures is to use known RNA structural motifs such as junctions, kissing loops or bulges and to construct a molecular model by connecting these building blocks with helical struts. We previously developed an algorithm for detecting internal loops, junctions and kissing loops in RNA structures. Here we present algorithms for automating or assisting many of the steps that are involved in creating RNA structures from building blocks: (1) assembling building blocks into nanostructures using either a combinatorial search or constraint satisfaction; (2) optimizing RNA 3D ring structures to improve ring closure; (3) sequence optimisation; (4) creating a unique non-degenerate RNA topology descriptor. This effectively creates a computational pipeline for generating molecular models of RNA nanostructures and more specifically RNA ring structures with optimized sequences from RNA building blocks. We show several examples of how the algorithms can be utilized to generate RNA tecto-shapes. PMID:18838281

  13. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Osipiuk, J.; Gornicki, P.; Maj, L.

    The structure of the YlxR protein of unknown function from Streptococcus pneumonia was determined to 1.35 Angstroms. YlxR is expressed from the nusA/infB operon in bacteria and belongs to a small protein family (COG2740) that shares a conserved sequence motif GRGA(Y/W). The family shows no significant amino-acid sequence similarity with other proteins. Three-wavelength diffraction MAD data were collected to 1.7 Angstroms from orthorhombic crystals using synchrotron radiation and the structure was determined using a semi-automated approach. The YlxR structure resembles a two-layer {alpha}/{beta} sandwich with the overall shape of a cylinder and shows no structural homology to proteins of knownmore » structure. Structural analysis revealed that the YlxR structure represents a new protein fold that belongs to the {alpha}-{beta} plait superfamily. The distribution of the electrostatic surface potential shows a large positively charged patch on one side of the protein, a feature often found in nucleic acid-binding proteins. Three sulfate ions bind to this positively charged surface. Analysis of potential binding sites uncovered several substantial clefts, with the largest spanning 3/4 of the protein. A similar distribution of binding sites and a large sharply bent cleft are observed in RNA-binding proteins that are unrelated in sequence and structure. It is proposed that YlxR is an RNA-binding protein.« less

  14. Streptococcus pneumonia YlxR at 1.35 A shows a putative new fold.

    PubMed

    Osipiuk, J; Górnicki, P; Maj, L; Dementieva, I; Laskowski, R; Joachimiak, A

    2001-11-01

    The structure of the YlxR protein of unknown function from Streptococcus pneumonia was determined to 1.35 A. YlxR is expressed from the nusA/infB operon in bacteria and belongs to a small protein family (COG2740) that shares a conserved sequence motif GRGA(Y/W). The family shows no significant amino-acid sequence similarity with other proteins. Three-wavelength diffraction MAD data were collected to 1.7 A from orthorhombic crystals using synchrotron radiation and the structure was determined using a semi-automated approach. The YlxR structure resembles a two-layer alpha/beta sandwich with the overall shape of a cylinder and shows no structural homology to proteins of known structure. Structural analysis revealed that the YlxR structure represents a new protein fold that belongs to the alpha-beta plait superfamily. The distribution of the electrostatic surface potential shows a large positively charged patch on one side of the protein, a feature often found in nucleic acid-binding proteins. Three sulfate ions bind to this positively charged surface. Analysis of potential binding sites uncovered several substantial clefts, with the largest spanning 3/4 of the protein. A similar distribution of binding sites and a large sharply bent cleft are observed in RNA-binding proteins that are unrelated in sequence and structure. It is proposed that YlxR is an RNA-binding protein.

  15. Structure-seq2: sensitive and accurate genome-wide profiling of RNA structure in vivo

    PubMed Central

    Ritchey, Laura E.; Su, Zhao; Tang, Yin; Tack, David C.

    2017-01-01

    Abstract RNA serves many functions in biology such as splicing, temperature sensing, and innate immunity. These functions are often determined by the structure of RNA. There is thus a pressing need to understand RNA structure and how it changes during diverse biological processes both in vivo and genome-wide. Here, we present Structure-seq2, which provides nucleotide-resolution RNA structural information in vivo and genome-wide. This optimized version of our original Structure-seq method increases sensitivity by at least 4-fold and improves data quality by minimizing formation of a deleterious by-product, reducing ligation bias, and improving read coverage. We also present a variation of Structure-seq2 in which a biotinylated nucleotide is incorporated during reverse transcription, which greatly facilitates the protocol by eliminating two PAGE purification steps. We benchmark Structure-seq2 on both mRNA and rRNA structure in rice (Oryza sativa). We demonstrate that Structure-seq2 can lead to new biological insights. Our Structure-seq2 datasets uncover hidden breaks in chloroplast rRNA and identify a previously unreported N1-methyladenosine (m1A) in a nuclear-encoded Oryza sativa rRNA. Overall, Structure-seq2 is a rapid, sensitive, and unbiased method to probe RNA in vivo and genome-wide that facilitates new insights into RNA biology. PMID:28637286

  16. Sequence-structure relationships in RNA loops: establishing the basis for loop homology modeling.

    PubMed

    Schudoma, Christian; May, Patrick; Nikiforova, Viktoria; Walther, Dirk

    2010-01-01

    The specific function of RNA molecules frequently resides in their seemingly unstructured loop regions. We performed a systematic analysis of RNA loops extracted from experimentally determined three-dimensional structures of RNA molecules. A comprehensive loop-structure data set was created and organized into distinct clusters based on structural and sequence similarity. We detected clear evidence of the hallmark of homology present in the sequence-structure relationships in loops. Loops differing by <25% in sequence identity fold into very similar structures. Thus, our results support the application of homology modeling for RNA loop model building. We established a threshold that may guide the sequence divergence-based selection of template structures for RNA loop homology modeling. Of all possible sequences that are, under the assumption of isosteric relationships, theoretically compatible with actual sequences observed in RNA structures, only a small fraction is contained in the Rfam database of RNA sequences and classes implying that the actual RNA loop space may consist of a limited number of unique loop structures and conserved sequences. The loop-structure data sets are made available via an online database, RLooM. RLooM also offers functionalities for the modeling of RNA loop structures in support of RNA engineering and design efforts.

  17. Visualization of RNA structure models within the Integrative Genomics Viewer.

    PubMed

    Busan, Steven; Weeks, Kevin M

    2017-07-01

    Analyses of the interrelationships between RNA structure and function are increasingly important components of genomic studies. The SHAPE-MaP strategy enables accurate RNA structure probing and realistic structure modeling of kilobase-length noncoding RNAs and mRNAs. Existing tools for visualizing RNA structure models are not suitable for efficient analysis of long, structurally heterogeneous RNAs. In addition, structure models are often advantageously interpreted in the context of other experimental data and gene annotation information, for which few tools currently exist. We have developed a module within the widely used and well supported open-source Integrative Genomics Viewer (IGV) that allows visualization of SHAPE and other chemical probing data, including raw reactivities, data-driven structural entropies, and data-constrained base-pair secondary structure models, in context with linear genomic data tracks. We illustrate the usefulness of visualizing RNA structure in the IGV by exploring structure models for a large viral RNA genome, comparing bacterial mRNA structure in cells with its structure under cell- and protein-free conditions, and comparing a noncoding RNA structure modeled using SHAPE data with a base-pairing model inferred through sequence covariation analysis. © 2017 Busan and Weeks; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  18. Dawn of the in vivo RNA structurome and interactome.

    PubMed

    Kwok, Chun Kit

    2016-10-15

    RNA is one of the most fascinating biomolecules in living systems given its structural versatility to fold into elaborate architectures for important biological functions such as gene regulation, catalysis, and information storage. Knowledge of RNA structures and interactions can provide deep insights into their functional roles in vivo For decades, RNA structural studies have been conducted on a transcript-by-transcript basis. The advent of next-generation sequencing (NGS) has enabled the development of transcriptome-wide structural probing methods to profile the global landscape of RNA structures and interactions, also known as the RNA structurome and interactome, which transformed our understanding of the RNA structure-function relationship on a transcriptomic scale. In this review, molecular tools and NGS methods used for RNA structure probing are presented, novel insights uncovered by RNA structurome and interactome studies are highlighted, and perspectives on current challenges and potential future directions are discussed. A more complete understanding of the RNA structures and interactions in vivo will help illuminate the novel roles of RNA in gene regulation, development, and diseases. © 2016 The Author(s); published by Portland Press Limited on behalf of the Biochemical Society.

  19. Alternative Polyadenylation and Nonsense-Mediated Decay Coordinately Regulate the Human HFE mRNA Levels

    PubMed Central

    Martins, Rute; Proença, Daniela; Silva, Bruno; Barbosa, Cristina; Silva, Ana Luísa; Faustino, Paula; Romão, Luísa

    2012-01-01

    Nonsense-mediated decay (NMD) is an mRNA surveillance pathway that selectively recognizes and degrades defective mRNAs carrying premature translation-termination codons. However, several studies have shown that NMD also targets physiological transcripts that encode full-length proteins, modulating their expression. Indeed, some features of physiological mRNAs can render them NMD-sensitive. Human HFE is a MHC class I protein mainly expressed in the liver that, when mutated, can cause hereditary hemochromatosis, a common genetic disorder of iron metabolism. The HFE gene structure comprises seven exons; although the sixth exon is 1056 base pairs (bp) long, only the first 41 bp encode for amino acids. Thus, the remaining downstream 1015 bp sequence corresponds to the HFE 3′ untranslated region (UTR), along with exon seven. Therefore, this 3′ UTR encompasses an exon/exon junction, a feature that can make the corresponding physiological transcript NMD-sensitive. Here, we demonstrate that in UPF1-depleted or in cycloheximide-treated HeLa and HepG2 cells the HFE transcripts are clearly upregulated, meaning that the physiological HFE mRNA is in fact an NMD-target. This role of NMD in controlling the HFE expression levels was further confirmed in HeLa cells transiently expressing the HFE human gene. Besides, we show, by 3′-RACE analysis in several human tissues that HFE mRNA expression results from alternative cleavage and polyadenylation at four different sites – two were previously described and two are novel polyadenylation sites: one located at exon six, which confers NMD-resistance to the corresponding transcripts, and another located at exon seven. In addition, we show that the amount of HFE mRNA isoforms resulting from cleavage and polyadenylation at exon seven, although present in both cell lines, is higher in HepG2 cells. These results reveal that NMD and alternative polyadenylation may act coordinately to control HFE mRNA levels, possibly varying its protein expression according to the physiological cellular requirements. PMID:22530027

  20. Novel features of the XRN-family in Arabidopsis: Evidence that AtXRN4, one of several orthologs of nuclear Xrn2p/Rat1p, functions in the cytoplasm

    PubMed Central

    Kastenmayer, J. P.; Green, P. J.

    2000-01-01

    The 5′-3′ exoribonucleases Xrn1p and Xrn2p/Rat1p function in the degradation and processing of several classes of RNA in Saccharomyces cerevisiae. Xrn1p is the main enzyme catalyzing cytoplasmic mRNA degradation in multiple decay pathways, whereas Xrn2p/Rat1p functions in the processing of rRNAs and small nucleolar RNAs (snoRNAs) in the nucleus. Much less is known about the XRN-like proteins of multicellular eukaryotes; however, differences in their activities could explain differences in mRNA degradation between multicellular and unicellular eukaryotes. One such difference is the lack in plants and animals of mRNA decay intermediates like those generated in yeast when Xrn1p is blocked by poly(G) tracts that are inserted within mRNAs. We investigated the XRN-family in Arabidopsis thaliana and found it to have several novel features. First, the Arabidopsis genome contains three XRN-like genes (AtXRNs) that are structurally similar to Xrn2p/Rat1p, a characteristic unique to plants. Furthermore, our experimental results and sequence database searches indicate that Xrn1p orthologs may be absent from higher plants. Second, the lack of poly(G) mRNA decay intermediates in plants cannot be explained by the activity of the AtXRNs, because they are blocked by poly(G) tracts. Finally, complementation of yeast mutants and localization studies indicate that two of the AtXRNs likely function in the nucleus, whereas the third acts in the cytoplasm. Thus, the XRN-family in plants is more complex than in other eukaryotes, and, if an XRN-like enzyme plays a role in mRNA decay in plants, the likely participant is a cytoplasmic Xrn2p/Rat1p ortholog, rather than an Xrn1p ortholog. PMID:11106401

  1. RNA 3D Structural Motifs: Definition, Identification, Annotation, and Database Searching

    NASA Astrophysics Data System (ADS)

    Nasalean, Lorena; Stombaugh, Jesse; Zirbel, Craig L.; Leontis, Neocles B.

    Structured RNA molecules resemble proteins in the hierarchical organization of their global structures, folding and broad range of functions. Structured RNAs are composed of recurrent modular motifs that play specific functional roles. Some motifs direct the folding of the RNA or stabilize the folded structure through tertiary interactions. Others bind ligands or proteins or catalyze chemical reactions. Therefore, it is desirable, starting from the RNA sequence, to be able to predict the locations of recurrent motifs in RNA molecules. Conversely, the potential occurrence of one or more known 3D RNA motifs may indicate that a genomic sequence codes for a structured RNA molecule. To identify known RNA structural motifs in new RNA sequences, precise structure-based definitions are needed that specify the core nucleotides of each motif and their conserved interactions. By comparing instances of each recurrent motif and applying base pair isosteriCity relations, one can identify neutral mutations that preserve its structure and function in the contexts in which it occurs.

  2. Crystal Structure of the Human Pol α B Subunit in Complex with the C-terminal Domain of the Catalytic Subunit*

    PubMed Central

    Suwa, Yoshiaki; Gu, Jianyou; Baranovskiy, Andrey G.; Babayeva, Nigar D.; Pavlov, Youri I.; Tahirov, Tahir H.

    2015-01-01

    In eukaryotic DNA replication, short RNA-DNA hybrid primers synthesized by primase-DNA polymerase α (Prim-Pol α) are needed to start DNA replication by the replicative DNA polymerases, Pol δ and Pol ϵ. The C terminus of the Pol α catalytic subunit (p180C) in complex with the B subunit (p70) regulates the RNA priming and DNA polymerizing activities of Prim-Pol α. It tethers Pol α and primase, facilitating RNA primer handover from primase to Pol α. To understand these regulatory mechanisms and to reveal the details of human Pol α organization, we determined the crystal structure of p70 in complex with p180C. The structured portion of p70 includes a phosphodiesterase (PDE) domain and an oligonucleotide/oligosaccharide binding (OB) domain. The N-terminal domain and the linker connecting it to the PDE domain are disordered in the reported crystal structure. The p180C adopts an elongated asymmetric saddle shape, with a three-helix bundle in the middle and zinc-binding modules (Zn1 and Zn2) on each side. The extensive p180C-p70 interactions involve 20 hydrogen bonds and a number of hydrophobic interactions resulting in an extended buried surface of 4080 Å2. Importantly, in the structure of the p180C-p70 complex with full-length p70, the residues from the N-terminal to the OB domain contribute to interactions with p180C. The comparative structural analysis revealed both the conserved features and the differences between the human and yeast Pol α complexes. PMID:25847248

  3. A comparative study of sequence- and structure-based features of small RNAs and other RNAs of bacteria.

    PubMed

    Barik, Amita; Das, Santasabuj

    2018-01-02

    Small RNAs (sRNAs) in bacteria have emerged as key players in transcriptional and post-transcriptional regulation of gene expression. Here, we present a statistical analysis of different sequence- and structure-related features of bacterial sRNAs to identify the descriptors that could discriminate sRNAs from other bacterial RNAs. We investigated a comprehensive and heterogeneous collection of 816 sRNAs, identified by northern blotting across 33 bacterial species and compared their various features with other classes of bacterial RNAs, such as tRNAs, rRNAs and mRNAs. We observed that sRNAs differed significantly from the rest with respect to G+C composition, normalized minimum free energy of folding, motif frequency and several RNA-folding parameters like base-pairing propensity, Shannon entropy and base-pair distance. Based on the selected features, we developed a predictive model using Random Forests (RF) method to classify the above four classes of RNAs. Our model displayed an overall predictive accuracy of 89.5%. These findings would help to differentiate bacterial sRNAs from other RNAs and further promote prediction of novel sRNAs in different bacterial species.

  4. The interaction between the yeast telomerase RNA and the Est1 protein requires three structural elements.

    PubMed

    Lubin, Johnathan W; Tucey, Timothy M; Lundblad, Victoria

    2012-09-01

    In the budding yeast Saccharomyces cerevisiae, the telomerase enzyme is composed of a 1.3-kb TLC1 RNA that forms a complex with Est2 (the catalytic subunit) and two regulatory proteins, Est1 and Est3. Previous work has identified a conserved 5-nt bulge, present in a long helical arm of TLC1, which mediates binding of Est1 to TLC1. However, increased expression of Est1 can bypass the consequences of removal of this RNA bulge, indicating that there are additional binding site(s) for Est1 on TLC1. We report here that a conserved single-stranded internal loop immediately adjacent to the bulge is also required for the Est1-RNA interaction; furthermore, a TLC1 variant that lacks this internal loop but retains the bulge cannot be suppressed by Est1 overexpression, arguing that the internal loop may be a more critical element for Est1 binding. An additional structural feature consisting of a single-stranded region at the base of the helix containing the bulge and internal loop also contributes to recognition of TLC1 by Est1, potentially by providing flexibility to this helical arm. Association of Est1 with each of these TLC1 motifs was assessed using a highly sensitive biochemical assay that simultaneously monitors the relative levels of the Est1 and Est2 proteins in the telomerase complex. The identification of three elements of TLC1 that are required for Est1 association provides a detailed view of this particular protein-RNA interaction.

  5. Role of the terminator hairpin in the biogenesis of functional Hfq-binding sRNAs.

    PubMed

    Morita, Teppei; Nishino, Ryo; Aiba, Hiroji

    2017-09-01

    Rho-independent transcription terminators of the genes encoding bacterial Hfq-binding sRNAs possess a set of seven or more T residues at the 3' end, as noted in previous studies. Here, we have studied the role of the terminator hairpin in the biogenesis of sRNAs focusing on SgrS and RyhB in Escherichia coli. We constructed variant sRNA genes in which the GC-rich inverted repeat sequences are extended to stabilize the terminator hairpins. We demonstrate that the extension of the hairpin stem leads to generation of heterogeneous transcripts in which the poly(U) tail is shortened. The transcripts with shortened poly(U) tails no longer bind to Hfq and lose the ability to repress the target mRNAs. The shortened transcripts are generated in an in vitro transcription system with purified RNA polymerase, indicating that the generation of shortened transcripts is caused by premature transcription termination. We conclude that the terminator structure of sRNA genes is optimized to generate functional sRNAs. Thus, the Rho-independent terminators of sRNA genes possess two common features: a long T residue stretch that is a prerequisite for generation of functional sRNAs and a moderate strength of hairpin structure that ensures the termination at the seventh or longer position within the consecutive T stretch. The modulation of the termination position at the Rho-independent terminators is critical for biosynthesis of functional sRNAs. © 2017 Morita et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  6. Human hnRNP protein A1 gene expression. Structural and functional characterization of the promoter.

    PubMed

    Biamonti, G; Bassi, M T; Cartegni, L; Mechta, F; Buvoli, M; Cobianchi, F; Riva, S

    1993-03-05

    hnRNP protein A1 (34 kDa, pl 9.5) is a prominent member of the family of proteins (hnRNP proteins) that associate with the nascent transcripts of RNA polymerase II and that accompany the hnRNA through the maturation process and the export to the cytoplasm. New evidence suggests an active and specific role for some of these proteins, including protein A1, in splicing and transport. Contrary to the other hnRNP proteins, the intracellular level of protein A1 was reported to change as a function of proliferation state and cell type. In this work we analyse the A1 gene expression in different cells under different growth and differentiation conditions. Proliferation dependent expression was observed in lymphocytes and fibroblasts while purified neurons express high A1 mRNA levels both in the proliferative (before birth) and in the quiescent (after birth) state. Transformed cell lines exhibit very high (proliferation independent) A1 mRNA levels compared to differentiated tissues. A structural and functional characterization of the A1 gene promoter was carried out by means of DNase I footprinting and CAT assays. The observed promoter features can account for both elevated and regulated mRNA transcription. At least 12 control elements are contained in the 734 nucleotides upstream of the transcription start site. Assays with the deleted and/or mutated promoter indicate a co-operation of multiple transcriptional elements, distributed over the entire promoter, in determining the overall activity and the response to proliferative stimuli (serum).

  7. Probing Xist RNA Structure in Cells Using Targeted Structure-Seq

    PubMed Central

    Rutenberg-Schoenberg, Michael; Simon, Matthew D.

    2015-01-01

    The long non-coding RNA (lncRNA) Xist is a master regulator of X-chromosome inactivation in mammalian cells. Models for how Xist and other lncRNAs function depend on thermodynamically stable secondary and higher-order structures that RNAs can form in the context of a cell. Probing accessible RNA bases can provide data to build models of RNA conformation that provide insight into RNA function, molecular evolution, and modularity. To study the structure of Xist in cells, we built upon recent advances in RNA secondary structure mapping and modeling to develop Targeted Structure-Seq, which combines chemical probing of RNA structure in cells with target-specific massively parallel sequencing. By enriching for signals from the RNA of interest, Targeted Structure-Seq achieves high coverage of the target RNA with relatively few sequencing reads, thus providing a targeted and scalable approach to analyze RNA conformation in cells. We use this approach to probe the full-length Xist lncRNA to develop new models for functional elements within Xist, including the repeat A element in the 5’-end of Xist. This analysis also identified new structural elements in Xist that are evolutionarily conserved, including a new element proximal to the C repeats that is important for Xist function. PMID:26646615

  8. De novo design of RNA-binding proteins with a prion-like domain related to ALS/FTD proteinopathies.

    PubMed

    Mitsuhashi, Kana; Ito, Daisuke; Mashima, Kyoko; Oyama, Munenori; Takahashi, Shinichi; Suzuki, Norihiro

    2017-12-04

    Aberrant RNA-binding proteins form the core of the neurodegeneration cascade in spectrums of disease, such as amyotrophic lateral sclerosis (ALS)/frontotemporal dementia (FTD). Six ALS-related molecules, TDP-43, FUS, TAF15, EWSR1, heterogeneous nuclear (hn)RNPA1 and hnRNPA2 are RNA-binding proteins containing candidate mutations identified in ALS patients and those share several common features, including harboring an aggregation-prone prion-like domain (PrLD) containing a glycine/serine-tyrosine-glycine/serine (G/S-Y-G/S)-motif-enriched low-complexity sequence and rich in glutamine and/or asparagine. Additinally, these six molecules are components of RNA granules involved in RNA quality control and become mislocated from the nucleus to form cytoplasmic inclusion bodies (IBs) in the ALS/FTD-affected brain. To reveal the essential mechanisms involved in ALS/FTD-related cytotoxicity associated with RNA-binding proteins containing PrLDs, we designed artificial RNA-binding proteins harboring G/S-Y-G/S-motif repeats with and without enriched glutamine residues and nuclear-import/export-signal sequences and examined their cytotoxicity in vitro. These proteins recapitulated features of ALS-linked molecules, including insoluble aggregation, formation of cytoplasmic IBs and components of RNA granules, and cytotoxicity instigation. These findings indicated that these artificial RNA-binding proteins mimicked features of ALS-linked molecules and allowed the study of mechanisms associated with gain of toxic functions related to ALS/FTD pathogenesis.

  9. psRNATarget: a plant small RNA target analysis server

    PubMed Central

    Dai, Xinbin; Zhao, Patrick Xuechun

    2011-01-01

    Plant endogenous non-coding short small RNAs (20–24 nt), including microRNAs (miRNAs) and a subset of small interfering RNAs (ta-siRNAs), play important role in gene expression regulatory networks (GRNs). For example, many transcription factors and development-related genes have been reported as targets of these regulatory small RNAs. Although a number of miRNA target prediction algorithms and programs have been developed, most of them were designed for animal miRNAs which are significantly different from plant miRNAs in the target recognition process. These differences demand the development of separate plant miRNA (and ta-siRNA) target analysis tool(s). We present psRNATarget, a plant small RNA target analysis server, which features two important analysis functions: (i) reverse complementary matching between small RNA and target transcript using a proven scoring schema, and (ii) target-site accessibility evaluation by calculating unpaired energy (UPE) required to ‘open’ secondary structure around small RNA’s target site on mRNA. The psRNATarget incorporates recent discoveries in plant miRNA target recognition, e.g. it distinguishes translational and post-transcriptional inhibition, and it reports the number of small RNA/target site pairs that may affect small RNA binding activity to target transcript. The psRNATarget server is designed for high-throughput analysis of next-generation data with an efficient distributed computing back-end pipeline that runs on a Linux cluster. The server front-end integrates three simplified user-friendly interfaces to accept user-submitted or preloaded small RNAs and transcript sequences; and outputs a comprehensive list of small RNA/target pairs along with the online tools for batch downloading, key word searching and results sorting. The psRNATarget server is freely available at http://plantgrn.noble.org/psRNATarget/. PMID:21622958

  10. Expression characteristics of long noncoding RNA uc.322 and its effects on pancreatic islet function.

    PubMed

    Zhao, Xiaoqin; Rong, Can; Pan, Fenghui; Xiang, Lizhi; Wang, Xinlei; Hu, Yun

    2018-06-28

    Increasing evidence indicates that long noncoding RNAs (lncRNAs) perform special biological functions by regulating gene expression through multiple pathways and molecular mechanisms. The aim of this study was to explore the expression characteristics of lncRNA uc.322 in pancreatic islet cells and its effects on the secretion function of islet cells. Bioinformatics analysis was used to detect the lncRNA uc.322 sequence, location, and structural features. Expression of lncRNA uc.322 in different tissues was detected by quantitative polymerase chain reaction analyses. Quantitative polymerase chain reaction, Western blot analysis, adenosine triphosphate determination, glucose-stimulated insulin secretion, and enzyme-linked immunosorbent assay were used to evaluate the effects of lncRNA uc.322 on insulin secretion. The results showed that the full-length of lncRNA uc.322 is 224 bp and that it is highly conserved in various species. Bioinformatics analysis revealed that lncRNA uc.322 is located on chr7:122893196-122893419 (GRCH37/hg19) within the SRY-related HMG-box 6 gene exon region. Compared with other tissues, lncRNA uc.322 is highly expressed in pancreatic tissue. Upregulation of lncRNA uc.322 expression increases the insulin transcription factors pancreatic and duodenal homeobox 1 and Forkhead box O1 expression, promotes insulin secretion in the extracellular fluid of Min6 cells, and increases the adenosine triphosphate concentration. On the other hand, knockdown of lncRNA uc.322 has opposite effects on Min6 cells. Overall, this study showed that upregulation of lncRNA uc.322 in islet β-cells can increase the expression of insulin transcription factors and promote insulin secretion, and it may be a new therapeutic target for diabetes. © 2018 Wiley Periodicals, Inc.

  11. Towards Long-Range RNA Structure Prediction in Eukaryotic Genes.

    PubMed

    Pervouchine, Dmitri D

    2018-06-15

    The ability to form an intramolecular structure plays a fundamental role in eukaryotic RNA biogenesis. Proximate regions in the primary transcripts fold into a local secondary structure, which is then hierarchically assembled into a tertiary structure that is stabilized by RNA-binding proteins and long-range intramolecular base pairings. While the local RNA structure can be predicted reasonably well for short sequences, long-range structure at the scale of eukaryotic genes remains problematic from the computational standpoint. The aim of this review is to list functional examples of long-range RNA structures, to summarize current comparative methods of structure prediction, and to highlight their advances and limitations in the context of long-range RNA structures. Most comparative methods implement the “first-align-then-fold” principle, i.e., they operate on multiple sequence alignments, while functional RNA structures often reside in non-conserved parts of the primary transcripts. The opposite “first-fold-then-align” approach is currently explored to a much lesser extent. Developing novel methods in both directions will improve the performance of comparative RNA structure analysis and help discover novel long-range structures, their higher-order organization, and RNA⁻RNA interactions across the transcriptome.

  12. Ultrasensitive Electrochemical Detection of mRNA Using Branched DNA Amplifiers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mao, Xun; Liu, Guodong; Wang, Shengfu

    2008-11-01

    We describe here an ultrasensitive electrochemical detection of m RNA protocol without RNA purification and PCR amplification. The new m RNA electrical detection capability is coupled to the amplification feature of branched DNA (bDNA) technology and with the nagnetic beads based electrochemical bioassay.

  13. SAM-VI RNAs selectively bind S-adenosylmethionine and exhibit similarities to SAM-III riboswitches.

    PubMed

    Mirihana Arachchilage, Gayan; Sherlock, Madeline E; Weinberg, Zasha; Breaker, Ronald R

    2018-03-04

    Five distinct riboswitch classes that regulate gene expression in response to the cofactor S-adenosylmethionine (SAM) or its metabolic breakdown product S-adenosylhomocysteine (SAH) have been reported previously. Collectively, these SAM- or SAH-sensing RNAs constitute the most abundant collection of riboswitches, and are found in nearly every major bacterial lineage. Here, we report a potential sixth member of this pervasive riboswitch family, called SAM-VI, which is predominantly found in Bifidobacterium species. SAM-VI aptamers selectively bind the cofactor SAM and strongly discriminate against SAH. The consensus sequence and structural model for SAM-VI share some features with the consensus model for the SAM-III riboswitch class, whose members are mainly found in lactic acid bacteria. However, there are sufficient differences between the two classes such that current bioinformatics methods separately cluster representatives of the two motifs. These findings highlight the abundance of RNA structures that can form to selectively recognize SAM, and showcase the ability of RNA to utilize diverse strategies to perform similar biological functions.

  14. Compactness of viral genomes: effect of disperse and localized random mutations

    NASA Astrophysics Data System (ADS)

    Lošdorfer Božič, Anže; Micheletti, Cristian; Podgornik, Rudolf; Tubiana, Luca

    2018-02-01

    Genomes of single-stranded RNA viruses have evolved to optimize several concurrent properties. One of them is the architecture of their genomic folds, which must not only feature precise structural elements at specific positions, but also allow for overall spatial compactness. The latter was shown to be disrupted by random synonymous mutations, a disruption which can consequently negatively affect genome encapsidation. In this study, we use three mutation schemes with different degrees of locality to mutate the genomes of phage MS2 and Brome Mosaic virus in order to understand the observed sensitivity of the global compactness of their folds. We find that mutating local stretches of their genomes’ sequence or structure is less disruptive to their compactness compared to inducing randomly-distributed mutations. Our findings are indicative of a mechanism for the conservation of compactness acting on a global scale of the genomes, and have several implications for understanding the interplay between local and global architecture of viral RNA genomes.

  15. microRNA in Cerebral Spinal Fluid as Biomarkers of Alzheimer’s Disease Risk After Brain Injury

    DTIC Science & Technology

    2016-08-01

    protein processing is a key feature of AD. MiRNAs are small non- coding RNA that regulate mRNA transcription, and may be a significant cause of protein...non- coding RNA that regulate mRNA transcription, and may be a significant cause of protein dysregulation. Our investigative team has generated

  16. 3D RNA and functional interactions from evolutionary couplings

    PubMed Central

    Weinreb, Caleb; Riesselman, Adam; Ingraham, John B.; Gross, Torsten; Sander, Chris; Marks, Debora S.

    2016-01-01

    Summary Non-coding RNAs are ubiquitous, but the discovery of new RNA gene sequences far outpaces research on their structure and functional interactions. We mine the evolutionary sequence record to derive precise information about function and structure of RNAs and RNA-protein complexes. As in protein structure prediction, we use maximum entropy global probability models of sequence co-variation to infer evolutionarily constrained nucleotide-nucleotide interactions within RNA molecules, and nucleotide-amino acid interactions in RNA-protein complexes. The predicted contacts allow all-atom blinded 3D structure prediction at good accuracy for several known RNA structures and RNA-protein complexes. For unknown structures, we predict contacts in 160 non-coding RNA families. Beyond 3D structure prediction, evolutionary couplings help identify important functional interactions, e.g., at switch points in riboswitches and at a complex nucleation site in HIV. Aided by accelerating sequence accumulation, evolutionary coupling analysis can accelerate the discovery of functional interactions and 3D structures involving RNA. PMID:27087444

  17. Rclick: a web server for comparison of RNA 3D structures.

    PubMed

    Nguyen, Minh N; Verma, Chandra

    2015-03-15

    RNA molecules play important roles in key biological processes in the cell and are becoming attractive for developing therapeutic applications. Since the function of RNA depends on its structure and dynamics, comparing and classifying the RNA 3D structures is of crucial importance to molecular biology. In this study, we have developed Rclick, a web server that is capable of superimposing RNA 3D structures by using clique matching and 3D least-squares fitting. Our server Rclick has been benchmarked and compared with other popular servers and methods for RNA structural alignments. In most cases, Rclick alignments were better in terms of structure overlap. Our server also recognizes conformational changes between structures. For this purpose, the server produces complementary alignments to maximize the extent of detectable similarity. Various examples showcase the utility of our web server for comparison of RNA, RNA-protein complexes and RNA-ligand structures. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  18. Translation initiation events on structured eukaryotic mRNAs generate gene expression noise

    PubMed Central

    Dacheux, Estelle; Malys, Naglis; Meng, Xiang; Ramachandran, Vinoy; Mendes, Pedro

    2017-01-01

    Abstract Gene expression stochasticity plays a major role in biology, creating non-genetic cellular individuality and influencing multiple processes, including differentiation and stress responses. We have addressed the lack of knowledge about posttranscriptional contributions to noise by determining cell-to-cell variations in the abundance of mRNA and reporter protein in yeast. Two types of structural element, a stem–loop and a poly(G) motif, not only inhibit translation initiation when inserted into an mRNA 5΄ untranslated region, but also generate noise. The noise-enhancing effect of the stem–loop structure also remains operational when combined with an upstream open reading frame. This has broad significance, since these elements are known to modulate the expression of a diversity of eukaryotic genes. Our findings suggest a mechanism for posttranscriptional noise generation that will contribute to understanding of the generally poor correlation between protein-level stochasticity and transcriptional bursting. We propose that posttranscriptional stochasticity can be linked to cycles of folding/unfolding of a stem–loop structure, or to interconversion between higher-order structural conformations of a G-rich motif, and have created a correspondingly configured computational model that generates fits to the experimental data. Stochastic events occurring during the ribosomal scanning process can therefore feature alongside transcriptional bursting as a source of noise. PMID:28521011

  19. Cryo-EM structure of the spinach chloroplast ribosome reveals the location of plastid-specific ribosomal proteins and extensions

    PubMed Central

    Graf, Michael; Arenz, Stefan; Huter, Paul; Dönhöfer, Alexandra; Nováček, Jiří

    2017-01-01

    Abstract Ribosomes are the protein synthesizing machines of the cell. Recent advances in cryo-EM have led to the determination of structures from a variety of species, including bacterial 70S and eukaryotic 80S ribosomes as well as mitoribosomes from eukaryotic mitochondria, however, to date high resolution structures of plastid 70S ribosomes have been lacking. Here we present a cryo-EM structure of the spinach chloroplast 70S ribosome, with an average resolution of 5.4 Å for the small 30S subunit and 3.6 Å for the large 50S ribosomal subunit. The structure reveals the location of the plastid-specific ribosomal proteins (RPs) PSRP1, PSRP4, PSRP5 and PSRP6 as well as the numerous plastid-specific extensions of the RPs. We discover many features by which the plastid-specific extensions stabilize the ribosome via establishing additional interactions with surrounding ribosomal RNA and RPs. Moreover, we identify a large conglomerate of plastid-specific protein mass adjacent to the tunnel exit site that could facilitate interaction of the chloroplast ribosome with the thylakoid membrane and the protein-targeting machinery. Comparing the Escherichia coli 70S ribosome with that of the spinach chloroplast ribosome provides detailed insight into the co-evolution of RP and rRNA. PMID:27986857

  20. RNA structures as mediators of neurological diseases and as drug targets

    PubMed Central

    Bernat, Viachaslau; Disney, Matthew D.

    2015-01-01

    RNAs adopt diverse folded structures that are essential for function and thus play critical roles in cellular biology. A striking example of this is the ribosome, a complex, three-dimensionally folded macromolecular machine that orchestrates protein synthesis. Advances in RNA biochemistry, structural and molecular biology, and bioinformatics have revealed other non-coding RNAs whose functions are dictated by their structure. It is not surprising that aberrantly folded RNA structures contribute to disease. In this review, we provide a brief introduction into RNA structural biology and then describe how RNA structures function in cells and cause or contribute to neurological disease. Finally, we highlight successful applications of rational design principles to provide chemical probes and lead compounds targeting structured RNAs. Based on several examples of well-characterized RNA-driven neurological disorders, we demonstrate how designed small molecules can facilitate study of RNA dysfunction, elucidating previously unknown roles for RNA in disease, and provide lead therapeutics. PMID:26139368

  1. The conservation and function of RNA secondary structure in plants

    PubMed Central

    Vandivier, Lee E.; Anderson, Stephen J.; Foley, Shawn W.; Gregory, Brian D.

    2016-01-01

    RNA transcripts fold into secondary structures via intricate patterns of base pairing. These secondary structures impart catalytic, ligand binding, and scaffolding functions to a wide array of RNAs, forming a critical node of biological regulation. Among their many functions, RNA structural elements modulate epigenetic marks, alter mRNA stability and translation, regulate alternative splicing, transduce signals, and scaffold large macromolecular complexes. Thus, the study of RNA secondary structure is critical to understanding the function and regulation of RNA transcripts. Here, we review the origins, form, and function of RNA secondary structure, focusing on plants. We then provide an overview of methods for probing secondary structure, from physical methods such as X-ray crystallography and nuclear magnetic resonance imaging (NMR) to chemical and nuclease probing methods. Marriage with high-throughput sequencing has enabled these latter methods to scale across whole transcriptomes, yielding tremendous new insights into the form and function of RNA secondary structure. PMID:26865341

  2. 2009 Epigenetics Gordon Research Conference (August 9 - 14, 2009)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jeanie Lee

    Epigenetics refers to the study of heritable changes in genome function that occur without a change in primary DNA sequence. The 2009 Gordon Conference in Epigenetics will feature discussion of various epigenetic phenomena, emerging understanding of their underlying mechanisms, and the growing appreciation that human, animal, and plant health all depend on proper epigenetic control. Special emphasis will be placed on genome-environment interactions particularly as they relate to human disease. Towards improving knowledge of molecular mechanisms, the conference will feature international leaders studying the roles of higher order chromatin structure, noncoding RNA, repeat elements, nuclear organization, and morphogenic evolution. Traditionalmore » and new model organisms are selected from plants, fungi, and metazoans.« less

  3. ModeRNA server: an online tool for modeling RNA 3D structures.

    PubMed

    Rother, Magdalena; Milanowska, Kaja; Puton, Tomasz; Jeleniewicz, Jaroslaw; Rother, Kristian; Bujnicki, Janusz M

    2011-09-01

    The diverse functional roles of non-coding RNA molecules are determined by their underlying structure. ModeRNA server is an online tool for RNA 3D structure modeling by the comparative approach, based on a template RNA structure and a user-defined target-template sequence alignment. It offers an option to search for potential templates, given the target sequence. The server also provides tools for analyzing, editing and formatting of RNA structure files. It facilitates the use of the ModeRNA software and offers new options in comparison to the standalone program. ModeRNA server was implemented using the Python language and the Django web framework. It is freely available at http://iimcb.genesilico.pl/modernaserver. iamb@genesilico.pl.

  4. SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues.

    PubMed

    Yang, Xiaoxia; Wang, Jia; Sun, Jun; Liu, Rong

    2015-01-01

    Protein-nucleic acid interactions are central to various fundamental biological processes. Automated methods capable of reliably identifying DNA- and RNA-binding residues in protein sequence are assuming ever-increasing importance. The majority of current algorithms rely on feature-based prediction, but their accuracy remains to be further improved. Here we propose a sequence-based hybrid algorithm SNBRFinder (Sequence-based Nucleic acid-Binding Residue Finder) by merging a feature predictor SNBRFinderF and a template predictor SNBRFinderT. SNBRFinderF was established using the support vector machine whose inputs include sequence profile and other complementary sequence descriptors, while SNBRFinderT was implemented with the sequence alignment algorithm based on profile hidden Markov models to capture the weakly homologous template of query sequence. Experimental results show that SNBRFinderF was clearly superior to the commonly used sequence profile-based predictor and SNBRFinderT can achieve comparable performance to the structure-based template methods. Leveraging the complementary relationship between these two predictors, SNBRFinder reasonably improved the performance of both DNA- and RNA-binding residue predictions. More importantly, the sequence-based hybrid prediction reached competitive performance relative to our previous structure-based counterpart. Our extensive and stringent comparisons show that SNBRFinder has obvious advantages over the existing sequence-based prediction algorithms. The value of our algorithm is highlighted by establishing an easy-to-use web server that is freely accessible at http://ibi.hzau.edu.cn/SNBRFinder.

  5. Comparative analysis of the 5S rRNA and its associated proteins reveals unique primitive rather than parasitic features in Giardia lamblia.

    PubMed

    Feng, Jin-Mei; Sun, Jun; Xin, De-Dong; Wen, Jian-Fan

    2012-01-01

    5S rRNA is a highly conserved ribosomal component. Eukaryotic 5S rRNA and its associated proteins (5S rRNA system) have become very well understood. Giardia lamblia was thought by some researchers to be the most primitive extant eukaryote while others considered it a highly evolved parasite. Previous reports have indicated that some aspects of its 5S rRNA system are simpler than that of common eukaryotes. We here explore whether this is true to its entire system, and whether this simplicity is a primitive or parasitic feature. By collecting and confirming pre-existing data and identifying new data, we obtained almost complete datasets of the system of three isolates of G. lamblia, two other parasitic excavates (Trichomonas vaginalis, Trypanosoma cruzi), and one free-living one (Naegleria gruberi). After comprehensively comparing each aspect of the system among these excavates and also with those of archaea and common eukaryotes, we found all the three Giardia isolates to harbor a same simplified 5S rRNA system, which is not only much simpler than that of common eukaryotes but also the simplest one among those of these excavates, and is surprisingly very similar to that of archaea; we also found among these excavates the system in parasitic species is not necessarily simpler than that in free-living species, conversely, the system of free-living species is even simpler in some respects than those of parasitic ones. The simplicity of Giardia 5S rRNA system should be considered a primitive rather than parasitically-degenerated feature. Therefore, Giardia 5S rRNA system might be a primitive system that is intermediate between that of archaea and the common eukaryotic model system, and it may reflect the evolutionary history of the eukaryotic 5S rRNA system from the archaeal form. Our results also imply G. lamblia might be a primitive eukaryote with secondary parasitically-degenerated features.

  6. Genome-Wide Comparative In Silico Analysis of the RNA Helicase Gene Family in Zea mays and Glycine max: A Comparison with Arabidopsis and Oryza sativa

    PubMed Central

    Huang, Jinguang; Zheng, Chengchao

    2013-01-01

    RNA helicases are enzymes that are thought to unwind double-stranded RNA molecules in an energy-dependent fashion through the hydrolysis of NTP. RNA helicases are associated with all processes involving RNA molecules, including nuclear transcription, editing, splicing, ribosome biogenesis, RNA export, and organelle gene expression. The involvement of RNA helicase in response to stress and in plant growth and development has been reported previously. While their importance in Arabidopsis and Oryza sativa has been partially studied, the function of RNA helicase proteins is poorly understood in Zea mays and Glycine max. In this study, we identified a total of RNA helicase genes in Arabidopsis and other crop species genome by genome-wide comparative in silico analysis. We classified the RNA helicase genes into three subfamilies according to the structural features of the motif II region, such as DEAD-box, DEAH-box and DExD/H-box, and different species showed different patterns of alternative splicing. Secondly, chromosome location analysis showed that the RNA helicase protein genes were distributed across all chromosomes with different densities in the four species. Thirdly, phylogenetic tree analyses identified the relevant homologs of DEAD-box, DEAH-box and DExD/H-box RNA helicase proteins in each of the four species. Fourthly, microarray expression data showed that many of these predicted RNA helicase genes were expressed in different developmental stages and different tissues under normal growth conditions. Finally, real-time quantitative PCR analysis showed that the expression levels of 10 genes in Arabidopsis and 13 genes in Zea mays were in close agreement with the microarray expression data. To our knowledge, this is the first report of a comparative genome-wide analysis of the RNA helicase gene family in Arabidopsis, Oryza sativa, Zea mays and Glycine max. This study provides valuable information for understanding the classification and putative functions of the RNA helicase gene family in crop growth and development. PMID:24265739

  7. Genome-wide comparative in silico analysis of the RNA helicase gene family in Zea mays and Glycine max: a comparison with Arabidopsis and Oryza sativa.

    PubMed

    Xu, Ruirui; Zhang, Shizhong; Huang, Jinguang; Zheng, Chengchao

    2013-01-01

    RNA helicases are enzymes that are thought to unwind double-stranded RNA molecules in an energy-dependent fashion through the hydrolysis of NTP. RNA helicases are associated with all processes involving RNA molecules, including nuclear transcription, editing, splicing, ribosome biogenesis, RNA export, and organelle gene expression. The involvement of RNA helicase in response to stress and in plant growth and development has been reported previously. While their importance in Arabidopsis and Oryza sativa has been partially studied, the function of RNA helicase proteins is poorly understood in Zea mays and Glycine max. In this study, we identified a total of RNA helicase genes in Arabidopsis and other crop species genome by genome-wide comparative in silico analysis. We classified the RNA helicase genes into three subfamilies according to the structural features of the motif II region, such as DEAD-box, DEAH-box and DExD/H-box, and different species showed different patterns of alternative splicing. Secondly, chromosome location analysis showed that the RNA helicase protein genes were distributed across all chromosomes with different densities in the four species. Thirdly, phylogenetic tree analyses identified the relevant homologs of DEAD-box, DEAH-box and DExD/H-box RNA helicase proteins in each of the four species. Fourthly, microarray expression data showed that many of these predicted RNA helicase genes were expressed in different developmental stages and different tissues under normal growth conditions. Finally, real-time quantitative PCR analysis showed that the expression levels of 10 genes in Arabidopsis and 13 genes in Zea mays were in close agreement with the microarray expression data. To our knowledge, this is the first report of a comparative genome-wide analysis of the RNA helicase gene family in Arabidopsis, Oryza sativa, Zea mays and Glycine max. This study provides valuable information for understanding the classification and putative functions of the RNA helicase gene family in crop growth and development.

  8. Freiburg RNA tools: a central online resource for RNA-focused research and teaching.

    PubMed

    Raden, Martin; Ali, Syed M; Alkhnbashi, Omer S; Busch, Anke; Costa, Fabrizio; Davis, Jason A; Eggenhofer, Florian; Gelhausen, Rick; Georg, Jens; Heyne, Steffen; Hiller, Michael; Kundu, Kousik; Kleinkauf, Robert; Lott, Steffen C; Mohamed, Mostafa M; Mattheis, Alexander; Miladi, Milad; Richter, Andreas S; Will, Sebastian; Wolff, Joachim; Wright, Patrick R; Backofen, Rolf

    2018-05-21

    The Freiburg RNA tools webserver is a well established online resource for RNA-focused research. It provides a unified user interface and comprehensive result visualization for efficient command line tools. The webserver includes RNA-RNA interaction prediction (IntaRNA, CopraRNA, metaMIR), sRNA homology search (GLASSgo), sequence-structure alignments (LocARNA, MARNA, CARNA, ExpaRNA), CRISPR repeat classification (CRISPRmap), sequence design (antaRNA, INFO-RNA, SECISDesign), structure aberration evaluation of point mutations (RaSE), and RNA/protein-family models visualization (CMV), and other methods. Open education resources offer interactive visualizations of RNA structure and RNA-RNA interaction prediction as well as basic and advanced sequence alignment algorithms. The services are freely available at http://rna.informatik.uni-freiburg.de.

  9. RNA secondary structure prediction with pseudoknots: Contribution of algorithm versus energy model.

    PubMed

    Jabbari, Hosna; Wark, Ian; Montemagno, Carlo

    2018-01-01

    RNA is a biopolymer with various applications inside the cell and in biotechnology. Structure of an RNA molecule mainly determines its function and is essential to guide nanostructure design. Since experimental structure determination is time-consuming and expensive, accurate computational prediction of RNA structure is of great importance. Prediction of RNA secondary structure is relatively simpler than its tertiary structure and provides information about its tertiary structure, therefore, RNA secondary structure prediction has received attention in the past decades. Numerous methods with different folding approaches have been developed for RNA secondary structure prediction. While methods for prediction of RNA pseudoknot-free structure (structures with no crossing base pairs) have greatly improved in terms of their accuracy, methods for prediction of RNA pseudoknotted secondary structure (structures with crossing base pairs) still have room for improvement. A long-standing question for improving the prediction accuracy of RNA pseudoknotted secondary structure is whether to focus on the prediction algorithm or the underlying energy model, as there is a trade-off on computational cost of the prediction algorithm versus the generality of the method. The aim of this work is to argue when comparing different methods for RNA pseudoknotted structure prediction, the combination of algorithm and energy model should be considered and a method should not be considered superior or inferior to others if they do not use the same scoring model. We demonstrate that while the folding approach is important in structure prediction, it is not the only important factor in prediction accuracy of a given method as the underlying energy model is also as of great value. Therefore we encourage researchers to pay particular attention in comparing methods with different energy models.

  10. PmiRExAt: plant miRNA expression atlas database and web applications

    PubMed Central

    Gurjar, Anoop Kishor Singh; Panwar, Abhijeet Singh; Gupta, Rajinder; Mantri, Shrikant S.

    2016-01-01

    High-throughput small RNA (sRNA) sequencing technology enables an entirely new perspective for plant microRNA (miRNA) research and has immense potential to unravel regulatory networks. Novel insights gained through data mining in publically available rich resource of sRNA data will help in designing biotechnology-based approaches for crop improvement to enhance plant yield and nutritional value. Bioinformatics resources enabling meta-analysis of miRNA expression across multiple plant species are still evolving. Here, we report PmiRExAt, a new online database resource that caters plant miRNA expression atlas. The web-based repository comprises of miRNA expression profile and query tool for 1859 wheat, 2330 rice and 283 maize miRNA. The database interface offers open and easy access to miRNA expression profile and helps in identifying tissue preferential, differential and constitutively expressing miRNAs. A feature enabling expression study of conserved miRNA across multiple species is also implemented. Custom expression analysis feature enables expression analysis of novel miRNA in total 117 datasets. New sRNA dataset can also be uploaded for analysing miRNA expression profiles for 73 plant species. PmiRExAt application program interface, a simple object access protocol web service allows other programmers to remotely invoke the methods written for doing programmatic search operations on PmiRExAt database. Database URL: http://pmirexat.nabi.res.in. PMID:27081157

  11. RNACompress: Grammar-based compression and informational complexity measurement of RNA secondary structure.

    PubMed

    Liu, Qi; Yang, Yu; Chen, Chun; Bu, Jiajun; Zhang, Yin; Ye, Xiuzi

    2008-03-31

    With the rapid emergence of RNA databases and newly identified non-coding RNAs, an efficient compression algorithm for RNA sequence and structural information is needed for the storage and analysis of such data. Although several algorithms for compressing DNA sequences have been proposed, none of them are suitable for the compression of RNA sequences with their secondary structures simultaneously. This kind of compression not only facilitates the maintenance of RNA data, but also supplies a novel way to measure the informational complexity of RNA structural data, raising the possibility of studying the relationship between the functional activities of RNA structures and their complexities, as well as various structural properties of RNA based on compression. RNACompress employs an efficient grammar-based model to compress RNA sequences and their secondary structures. The main goals of this algorithm are two fold: (1) present a robust and effective way for RNA structural data compression; (2) design a suitable model to represent RNA secondary structure as well as derive the informational complexity of the structural data based on compression. Our extensive tests have shown that RNACompress achieves a universally better compression ratio compared with other sequence-specific or common text-specific compression algorithms, such as Gencompress, winrar and gzip. Moreover, a test of the activities of distinct GTP-binding RNAs (aptamers) compared with their structural complexity shows that our defined informational complexity can be used to describe how complexity varies with activity. These results lead to an objective means of comparing the functional properties of heteropolymers from the information perspective. A universal algorithm for the compression of RNA secondary structure as well as the evaluation of its informational complexity is discussed in this paper. We have developed RNACompress, as a useful tool for academic users. Extensive tests have shown that RNACompress is a universally efficient algorithm for the compression of RNA sequences with their secondary structures. RNACompress also serves as a good measurement of the informational complexity of RNA secondary structure, which can be used to study the functional activities of RNA molecules.

  12. RNACompress: Grammar-based compression and informational complexity measurement of RNA secondary structure

    PubMed Central

    Liu, Qi; Yang, Yu; Chen, Chun; Bu, Jiajun; Zhang, Yin; Ye, Xiuzi

    2008-01-01

    Background With the rapid emergence of RNA databases and newly identified non-coding RNAs, an efficient compression algorithm for RNA sequence and structural information is needed for the storage and analysis of such data. Although several algorithms for compressing DNA sequences have been proposed, none of them are suitable for the compression of RNA sequences with their secondary structures simultaneously. This kind of compression not only facilitates the maintenance of RNA data, but also supplies a novel way to measure the informational complexity of RNA structural data, raising the possibility of studying the relationship between the functional activities of RNA structures and their complexities, as well as various structural properties of RNA based on compression. Results RNACompress employs an efficient grammar-based model to compress RNA sequences and their secondary structures. The main goals of this algorithm are two fold: (1) present a robust and effective way for RNA structural data compression; (2) design a suitable model to represent RNA secondary structure as well as derive the informational complexity of the structural data based on compression. Our extensive tests have shown that RNACompress achieves a universally better compression ratio compared with other sequence-specific or common text-specific compression algorithms, such as Gencompress, winrar and gzip. Moreover, a test of the activities of distinct GTP-binding RNAs (aptamers) compared with their structural complexity shows that our defined informational complexity can be used to describe how complexity varies with activity. These results lead to an objective means of comparing the functional properties of heteropolymers from the information perspective. Conclusion A universal algorithm for the compression of RNA secondary structure as well as the evaluation of its informational complexity is discussed in this paper. We have developed RNACompress, as a useful tool for academic users. Extensive tests have shown that RNACompress is a universally efficient algorithm for the compression of RNA sequences with their secondary structures. RNACompress also serves as a good measurement of the informational complexity of RNA secondary structure, which can be used to study the functional activities of RNA molecules. PMID:18373878

  13. High-throughput determination of RNA structure by proximity ligation.

    PubMed

    Ramani, Vijay; Qiu, Ruolan; Shendure, Jay

    2015-09-01

    We present an unbiased method to globally resolve RNA structures through pairwise contact measurements between interacting regions. RNA proximity ligation (RPL) uses proximity ligation of native RNA followed by deep sequencing to yield chimeric reads with ligation junctions in the vicinity of structurally proximate bases. We apply RPL in both baker's yeast (Saccharomyces cerevisiae) and human cells and generate contact probability maps for ribosomal and other abundant RNAs, including yeast snoRNAs, the RNA subunit of the signal recognition particle and the yeast U2 spliceosomal RNA homolog. RPL measurements correlate with established secondary structures for these RNA molecules, including stem-loop structures and long-range pseudoknots. We anticipate that RPL will complement the current repertoire of computational and experimental approaches in enabling the high-throughput determination of secondary and tertiary RNA structures.

  14. Accurate Classification of RNA Structures Using Topological Fingerprints

    PubMed Central

    Li, Kejie; Gribskov, Michael

    2016-01-01

    While RNAs are well known to possess complex structures, functionally similar RNAs often have little sequence similarity. While the exact size and spacing of base-paired regions vary, functionally similar RNAs have pronounced similarity in the arrangement, or topology, of base-paired stems. Furthermore, predicted RNA structures often lack pseudoknots (a crucial aspect of biological activity), and are only partially correct, or incomplete. A topological approach addresses all of these difficulties. In this work we describe each RNA structure as a graph that can be converted to a topological spectrum (RNA fingerprint). The set of subgraphs in an RNA structure, its RNA fingerprint, can be compared with the fingerprints of other RNA structures to identify and correctly classify functionally related RNAs. Topologically similar RNAs can be identified even when a large fraction, up to 30%, of the stems are omitted, indicating that highly accurate structures are not necessary. We investigate the performance of the RNA fingerprint approach on a set of eight highly curated RNA families, with diverse sizes and functions, containing pseudoknots, and with little sequence similarity–an especially difficult test set. In spite of the difficult test set, the RNA fingerprint approach is very successful (ROC AUC > 0.95). Due to the inclusion of pseudoknots, the RNA fingerprint approach both covers a wider range of possible structures than methods based only on secondary structure, and its tolerance for incomplete structures suggests that it can be applied even to predicted structures. Source code is freely available at https://github.rcac.purdue.edu/mgribsko/XIOS_RNA_fingerprint. PMID:27755571

  15. SimRNA: a coarse-grained method for RNA folding simulations and 3D structure prediction.

    PubMed

    Boniecki, Michal J; Lach, Grzegorz; Dawson, Wayne K; Tomala, Konrad; Lukasz, Pawel; Soltysinski, Tomasz; Rother, Kristian M; Bujnicki, Janusz M

    2016-04-20

    RNA molecules play fundamental roles in cellular processes. Their function and interactions with other biomolecules are dependent on the ability to form complex three-dimensional (3D) structures. However, experimental determination of RNA 3D structures is laborious and challenging, and therefore, the majority of known RNAs remain structurally uncharacterized. Here, we present SimRNA: a new method for computational RNA 3D structure prediction, which uses a coarse-grained representation, relies on the Monte Carlo method for sampling the conformational space, and employs a statistical potential to approximate the energy and identify conformations that correspond to biologically relevant structures. SimRNA can fold RNA molecules using only sequence information, and, on established test sequences, it recapitulates secondary structure with high accuracy, including correct prediction of pseudoknots. For modeling of complex 3D structures, it can use additional restraints, derived from experimental or computational analyses, including information about secondary structure and/or long-range contacts. SimRNA also can be used to analyze conformational landscapes and identify potential alternative structures. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. RNA Characterization by Solid-State NMR Spectroscopy.

    PubMed

    Yang, Yufei; Wang, Shenlin

    2018-06-21

    The structures of RNAs, which play critical roles in various biological processes, provide important clues and insights into the biological functions of these molecules. However, RNA structure determination remains a challenging topic. In recent years, magic-angle-spinning solid-state NMR (MAS SSNMR) has emerged as an alternative technique for structural and dynamic characterization of RNA. MAS SSNMR has been successfully applied to provide atomic-level structural information about several RNA molecules and RNA-protein complexes. In this Minireview, we give an overview of recent progress in the field of MAS SSNMR based RNA structural characterization, and introduce sample preparation strategies and SSNMR spectroscopic techniques that have been incorporated to identify RNA structural elements. We also highlight a few impressive examples of RNAs that have been investigated extensively by SSNMR. Finally, we briefly discuss future technical trends in the use of MAS SSNMR to facilitate RNA structure determination. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  17. Yeast ribonuclease III uses a network of multiple hydrogen bonds for RNA binding and cleavage.

    PubMed

    Lavoie, Mathieu; Abou Elela, Sherif

    2008-08-19

    Members of the bacterial RNase III family recognize a variety of short structured RNAs with few common features. It is not clear how this group of enzymes supports high cleavage fidelity while maintaining a broad base of substrates. Here we show that the yeast orthologue of RNase III (Rnt1p) uses a network of 2'-OH-dependent interactions to recognize substrates with different structures. We designed a series of bipartite substrates permitting the distinction between binding and cleavage defects. Each substrate was engineered to carry a single or multiple 2'- O-methyl or 2'-fluoro ribonucleotide substitutions to prevent the formation of hydrogen bonds with a specific nucleotide or group of nucleotides. Interestingly, introduction of 2'- O-methyl ribonucleotides near the cleavage site increased the rate of catalysis, indicating that 2'-OH are not required for cleavage. Substitution of nucleotides in known Rnt1p binding site with 2'- O-methyl ribonucleotides inhibited cleavage while single 2'-fluoro ribonucleotide substitutions did not. This indicates that while no single 2'-OH is essential for Rnt1p cleavage, small changes in the substrate structure are not tolerated. Strikingly, several nucleotide substitutions greatly increased the substrate dissociation constant with little or no effect on the Michaelis-Menten constant or rate of catalysis. Together, the results indicate that Rnt1p uses a network of nucleotide interactions to identify its substrate and support two distinct modes of binding. One mode is primarily mediated by the dsRNA binding domain and leads to the formation of stable RNA/protein complex, while the other requires the presence of the nuclease and N-terminal domains and leads to RNA cleavage.

  18. Metal cofactor modulated folding and target recognition of HIV-1 NCp7.

    PubMed

    Ren, Weitong; Ji, Dongqing; Xu, Xiulian

    2018-01-01

    The HIV-1 nucleocapsid 7 (NCp7) plays crucial roles in multiple stages of HIV-1 life cycle, and its biological functions rely on the binding of zinc ions. Understanding the molecular mechanism of how the zinc ions modulate the conformational dynamics and functions of the NCp7 is essential for the drug development and HIV-1 treatment. In this work, using a structure-based coarse-grained model, we studied the effects of zinc cofactors on the folding and target RNA(SL3) recognition of the NCp7 by molecular dynamics simulations. After reproducing some key properties of the zinc binding and folding of the NCp7 observed in previous experiments, our simulations revealed several interesting features in the metal ion modulated folding and target recognition. Firstly, we showed that the zinc binding makes the folding transition states of the two zinc fingers less structured, which is in line with the Hammond effect observed typically in mutation, temperature or denaturant induced perturbations to protein structure and stability. Secondly, We showed that there exists mutual interplay between the zinc ion binding and NCp7-target recognition. Binding of zinc ions enhances the affinity between the NCp7 and the target RNA, whereas the formation of the NCp7-RNA complex reshapes the intrinsic energy landscape of the NCp7 and increases the stability and zinc affinity of the two zinc fingers. Thirdly, by characterizing the effects of salt concentrations on the target RNA recognition, we showed that the NCp7 achieves optimal balance between the affinity and binding kinetics near the physiologically relevant salt concentrations. In addition, the effects of zinc binding on the inter-domain conformational flexibility and folding cooperativity of the NCp7 were also discussed.

  19. Critical chemical features in trans-acting-responsive RNA are required for interaction with human immunodeficiency virus type 1 Tat protein.

    PubMed Central

    Sumner-Smith, M; Roy, S; Barnett, R; Reid, L S; Kuperman, R; Delling, U; Sonenberg, N

    1991-01-01

    The human immunodeficiency virus type 1 Tat protein binds to an RNA stem-loop structure called TAR which is present at the 5' end of all human immunodeficiency virus type 1 transcripts. This binding is centered on a bulge within the stem of TAR and is an essential step in the trans-activation process which results in a dramatic increase in viral gene expression. By analysis of a series of TAR derivatives produced by transcription or direct chemical synthesis, we determined the structural and chemical requirements for Tat binding. Tat binds well to structures which have a bulge of two to at least five unpaired bases bounded on both sides by a double-stranded RNA stem. This apparent flexibility in bulge size is in contrast to an absolute requirement for an unpaired uridine (U) in the 5'-most position of the bulge (+23). Substitution of the U with either natural bases or chemical analogs demonstrated that the imido group at the N-3 position and, possibly, the carbonyl group at the C-4 position of U are critical for Tat binding. Cytosine (C), which differs from U at only these positions, is not an acceptable substitute. Furthermore, methylation at N-3 abolishes binding. While methylation of U at the C-5 position has little effect on binding, fluorination reduces it, possibly because of its effects on relative tautomer stability at the N-3 and C-4 positions. Thus, we have identified key moieties in the U residue that are of importance for the binding of Tat to TAR RNA. We hypothesize that the invariant U is involved in hydrogen bond interactions with either another part of TAR or the TAR-binding domain in Tat. Images PMID:1895380

  20. Translational control of small heat shock genes in mesophilic and thermophilic cyanobacteria by RNA thermometers

    PubMed Central

    Cimdins, Annika; Klinkert, Birgit; Aschke-Sonnenborn, Ursula; Kaiser, Friederike M; Kortmann, Jens; Narberhaus, Franz

    2014-01-01

    Cyanobacteria constitute a heterogeneous phylum of oxygen-producing, photosynthetic prokaryotes. They are susceptible to various stress conditions like heat, salt, or light stress, all inducing the cyanobacterial heat shock response (HSR). Cyanobacterial small heat shock proteins (sHsps) are known to preserve thylakoid membrane integrity under stress conditions, thereby protecting the photosynthesis machinery. In Synechocystis sp PCC 6803, synthesis of the sHsp Hsp17 is regulated by an RNA thermometer (RNAT) in the 5′-untranslated region (5′-UTR) of the hsp17 mRNA. RNATs are direct temperature sensors that control expression of many bacterial heat shock and virulence genes. They hinder translation at low temperatures by base pairing, thus blocking ribosome access to the mRNA.   To explore the temperature range in which RNATs act, we studied various RNAT candidates upstream of sHsp genes from mesophilic and thermophilic cyanobacteria. The mesophilic cyanobacteria Anabaena variabilis and Nostoc sp chromosomally encode two sHsps each. Reporter gene studies suggested RNAT-mediated post-transcriptional regulation of shsp expression in both organisms. Detailed structural analysis of the two A. variabilis candidates revealed two novel RNAT types. The first, avashort, regulates translation primarily by masking of the AUG translational start codon. The second, featuring an extended initial hairpin, thus named avalong, presumably makes use of complex tertiary interaction. The 5′-UTR of the small heat shock gene hspA in the thermophile Thermosynechococcus elongatus is predicted to adopt an extended secondary structure. Structure probing revealed that the ribosome binding site was blocked at temperatures below 55 °C. The results of this study demonstrate that cyanobacteria commonly use RNATs to control expression of their small heat shock genes. PMID:24755616

  1. SRD: a Staphylococcus regulatory RNA database.

    PubMed

    Sassi, Mohamed; Augagneur, Yoann; Mauro, Tony; Ivain, Lorraine; Chabelskaya, Svetlana; Hallier, Marc; Sallou, Olivier; Felden, Brice

    2015-05-01

    An overflow of regulatory RNAs (sRNAs) was identified in a wide range of bacteria. We designed and implemented a new resource for the hundreds of sRNAs identified in Staphylococci, with primary focus on the human pathogen Staphylococcus aureus. The "Staphylococcal Regulatory RNA Database" (SRD, http://srd.genouest.org/) compiled all published data in a single interface including genetic locations, sequences and other features. SRD proposes novel and simplified identifiers for Staphylococcal regulatory RNAs (srn) based on the sRNA's genetic location in S. aureus strain N315 which served as a reference. From a set of 894 sequences and after an in-depth cleaning, SRD provides a list of 575 srn exempt of redundant sequences. For each sRNA, their experimental support(s) is provided, allowing the user to individually assess their validity and significance. RNA-seq analysis performed on strains N315, NCTC8325, and Newman allowed us to provide further details, upgrade the initial annotation, and identified 159 RNA-seq independent transcribed sRNAs. The lists of 575 and 159 sRNAs sequences were used to predict the number and location of srns in 18 S. aureus strains and 10 other Staphylococci. A comparison of the srn contents within 32 Staphylococcal genomes revealed a poor conservation between species. In addition, sRNA structure predictions obtained with MFold are accessible. A BLAST server and the intaRNA program, which is dedicated to target prediction, were implemented. SRD is the first sRNA database centered on a genus; it is a user-friendly and scalable device with the possibility to submit new sequences that should spread in the literature. © 2015 Sassi et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  2. Nearly complete rRNA genes assembled from across the metazoan animals: effects of more taxa, a structure-based alignment, and paired-sites evolutionary models on phylogeny reconstruction.

    PubMed

    Mallatt, Jon; Craig, Catherine Waggoner; Yoder, Matthew J

    2010-04-01

    This study (1) uses nearly complete rRNA-gene sequences from across Metazoa (197 taxa) to reconstruct animal phylogeny; (2) presents a highly annotated, manual alignment of these sequences with special reference to rRNA features including paired sites (http://purl.oclc.org/NET/rRNA/Metazoan_alignment) and (3) tests, after eliminating as few disruptive, rogue sequences as possible, if a likelihood framework can recover the main metazoan clades. We found that systematic elimination of approximately 6% of the sequences, including the divergent or unstably placed sequences of cephalopods, arrowworm, symphylan and pauropod myriapods, and of myzostomid and nemertodermatid worms, led to a tree that supported Ecdysozoa, Lophotrochozoa, Protostomia, and Bilateria. Deuterostomia, however, was never recovered, because the rRNA of urochordates goes (nonsignificantly) near the base of the Bilateria. Counterintuitively, when we modeled the evolution of the paired sites, phylogenetic resolution was not increased over traditional tree-building models that assume all sites in rRNA evolve independently. The rRNA genes of non-bilaterians contain a higher % AT than do those of most bilaterians. The rRNA genes of Acoela and Myzostomida were found to be secondarily shortened, AT-enriched, and highly modified, throwing some doubt on the location of these worms at the base of Bilateria in the rRNA tree--especially myzostomids, which other evidence suggests are annelids instead. Other findings are marsupial-with-placental mammals, arrowworms in Ecdysozoa (well supported here but contradicted by morphology), and Placozoa as sister to Cnidaria. Finally, despite the difficulties, the rRNA-gene trees are in strong concordance with trees derived from multiple protein-coding genes in supporting the new animal phylogeny. (c) 2009 Elsevier Inc. All rights reserved.

  3. A Dual Interaction Between the 5'- and 3'-Ends of the Melon Necrotic Spot Virus (MNSV) RNA Genome Is Required for Efficient Cap-Independent Translation.

    PubMed

    Miras, Manuel; Rodríguez-Hernández, Ana M; Romero-López, Cristina; Berzal-Herranz, Alfredo; Colchero, Jaime; Aranda, Miguel A; Truniger, Verónica

    2018-01-01

    In eukaryotes, the formation of a 5'-cap and 3'-poly(A) dependent protein-protein bridge is required for translation of its mRNAs. In contrast, several plant virus RNA genomes lack both of these mRNA features, but instead have a 3'-CITE (for cap-independent translation enhancer), a RNA element present in their 3'-untranslated region that recruits translation initiation factors and is able to control its cap-independent translation. For several 3'-CITEs, direct RNA-RNA long-distance interactions based on sequence complementarity between the 5'- and 3'-ends are required for efficient translation, as they bring the translation initiation factors bound to the 3'-CITE to the 5'-end. For the carmovirus melon necrotic spot virus (MNSV), a 3'-CITE has been identified, and the presence of its 5'-end in cis has been shown to be required for its activity. Here, we analyze the secondary structure of the 5'-end of the MNSV RNA genome and identify two highly conserved nucleotide sequence stretches that are complementary to the apical loop of its 3'-CITE. In in vivo cap-independent translation assays with mutant constructs, by disrupting and restoring sequence complementarity, we show that the interaction between the 3'-CITE and at least one complementary sequence in the 5'-end is essential for virus RNA translation, although efficient virus translation and multiplication requires both connections. The complementary sequence stretches are invariant in all MNSV isolates, suggesting that the dual 5'-3' RNA:RNA interactions are required for optimal MNSV cap-independent translation and multiplication.

  4. Stem-Loop RNA Hairpins in Giant Viruses: Invading rRNA-Like Repeats and a Template Free RNA

    PubMed Central

    Seligmann, Hervé; Raoult, Didier

    2018-01-01

    We examine the hypothesis that de novo template-free RNAs still form spontaneously, as they did at the origins of life, invade modern genomes, contribute new genetic material. Previously, analyses of RNA secondary structures suggested that some RNAs resembling ancestral (t)RNAs formed recently de novo, other parasitic sequences cluster with rRNAs. Here positive control analyses of additional RNA secondary structures confirm ancestral and de novo statuses of RNA grouped according to secondary structure. Viroids with branched stems resemble de novo RNAs, rod-shaped viroids resemble rRNA secondary structures, independently of GC contents. 5′ UTR leading regions of West Nile and Dengue flavivirid viruses resemble de novo and rRNA structures, respectively. An RNA homologous with Megavirus, Dengue and West Nile genomes, copperhead snake microsatellites and levant cotton repeats, not templated by Mimivirus' genome, persists throughout Mimivirus' infection. Its secondary structure clusters with candidate de novo RNAs. The saltatory phyletic distribution and secondary structure of Mimivirus' peculiar RNA suggest occasional template-free polymerization of this sequence, rather than noncanonical transcriptions (swinger polymerization, posttranscriptional editing). PMID:29449833

  5. Vfold: a web server for RNA structure and folding thermodynamics prediction.

    PubMed

    Xu, Xiaojun; Zhao, Peinan; Chen, Shi-Jie

    2014-01-01

    The ever increasing discovery of non-coding RNAs leads to unprecedented demand for the accurate modeling of RNA folding, including the predictions of two-dimensional (base pair) and three-dimensional all-atom structures and folding stabilities. Accurate modeling of RNA structure and stability has far-reaching impact on our understanding of RNA functions in human health and our ability to design RNA-based therapeutic strategies. The Vfold server offers a web interface to predict (a) RNA two-dimensional structure from the nucleotide sequence, (b) three-dimensional structure from the two-dimensional structure and the sequence, and (c) folding thermodynamics (heat capacity melting curve) from the sequence. To predict the two-dimensional structure (base pairs), the server generates an ensemble of structures, including loop structures with the different intra-loop mismatches, and evaluates the free energies using the experimental parameters for the base stacks and the loop entropy parameters given by a coarse-grained RNA folding model (the Vfold model) for the loops. To predict the three-dimensional structure, the server assembles the motif scaffolds using structure templates extracted from the known PDB structures and refines the structure using all-atom energy minimization. The Vfold-based web server provides a user friendly tool for the prediction of RNA structure and stability. The web server and the source codes are freely accessible for public use at "http://rna.physics.missouri.edu".

  6. A machine learning approach for predicting CRISPR-Cas9 cleavage efficiencies and patterns underlying its mechanism of action.

    PubMed

    Abadi, Shiran; Yan, Winston X; Amar, David; Mayrose, Itay

    2017-10-01

    The adaptation of the CRISPR-Cas9 system as a genome editing technique has generated much excitement in recent years owing to its ability to manipulate targeted genes and genomic regions that are complementary to a programmed single guide RNA (sgRNA). However, the efficacy of a specific sgRNA is not uniquely defined by exact sequence homology to the target site, thus unintended off-targets might additionally be cleaved. Current methods for sgRNA design are mainly concerned with predicting off-targets for a given sgRNA using basic sequence features and employ elementary rules for ranking possible sgRNAs. Here, we introduce CRISTA (CRISPR Target Assessment), a novel algorithm within the machine learning framework that determines the propensity of a genomic site to be cleaved by a given sgRNA. We show that the predictions made with CRISTA are more accurate than other available methodologies. We further demonstrate that the occurrence of bulges is not a rare phenomenon and should be accounted for in the prediction process. Beyond predicting cleavage efficiencies, the learning process provides inferences regarding patterns that underlie the mechanism of action of the CRISPR-Cas9 system. We discover that attributes that describe the spatial structure and rigidity of the entire genomic site as well as those surrounding the PAM region are a major component of the prediction capabilities.

  7. SPAR: small RNA-seq portal for analysis of sequencing experiments.

    PubMed

    Kuksa, Pavel P; Amlie-Wolf, Alexandre; Katanic, Živadin; Valladares, Otto; Wang, Li-San; Leung, Yuk Yee

    2018-05-04

    The introduction of new high-throughput small RNA sequencing protocols that generate large-scale genomics datasets along with increasing evidence of the significant regulatory roles of small non-coding RNAs (sncRNAs) have highlighted the urgent need for tools to analyze and interpret large amounts of small RNA sequencing data. However, it remains challenging to systematically and comprehensively discover and characterize sncRNA genes and specifically-processed sncRNA products from these datasets. To fill this gap, we present Small RNA-seq Portal for Analysis of sequencing expeRiments (SPAR), a user-friendly web server for interactive processing, analysis, annotation and visualization of small RNA sequencing data. SPAR supports sequencing data generated from various experimental protocols, including smRNA-seq, short total RNA sequencing, microRNA-seq, and single-cell small RNA-seq. Additionally, SPAR includes publicly available reference sncRNA datasets from our DASHR database and from ENCODE across 185 human tissues and cell types to produce highly informative small RNA annotations across all major small RNA types and other features such as co-localization with various genomic features, precursor transcript cleavage patterns, and conservation. SPAR allows the user to compare the input experiment against reference ENCODE/DASHR datasets. SPAR currently supports analyses of human (hg19, hg38) and mouse (mm10) sequencing data. SPAR is freely available at https://www.lisanwanglab.org/SPAR.

  8. Solution structure and thermodynamics of 2',5' RNA intercalation.

    PubMed

    Horowitz, Eric D; Lilavivat, Seth; Holladay, Benjamin W; Germann, Markus W; Hud, Nicholas V

    2009-04-29

    As a means to explore the influence of the nucleic acid backbone on the intercalative binding of ligands to DNA and RNA, we have determined the solution structure of a proflavine-bound 2',5'-linked octamer duplex with the sequence GCCGCGGC. This structure represents the first NMR structure of an intercalated RNA duplex, of either backbone structural isomer. By comparison with X-ray crystal structures, we have identified similarities and differences between intercalated 3',5' and 2',5'-linked RNA duplexes. First, the two forms of RNA have different sugar pucker geometries at the intercalated nucleotide steps, yet have the same interphosphate distances. Second, as in intercalated 3',5' RNA, the phosphate backbone angle zeta at the 2',5' RNA intercalation site prefers to be in the trans conformation, whereas unintercalated 2',5' and 3',5' RNA prefer the -gauche conformation. These observations provide new insights regarding the transitions required for intercalation of a phosphodiester-ribose backbone and suggest a possible contribution of the backbone to the origin of the nearest-neighbor exclusion principle. Thermodynamic studies presented for intercalation of both structural RNA isomers also reveal a surprising sensitivity of intercalator binding enthalpy and entropy to the details of RNA backbone structure.

  9. RNAstructure: software for RNA secondary structure prediction and analysis.

    PubMed

    Reuter, Jessica S; Mathews, David H

    2010-03-15

    To understand an RNA sequence's mechanism of action, the structure must be known. Furthermore, target RNA structure is an important consideration in the design of small interfering RNAs and antisense DNA oligonucleotides. RNA secondary structure prediction, using thermodynamics, can be used to develop hypotheses about the structure of an RNA sequence. RNAstructure is a software package for RNA secondary structure prediction and analysis. It uses thermodynamics and utilizes the most recent set of nearest neighbor parameters from the Turner group. It includes methods for secondary structure prediction (using several algorithms), prediction of base pair probabilities, bimolecular structure prediction, and prediction of a structure common to two sequences. This contribution describes new extensions to the package, including a library of C++ classes for incorporation into other programs, a user-friendly graphical user interface written in JAVA, and new Unix-style text interfaces. The original graphical user interface for Microsoft Windows is still maintained. The extensions to RNAstructure serve to make RNA secondary structure prediction user-friendly. The package is available for download from the Mathews lab homepage at http://rna.urmc.rochester.edu/RNAstructure.html.

  10. Computational Characterization of Exogenous MicroRNAs that Can Be Transferred into Human Circulation

    PubMed Central

    Shu, Jiang; Chiang, Kevin; Zempleni, Janos; Cui, Juan

    2015-01-01

    MicroRNAs have been long considered synthesized endogenously until very recent discoveries showing that human can absorb dietary microRNAs from animal and plant origins while the mechanism remains unknown. Compelling evidences of microRNAs from rice, milk, and honeysuckle transported to human blood and tissues have created a high volume of interests in the fundamental questions that which and how exogenous microRNAs can be transferred into human circulation and possibly exert functions in humans. Here we present an integrated genomics and computational analysis to study the potential deciding features of transportable microRNAs. Specifically, we analyzed all publicly available microRNAs, a total of 34,612 from 194 species, with 1,102 features derived from the microRNA sequence and structure. Through in-depth bioinformatics analysis, 8 groups of discriminative features have been used to characterize human circulating microRNAs and infer the likelihood that a microRNA will get transferred into human circulation. For example, 345 dietary microRNAs have been predicted as highly transportable candidates where 117 of them have identical sequences with their homologs in human and 73 are known to be associated with exosomes. Through a milk feeding experiment, we have validated 9 cow-milk microRNAs in human plasma using microRNA-sequencing analysis, including the top ranked microRNAs such as bta-miR-487b, miR-181b, and miR-421. The implications in health-related processes have been illustrated in the functional analysis. This work demonstrates the data-driven computational analysis is highly promising to study novel molecular characteristics of transportable microRNAs while bypassing the complex mechanistic details. PMID:26528912

  11. Computational Characterization of Exogenous MicroRNAs that Can Be Transferred into Human Circulation.

    PubMed

    Shu, Jiang; Chiang, Kevin; Zempleni, Janos; Cui, Juan

    2015-01-01

    MicroRNAs have been long considered synthesized endogenously until very recent discoveries showing that human can absorb dietary microRNAs from animal and plant origins while the mechanism remains unknown. Compelling evidences of microRNAs from rice, milk, and honeysuckle transported to human blood and tissues have created a high volume of interests in the fundamental questions that which and how exogenous microRNAs can be transferred into human circulation and possibly exert functions in humans. Here we present an integrated genomics and computational analysis to study the potential deciding features of transportable microRNAs. Specifically, we analyzed all publicly available microRNAs, a total of 34,612 from 194 species, with 1,102 features derived from the microRNA sequence and structure. Through in-depth bioinformatics analysis, 8 groups of discriminative features have been used to characterize human circulating microRNAs and infer the likelihood that a microRNA will get transferred into human circulation. For example, 345 dietary microRNAs have been predicted as highly transportable candidates where 117 of them have identical sequences with their homologs in human and 73 are known to be associated with exosomes. Through a milk feeding experiment, we have validated 9 cow-milk microRNAs in human plasma using microRNA-sequencing analysis, including the top ranked microRNAs such as bta-miR-487b, miR-181b, and miR-421. The implications in health-related processes have been illustrated in the functional analysis. This work demonstrates the data-driven computational analysis is highly promising to study novel molecular characteristics of transportable microRNAs while bypassing the complex mechanistic details.

  12. The genome of Pelobacter carbinolicus reveals surprising metabolic capabilities and physiological features

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Aklujkar, Muktak; Haveman, Shelley; DiDonatoJr, Raymond

    2012-01-01

    Background: The bacterium Pelobacter carbinolicus is able to grow by fermentation, syntrophic hydrogen/formate transfer, or electron transfer to sulfur from short-chain alcohols, hydrogen or formate; it does not oxidize acetate and is not known to ferment any sugars or grow autotrophically. The genome of P. carbinolicus was sequenced in order to understand its metabolic capabilities and physiological features in comparison with its relatives, acetate-oxidizing Geobacter species. Results: Pathways were predicted for catabolism of known substrates: 2,3-butanediol, acetoin, glycerol, 1,2-ethanediol, ethanolamine, choline and ethanol. Multiple isozymes of 2,3-butanediol dehydrogenase, ATP synthase and [FeFe]-hydrogenase were differentiated and assigned roles according to theirmore » structural properties and genomic contexts. The absence of asparagine synthetase and the presence of a mutant tRNA for asparagine encoded among RNA-active enzymes suggest that P. carbinolicus may make asparaginyl-tRNA in a novel way. Catabolic glutamate dehydrogenases were discovered, implying that the tricarboxylic acid (TCA) cycle can function catabolically. A phosphotransferase system for uptake of sugars was discovered, along with enzymes that function in 2,3-butanediol production. Pyruvate: ferredoxin/flavodoxin oxidoreductase was identified as a potential bottleneck in both the supply of oxaloacetate for oxidation of acetate by the TCA cycle and the connection of glycolysis to production of ethanol. The P. carbinolicus genome was found to encode autotransporters and various appendages, including three proteins with similarity to the geopilin of electroconductive nanowires. Conclusions: Several surprising metabolic capabilities and physiological features were predicted from the genome of P. carbinolicus, suggesting that it is more versatile than anticipated.« less

  13. Cytochemical features common to nucleoli and cytoplasmic nucleoloids of Olea europaea meiocytes: detection of rRNA by in situ hybridization.

    PubMed

    Alché, J D; Fernández, M C; Rodríguez-García, M I

    1994-02-01

    We used light and electron microscopic techniques to study the composition of cytoplasmic nucleoloids during meiotic division in Olea europaea. Nucleoloids were found in two clearly distinguishable morphological varieties: one similar in morphology to the nucleolus, and composed mainly of dense fibrillar component, and another surrounded by many ribosome-like particles. Cytochemical and immunocytochemical techniques showed similar reactivities in nucleoloids and the nucleolus: both are ribonucleoproteic in nature, and possess argyrophillic, argentaffinic and highly phosphorylated proteins. Immunohistochemical techniques failed to detect DNA in either structure. In situ hybridization to a 18 S rRNA probe demonstrated the presence of ribosomal transcripts in both the nucleolus and nucleoloids. These similarities in morphology and composition may reflect similar functionalities.

  14. Secondary structure of the 3'-noncoding region of flavivirus genomes: comparative analysis of base pairing probabilities.

    PubMed

    Rauscher, S; Flamm, C; Mandl, C W; Heinz, F X; Stadler, P F

    1997-07-01

    The prediction of the complete matrix of base pairing probabilities was applied to the 3' noncoding region (NCR) of flavivirus genomes. This approach identifies not only well-defined secondary structure elements, but also regions of high structural flexibility. Flaviviruses, many of which are important human pathogens, have a common genomic organization, but exhibit a significant degree of RNA sequence diversity in the functionally important 3'-NCR. We demonstrate the presence of secondary structures shared by all flaviviruses, as well as structural features that are characteristic for groups of viruses within the genus reflecting the established classification scheme. The significance of most of the predicted structures is corroborated by compensatory mutations. The availability of infectious clones for several flaviviruses will allow the assessment of these structural elements in processes of the viral life cycle, such as replication and assembly.

  15. A Method for WD40 Repeat Detection and Secondary Structure Prediction

    PubMed Central

    Wang, Yang; Jiang, Fan; Zhuo, Zhu; Wu, Xian-Hui; Wu, Yun-Dong

    2013-01-01

    WD40-repeat proteins (WD40s), as one of the largest protein families in eukaryotes, play vital roles in assembling protein-protein/DNA/RNA complexes. WD40s fold into similar β-propeller structures despite diversified sequences. A program WDSP (WD40 repeat protein Structure Predictor) has been developed to accurately identify WD40 repeats and predict their secondary structures. The method is designed specifically for WD40 proteins by incorporating both local residue information and non-local family-specific structural features. It overcomes the problem of highly diversified protein sequences and variable loops. In addition, WDSP achieves a better prediction in identifying multiple WD40-domain proteins by taking the global combination of repeats into consideration. In secondary structure prediction, the average Q3 accuracy of WDSP in jack-knife test reaches 93.7%. A disease related protein LRRK2 was used as a representive example to demonstrate the structure prediction. PMID:23776530

  16. Role of the terminator hairpin in the biogenesis of functional Hfq-binding sRNAs

    PubMed Central

    Morita, Teppei; Nishino, Ryo; Aiba, Hiroji

    2017-01-01

    Rho-independent transcription terminators of the genes encoding bacterial Hfq-binding sRNAs possess a set of seven or more T residues at the 3′ end, as noted in previous studies. Here, we have studied the role of the terminator hairpin in the biogenesis of sRNAs focusing on SgrS and RyhB in Escherichia coli. We constructed variant sRNA genes in which the GC-rich inverted repeat sequences are extended to stabilize the terminator hairpins. We demonstrate that the extension of the hairpin stem leads to generation of heterogeneous transcripts in which the poly(U) tail is shortened. The transcripts with shortened poly(U) tails no longer bind to Hfq and lose the ability to repress the target mRNAs. The shortened transcripts are generated in an in vitro transcription system with purified RNA polymerase, indicating that the generation of shortened transcripts is caused by premature transcription termination. We conclude that the terminator structure of sRNA genes is optimized to generate functional sRNAs. Thus, the Rho-independent terminators of sRNA genes possess two common features: a long T residue stretch that is a prerequisite for generation of functional sRNAs and a moderate strength of hairpin structure that ensures the termination at the seventh or longer position within the consecutive T stretch. The modulation of the termination position at the Rho-independent terminators is critical for biosynthesis of functional sRNAs. PMID:28606943

  17. Structural and Functional Analysis of the Interaction Between the Nucleoporin Nup98 and the mRNA Export Facto Rae1

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Y Ren; H Seo; G Blobel

    The export of mRNAs is a multistep process, involving the packaging of mRNAs into messenger ribonucleoprotein particles (mRNPs), their transport through nuclear pore complexes, and mRNP remodeling events prior to translation. Ribonucleic acid export 1 (Rae1) and Nup98 are evolutionarily conserved mRNA export factors that are targeted by the vesicular stomatitis virus matrix protein to inhibit host cell nuclear export. Here, we present the crystal structure of human Rae1 in complex with the Gle2-binding sequence (GLEBS) of Nup98 at 1.65 {angstrom} resolution. Rae1 forms a seven-bladed {beta}-propeller with several extensive surface loops. The Nup98 GLEBS motif forms an {approx}50-{angstrom}-long hairpinmore » that binds with its C-terminal arm to an essentially invariant hydrophobic surface that extends over the entire top face of the Rae1 {beta}-propeller. The C-terminal arm of the GLEBS hairpin is necessary and sufficient for Rae1 binding, and we identify a tandem glutamate element in this arm as critical for complex formation. The Rae1 {center_dot} Nup98{sup GLEBS} surface features an additional conserved patch with a positive electrostatic potential, and we demonstrate that the complex possesses single-stranded RNA-binding capability. Together, these data suggest that the Rae1 {center_dot} Nup98 complex directly binds to the mRNP at several stages of the mRNA export pathway.« less

  18. Structural insight and flexible features of NS5 proteins from all four serotypes of Dengue virus in solution

    PubMed Central

    Saw, Wuan Geok; Tria, Giancarlo; Grüber, Ardina; Subramanian Manimekalai, Malathy Sony; Zhao, Yongqian; Chandramohan, Arun; Srinivasan Anand, Ganesh; Matsui, Tsutomu; Weiss, Thomas M.; Vasudevan, Subhash G.; Grüber, Gerhard

    2015-01-01

    Infection by the four serotypes of Dengue virus (DENV-1 to DENV-4) causes an important arthropod-borne viral disease in humans. The multifunctional DENV nonstructural protein 5 (NS5) is essential for capping and replication of the viral RNA and harbours a methyltransferase (MTase) domain and an RNA-dependent RNA polymerase (RdRp) domain. In this study, insights into the overall structure and flexibility of the entire NS5 of all four Dengue virus serotypes in solution are presented for the first time. The solution models derived revealed an arrangement of the full-length NS5 (NS5FL) proteins with the MTase domain positioned at the top of the RdRP domain. The DENV-1 to DENV-4 NS5 forms are elongated and flexible in solution, with DENV-4 NS5 being more compact relative to NS5 from DENV-1, DENV-2 and DENV-3. Solution studies of the individual MTase and RdRp domains show the compactness of the RdRp domain as well as the contribution of the MTase domain and the ten-residue linker region to the flexibility of the entire NS5. Swapping the ten-residue linker between DENV-4 NS5FL and DENV-3 NS5FL demonstrated its importance in MTase–RdRp communication and in concerted interaction with viral and host proteins, as probed by amide hydrogen/deuterium mass spectrometry. Conformational alterations owing to RNA binding are presented. PMID:26527147

  19. Why double-stranded RNA resists condensation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tolokh, Igor S.; Pabit, Suzette; Katz, Andrea M.

    2014-09-15

    The addition of small amounts of multivalent cations to solutions containing double-stranded DNA leads to attraction between the negatively charged helices and eventually to condensation. Surprisingly, this effect is suppressed in double-stranded RNA, which carries the same charge as the DNA, but assumes a different double helical form. However, additional characterization of short (25 base-pairs) nucleic acid (NA) duplex structures by circular dichroism shows that measured differences in condensation are not solely determined by duplex helical geometry. Here we combine experiment, theory, and atomistic simulations to propose a mechanism that connects the observed variations in condensation of short NA duplexesmore » with the spatial variation of cobalt hexammine (CoHex) binding at the NA duplex surface. The atomistic picture that emerged showed that CoHex distributions around the NA reveals two major NA-CoHex binding modes -- internal and external -- distinguished by the proximity of bound CoHex to the helical axis. Decreasing trends in experimentally observed condensation propensity of the four studied NA duplexes (from B-like form of homopolymeric DNA, to mixed sequence DNA, to DNA:RNA hybrid, to A-like RNA) are explained by the progressive decrease of a single quantity: the fraction of CoHex ions in the external binding mode. Thus, while NA condensation depends on a complex interplay between various structural and sequence features, our coupled experimental and theoretical results suggest a new model in which a single parameter connects the NA condensation propensity with geometry and sequence dependence of CoHex binding.« less

  20. Structural insight and flexible features of NS5 proteins from all four serotypes of Dengue virus in solution

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Saw, Wuan Geok; Tria, Giancarlo; Grüber, Ardina

    Infection by the four serotypes ofDengue virus(DENV-1 to DENV-4) causes an important arthropod-borne viral disease in humans. The multifunctional DENV nonstructural protein 5 (NS5) is essential for capping and replication of the viral RNA and harbours a methyltransferase (MTase) domain and an RNA-dependent RNA polymerase (RdRp) domain. In this study, insights into the overall structure and flexibility of the entire NS5 of all fourDengue virusserotypes in solution are presented for the first time. The solution models derived revealed an arrangement of the full-length NS5 (NS5FL) proteins with the MTase domain positioned at the top of the RdRP domain. The DENV-1more » to DENV-4 NS5 forms are elongated and flexible in solution, with DENV-4 NS5 being more compact relative to NS5 from DENV-1, DENV-2 and DENV-3. Solution studies of the individual MTase and RdRp domains show the compactness of the RdRp domain as well as the contribution of the MTase domain and the ten-residue linker region to the flexibility of the entire NS5. Swapping the ten-residue linker between DENV-4 NS5FL and DENV-3 NS5FL demonstrated its importance in MTase–RdRp communication and in concerted interaction with viral and host proteins, as probed by amide hydrogen/deuterium mass spectrometry. Conformational alterations owing to RNA binding are presented.« less

Top