Sample records for rna structure analysis

  1. Structural landscape of base pairs containing post-transcriptional modifications in RNA

    PubMed Central

    Seelam, Preethi P.; Sharma, Purshotam

    2017-01-01

    Base pairs involving post-transcriptionally modified nucleobases are believed to play important roles in a wide variety of functional RNAs. Here we present our attempts toward understanding the structural and functional role of naturally occurring modified base pairs using a combination of X-ray crystal structure database analysis, sequence analysis, and advanced quantum chemical methods. Our bioinformatics analysis reveals that despite their presence in all major secondary structural elements, modified base pairs are most prevalent in tRNA crystal structures and most commonly involve guanine or uridine modifications. Further, analysis of tRNA sequences reveals additional examples of modified base pairs at structurally conserved tRNA regions and highlights the conservation patterns of these base pairs in three domains of life. Comparison of structures and binding energies of modified base pairs with their unmodified counterparts, using quantum chemical methods, allowed us to classify the base modifications in terms of the nature of their electronic structure effects on base-pairing. Analysis of specific structural contexts of modified base pairs in RNA crystal structures revealed several interesting scenarios, including those at the tRNA:rRNA interface, antibiotic-binding sites on the ribosome, and the three-way junctions within tRNA. These scenarios, when analyzed in the context of available experimental data, allowed us to correlate the occurrence and strength of modified base pairs with their specific functional roles. Overall, our study highlights the structural importance of modified base pairs in RNA and points toward the need for greater appreciation of the role of modified bases and their interactions, in the context of many biological processes involving RNA. PMID:28341704

  2. RNA-TVcurve: a Web server for RNA secondary structure comparison based on a multi-scale similarity of its triple vector curve representation.

    PubMed

    Li, Ying; Shi, Xiaohu; Liang, Yanchun; Xie, Juan; Zhang, Yu; Ma, Qin

    2017-01-21

    RNAs have been found to carry diverse functionalities in nature. Inferring the similarity between two given RNAs is a fundamental step to understand and interpret their functional relationship. The majority of functional RNAs show conserved secondary structures, rather than sequence conservation. Those algorithms relying on sequence-based features usually have limitations in their prediction performance. Hence, integrating RNA structure features is very critical for RNA analysis. Existing algorithms mainly fall into two categories: alignment-based and alignment-free. The alignment-free algorithms of RNA comparison usually have lower time complexity than alignment-based algorithms. An alignment-free RNA comparison algorithm was proposed, in which novel numerical representations RNA-TVcurve (triple vector curve representation) of RNA sequence and corresponding secondary structure features are provided. Then a multi-scale similarity score of two given RNAs was designed based on wavelet decomposition of their numerical representation. In support of RNA mutation and phylogenetic analysis, a web server (RNA-TVcurve) was designed based on this alignment-free RNA comparison algorithm. It provides three functional modules: 1) visualization of numerical representation of RNA secondary structure; 2) detection of single-point mutation based on secondary structure; and 3) comparison of pairwise and multiple RNA secondary structures. The inputs of the web server require RNA primary sequences, while corresponding secondary structures are optional. For the primary sequences alone, the web server can compute the secondary structures using free energy minimization algorithm in terms of RNAfold tool from Vienna RNA package. RNA-TVcurve is the first integrated web server, based on an alignment-free method, to deliver a suite of RNA analysis functions, including visualization, mutation analysis and multiple RNAs structure comparison. The comparison results with two popular RNA comparison tools, RNApdist and RNAdistance, showcased that RNA-TVcurve can efficiently capture subtle relationships among RNAs for mutation detection and non-coding RNA classification. All the relevant results were shown in an intuitive graphical manner, and can be freely downloaded from this server. RNA-TVcurve, along with test examples and detailed documents, are available at: http://ml.jlu.edu.cn/tvcurve/ .

  3. bpRNA: large-scale automated annotation and analysis of RNA secondary structure.

    PubMed

    Danaee, Padideh; Rouches, Mason; Wiley, Michelle; Deng, Dezhong; Huang, Liang; Hendrix, David

    2018-05-09

    While RNA secondary structure prediction from sequence data has made remarkable progress, there is a need for improved strategies for annotating the features of RNA secondary structures. Here, we present bpRNA, a novel annotation tool capable of parsing RNA structures, including complex pseudoknot-containing RNAs, to yield an objective, precise, compact, unambiguous, easily-interpretable description of all loops, stems, and pseudoknots, along with the positions, sequence, and flanking base pairs of each such structural feature. We also introduce several new informative representations of RNA structure types to improve structure visualization and interpretation. We have further used bpRNA to generate a web-accessible meta-database, 'bpRNA-1m', of over 100 000 single-molecule, known secondary structures; this is both more fully and accurately annotated and over 20-times larger than existing databases. We use a subset of the database with highly similar (≥90% identical) sequences filtered out to report on statistical trends in sequence, flanking base pairs, and length. Both the bpRNA method and the bpRNA-1m database will be valuable resources both for specific analysis of individual RNA molecules and large-scale analyses such as are useful for updating RNA energy parameters for computational thermodynamic predictions, improving machine learning models for structure prediction, and for benchmarking structure-prediction algorithms.

  4. Visualization of RNA structure models within the Integrative Genomics Viewer.

    PubMed

    Busan, Steven; Weeks, Kevin M

    2017-07-01

    Analyses of the interrelationships between RNA structure and function are increasingly important components of genomic studies. The SHAPE-MaP strategy enables accurate RNA structure probing and realistic structure modeling of kilobase-length noncoding RNAs and mRNAs. Existing tools for visualizing RNA structure models are not suitable for efficient analysis of long, structurally heterogeneous RNAs. In addition, structure models are often advantageously interpreted in the context of other experimental data and gene annotation information, for which few tools currently exist. We have developed a module within the widely used and well supported open-source Integrative Genomics Viewer (IGV) that allows visualization of SHAPE and other chemical probing data, including raw reactivities, data-driven structural entropies, and data-constrained base-pair secondary structure models, in context with linear genomic data tracks. We illustrate the usefulness of visualizing RNA structure in the IGV by exploring structure models for a large viral RNA genome, comparing bacterial mRNA structure in cells with its structure under cell- and protein-free conditions, and comparing a noncoding RNA structure modeled using SHAPE data with a base-pairing model inferred through sequence covariation analysis. © 2017 Busan and Weeks; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  5. Analysis of RNA structure using small-angle X-ray scattering

    PubMed Central

    Cantara, William A.; Olson, Erik D.; Musier-Forsyth, Karin

    2016-01-01

    In addition to their role in correctly attaching specific amino acids to cognate tRNAs, aminoacyl-tRNA synthetases (aaRS) have been found to possess many alternative functions and often bind to and act on other nucleic acids. In contrast to the well-defined 3D structure of tRNA, the structures of many of the other RNAs recognized by aaRSs have not been solved. Despite advances in the use of X-ray crystallography (XRC), nuclear magnetic resonance (NMR) spectroscopy and cryo-electron microscopy (cryo-EM) for structural characterization of biomolecules, significant challenges to solving RNA structures still exist. Recently, small-angle X-ray scattering (SAXS) has been increasingly employed to characterize the 3D structures of RNAs and RNA-protein complexes. SAXS is capable of providing low-resolution tertiary structure information under physiological conditions and with less intensive sample preparation and data analysis requirements than XRC, NMR and cryo-EM. In this article, we describe best practices involved in the process of RNA and RNA-protein sample preparation, SAXS data collection, data analysis, and structural model building. PMID:27777026

  6. RNAstructure: software for RNA secondary structure prediction and analysis.

    PubMed

    Reuter, Jessica S; Mathews, David H

    2010-03-15

    To understand an RNA sequence's mechanism of action, the structure must be known. Furthermore, target RNA structure is an important consideration in the design of small interfering RNAs and antisense DNA oligonucleotides. RNA secondary structure prediction, using thermodynamics, can be used to develop hypotheses about the structure of an RNA sequence. RNAstructure is a software package for RNA secondary structure prediction and analysis. It uses thermodynamics and utilizes the most recent set of nearest neighbor parameters from the Turner group. It includes methods for secondary structure prediction (using several algorithms), prediction of base pair probabilities, bimolecular structure prediction, and prediction of a structure common to two sequences. This contribution describes new extensions to the package, including a library of C++ classes for incorporation into other programs, a user-friendly graphical user interface written in JAVA, and new Unix-style text interfaces. The original graphical user interface for Microsoft Windows is still maintained. The extensions to RNAstructure serve to make RNA secondary structure prediction user-friendly. The package is available for download from the Mathews lab homepage at http://rna.urmc.rochester.edu/RNAstructure.html.

  7. Probing the transition state for nucleic acid hybridization using phi-value analysis.

    PubMed

    Kim, Jandi; Shin, Jong-Shik

    2010-04-27

    Genetic regulation by noncoding RNA elements such as microRNA and small interfering RNA (siRNA) involves hybridization of a short single-stranded RNA with a complementary segment in a target mRNA. The physical basis of the hybridization process between the structured nucleic acids is not well understood primarily because of the lack of information about the transition-state structure. Here we use transition-state theory, inspired by phi-value analysis in protein folding studies, to provide quantitative analysis of the relationship between changes in the secondary structure stability and the activation free energy. Time course monitoring of the hybridization reaction was performed under pseudo-steady-state conditions using a single fluorophore. The phi-value analysis indicates that the native secondary structure remains intact in the transition state. The nativelike transition state was confirmed via examination of the salt dependence of the hybridization kinetics, indicating that the number of sodium ions associated with the transition state was not substantially affected by changes in the native secondary structure. These results propose that hybridization between structured nucleic acids undergoes a transition state leading to formation of a nucleation complex and then is followed by sequential displacement of preexisting base pairings involving successive small energy barriers. The proposed mechanism might provide new insight into physical processes during small RNA-mediated gene silencing, which is essential to selection of a target mRNA segment for siRNA design.

  8. RNA2DMut: a web tool for the design and analysis of RNA structure mutations.

    PubMed

    Moss, Walter N

    2018-03-01

    With the widespread application of high-throughput sequencing, novel RNA sequences are being discovered at an astonishing rate. The analysis of function, however, lags behind. In both the cis - and trans -regulatory functions of RNA, secondary structure (2D base-pairing) plays essential regulatory roles. In order to test RNA function, it is essential to be able to design and analyze mutations that can affect structure. This was the motivation for the creation of the RNA2DMut web tool. With RNA2DMut, users can enter in RNA sequences to analyze, constrain mutations to specific residues, or limit changes to purines/pyrimidines. The sequence is analyzed at each base to determine the effect of every possible point mutation on 2D structure. The metrics used in RNA2DMut rely on the calculation of the Boltzmann structure ensemble and do not require a robust 2D model of RNA structure for designing mutations. This tool can facilitate a wide array of uses involving RNA: for example, in designing and evaluating mutants for biological assays, interrogating RNA-protein interactions, identifying key regions to alter in SELEX experiments, and improving RNA folding and crystallization properties for structural biology. Additional tools are available to help users introduce other mutations (e.g., indels and substitutions) and evaluate their effects on RNA structure. Example calculations are shown for five RNAs that require 2D structure for their function: the MALAT1 mascRNA, an influenza virus splicing regulatory motif, the EBER2 viral noncoding RNA, the Xist lncRNA repA region, and human Y RNA 5. RNA2DMut can be accessed at https://rna2dmut.bb.iastate.edu/. © 2018 Moss; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  9. Kinetic analysis of the effects of target structure on siRNA efficiency

    NASA Astrophysics Data System (ADS)

    Chen, Jiawen; Zhang, Wenbing

    2012-12-01

    RNAi efficiency for target cleavage and protein expression is related to the target structure. Considering the RNA-induced silencing complex (RISC) as a multiple turnover enzyme, we investigated the effect of target mRNA structure on siRNA efficiency with kinetic analysis. The 4-step model was used to study the target cleavage kinetic process: hybridization nucleation at an accessible target site, RISC-mRNA hybrid elongation along with mRNA target structure melting, target cleavage, and enzyme reactivation. At this model, the terms accounting for the target accessibility, stability, and the seed and the nucleation site effects are all included. The results are in good agreement with that of experiments which show different arguments about the structure effects on siRNA efficiency. It shows that the siRNA efficiency is influenced by the integrated factors of target's accessibility, stability, and the seed effects. To study the off-target effects, a simple model of one siRNA binding to two mRNA targets was designed. By using this model, the possibility for diminishing the off-target effects by the concentration of siRNA was discussed.

  10. Structure, recognition and adaptive binding in RNA aptamer complexes.

    PubMed

    Patel, D J; Suri, A K; Jiang, F; Jiang, L; Fan, P; Kumar, R A; Nonin, S

    1997-10-10

    Novel features of RNA structure, recognition and discrimination have been recently elucidated through the solution structural characterization of RNA aptamers that bind cofactors, aminoglycoside antibiotics, amino acids and peptides with high affinity and specificity. This review presents the solution structures of RNA aptamer complexes with adenosine monophosphate, flavin mononucleotide, arginine/citrulline and tobramycin together with an example of hydrogen exchange measurements of the base-pair kinetics for the AMP-RNA aptamer complex. A comparative analysis of the structures of these RNA aptamer complexes yields the principles, patterns and diversity associated with RNA architecture, molecular recognition and adaptive binding associated with complex formation.

  11. Evaluating and learning from RNA pseudotorsional space: quantitative validation of a reduced representation for RNA structure.

    PubMed

    Wadley, Leven M; Keating, Kevin S; Duarte, Carlos M; Pyle, Anna Marie

    2007-09-28

    Quantitatively describing RNA structure and conformational elements remains a formidable problem. Seven standard torsion angles and the sugar pucker are necessary to characterize the conformation of an RNA nucleotide completely. Progress has been made toward understanding the discrete nature of RNA structure, but classifying simple and ubiquitous structural elements such as helices and motifs remains a difficult task. One approach for describing RNA structure in a simple, mathematically consistent, and computationally accessible manner involves the invocation of two pseudotorsions, eta (C4'(n-1), P(n), C4'(n), P(n+1)) and theta (P(n), C4'(n), P(n+1), C4'(n+1)), which can be used to describe RNA conformation in much the same way that varphi and psi are used to describe backbone configuration of proteins. Here, we conduct an exploration and statistical evaluation of pseudotorsional space and of the Ramachandran-like eta-theta plot. We show that, through the rigorous quantitative analysis of the eta-theta plot, the pseudotorsional descriptors eta and theta, together with sugar pucker, are sufficient to describe RNA backbone conformation fully in most cases. These descriptors are also shown to contain considerable information about nucleotide base conformation, revealing a previously uncharacterized interplay between backbone and base orientation. A window function analysis is used to discern statistically relevant regions of density in the eta-theta scatter plot and then nucleotides in colocalized clusters in the eta-theta plane are shown to have similar 3-D structures through RMSD analysis of the RNA structural constituents. We find that major clusters in the eta-theta plot are few, underscoring the discrete nature of RNA backbone conformation. Like the Ramachandran plot, the eta-theta plot is a valuable system for conceptualizing biomolecular conformation, it is a useful tool for analyzing RNA tertiary structures, and it is a vital component of new approaches for solving the 3-D structures of large RNA molecules and RNA assemblies.

  12. Temperature-responsive in vitro RNA structurome of Yersinia pseudotuberculosis.

    PubMed

    Righetti, Francesco; Nuss, Aaron M; Twittenhoff, Christian; Beele, Sascha; Urban, Kristina; Will, Sebastian; Bernhart, Stephan H; Stadler, Peter F; Dersch, Petra; Narberhaus, Franz

    2016-06-28

    RNA structures are fundamentally important for RNA function. Dynamic, condition-dependent structural changes are able to modulate gene expression as shown for riboswitches and RNA thermometers. By parallel analysis of RNA structures, we mapped the RNA structurome of Yersinia pseudotuberculosis at three different temperatures. This human pathogen is exquisitely responsive to host body temperature (37 °C), which induces a major metabolic transition. Our analysis profiles the structure of more than 1,750 RNAs at 25 °C, 37 °C, and 42 °C. Average mRNAs tend to be unstructured around the ribosome binding site. We searched for 5'-UTRs that are folded at low temperature and identified novel thermoresponsive RNA structures from diverse gene categories. The regulatory potential of 16 candidates was validated. In summary, we present a dynamic bacterial RNA structurome and find that the expression of virulence-relevant functions in Y. pseudotuberculosis and reprogramming of its metabolism in response to temperature is associated with a restructuring of numerous mRNAs.

  13. RNApdbee 2.0: multifunctional tool for RNA structure annotation.

    PubMed

    Zok, Tomasz; Antczak, Maciej; Zurkowski, Michal; Popenda, Mariusz; Blazewicz, Jacek; Adamiak, Ryszard W; Szachniuk, Marta

    2018-04-30

    In the field of RNA structural biology and bioinformatics, an access to correctly annotated RNA structure is of crucial importance, especially in the secondary and 3D structure predictions. RNApdbee webserver, introduced in 2014, primarily aimed to address the problem of RNA secondary structure extraction from the PDB files. Its new version, RNApdbee 2.0, is a highly advanced multifunctional tool for RNA structure annotation, revealing the relationship between RNA secondary and 3D structure given in the PDB or PDBx/mmCIF format. The upgraded version incorporates new algorithms for recognition and classification of high-ordered pseudoknots in large RNA structures. It allows analysis of isolated base pairs impact on RNA structure. It can visualize RNA secondary structures-including that of quadruplexes-with depiction of non-canonical interactions. It also annotates motifs to ease identification of stems, loops and single-stranded fragments in the input RNA structure. RNApdbee 2.0 is implemented as a publicly available webserver with an intuitive interface and can be freely accessed at http://rnapdbee.cs.put.poznan.pl/.

  14. RNA-SSPT: RNA Secondary Structure Prediction Tools.

    PubMed

    Ahmad, Freed; Mahboob, Shahid; Gulzar, Tahsin; Din, Salah U; Hanif, Tanzeela; Ahmad, Hifza; Afzal, Muhammad

    2013-01-01

    The prediction of RNA structure is useful for understanding evolution for both in silico and in vitro studies. Physical methods like NMR studies to predict RNA secondary structure are expensive and difficult. Computational RNA secondary structure prediction is easier. Comparative sequence analysis provides the best solution. But secondary structure prediction of a single RNA sequence is challenging. RNA-SSPT is a tool that computationally predicts secondary structure of a single RNA sequence. Most of the RNA secondary structure prediction tools do not allow pseudoknots in the structure or are unable to locate them. Nussinov dynamic programming algorithm has been implemented in RNA-SSPT. The current studies shows only energetically most favorable secondary structure is required and the algorithm modification is also available that produces base pairs to lower the total free energy of the secondary structure. For visualization of RNA secondary structure, NAVIEW in C language is used and modified in C# for tool requirement. RNA-SSPT is built in C# using Dot Net 2.0 in Microsoft Visual Studio 2005 Professional edition. The accuracy of RNA-SSPT is tested in terms of Sensitivity and Positive Predicted Value. It is a tool which serves both secondary structure prediction and secondary structure visualization purposes.

  15. RNA-SSPT: RNA Secondary Structure Prediction Tools

    PubMed Central

    Ahmad, Freed; Mahboob, Shahid; Gulzar, Tahsin; din, Salah U; Hanif, Tanzeela; Ahmad, Hifza; Afzal, Muhammad

    2013-01-01

    The prediction of RNA structure is useful for understanding evolution for both in silico and in vitro studies. Physical methods like NMR studies to predict RNA secondary structure are expensive and difficult. Computational RNA secondary structure prediction is easier. Comparative sequence analysis provides the best solution. But secondary structure prediction of a single RNA sequence is challenging. RNA-SSPT is a tool that computationally predicts secondary structure of a single RNA sequence. Most of the RNA secondary structure prediction tools do not allow pseudoknots in the structure or are unable to locate them. Nussinov dynamic programming algorithm has been implemented in RNA-SSPT. The current studies shows only energetically most favorable secondary structure is required and the algorithm modification is also available that produces base pairs to lower the total free energy of the secondary structure. For visualization of RNA secondary structure, NAVIEW in C language is used and modified in C# for tool requirement. RNA-SSPT is built in C# using Dot Net 2.0 in Microsoft Visual Studio 2005 Professional edition. The accuracy of RNA-SSPT is tested in terms of Sensitivity and Positive Predicted Value. It is a tool which serves both secondary structure prediction and secondary structure visualization purposes. PMID:24250115

  16. RNA Framework: an all-in-one toolkit for the analysis of RNA structures and post-transcriptional modifications.

    PubMed

    Incarnato, Danny; Morandi, Edoardo; Simon, Lisa Marie; Oliviero, Salvatore

    2018-06-09

    RNA is emerging as a key regulator of a plethora of biological processes. While its study has remained elusive for decades, the recent advent of high-throughput sequencing technologies provided the unique opportunity to develop novel techniques for the study of RNA structure and post-transcriptional modifications. Nonetheless, most of the required downstream bioinformatics analyses steps are not easily reproducible, thus making the application of these techniques a prerogative of few laboratories. Here we introduce RNA Framework, an all-in-one toolkit for the analysis of most NGS-based RNA structure probing and post-transcriptional modification mapping experiments. To prove the extreme versatility of RNA Framework, we applied it to both an in-house generated DMS-MaPseq dataset, and to a series of literature available experiments. Notably, when starting from publicly available datasets, our software easily allows replicating authors' findings. Collectively, RNA Framework provides the most complete and versatile toolkit to date for a rapid and streamlined analysis of the RNA epistructurome. RNA Framework is available for download at: http://www.rnaframework.com.

  17. Direct Duplex Detection: An Emerging Tool in the RNA Structure Analysis Toolbox.

    PubMed

    Weidmann, Chase A; Mustoe, Anthony M; Weeks, Kevin M

    2016-09-01

    While a variety of powerful tools exists for analyzing RNA structure, identifying long-range and intermolecular base-pairing interactions has remained challenging. Recently, three groups introduced a high-throughput strategy that uses psoralen-mediated crosslinking to directly identify RNA-RNA duplexes in cells. Initial application of these methods highlights the preponderance of long-range structures within and between RNA molecules and their widespread structural dynamics. Copyright © 2016 Elsevier Ltd. All rights reserved.

  18. Hepatitis Delta Antigen Requires a Flexible Quasi-Double-Stranded RNA Structure To Bind and Condense Hepatitis Delta Virus RNA in a Ribonucleoprotein Complex

    PubMed Central

    Griffin, Brittany L.; Chasovskikh, Sergey; Dritschilo, Anatoly

    2014-01-01

    ABSTRACT The circular genome and antigenome RNAs of hepatitis delta virus (HDV) form characteristic unbranched, quasi-double-stranded RNA secondary structures in which short double-stranded helical segments are interspersed with internal loops and bulges. The ribonucleoprotein complexes (RNPs) formed by these RNAs with the virus-encoded protein hepatitis delta antigen (HDAg) perform essential roles in the viral life cycle, including viral replication and virion formation. Little is understood about the formation and structure of these complexes and how they function in these key processes. Here, the specific RNA features required for HDAg binding and the topology of the complexes formed were investigated. Selective 2′OH acylation analyzed by primer extension (SHAPE) applied to free and HDAg-bound HDV RNAs indicated that the characteristic secondary structure of the RNA is preserved when bound to HDAg. Notably, the analysis indicated that predicted unpaired positions in the RNA remained dynamic in the RNP. Analysis of the in vitro binding activity of RNAs in which internal loops and bulges were mutated and of synthetically designed RNAs demonstrated that the distinctive secondary structure, not the primary RNA sequence, is the major determinant of HDAg RNA binding specificity. Atomic force microscopy analysis of RNPs formed in vitro revealed complexes in which the HDV RNA is substantially condensed by bending or wrapping. Our results support a model in which the internal loops and bulges in HDV RNA contribute flexibility to the quasi-double-stranded structure that allows RNA bending and condensing by HDAg. IMPORTANCE RNA-protein complexes (RNPs) formed by the hepatitis delta virus RNAs and protein, HDAg, perform critical roles in virus replication. Neither the structures of these RNPs nor the RNA features required to form them have been characterized. HDV RNA is unusual in that it forms an unbranched quasi-double-stranded structure in which short base-paired segments are interspersed with internal loops and bulges. We analyzed the role of the HDV RNA sequence and secondary structure in the formation of a minimal RNP and visualized the structure of this RNP using atomic force microscopy. Our results indicate that HDAg does not recognize the primary sequence of the RNA; rather, the principle contribution of unpaired bases in HDV RNA to HDAg binding is to allow flexibility in the unbranched quasi-double-stranded RNA structure. Visualization of RNPs by atomic force microscopy indicated that the RNA is significantly bent or condensed in the complex. PMID:24741096

  19. Tertiary structure-based analysis of microRNA–target interactions

    PubMed Central

    Gan, Hin Hark; Gunsalus, Kristin C.

    2013-01-01

    Current computational analysis of microRNA interactions is based largely on primary and secondary structure analysis. Computationally efficient tertiary structure-based methods are needed to enable more realistic modeling of the molecular interactions underlying miRNA-mediated translational repression. We incorporate algorithms for predicting duplex RNA structures, ionic strength effects, duplex entropy and free energy, and docking of duplex–Argonaute protein complexes into a pipeline to model and predict miRNA–target duplex binding energies. To ensure modeling accuracy and computational efficiency, we use an all-atom description of RNA and a continuum description of ionic interactions using the Poisson–Boltzmann equation. Our method predicts the conformations of two constructs of Caenorhabditis elegans let-7 miRNA–target duplexes to an accuracy of ∼3.8 Å root mean square distance of their NMR structures. We also show that the computed duplex formation enthalpies, entropies, and free energies for eight miRNA–target duplexes agree with titration calorimetry data. Analysis of duplex–Argonaute docking shows that structural distortions arising from single-base-pair mismatches in the seed region influence the activity of the complex by destabilizing both duplex hybridization and its association with Argonaute. Collectively, these results demonstrate that tertiary structure-based modeling of miRNA interactions can reveal structural mechanisms not accessible with current secondary structure-based methods. PMID:23417009

  20. Evaluation of sequence alignments and oligonucleotide probes with respect to three-dimensional structure of ribosomal RNA using ARB software package

    PubMed Central

    Kumar, Yadhu; Westram, Ralf; Kipfer, Peter; Meier, Harald; Ludwig, Wolfgang

    2006-01-01

    Background Availability of high-resolution RNA crystal structures for the 30S and 50S ribosomal subunits and the subsequent validation of comparative secondary structure models have prompted the biologists to use three-dimensional structure of ribosomal RNA (rRNA) for evaluating sequence alignments of rRNA genes. Furthermore, the secondary and tertiary structural features of rRNA are highly useful and successfully employed in designing rRNA targeted oligonucleotide probes intended for in situ hybridization experiments. RNA3D, a program to combine sequence alignment information with three-dimensional structure of rRNA was developed. Integration into ARB software package, which is used extensively by the scientific community for phylogenetic analysis and molecular probe designing, has substantially extended the functionality of ARB software suite with 3D environment. Results Three-dimensional structure of rRNA is visualized in OpenGL 3D environment with the abilities to change the display and overlay information onto the molecule, dynamically. Phylogenetic information derived from the multiple sequence alignments can be overlaid onto the molecule structure in a real time. Superimposition of both statistical and non-statistical sequence associated information onto the rRNA 3D structure can be done using customizable color scheme, which is also applied to a textual sequence alignment for reference. Oligonucleotide probes designed by ARB probe design tools can be mapped onto the 3D structure along with the probe accessibility models for evaluation with respect to secondary and tertiary structural conformations of rRNA. Conclusion Visualization of three-dimensional structure of rRNA in an intuitive display provides the biologists with the greater possibilities to carry out structure based phylogenetic analysis. Coupled with secondary structure models of rRNA, RNA3D program aids in validating the sequence alignments of rRNA genes and evaluating probe target sites. Superimposition of the information derived from the multiple sequence alignment onto the molecule dynamically allows the researchers to observe any sequence inherited characteristics (phylogenetic information) in real-time environment. The extended ARB software package is made freely available for the scientific community via . PMID:16672074

  1. Kinetic analysis of pre-ribosome structure in vivo

    PubMed Central

    Swiatkowska, Agata; Wlotzka, Wiebke; Tuck, Alex; Barrass, J. David; Beggs, Jean D.; Tollervey, David

    2012-01-01

    Pre-ribosomal particles undergo numerous structural changes during maturation, but their high complexity and short lifetimes make these changes very difficult to follow in vivo. In consequence, pre-ribosome structure and composition have largely been inferred from purified particles and analyzed in vitro. Here we describe techniques for kinetic analyses of the changes in pre-ribosome structure in living cells of Saccharomyces cerevisiae. To allow this, in vivo structure probing by DMS modification was combined with affinity purification of newly synthesized 20S pre-rRNA over a time course of metabolic labeling with 4-thiouracil. To demonstrate that this approach is generally applicable, we initially analyzed the accessibility of the region surrounding cleavage site D site at the 3′ end of the mature 18S rRNA region of the pre-rRNA. This revealed a remarkably flexible structure throughout 40S subunit biogenesis, with little stable RNA–protein interaction apparent. Analysis of folding in the region of the 18S central pseudoknot was consistent with previous data showing U3 snoRNA–18S rRNA interactions. Dynamic changes in the structure of the hinge between helix 28 (H28) and H44 of pre-18S rRNA were consistent with recently reported interactions with the 3′ guide region of U3 snoRNA. Finally, analysis of the H18 region indicates that the RNA structure matures early, but additional protection appears subsequently, presumably reflecting protein binding. The structural analyses described here were performed on total, affinity-purified, newly synthesized RNA, so many classes of RNA and RNA–protein complex are potentially amenable to this approach. PMID:23093724

  2. DMS-MaPseq for genome-wide or targeted RNA structure probing in vivo.

    PubMed

    Zubradt, Meghan; Gupta, Paromita; Persad, Sitara; Lambowitz, Alan M; Weissman, Jonathan S; Rouskin, Silvi

    2017-01-01

    Coupling of structure-specific in vivo chemical modification to next-generation sequencing is transforming RNA secondary structure studies in living cells. The dominant strategy for detecting in vivo chemical modifications uses reverse transcriptase truncation products, which introduce biases and necessitate population-average assessments of RNA structure. Here we present dimethyl sulfate (DMS) mutational profiling with sequencing (DMS-MaPseq), which encodes DMS modifications as mismatches using a thermostable group II intron reverse transcriptase. DMS-MaPseq yields a high signal-to-noise ratio, can report multiple structural features per molecule, and allows both genome-wide studies and focused in vivo investigations of even low-abundance RNAs. We apply DMS-MaPseq for the first analysis of RNA structure within an animal tissue and to identify a functional structure involved in noncanonical translation initiation. Additionally, we use DMS-MaPseq to compare the in vivo structure of pre-mRNAs with their mature isoforms. These applications illustrate DMS-MaPseq's capacity to dramatically expand in vivo analysis of RNA structure.

  3. Mapping RNA Structure In Vitro with SHAPE Chemistry and Next-Generation Sequencing (SHAPE-Seq).

    PubMed

    Watters, Kyle E; Lucks, Julius B

    2016-01-01

    Mapping RNA structure with selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) chemistry has proven to be a versatile method for characterizing RNA structure in a variety of contexts. SHAPE reagents covalently modify RNAs in a structure-dependent manner to create adducts at the 2'-OH group of the ribose backbone at nucleotides that are structurally flexible. The positions of these adducts are detected using reverse transcriptase (RT) primer extension, which stops one nucleotide before the modification, to create a pool of cDNAs whose lengths reflect the location of SHAPE modification. Quantification of the cDNA pools is used to estimate the "reactivity" of each nucleotide in an RNA molecule to the SHAPE reagent. High reactivities indicate nucleotides that are structurally flexible, while low reactivities indicate nucleotides that are inflexible. These SHAPE reactivities can then be used to infer RNA structures by restraining RNA structure prediction algorithms. Here, we provide a state-of-the-art protocol describing how to perform in vitro RNA structure probing with SHAPE chemistry using next-generation sequencing to quantify cDNA pools and estimate reactivities (SHAPE-Seq). The use of next-generation sequencing allows for higher throughput, more consistent data analysis, and multiplexing capabilities. The technique described herein, SHAPE-Seq v2.0, uses a universal reverse transcription priming site that is ligated to the RNA after SHAPE modification. The introduced priming site allows for the structural analysis of an RNA independent of its sequence.

  4. Quantifying the relationship between sequence and three-dimensional structure conservation in RNA

    PubMed Central

    2010-01-01

    Background In recent years, the number of available RNA structures has rapidly grown reflecting the increased interest on RNA biology. Similarly to the studies carried out two decades ago for proteins, which gave the fundamental grounds for developing comparative protein structure prediction methods, we are now able to quantify the relationship between sequence and structure conservation in RNA. Results Here we introduce an all-against-all sequence- and three-dimensional (3D) structure-based comparison of a representative set of RNA structures, which have allowed us to quantitatively confirm that: (i) there is a measurable relationship between sequence and structure conservation that weakens for alignments resulting in below 60% sequence identity, (ii) evolution tends to conserve more RNA structure than sequence, and (iii) there is a twilight zone for RNA homology detection. Discussion The computational analysis here presented quantitatively describes the relationship between sequence and structure for RNA molecules and defines a twilight zone region for detecting RNA homology. Our work could represent the theoretical basis and limitations for future developments in comparative RNA 3D structure prediction. PMID:20550657

  5. RNA-Puzzles: A CASP-like evaluation of RNA three-dimensional structure prediction

    PubMed Central

    Cruz, José Almeida; Blanchet, Marc-Frédérick; Boniecki, Michal; Bujnicki, Janusz M.; Chen, Shi-Jie; Cao, Song; Das, Rhiju; Ding, Feng; Dokholyan, Nikolay V.; Flores, Samuel Coulbourn; Huang, Lili; Lavender, Christopher A.; Lisi, Véronique; Major, François; Mikolajczak, Katarzyna; Patel, Dinshaw J.; Philips, Anna; Puton, Tomasz; Santalucia, John; Sijenyi, Fredrick; Hermann, Thomas; Rother, Kristian; Rother, Magdalena; Serganov, Alexander; Skorupski, Marcin; Soltysinski, Tomasz; Sripakdeevong, Parin; Tuszynska, Irina; Weeks, Kevin M.; Waldsich, Christina; Wildauer, Michael; Leontis, Neocles B.; Westhof, Eric

    2012-01-01

    We report the results of a first, collective, blind experiment in RNA three-dimensional (3D) structure prediction, encompassing three prediction puzzles. The goals are to assess the leading edge of RNA structure prediction techniques; compare existing methods and tools; and evaluate their relative strengths, weaknesses, and limitations in terms of sequence length and structural complexity. The results should give potential users insight into the suitability of available methods for different applications and facilitate efforts in the RNA structure prediction community in ongoing efforts to improve prediction tools. We also report the creation of an automated evaluation pipeline to facilitate the analysis of future RNA structure prediction exercises. PMID:22361291

  6. 3D RNA and functional interactions from evolutionary couplings

    PubMed Central

    Weinreb, Caleb; Riesselman, Adam; Ingraham, John B.; Gross, Torsten; Sander, Chris; Marks, Debora S.

    2016-01-01

    Summary Non-coding RNAs are ubiquitous, but the discovery of new RNA gene sequences far outpaces research on their structure and functional interactions. We mine the evolutionary sequence record to derive precise information about function and structure of RNAs and RNA-protein complexes. As in protein structure prediction, we use maximum entropy global probability models of sequence co-variation to infer evolutionarily constrained nucleotide-nucleotide interactions within RNA molecules, and nucleotide-amino acid interactions in RNA-protein complexes. The predicted contacts allow all-atom blinded 3D structure prediction at good accuracy for several known RNA structures and RNA-protein complexes. For unknown structures, we predict contacts in 160 non-coding RNA families. Beyond 3D structure prediction, evolutionary couplings help identify important functional interactions, e.g., at switch points in riboswitches and at a complex nucleation site in HIV. Aided by accelerating sequence accumulation, evolutionary coupling analysis can accelerate the discovery of functional interactions and 3D structures involving RNA. PMID:27087444

  7. Impact of target mRNA structure on siRNA silencing efficiency: A large-scale study.

    PubMed

    Gredell, Joseph A; Berger, Angela K; Walton, S Patrick

    2008-07-01

    The selection of active siRNAs is generally based on identifying siRNAs with certain sequence and structural properties. However, the efficiency of RNA interference has also been shown to depend on the structure of the target mRNA, primarily through studies using exogenous transcripts with well-defined secondary structures in the vicinity of the target sequence. While these studies provide a means for examining the impact of target sequence and structure independently, the predicted secondary structures for these transcripts are often not reflective of structures that form in full-length, native mRNAs where interactions can occur between relatively remote segments of the mRNAs. Here, using a combination of experimental results and analysis of a large dataset, we demonstrate that the accessibility of certain local target structures on the mRNA is an important determinant in the gene silencing ability of siRNAs. siRNAs targeting the enhanced green fluorescent protein were chosen using a minimal siRNA selection algorithm followed by classification based on the predicted minimum free energy structures of the target transcripts. Transfection into HeLa and HepG2 cells revealed that siRNAs targeting regions of the mRNA predicted to have unpaired 5'- and 3'-ends resulted in greater gene silencing than regions predicted to have other types of secondary structure. These results were confirmed by analysis of gene silencing data from previously published siRNAs, which showed that mRNA target regions unpaired at either the 5'-end or 3'-end were silenced, on average, approximately 10% more strongly than target regions unpaired in the center or primarily paired throughout. We found this effect to be independent of the structure of the siRNA guide strand. Taken together, these results suggest minimal requirements for nucleation of hybridization between the siRNA guide strand and mRNA and that both mRNA and guide strand structure should be considered when choosing candidate siRNAs. (c) 2008 Wiley Periodicals, Inc.

  8. Impact of target mRNA structure on siRNA silencing efficiency: a large-scale study

    PubMed Central

    Gredell, Joseph A.; Berger, Angela K.; Walton, S. Patrick

    2009-01-01

    The selection of active siRNAs is generally based on identifying siRNAs with certain sequence and structural properties. However, the efficiency of RNA interference has also been shown to depend on the structure of the target mRNA, primarily through studies using exogenous transcripts with well-defined secondary structures in the vicinity of the target sequence. While these studies provide a means for examining the impact of target sequence and structure independently, the predicted secondary structures for these transcripts are often not reflective of structures that form in full-length, native mRNAs where interactions can occur between relatively remote segments of the mRNAs. Here, using a combination of experimental results and analysis of a large dataset, we demonstrate that the accessibility of certain local target structures on the mRNA is an important determinant in the gene silencing ability of siRNAs. siRNAs targeting the enhanced green fluorescent protein were chosen using a minimal siRNA selection algorithm followed by classification based on the predicted minimum free energy structures of the target transcripts. Transfection into HeLa and HepG2 cells revealed that siRNAs targeting regions of the mRNA predicted to have unpaired 5’- and 3’-ends resulted in greater gene silencing than regions predicted to have other types of secondary structure. These results were confirmed by analysis of gene silencing data from previously published siRNAs, which showed that mRNA target regions unpaired at either the 5’-end or 3’-end were silenced, on average, ~10% more strongly than target regions unpaired in the center or primarily paired throughout. We found this effect to be independent of the structure of the siRNA guide strand. Taken together, these results suggest minimal requirements for nucleation of hybridization between the siRNA guide strand and mRNA and that both mRNA and guide strand structure should be considered when choosing candidate siRNAs. PMID:18306428

  9. Functional RNA structures throughout the Hepatitis C Virus genome.

    PubMed

    Adams, Rebecca L; Pirakitikulr, Nathan; Pyle, Anna Marie

    2017-06-01

    The single-stranded Hepatitis C Virus (HCV) genome adopts a set of elaborate RNA structures that are involved in every stage of the viral lifecycle. Recent advances in chemical probing, sequencing, and structural biology have facilitated analysis of RNA folding on a genome-wide scale, revealing novel structures and networks of interactions. These studies have underscored the active role played by RNA in every function of HCV and they open the door to new types of RNA-targeted therapeutics. Copyright © 2017 Elsevier B.V. All rights reserved.

  10. Rose spring dwarf-associated virus has RNA structural and gene-expression features like those of Barley yellow dwarf virus

    PubMed Central

    Salem, Nida’ M.; Miller, W. Allen; Rowhani, Adib; Golino, Deborah A.; Moyne, Anne-Laure; Falk, Bryce W.

    2015-01-01

    We determined the complete nucleotide sequence of the Rose spring dwarf-associated virus (RSDaV) genomic RNA (GenBank accession no. EU024678) and compared its predicted RNA structural characteristics affecting gene expression. A cDNA library was derived from RSDaV double-stranded RNAs (dsRNAs) purified from infected tissue. Nucleotide sequence analysis of the cloned cDNAs, plus for clones generated by 5′- and 3′-RACE showed the RSDaV genomic RNA to be 5,808 nucleotides. The genomic RNA contains five major open reading frames (ORFs), and three small ORFs in the 3′-terminal 800 nucleotides, typical for viruses of genus Luteovirus in the family Luteoviridae. Northern blot hybridization analysis revealed the genomic RNA and two prominent subgenomic RNAs of approximately 3 kb and 1 kb. Putative 5′ ends of the sgRNAs were predicted by identification of conserved sequences and secondary structures which resembled the Barley yellow dwarf virus (BYDV) genomic RNA 5′ end and subgenomic RNA promoter sequences. Secondary structures of the BYDV-like ribosomal frameshift elements and cap-independent translation elements, including long-distance base pairing spanning four kb were identified. These contain similarities but also informative differences with the BYDV structures, including a strikingly different structure predicted for the 3′ cap-independent translation element. These analyses of the RSDaV genomic RNA show more complexity for the RNA structural elements for members of the Luteoviridae. PMID:18329064

  11. Rose spring dwarf-associated virus has RNA structural and gene-expression features like those of Barley yellow dwarf virus.

    PubMed

    Salem, Nida' M; Miller, W Allen; Rowhani, Adib; Golino, Deborah A; Moyne, Anne-Laure; Falk, Bryce W

    2008-06-05

    We determined the complete nucleotide sequence of the Rose spring dwarf-associated virus (RSDaV) genomic RNA (GenBank accession no. EU024678) and compared its predicted RNA structural characteristics affecting gene expression. A cDNA library was derived from RSDaV double-stranded RNAs (dsRNAs) purified from infected tissue. Nucleotide sequence analysis of the cloned cDNAs, plus for clones generated by 5'- and 3'-RACE showed the RSDaV genomic RNA to be 5808 nucleotides. The genomic RNA contains five major open reading frames (ORFs), and three small ORFs in the 3'-terminal 800 nucleotides, typical for viruses of genus Luteovirus in the family Luteoviridae. Northern blot hybridization analysis revealed the genomic RNA and two prominent subgenomic RNAs of approximately 3 kb and 1 kb. Putative 5' ends of the sgRNAs were predicted by identification of conserved sequences and secondary structures which resembled the Barley yellow dwarf virus (BYDV) genomic RNA 5' end and subgenomic RNA promoter sequences. Secondary structures of the BYDV-like ribosomal frameshift elements and cap-independent translation elements, including long-distance base pairing spanning four kb were identified. These contain similarities but also informative differences with the BYDV structures, including a strikingly different structure predicted for the 3' cap-independent translation element. These analyses of the RSDaV genomic RNA show more complexity for the RNA structural elements for members of the Luteoviridae.

  12. Analysis of the tertiary structure of the ribonuclease P ribozyme-substrate complex by site-specific photoaffinity crosslinking.

    PubMed Central

    Harris, M E; Kazantsev, A V; Chen, J L; Pace, N R

    1997-01-01

    Bacterial ribonuclease P (RNase P), an endonuclease involved in tRNA maturation, is a ribonucleoprotein containing a catalytic RNA. The secondary structure of this ribozyme is well-established, and a low-resolution model of the three-dimensional structure of the ribozyme-substrate complex has been proposed based on site-specific crosslinking and phylogenetic comparative data [Harris ME et al., 1994 EMBO J 13:3953-3963]. However, several substructures of that model were poorly constrained by the available data. In the present analysis, additional constraints between elements within the Escherichia coli RNase P RNA-pre-tRNA complex were determined by intra- and intermolecular crosslinking experiments. Circularly permuted RNase P RNAs were used to position an azidophenacyl photoactive crosslinking agent specifically at strategic sites within the ribozyme-substrate complex. Crosslink sites were mapped by primer extension and confirmed by analysis of the mobility of the crosslinked RNA lariats on denaturing acrylamide gels relative to circular and linear RNA standards. Crosslinked species generally retained significant catalytic activity, indicating that the results reflect the native ribozyme structure. The crosslinking results support the general configuration of the structure model and predicate new positions and orientations for helices that were previously poorly constrained by the data set. The expanded library of crosslinking constraints was used, together with secondary and tertiary structure identified by phylogenetic sequence comparisons, to refine significantly the model of RNase P RNA with bound substrate pre-tRNA. The crosslinking results and data from chemical-modification and mutational studies are discussed in the context of the current structural perspective on this ribozyme. PMID:9174092

  13. Expression, crystallization and preliminary crystallographic analysis of RNA-binding protein Hfq (YmaH) from Bacillus subtilis in complex with an RNA aptamer.

    PubMed

    Baba, Seiki; Someya, Tatsuhiko; Kawai, Gota; Nakamura, Kouji; Kumasaka, Takashi

    2010-05-01

    The Hfq protein is a hexameric RNA-binding protein which regulates gene expression by binding to RNA under the influence of diverse environmental stresses. Its ring structure binds various types of RNA, including mRNA and sRNA. RNA-bound structures of Hfq from Escherichia coli and Staphylococcus aureus have been revealed to have poly(A) RNA at the distal site and U-rich RNA at the proximal site, respectively. Here, crystals of a complex of the Bacillus subtilis Hfq protein with an A/G-repeat 7-mer RNA (Hfq-RNA) that were prepared using the hanging-drop vapour-diffusion technique are reported. The type 1 Hfq-RNA crystals belonged to space group I422, with unit-cell parameters a = b = 123.70, c = 119.13 A, while the type 2 Hfq-RNA crystals belonged to space group F222, with unit-cell parameters a = 91.92, b = 92.50, c = 114.92 A. Diffraction data were collected to a resolution of 2.20 A from both crystal forms. The hexameric structure of the Hfq protein was clearly shown by self-rotation analysis.

  14. RSRE: RNA structural robustness evaluator

    PubMed Central

    Shu, Wenjie; Zheng, Zhiqiang; Wang, Shengqi

    2007-01-01

    Biological robustness, defined as the ability to maintain stable functioning in the face of various perturbations, is an important and fundamental topic in current biology, and has become a focus of numerous studies in recent years. Although structural robustness has been explored in several types of RNA molecules, the origins of robustness are still controversial. Computational analysis results are needed to make up for the lack of evidence of robustness in natural biological systems. The RNA structural robustness evaluator (RSRE) web server presented here provides a freely available online tool to quantitatively evaluate the structural robustness of RNA based on the widely accepted definition of neutrality. Several classical structure comparison methods are employed; five randomization methods are implemented to generate control sequences; sub-optimal predicted structures can be optionally utilized to mitigate the uncertainty of secondary structure prediction. With a user-friendly interface, the web application is easy to use. Intuitive illustrations are provided along with the original computational results to facilitate analysis. The RSRE will be helpful in the wide exploration of RNA structural robustness and will catalyze our understanding of RNA evolution. The RSRE web server is freely available at http://biosrv1.bmi.ac.cn/RSRE/ or http://biotech.bmi.ac.cn/RSRE/. PMID:17567615

  15. DMS-MaPseq for genome-wide or targeted RNA structure probing in vivo

    PubMed Central

    Zubradt, Meghan; Gupta, Paromita; Persad, Sitara; Lambowitz, Alan M.; Weissman, Jonathan S.; Rouskin, Silvi

    2017-01-01

    Coupling structure-specific in vivo chemical modification to next-generation sequencing is transforming RNA secondary structural studies in living cells. The dominant strategy for detecting in vivo chemical modifications uses reverse transcriptase truncation products, which introduces biases and necessitates population-average assessments of RNA structure. Here we present dimethyl sulfate mutational profiling with sequencing (DMS-MaPseq), which encodes DMS modifications as mismatches using a thermostable group II intron reverse transcriptase (TGIRT). DMS-MaPseq yields a high signal-to-noise ratio, can report multiple structural features per molecule, and allows both genome-wide studies and focused in vivo investigations of even low abundance RNAs. We apply DMS-MaPseq for the first analysis of RNA structure within an animal tissue and to identify a functional structure involved in non-canonical translation initiation. Additionally, we use DMS-MaPseq to compare the in vivo structure of pre-mRNAs to their mature isoforms. These applications illustrate DMS-MaPseq’s capacity to dramatically expand in vivo analysis of RNA structure. PMID:27819661

  16. Trans-acting RNAs as molecular probes for monitoring time-dependent structural change of an RNA complex adapting two structures.

    PubMed

    Maeda, Yuri; Furuta, Hiroyuki; Ikawa, Yoshiya

    2011-03-01

    As dynamic structural changes are pivotal for the functions of some classes of RNA molecule, it is important to develop methods to monitor structural changes in RNA in a time-dependent manner without chemical modification. Based on previous reports that trans-acting RNAs can be used as probes for analysis and control of 3D structures of target RNAs, we applied this method to monitor time-dependent structural changes in RNA. We designed and performed a proof-of-principle study using a simple model RNA complex that adopts two different structures as a target. The time-dependent structural changes in the target RNA were successfully monitored using two trans-acting RNAs, which stably form a ternary complex with the bimolecular target RNA and act as a catalyst to join two RNA fragments of the target complex, respectively. Copyright © 2010 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.

  17. Sequence-structure relationships in RNA loops: establishing the basis for loop homology modeling.

    PubMed

    Schudoma, Christian; May, Patrick; Nikiforova, Viktoria; Walther, Dirk

    2010-01-01

    The specific function of RNA molecules frequently resides in their seemingly unstructured loop regions. We performed a systematic analysis of RNA loops extracted from experimentally determined three-dimensional structures of RNA molecules. A comprehensive loop-structure data set was created and organized into distinct clusters based on structural and sequence similarity. We detected clear evidence of the hallmark of homology present in the sequence-structure relationships in loops. Loops differing by <25% in sequence identity fold into very similar structures. Thus, our results support the application of homology modeling for RNA loop model building. We established a threshold that may guide the sequence divergence-based selection of template structures for RNA loop homology modeling. Of all possible sequences that are, under the assumption of isosteric relationships, theoretically compatible with actual sequences observed in RNA structures, only a small fraction is contained in the Rfam database of RNA sequences and classes implying that the actual RNA loop space may consist of a limited number of unique loop structures and conserved sequences. The loop-structure data sets are made available via an online database, RLooM. RLooM also offers functionalities for the modeling of RNA loop structures in support of RNA engineering and design efforts.

  18. Superposition of two tRNA{sup Ser} acceptor stem crystal structures: Comparison of structure, ligands and hydration

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Eichert, Andre; Fuerste, Jens P.; Ulrich, Alexander

    2010-05-07

    We solved the X-ray structures of two Escherichia coli tRNA{sup Ser} acceptor stem microhelices. As both tRNAs are aminoacylated by the same seryl-tRNA-synthetase, we performed a comparative structure analysis of both duplexes to investigate the helical conformation, the hydration patterns and magnesium binding sites. It is well accepted, that the hydration of RNA plays an important role in RNA-protein interactions and that the extensive solvent content of the minor groove has a special function in RNA. The detailed comparison of both tRNA{sup Ser} microhelices provides insights into the structural arrangement of the isoacceptor tRNA aminoacyl stems with respect to themore » surrounding water molecules and may eventually help us to understand their biological function at atomic resolution.« less

  19. Towards Long-Range RNA Structure Prediction in Eukaryotic Genes.

    PubMed

    Pervouchine, Dmitri D

    2018-06-15

    The ability to form an intramolecular structure plays a fundamental role in eukaryotic RNA biogenesis. Proximate regions in the primary transcripts fold into a local secondary structure, which is then hierarchically assembled into a tertiary structure that is stabilized by RNA-binding proteins and long-range intramolecular base pairings. While the local RNA structure can be predicted reasonably well for short sequences, long-range structure at the scale of eukaryotic genes remains problematic from the computational standpoint. The aim of this review is to list functional examples of long-range RNA structures, to summarize current comparative methods of structure prediction, and to highlight their advances and limitations in the context of long-range RNA structures. Most comparative methods implement the “first-align-then-fold” principle, i.e., they operate on multiple sequence alignments, while functional RNA structures often reside in non-conserved parts of the primary transcripts. The opposite “first-fold-then-align” approach is currently explored to a much lesser extent. Developing novel methods in both directions will improve the performance of comparative RNA structure analysis and help discover novel long-range structures, their higher-order organization, and RNA⁻RNA interactions across the transcriptome.

  20. 5S rRNA-derived and tRNA-derived SINEs in fruit bats.

    PubMed

    Gogolevsky, Konstantin P; Vassetzky, Nikita S; Kramerov, Dmitri A

    2009-05-01

    Most short retroposons (SINEs) descend from cellular tRNA of 7SL RNA. Here, four new SINEs were found in megabats (Megachiroptera) but neither in microbats nor in other mammals. Two of them, MEG-RS and MEG-RL, descend from another cellular RNA, 5S rRNA; one (MEG-T2) is a tRNA-derived SINE; and MEG-TR is a hybrid tRNA/5S rRNA SINE. Insertion locus analysis suggests that these SINEs were active in the recent fruit bat evolution. Analysis of MEG-RS and MEG-RL in comparison with other few 5S rRNA-derived SINEs demonstrates that the internal RNA polymerase III promoter is their most invariant region, while the secondary structure is more variable. The mechanisms underlying the modular structure of these and other SINEs as well as their variation are discussed. The scenario of evolution of MEG SINEs is proposed.

  1. Structural and functional analysis of 5S rRNA in Saccharomyces cerevisiae

    PubMed Central

    Kiparisov, S.; Sergiev, P. V.; Dontsova, O. A.; Petrov, A.; Meskauskas, A.; Dinman, J. D.

    2005-01-01

    5S rRNA extends from the central protuberance of the large ribosomal subunit, through the A-site finger, and down to the GTPase-associated center. Here, we present a structure-function analysis of seven 5S rRNA alleles which are sufficient for viability in the yeast Saccharomyces cerevisiae when expressed in the absence of wild-type 5S rRNAs, and extend this analysis using a large bank of mutant alleles that show semidominant phenotypes in the presence of wild-type 5S rRNA. This analysis supports the hypothesis that 5S rRNA serves to link together several different functional centers of the ribosome. Data are also presented which suggest that in eukaryotic genomes selection has favored the maintenance of multiple alleles of 5S rRNA, and that these may provide cells with a mechanism to post-transcriptionally regulate gene expression. PMID:16047201

  2. rCAD: A Novel Database Schema for the Comparative Analysis of RNA.

    PubMed

    Ozer, Stuart; Doshi, Kishore J; Xu, Weijia; Gutell, Robin R

    2011-12-31

    Beyond its direct involvement in protein synthesis with mRNA, tRNA, and rRNA, RNA is now being appreciated for its significance in the overall metabolism and regulation of the cell. Comparative analysis has been very effective in the identification and characterization of RNA molecules, including the accurate prediction of their secondary structure. We are developing an integrative scalable data management and analysis system, the RNA Comparative Analysis Database (rCAD), implemented with SQL Server to support RNA comparative analysis. The platformagnostic database schema of rCAD captures the essential relationships between the different dimensions of information for RNA comparative analysis datasets. The rCAD implementation enables a variety of comparative analysis manipulations with multiple integrated data dimensions for advanced RNA comparative analysis workflows. In this paper, we describe details of the rCAD schema design and illustrate its usefulness with two usage scenarios.

  3. rCAD: A Novel Database Schema for the Comparative Analysis of RNA

    PubMed Central

    Ozer, Stuart; Doshi, Kishore J.; Xu, Weijia; Gutell, Robin R.

    2013-01-01

    Beyond its direct involvement in protein synthesis with mRNA, tRNA, and rRNA, RNA is now being appreciated for its significance in the overall metabolism and regulation of the cell. Comparative analysis has been very effective in the identification and characterization of RNA molecules, including the accurate prediction of their secondary structure. We are developing an integrative scalable data management and analysis system, the RNA Comparative Analysis Database (rCAD), implemented with SQL Server to support RNA comparative analysis. The platformagnostic database schema of rCAD captures the essential relationships between the different dimensions of information for RNA comparative analysis datasets. The rCAD implementation enables a variety of comparative analysis manipulations with multiple integrated data dimensions for advanced RNA comparative analysis workflows. In this paper, we describe details of the rCAD schema design and illustrate its usefulness with two usage scenarios. PMID:24772454

  4. Structure of human IFIT1 with capped RNA reveals adaptable mRNA binding and mechanisms for sensing N1 and N2 ribose 2′-O methylations

    PubMed Central

    Laudenbach, Beatrice Theres; Martínez-Montero, Saúl; Cencic, Regina; Habjan, Matthias; Pichlmair, Andreas; Damha, Masad J.; Pelletier, Jerry; Nagar, Bhushan

    2017-01-01

    IFIT1 (IFN-induced protein with tetratricopeptide repeats-1) is an effector of the host innate immune antiviral response that prevents propagation of virus infection by selectively inhibiting translation of viral mRNA. It relies on its ability to compete with the translation initiation factor eIF4F to specifically recognize foreign capped mRNAs, while remaining inactive against host mRNAs marked by ribose 2′-O methylation at the first cap-proximal nucleotide (N1). We report here several crystal structures of RNA-bound human IFIT1, including a 1.6-Å complex with capped RNA. IFIT1 forms a water-filled, positively charged RNA-binding tunnel with a separate hydrophobic extension that unexpectedly engages the cap in multiple conformations (syn and anti) giving rise to a relatively plastic and nonspecific mode of binding, in stark contrast to eIF4E. Cap-proximal nucleotides encircled by the tunnel provide affinity to compete with eIF4F while allowing IFIT1 to select against N1 methylated mRNA. Gel-shift binding assays confirm that N1 methylation interferes with IFIT1 binding, but in an RNA-dependent manner, whereas translation assays reveal that N1 methylation alone is not sufficient to prevent mRNA recognition at high IFIT1 concentrations. Structural and functional analysis show that 2′-O methylation at N2, another abundant mRNA modification, is also detrimental for RNA binding, thus revealing a potentially synergistic role for it in self- versus nonself-mRNA discernment. Finally, structure-guided mutational analysis confirms the importance of RNA binding for IFIT1 restriction of a human coronavirus mutant lacking viral N1 methylation. Our structural and biochemical analysis sheds new light on the molecular basis for IFIT1 translational inhibition of capped viral RNA. PMID:28251928

  5. Probing Xist RNA Structure in Cells Using Targeted Structure-Seq

    PubMed Central

    Rutenberg-Schoenberg, Michael; Simon, Matthew D.

    2015-01-01

    The long non-coding RNA (lncRNA) Xist is a master regulator of X-chromosome inactivation in mammalian cells. Models for how Xist and other lncRNAs function depend on thermodynamically stable secondary and higher-order structures that RNAs can form in the context of a cell. Probing accessible RNA bases can provide data to build models of RNA conformation that provide insight into RNA function, molecular evolution, and modularity. To study the structure of Xist in cells, we built upon recent advances in RNA secondary structure mapping and modeling to develop Targeted Structure-Seq, which combines chemical probing of RNA structure in cells with target-specific massively parallel sequencing. By enriching for signals from the RNA of interest, Targeted Structure-Seq achieves high coverage of the target RNA with relatively few sequencing reads, thus providing a targeted and scalable approach to analyze RNA conformation in cells. We use this approach to probe the full-length Xist lncRNA to develop new models for functional elements within Xist, including the repeat A element in the 5’-end of Xist. This analysis also identified new structural elements in Xist that are evolutionarily conserved, including a new element proximal to the C repeats that is important for Xist function. PMID:26646615

  6. Structural features of microRNA (miRNA) precursors and their relevance to miRNA biogenesis and small interfering RNA/short hairpin RNA design.

    PubMed

    Krol, Jacek; Sobczak, Krzysztof; Wilczynska, Urszula; Drath, Maria; Jasinska, Anna; Kaczynska, Danuta; Krzyzosiak, Wlodzimierz J

    2004-10-01

    We have established the structures of 10 human microRNA (miRNA) precursors using biochemical methods. Eight of these structures turned out to be different from those that were computer-predicted. The differences localized in the terminal loop region and at the opposite side of the precursor hairpin stem. We have analyzed the features of these structures from the perspectives of miRNA biogenesis and active strand selection. We demonstrated the different thermodynamic stability profiles for pre-miRNA hairpins harboring miRNAs at their 5'- and 3'-sides and discussed their functional implications. Our results showed that miRNA prediction based on predicted precursor structures may give ambiguous results, and the success rate is significantly higher for the experimentally determined structures. On the other hand, the differences between the predicted and experimentally determined structures did not affect the stability of termini produced through "conceptual dicing." This result confirms the value of thermodynamic analysis based on mfold as a predictor of strand section by RNAi-induced silencing complex (RISC).

  7. Structural architecture of the human long non-coding RNA, steroid receptor RNA activator

    PubMed Central

    Novikova, Irina V.; Hennelly, Scott P.; Sanbonmatsu, Karissa Y.

    2012-01-01

    While functional roles of several long non-coding RNAs (lncRNAs) have been determined, the molecular mechanisms are not well understood. Here, we report the first experimentally derived secondary structure of a human lncRNA, the steroid receptor RNA activator (SRA), 0.87 kB in size. The SRA RNA is a non-coding RNA that coactivates several human sex hormone receptors and is strongly associated with breast cancer. Coding isoforms of SRA are also expressed to produce proteins, making the SRA gene a unique bifunctional system. Our experimental findings (SHAPE, in-line, DMS and RNase V1 probing) reveal that this lncRNA has a complex structural organization, consisting of four domains, with a variety of secondary structure elements. We examine the coevolution of the SRA gene at the RNA structure and protein structure levels using comparative sequence analysis across vertebrates. Rapid evolutionary stabilization of RNA structure, combined with frame-disrupting mutations in conserved regions, suggests that evolutionary pressure preserves the RNA structural core rather than its translational product. We perform similar experiments on alternatively spliced SRA isoforms to assess their structural features. PMID:22362738

  8. Analysis of sequencing data for probing RNA secondary structures and protein-RNA binding in studying posttranscriptional regulations.

    PubMed

    Hu, Xihao; Wu, Yang; Lu, Zhi John; Yip, Kevin Y

    2016-11-01

    High-throughput sequencing has been used to study posttranscriptional regulations, where the identification of protein-RNA binding is a major and fast-developing sub-area, which is in turn benefited by the sequencing methods for whole-transcriptome probing of RNA secondary structures. In the study of RNA secondary structures using high-throughput sequencing, bases are modified or cleaved according to their structural features, which alter the resulting composition of sequencing reads. In the study of protein-RNA binding, methods have been proposed to immuno-precipitate (IP) protein-bound RNA transcripts in vitro or in vivo By sequencing these transcripts, the protein-RNA interactions and the binding locations can be identified. For both types of data, read counts are affected by a combination of confounding factors, including expression levels of transcripts, sequence biases, mapping errors and the probing or IP efficiency of the experimental protocols. Careful processing of the sequencing data and proper extraction of important features are fundamentally important to a successful analysis. Here we review and compare different experimental methods for probing RNA secondary structures and binding sites of RNA-binding proteins (RBPs), and the computational methods proposed for analyzing the corresponding sequencing data. We suggest how these two types of data should be integrated to study the structural properties of RBP binding sites as a systematic way to better understand posttranscriptional regulations. © The Author 2015. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

  9. Consistent global structures of complex RNA states through multidimensional chemical mapping

    PubMed Central

    Cheng, Clarence Yu; Chou, Fang-Chieh; Kladwang, Wipapat; Tian, Siqi; Cordero, Pablo; Das, Rhiju

    2015-01-01

    Accelerating discoveries of non-coding RNA (ncRNA) in myriad biological processes pose major challenges to structural and functional analysis. Despite progress in secondary structure modeling, high-throughput methods have generally failed to determine ncRNA tertiary structures, even at the 1-nm resolution that enables visualization of how helices and functional motifs are positioned in three dimensions. We report that integrating a new method called MOHCA-seq (Multiplexed •OH Cleavage Analysis with paired-end sequencing) with mutate-and-map secondary structure inference guides Rosetta 3D modeling to consistent 1-nm accuracy for intricately folded ncRNAs with lengths up to 188 nucleotides, including a blind RNA-puzzle challenge, the lariat-capping ribozyme. This multidimensional chemical mapping (MCM) pipeline resolves unexpected tertiary proximities for cyclic-di-GMP, glycine, and adenosylcobalamin riboswitch aptamers without their ligands and a loose structure for the recently discovered human HoxA9D internal ribosome entry site regulon. MCM offers a sequencing-based route to uncovering ncRNA 3D structure, applicable to functionally important but potentially heterogeneous states. DOI: http://dx.doi.org/10.7554/eLife.07600.001 PMID:26035425

  10. ClaRNA: a classifier of contacts in RNA 3D structures based on a comparative analysis of various classification schemes

    PubMed Central

    Waleń, Tomasz; Chojnowski, Grzegorz; Gierski, Przemysław; Bujnicki, Janusz M.

    2014-01-01

    The understanding of folding and function of RNA molecules depends on the identification and classification of interactions between ribonucleotide residues. We developed a new method named ClaRNA for computational classification of contacts in RNA 3D structures. Unique features of the program are the ability to identify imperfect contacts and to process coarse-grained models. Each doublet of spatially close ribonucleotide residues in a query structure is compared to clusters of reference doublets obtained by analysis of a large number of experimentally determined RNA structures, and assigned a score that describes its similarity to one or more known types of contacts, including pairing, stacking, base–phosphate and base–ribose interactions. The accuracy of ClaRNA is 0.997 for canonical base pairs, 0.983 for non-canonical pairs and 0.961 for stacking interactions. The generalized squared correlation coefficient (GC2) for ClaRNA is 0.969 for canonical base pairs, 0.638 for non-canonical pairs and 0.824 for stacking interactions. The classifier can be easily extended to include new types of spatial relationships between pairs or larger assemblies of nucleotide residues. ClaRNA is freely available via a web server that includes an extensive set of tools for processing and visualizing structural information about RNA molecules. PMID:25159614

  11. Correlation of RNA secondary structure statistics with thermodynamic stability and applications to folding.

    PubMed

    Wu, Johnny C; Gardner, David P; Ozer, Stuart; Gutell, Robin R; Ren, Pengyu

    2009-08-28

    The accurate prediction of the secondary and tertiary structure of an RNA with different folding algorithms is dependent on several factors, including the energy functions. However, an RNA higher-order structure cannot be predicted accurately from its sequence based on a limited set of energy parameters. The inter- and intramolecular forces between this RNA and other small molecules and macromolecules, in addition to other factors in the cell such as pH, ionic strength, and temperature, influence the complex dynamics associated with transition of a single stranded RNA to its secondary and tertiary structure. Since all of the factors that affect the formation of an RNAs 3D structure cannot be determined experimentally, statistically derived potential energy has been used in the prediction of protein structure. In the current work, we evaluate the statistical free energy of various secondary structure motifs, including base-pair stacks, hairpin loops, and internal loops, using their statistical frequency obtained from the comparative analysis of more than 50,000 RNA sequences stored in the RNA Comparative Analysis Database (rCAD) at the Comparative RNA Web (CRW) Site. Statistical energy was computed from the structural statistics for several datasets. While the statistical energy for a base-pair stack correlates with experimentally derived free energy values, suggesting a Boltzmann-like distribution, variation is observed between different molecules and their location on the phylogenetic tree of life. Our statistical energy values calculated for several structural elements were utilized in the Mfold RNA-folding algorithm. The combined statistical energy values for base-pair stacks, hairpins and internal loop flanks result in a significant improvement in the accuracy of secondary structure prediction; the hairpin flanks contribute the most.

  12. Hydration sites of unpaired RNA bases: a statistical analysis of the PDB structures.

    PubMed

    Kirillova, Svetlana; Carugo, Oliviero

    2011-10-19

    Hydration is crucial for RNA structure and function. X-ray crystallography is the most commonly used method to determine RNA structures and hydration and, therefore, statistical surveys are based on crystallographic results, the number of which is quickly increasing. A statistical analysis of the water molecule distribution in high-resolution X-ray structures of unpaired RNA nucleotides showed that: different bases have the same penchant to be surrounded by water molecules; clusters of water molecules indicate possible hydration sites, which, in some cases, match those of the major and minor grooves of RNA and DNA double helices; complex hydrogen bond networks characterize the solvation of the nucleotides, resulting in a significant rigidity of the base and its surrounding water molecules. Interestingly, the hydration sites around unpaired RNA bases do not match, in general, the positions that are occupied by the second nucleotide when the base-pair is formed. The hydration sites around unpaired RNA bases were found. They do not replicate the atom positions of complementary bases in the Watson-Crick pairs.

  13. Hydration sites of unpaired RNA bases: a statistical analysis of the PDB structures

    PubMed Central

    2011-01-01

    Background Hydration is crucial for RNA structure and function. X-ray crystallography is the most commonly used method to determine RNA structures and hydration and, therefore, statistical surveys are based on crystallographic results, the number of which is quickly increasing. Results A statistical analysis of the water molecule distribution in high-resolution X-ray structures of unpaired RNA nucleotides showed that: different bases have the same penchant to be surrounded by water molecules; clusters of water molecules indicate possible hydration sites, which, in some cases, match those of the major and minor grooves of RNA and DNA double helices; complex hydrogen bond networks characterize the solvation of the nucleotides, resulting in a significant rigidity of the base and its surrounding water molecules. Interestingly, the hydration sites around unpaired RNA bases do not match, in general, the positions that are occupied by the second nucleotide when the base-pair is formed. Conclusions The hydration sites around unpaired RNA bases were found. They do not replicate the atom positions of complementary bases in the Watson-Crick pairs. PMID:22011380

  14. Expression, crystallization and preliminary crystallographic analysis of RNA-binding protein Hfq (YmaH) from Bacillus subtilis in complex with an RNA aptamer

    PubMed Central

    Baba, Seiki; Someya, Tatsuhiko; Kawai, Gota; Nakamura, Kouji; Kumasaka, Takashi

    2010-01-01

    The Hfq protein is a hexameric RNA-binding protein which regulates gene expression by binding to RNA under the influence of diverse environmental stresses. Its ring structure binds various types of RNA, including mRNA and sRNA. RNA-bound structures of Hfq from Escherichia coli and Staphylococcus aureus have been revealed to have poly(A) RNA at the distal site and U-rich RNA at the proximal site, respectively. Here, crystals of a complex of the Bacillus subtilis Hfq protein with an A/G-repeat 7-mer RNA (Hfq–RNA) that were prepared using the hanging-drop vapour-diffusion technique are reported. The type 1 Hfq–RNA crystals belonged to space group I422, with unit-cell parameters a = b = 123.70, c = 119.13 Å, while the type 2 Hfq–RNA crystals belonged to space group F222, with unit-cell parameters a = 91.92, b = 92.50, c = 114.92 Å. Diffraction data were collected to a resolution of 2.20 Å from both crystal forms. The hexameric structure of the Hfq protein was clearly shown by self-rotation analysis. PMID:20445260

  15. Lessons from an evolving rRNA: 16S and 23S rRNA structures from a comparative perspective

    NASA Technical Reports Server (NTRS)

    Gutell, R. R.; Larsen, N.; Woese, C. R.

    1994-01-01

    The 16S and 23S rRNA higher-order structures inferred from comparative analysis are now quite refined. The models presented here differ from their immediate predecessors only in minor detail. Thus, it is safe to assert that all of the standard secondary-structure elements in (prokaryotic) rRNAs have been identified, with approximately 90% of the individual base pairs in each molecule having independent comparative support, and that at least some of the tertiary interactions have been revealed. It is interesting to compare the rRNAs in this respect with tRNA, whose higher-order structure is known in detail from its crystal structure (36) (Table 2). It can be seen that rRNAs have as great a fraction of their sequence in established secondary-structure elements as does tRNA. However, the fact that the former show a much lower fraction of identified tertiary interactions and a greater fraction of unpaired nucleotides than the latter implies that many of the rRNA tertiary interactions remain to be located. (Alternatively, the ribosome might involve protein-rRNA rather than intramolecular rRNA interactions to stabilize three-dimensional structure.) Experimental studies on rRNA are consistent to a first approximation with the structures proposed here, confirming the basic assumption of comparative analysis, i.e., that bases whose compositions strictly covary are physically interacting. In the exhaustive study of Moazed et al. (45) on protection of the bases in the small-subunit rRNA against chemical modification, the vast majority of bases inferred to pair by covariation are found to be protected from chemical modification, both in isolated small-subunit rRNA and in the 30S subunit. The majority of the tertiary interactions are reflected in the chemical protection data as well (45). On the other hand, many of the bases not shown as paired in Fig. 1 are accessible to chemical attack (45). However, in this case a sizeable fraction of them are also protected against chemical modification (in the isolated rRNA), which suggests that considerable higher-order structure remains to be found (although all of it may not involve base-base interactions and so may not be detectable by comparative analysis). The agreement between the higher-order structure of the small-subunit rRNA and protection against chemical modification is not perfect, however; some bases shown to covary canonically are accessible to chemical modification (45).(ABSTRACT TRUNCATED AT 400 WORDS).

  16. Comprehensive comparative analysis and identification of RNA-binding protein domains: multi-class classification and feature selection.

    PubMed

    Jahandideh, Samad; Srinivasasainagendra, Vinodh; Zhi, Degui

    2012-11-07

    RNA-protein interaction plays an important role in various cellular processes, such as protein synthesis, gene regulation, post-transcriptional gene regulation, alternative splicing, and infections by RNA viruses. In this study, using Gene Ontology Annotated (GOA) and Structural Classification of Proteins (SCOP) databases an automatic procedure was designed to capture structurally solved RNA-binding protein domains in different subclasses. Subsequently, we applied tuned multi-class SVM (TMCSVM), Random Forest (RF), and multi-class ℓ1/ℓq-regularized logistic regression (MCRLR) for analysis and classifying RNA-binding protein domains based on a comprehensive set of sequence and structural features. In this study, we compared prediction accuracy of three different state-of-the-art predictor methods. From our results, TMCSVM outperforms the other methods and suggests the potential of TMCSVM as a useful tool for facilitating the multi-class prediction of RNA-binding protein domains. On the other hand, MCRLR by elucidating importance of features for their contribution in predictive accuracy of RNA-binding protein domains subclasses, helps us to provide some biological insights into the roles of sequences and structures in protein-RNA interactions.

  17. Differential accumulation of nif structural gene mRNA in Azotobacter vinelandii.

    PubMed

    Hamilton, Trinity L; Jacobson, Marty; Ludwig, Marcus; Boyd, Eric S; Bryant, Donald A; Dean, Dennis R; Peters, John W

    2011-09-01

    Northern analysis was employed to investigate mRNA produced by mutant strains of Azotobacter vinelandii with defined deletions in the nif structural genes and in the intergenic noncoding regions. The results indicate that intergenic RNA secondary structures effect the differential accumulation of transcripts, supporting the high Fe protein-to-MoFe protein ratio required for optimal diazotrophic growth.

  18. Computational analysis of conserved RNA secondary structure in transcriptomes and genomes.

    PubMed

    Eddy, Sean R

    2014-01-01

    Transcriptomics experiments and computational predictions both enable systematic discovery of new functional RNAs. However, many putative noncoding transcripts arise instead from artifacts and biological noise, and current computational prediction methods have high false positive rates. I discuss prospects for improving computational methods for analyzing and identifying functional RNAs, with a focus on detecting signatures of conserved RNA secondary structure. An interesting new front is the application of chemical and enzymatic experiments that probe RNA structure on a transcriptome-wide scale. I review several proposed approaches for incorporating structure probing data into the computational prediction of RNA secondary structure. Using probabilistic inference formalisms, I show how all these approaches can be unified in a well-principled framework, which in turn allows RNA probing data to be easily integrated into a wide range of analyses that depend on RNA secondary structure inference. Such analyses include homology search and genome-wide detection of new structural RNAs.

  19. On the combinatorics of sparsification.

    PubMed

    Huang, Fenix Wd; Reidys, Christian M

    2012-10-22

    We study the sparsification of dynamic programming based on folding algorithms of RNA structures. Sparsification is a method that improves significantly the computation of minimum free energy (mfe) RNA structures. We provide a quantitative analysis of the sparsification of a particular decomposition rule, Λ∗. This rule splits an interval of RNA secondary and pseudoknot structures of fixed topological genus. Key for quantifying sparsifications is the size of the so called candidate sets. Here we assume mfe-structures to be specifically distributed (see Assumption 1) within arbitrary and irreducible RNA secondary and pseudoknot structures of fixed topological genus. We then present a combinatorial framework which allows by means of probabilities of irreducible sub-structures to obtain the expectation of the Λ∗-candidate set w.r.t. a uniformly random input sequence. We compute these expectations for arc-based energy models via energy-filtered generating functions (GF) in case of RNA secondary structures as well as RNA pseudoknot structures. Furthermore, for RNA secondary structures we also analyze a simplified loop-based energy model. Our combinatorial analysis is then compared to the expected number of Λ∗-candidates obtained from the folding mfe-structures. In case of the mfe-folding of RNA secondary structures with a simplified loop-based energy model our results imply that sparsification provides a significant, constant improvement of 91% (theory) to be compared to an 96% (experimental, simplified arc-based model) reduction. However, we do not observe a linear factor improvement. Finally, in case of the "full" loop-energy model we can report a reduction of 98% (experiment). Sparsification was initially attributed a linear factor improvement. This conclusion was based on the so called polymer-zeta property, which stems from interpreting polymer chains as self-avoiding walks. Subsequent findings however reveal that the O(n) improvement is not correct. The combinatorial analysis presented here shows that, assuming a specific distribution (see Assumption 1), of mfe-structures within irreducible and arbitrary structures, the expected number of Λ∗-candidates is Θ(n2). However, the constant reduction is quite significant, being in the range of 96%. We furthermore show an analogous result for the sparsification of the Λ∗-decomposition rule for RNA pseudoknotted structures of genus one. Finally we observe that the effect of sparsification is sensitive to the employed energy model.

  20. Structural insights into RISC assembly facilitated by dsRNA-binding domains of human RNA helicase A (DHX9)

    PubMed Central

    Fu, Qinqin; Yuan, Y. Adam

    2013-01-01

    Intensive research interest has focused on small RNA-processing machinery and the RNA-induced silencing complex (RISC), key cellular machines in RNAi pathways. However, the structural mechanism regarding RISC assembly, the primary step linking small RNA processing and RNA-mediated gene silencing, is largely unknown. Human RNA helicase A (DHX9) was reported to function as an RISC-loading factor, and such function is mediated mainly by its dsRNA-binding domains (dsRBDs). Here, we report the crystal structures of human RNA helicase A (RHA) dsRBD1 and dsRBD2 domains in complex with dsRNAs, respectively. Structural analysis not only reveals higher siRNA duplex-binding affinity displayed by dsRBD1, but also identifies a crystallographic dsRBD1 pair of physiological significance in cooperatively recognizing dsRNAs. Structural observations are further validated by isothermal titration calorimetric (ITC) assay. Moreover, co-immunoprecipitation (co-IP) assay coupled with mutagenesis demonstrated that both dsRBDs are required for RISC association, and such association is mediated by dsRNA. Hence, our structural and functional efforts have revealed a potential working model for siRNA recognition by RHA tandem dsRBDs, and together they provide direct structural insights into RISC assembly facilitated by RHA. PMID:23361462

  1. Structural insights into RISC assembly facilitated by dsRNA-binding domains of human RNA helicase A (DHX9).

    PubMed

    Fu, Qinqin; Yuan, Y Adam

    2013-03-01

    Intensive research interest has focused on small RNA-processing machinery and the RNA-induced silencing complex (RISC), key cellular machines in RNAi pathways. However, the structural mechanism regarding RISC assembly, the primary step linking small RNA processing and RNA-mediated gene silencing, is largely unknown. Human RNA helicase A (DHX9) was reported to function as an RISC-loading factor, and such function is mediated mainly by its dsRNA-binding domains (dsRBDs). Here, we report the crystal structures of human RNA helicase A (RHA) dsRBD1 and dsRBD2 domains in complex with dsRNAs, respectively. Structural analysis not only reveals higher siRNA duplex-binding affinity displayed by dsRBD1, but also identifies a crystallographic dsRBD1 pair of physiological significance in cooperatively recognizing dsRNAs. Structural observations are further validated by isothermal titration calorimetric (ITC) assay. Moreover, co-immunoprecipitation (co-IP) assay coupled with mutagenesis demonstrated that both dsRBDs are required for RISC association, and such association is mediated by dsRNA. Hence, our structural and functional efforts have revealed a potential working model for siRNA recognition by RHA tandem dsRBDs, and together they provide direct structural insights into RISC assembly facilitated by RHA.

  2. Structural Analysis of Single-Point Mutations Given an RNA Sequence: A Case Study with RNAMute

    NASA Astrophysics Data System (ADS)

    Churkin, Alexander; Barash, Danny

    2006-12-01

    We introduce here for the first time the RNAMute package, a pattern-recognition-based utility to perform mutational analysis and detect vulnerable spots within an RNA sequence that affect structure. Mutations in these spots may lead to a structural change that directly relates to a change in functionality. Previously, the concept was tried on RNA genetic control elements called "riboswitches" and other known RNA switches, without an organized utility that analyzes all single-point mutations and can be further expanded. The RNAMute package allows a comprehensive categorization, given an RNA sequence that has functional relevance, by exploring the patterns of all single-point mutants. For illustration, we apply the RNAMute package on an RNA transcript for which individual point mutations were shown experimentally to inactivate spectinomycin resistance in Escherichia coli. Functional analysis of mutations on this case study was performed experimentally by creating a library of point mutations using PCR and screening to locate those mutations. With the availability of RNAMute, preanalysis can be performed computationally before conducting an experiment.

  3. Domain motions of Argonaute, the catalytic engine of RNA interference

    PubMed Central

    Ming, Dengming; Wall, Michael E; Sanbonmatsu, Kevin Y

    2007-01-01

    Background The Argonaute protein is the core component of the RNA-induced silencing complex, playing the central role of cleaving the mRNA target. Visual inspection of static crystal structures already has enabled researchers to suggest conformational changes of Argonaute that might occur during RNA interference. We have taken the next step by performing an all-atom normal mode analysis of the Pyrococcus furiosus and Aquifex aeolicus Argonaute crystal structures, allowing us to quantitatively assess the feasibility of these conformational changes. To perform the analysis, we begin with the energy-minimized X-ray structures. Normal modes are then calculated using an all-atom molecular mechanics force field. Results The analysis reveals low-frequency vibrations that facilitate the accommodation of RNA duplexes – an essential step in target recognition. The Pyrococcus furiosus and Aquifex aeolicus Argonaute proteins both exhibit low-frequency torsion and hinge motions; however, differences in the overall architecture of the proteins cause the detailed dynamics to be significantly different. Conclusion Overall, low-frequency vibrations of Argonaute are consistent with mechanisms within the current reaction cycle model for RNA interference. PMID:18053142

  4. Domain motions of Argonaute, the catalytic engine of RNA interference.

    PubMed

    Ming, Dengming; Wall, Michael E; Sanbonmatsu, Kevin Y

    2007-11-30

    The Argonaute protein is the core component of the RNA-induced silencing complex, playing the central role of cleaving the mRNA target. Visual inspection of static crystal structures already has enabled researchers to suggest conformational changes of Argonaute that might occur during RNA interference. We have taken the next step by performing an all-atom normal mode analysis of the Pyrococcus furiosus and Aquifex aeolicus Argonaute crystal structures, allowing us to quantitatively assess the feasibility of these conformational changes. To perform the analysis, we begin with the energy-minimized X-ray structures. Normal modes are then calculated using an all-atom molecular mechanics force field. The analysis reveals low-frequency vibrations that facilitate the accommodation of RNA duplexes - an essential step in target recognition. The Pyrococcus furiosus and Aquifex aeolicus Argonaute proteins both exhibit low-frequency torsion and hinge motions; however, differences in the overall architecture of the proteins cause the detailed dynamics to be significantly different. Overall, low-frequency vibrations of Argonaute are consistent with mechanisms within the current reaction cycle model for RNA interference.

  5. RNA-Seq-Based Transcript Structure Analysis with TrBorderExt.

    PubMed

    Wang, Yejun; Sun, Ming-An; White, Aaron P

    2018-01-01

    RNA-Seq has become a routine strategy for genome-wide gene expression comparisons in bacteria. Despite lower resolution in transcript border parsing compared with dRNA-Seq, TSS-EMOTE, Cappable-seq, Term-seq, and others, directional RNA-Seq still illustrates its advantages: low cost, quantification and transcript border analysis with a medium resolution (±10-20 nt). To facilitate mining of directional RNA-Seq datasets especially with respect to transcript structure analysis, we developed a tool, TrBorderExt, which can parse transcript start sites and termination sites accurately in bacteria. A detailed protocol is described in this chapter for how to use the software package step by step to identify bacterial transcript borders from raw RNA-Seq data. The package was developed with Perl and R programming languages, and is accessible freely through the website: http://www.szu-bioinf.org/TrBorderExt .

  6. Empirical analysis of RNA robustness and evolution using high-throughput sequencing of ribozyme reactions.

    PubMed

    Hayden, Eric J

    2016-08-15

    RNA molecules provide a realistic but tractable model of a genotype to phenotype relationship. This relationship has been extensively investigated computationally using secondary structure prediction algorithms. Enzymatic RNA molecules, or ribozymes, offer access to genotypic and phenotypic information in the laboratory. Advancements in high-throughput sequencing technologies have enabled the analysis of sequences in the lab that now rivals what can be accomplished computationally. This has motivated a resurgence of in vitro selection experiments and opened new doors for the analysis of the distribution of RNA functions in genotype space. A body of computational experiments has investigated the persistence of specific RNA structures despite changes in the primary sequence, and how this mutational robustness can promote adaptations. This article summarizes recent approaches that were designed to investigate the role of mutational robustness during the evolution of RNA molecules in the laboratory, and presents theoretical motivations, experimental methods and approaches to data analysis. Copyright © 2016 Elsevier Inc. All rights reserved.

  7. Structural RNAs of known and unknown function identified in malaria parasites by comparative genomics and RNA analysis

    PubMed Central

    Chakrabarti, Kausik; Pearson, Michael; Grate, Leslie; Sterne-Weiler, Timothy; Deans, Jonathan; Donohue, John Paul; Ares, Manuel

    2007-01-01

    As the genomes of more eukaryotic pathogens are sequenced, understanding how molecular differences between parasite and host might be exploited to provide new therapies has become a major focus. Central to cell function are RNA-containing complexes involved in gene expression, such as the ribosome, the spliceosome, snoRNAs, RNase P, and telomerase, among others. In this article we identify by comparative genomics and validate by RNA analysis numerous previously unknown structural RNAs encoded by the Plasmodium falciparum genome, including the telomerase RNA, U3, 31 snoRNAs, as well as previously predicted spliceosomal snRNAs, SRP RNA, MRP RNA, and RNAse P RNA. Furthermore, we identify six new RNA coding genes of unknown function. To investigate the relationships of the RNA coding genes to other genomic features in related parasites, we developed a genome browser for P. falciparum (http://areslab.ucsc.edu/cgi-bin/hgGateway). Additional experiments provide evidence supporting the prediction that snoRNAs guide methylation of a specific position on U4 snRNA, as well as predicting an snRNA promoter element particular to Plasmodium sp. These findings should allow detailed structural comparisons between the RNA components of the gene expression machinery of the parasite and its vertebrate hosts. PMID:17901154

  8. Free energy minimization to predict RNA secondary structures and computational RNA design.

    PubMed

    Churkin, Alexander; Weinbrand, Lina; Barash, Danny

    2015-01-01

    Determining the RNA secondary structure from sequence data by computational predictions is a long-standing problem. Its solution has been approached in two distinctive ways. If a multiple sequence alignment of a collection of homologous sequences is available, the comparative method uses phylogeny to determine conserved base pairs that are more likely to form as a result of billions of years of evolution than by chance. In the case of single sequences, recursive algorithms that compute free energy structures by using empirically derived energy parameters have been developed. This latter approach of RNA folding prediction by energy minimization is widely used to predict RNA secondary structure from sequence. For a significant number of RNA molecules, the secondary structure of the RNA molecule is indicative of its function and its computational prediction by minimizing its free energy is important for its functional analysis. A general method for free energy minimization to predict RNA secondary structures is dynamic programming, although other optimization methods have been developed as well along with empirically derived energy parameters. In this chapter, we introduce and illustrate by examples the approach of free energy minimization to predict RNA secondary structures.

  9. De novo discovery of structural motifs in RNA 3D structures through clustering.

    PubMed

    Ge, Ping; Islam, Shahidul; Zhong, Cuncong; Zhang, Shaojie

    2018-05-18

    As functional components in three-dimensional (3D) conformation of an RNA, the RNA structural motifs provide an easy way to associate the molecular architectures with their biological mechanisms. In the past years, many computational tools have been developed to search motif instances by using the existing knowledge of well-studied families. Recently, with the rapidly increasing number of resolved RNA 3D structures, there is an urgent need to discover novel motifs with the newly presented information. In this work, we classify all the loops in non-redundant RNA 3D structures to detect plausible RNA structural motif families by using a clustering pipeline. Compared with other clustering approaches, our method has two benefits: first, the underlying alignment algorithm is tolerant to the variations in 3D structures. Second, sophisticated downstream analysis has been performed to ensure the clusters are valid and easily applied to further research. The final clustering results contain many interesting new variants of known motif families, such as GNAA tetraloop, kink-turn, sarcin-ricin and T-loop. We have also discovered potential novel functional motifs conserved in ribosomal RNA, sgRNA, SRP RNA, riboswitch and ribozyme.

  10. RNA FRABASE 2.0: an advanced web-accessible database with the capacity to search the three-dimensional fragments within RNA structures

    PubMed Central

    2010-01-01

    Background Recent discoveries concerning novel functions of RNA, such as RNA interference, have contributed towards the growing importance of the field. In this respect, a deeper knowledge of complex three-dimensional RNA structures is essential to understand their new biological functions. A number of bioinformatic tools have been proposed to explore two major structural databases (PDB, NDB) in order to analyze various aspects of RNA tertiary structures. One of these tools is RNA FRABASE 1.0, the first web-accessible database with an engine for automatic search of 3D fragments within PDB-derived RNA structures. This search is based upon the user-defined RNA secondary structure pattern. In this paper, we present and discuss RNA FRABASE 2.0. This second version of the system represents a major extension of this tool in terms of providing new data and a wide spectrum of novel functionalities. An intuitionally operated web server platform enables very fast user-tailored search of three-dimensional RNA fragments, their multi-parameter conformational analysis and visualization. Description RNA FRABASE 2.0 has stored information on 1565 PDB-deposited RNA structures, including all NMR models. The RNA FRABASE 2.0 search engine algorithms operate on the database of the RNA sequences and the new library of RNA secondary structures, coded in the dot-bracket format extended to hold multi-stranded structures and to cover residues whose coordinates are missing in the PDB files. The library of RNA secondary structures (and their graphics) is made available. A high level of efficiency of the 3D search has been achieved by introducing novel tools to formulate advanced searching patterns and to screen highly populated tertiary structure elements. RNA FRABASE 2.0 also stores data and conformational parameters in order to provide "on the spot" structural filters to explore the three-dimensional RNA structures. An instant visualization of the 3D RNA structures is provided. RNA FRABASE 2.0 is freely available at http://rnafrabase.cs.put.poznan.pl. Conclusions RNA FRABASE 2.0 provides a novel database and powerful search engine which is equipped with new data and functionalities that are unavailable elsewhere. Our intention is that this advanced version of the RNA FRABASE will be of interest to all researchers working in the RNA field. PMID:20459631

  11. RNA FRABASE 2.0: an advanced web-accessible database with the capacity to search the three-dimensional fragments within RNA structures.

    PubMed

    Popenda, Mariusz; Szachniuk, Marta; Blazewicz, Marek; Wasik, Szymon; Burke, Edmund K; Blazewicz, Jacek; Adamiak, Ryszard W

    2010-05-06

    Recent discoveries concerning novel functions of RNA, such as RNA interference, have contributed towards the growing importance of the field. In this respect, a deeper knowledge of complex three-dimensional RNA structures is essential to understand their new biological functions. A number of bioinformatic tools have been proposed to explore two major structural databases (PDB, NDB) in order to analyze various aspects of RNA tertiary structures. One of these tools is RNA FRABASE 1.0, the first web-accessible database with an engine for automatic search of 3D fragments within PDB-derived RNA structures. This search is based upon the user-defined RNA secondary structure pattern. In this paper, we present and discuss RNA FRABASE 2.0. This second version of the system represents a major extension of this tool in terms of providing new data and a wide spectrum of novel functionalities. An intuitionally operated web server platform enables very fast user-tailored search of three-dimensional RNA fragments, their multi-parameter conformational analysis and visualization. RNA FRABASE 2.0 has stored information on 1565 PDB-deposited RNA structures, including all NMR models. The RNA FRABASE 2.0 search engine algorithms operate on the database of the RNA sequences and the new library of RNA secondary structures, coded in the dot-bracket format extended to hold multi-stranded structures and to cover residues whose coordinates are missing in the PDB files. The library of RNA secondary structures (and their graphics) is made available. A high level of efficiency of the 3D search has been achieved by introducing novel tools to formulate advanced searching patterns and to screen highly populated tertiary structure elements. RNA FRABASE 2.0 also stores data and conformational parameters in order to provide "on the spot" structural filters to explore the three-dimensional RNA structures. An instant visualization of the 3D RNA structures is provided. RNA FRABASE 2.0 is freely available at http://rnafrabase.cs.put.poznan.pl. RNA FRABASE 2.0 provides a novel database and powerful search engine which is equipped with new data and functionalities that are unavailable elsewhere. Our intention is that this advanced version of the RNA FRABASE will be of interest to all researchers working in the RNA field.

  12. The application of cluster analysis in the intercomparison of loop structures in RNA.

    PubMed

    Huang, Hung-Chung; Nagaswamy, Uma; Fox, George E

    2005-04-01

    We have developed a computational approach for the comparison and classification of RNA loop structures. Hairpin or interior loops identified in atomic resolution RNA structures were intercompared by conformational matching. The root-mean-square deviation (RMSD) values between all pairs of RNA fragments of interest, even if from different molecules, are calculated. Subsequently, cluster analysis is performed on the resulting matrix of RMSD distances using the unweighted pair group method with arithmetic mean (UPGMA). The cluster analysis objectively reveals groups of folds that resemble one another. To demonstrate the utility of the approach, a comprehensive analysis of all the terminal hairpin tetraloops that have been observed in 15 RNA structures that have been determined by X-ray crystallography was undertaken. The method found major clusters corresponding to the well-known GNRA and UNCG types. In addition, two tetraloops with the unusual primary sequence UMAC (M is A or C) were successfully assigned to the GNRA cluster. Larger loop structures were also examined and the clustering results confirmed the occurrence of variations of the GNRA and UNCG tetraloops in these loops and provided a systematic means for locating them. Nineteen examples of larger loops that closely resemble either the GNRA or UNCG tetraloop were found in the large ribosomal RNAs. When the clustering approach was extended to include all structures in the SCOR database, novel relationships were detected including one between the ANYA motif and a less common folding of the GAAA tetraloop sequence.

  13. The application of cluster analysis in the intercomparison of loop structures in RNA

    PubMed Central

    HUANG, HUNG-CHUNG; NAGASWAMY, UMA; FOX, GEORGE E.

    2005-01-01

    We have developed a computational approach for the comparison and classification of RNA loop structures. Hairpin or interior loops identified in atomic resolution RNA structures were intercompared by conformational matching. The root-mean-square deviation (RMSD) values between all pairs of RNA fragments of interest, even if from different molecules, are calculated. Subsequently, cluster analysis is performed on the resulting matrix of RMSD distances using the unweighted pair group method with arithmetic mean (UPGMA). The cluster analysis objectively reveals groups of folds that resemble one another. To demonstrate the utility of the approach, a comprehensive analysis of all the terminal hairpin tetraloops that have been observed in 15 RNA structures that have been determined by X-ray crystallography was undertaken. The method found major clusters corresponding to the well-known GNRA and UNCG types. In addition, two tetraloops with the unusual primary sequence UMAC (M is A or C) were successfully assigned to the GNRA cluster. Larger loop structures were also examined and the clustering results confirmed the occurrence of variations of the GNRA and UNCG tetraloops in these loops and provided a systematic means for locating them. Nineteen examples of larger loops that closely resemble either the GNRA or UNCG tetraloop were found in the large ribosomal RNAs. When the clustering approach was extended to include all structures in the SCOR database, novel relationships were detected including one between the ANYA motif and a less common folding of the GAAA tetraloop sequence. PMID:15769871

  14. DSSR-enhanced visualization of nucleic acid structures in Jmol

    PubMed Central

    Hanson, Robert M.

    2017-01-01

    Abstract Sophisticated and interactive visualizations are essential for making sense of the intricate 3D structures of macromolecules. For proteins, secondary structural components are routinely featured in molecular graphics visualizations. However, the field of RNA structural bioinformatics is still lagging behind; for example, current molecular graphics tools lack built-in support even for base pairs, double helices, or hairpin loops. DSSR (Dissecting the Spatial Structure of RNA) is an integrated and automated command-line tool for the analysis and annotation of RNA tertiary structures. It calculates a comprehensive and unique set of features for characterizing RNA, as well as DNA structures. Jmol is a widely used, open-source Java viewer for 3D structures, with a powerful scripting language. JSmol, its reincarnation based on native JavaScript, has a predominant position in the post Java-applet era for web-based visualization of molecular structures. The DSSR-Jmol integration presented here makes salient features of DSSR readily accessible, either via the Java-based Jmol application itself, or its HTML5-based equivalent, JSmol. The DSSR web service accepts 3D coordinate files (in mmCIF or PDB format) initiated from a Jmol or JSmol session and returns DSSR-derived structural features in JSON format. This seamless combination of DSSR and Jmol/JSmol brings the molecular graphics of 3D RNA structures to a similar level as that for proteins, and enables a much deeper analysis of structural characteristics. It fills a gap in RNA structural bioinformatics, and is freely accessible (via the Jmol application or the JSmol-based website http://jmol.x3dna.org). PMID:28472503

  15. A complex structure in the mRNA of Tf1 is recognized and cleaved to generate the primer of reverse transcription.

    PubMed

    Lin, J H; Levin, H L

    1997-01-15

    All retroviruses and LTR-containing retrotransposons are thought to require specific tRNA molecules to serve as primers of reverse transcription. An exception is the LTR-containing retrotransposon Tf1, isolated from Schizosaccharomyces pombe. Instead of requiring a tRNA, the reverse transcriptase of Tf1 uses the first 11 bases of the Tf1 transcript as the primer for reverse transcription. The primer is generated by a cleavage that occurs between bases 11 and 12 of the Tf1 mRNA. Sequence analysis of the 5' untranslated region of the Tf1 mRNA resulted in the identification of a region with the potential to form an RNA structure of 89 bases that included the primer binding site and the first 11 bases of the Tf1 mRNA. Systematic mutagenesis of this region revealed 34 single-point mutants in the structure that resulted in reduced transposition activity. The defects in transposition correlated with reduced level of Tf1 reverse transcripts as determined by DNA blot analysis. Evidence that the RNA structure did form in vivo included the result that strains with second site mutations that restored complementarity resulted in increased levels of reverse transcripts and Tf1 transposition. The majority of the mutants defective for reverse transcription were unable to cleave the Tf1 mRNA between bases 11 and 12. These data indicate that formation of an extensive RNA structure was required for the cleavage reaction that generated the primer for Tf1 reverse transcription.

  16. The impact of CRISPR repeat sequence on structures of a Cas6 protein-RNA complex

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Ruiying; Zheng, Han; Preamplume, Gan

    The repeat-associated mysterious proteins (RAMPs) comprise the most abundant family of proteins involved in prokaryotic immunity against invading genetic elements conferred by the clustered regularly interspaced short palindromic repeat (CRISPR) system. Cas6 is one of the first characterized RAMP proteins and is a key enzyme required for CRISPR RNA maturation. Despite a strong structural homology with other RAMP proteins that bind hairpin RNA, Cas6 distinctly recognizes single-stranded RNA. Previous structural and biochemical studies show that Cas6 captures the 5' end while cleaving the 3' end of the CRISPR RNA. Here, we describe three structures and complementary biochemical analysis of amore » noncatalytic Cas6 homolog from Pyrococcus horikoshii bound to CRISPR repeat RNA of different sequences. Our study confirms the specificity of the Cas6 protein for single-stranded RNA and further reveals the importance of the bases at Positions 5-7 in Cas6-RNA interactions. Substitutions of these bases result in structural changes in the protein-RNA complex including its oligomerization state.« less

  17. In-silico analysis for RNA-interference mechanism of α-synuclein to treat Parkinson's disease.

    PubMed

    Seema, S; Seenivasagam, R; Hemavathi, K

    2013-01-01

    Parkinson's Disease (PD) causing mutations in α-synuclein gene are ALA30PRO, GLU46LYS and ALA53THR. The conformational changes in proteins with respect to all the three mutations were analysed. These were used to predict the structures of Short Interfering RNA (siRNA) antisense strand and siRNA region. The siRNA binds with the argonaute protein forming RNA Induced Silencing Complex (RISC). Then, siRNA antisense-strand was attached to RISC. The structure of dicer (RNase-III-enzyme) cleaves double-stranded RNA (dsRNA) into two siRNA-strands. Incorporation of single siRNA-strand into RISC guides to pair with the complementary α-synuclein target-messenger RNA (mRNA) thereby enabling it to cleave the target.

  18. Quantitative Understanding of SHAPE Mechanism from RNA Structure and Dynamics Analysis.

    PubMed

    Hurst, Travis; Xu, Xiaojun; Zhao, Peinan; Chen, Shi-Jie

    2018-05-10

    The selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) method probes RNA local structural and dynamic information at single nucleotide resolution. To gain quantitative insights into the relationship between nucleotide flexibility, RNA 3D structure, and SHAPE reactivity, we develop a 3D Structure-SHAPE Relationship model (3DSSR) to rebuild SHAPE profiles from 3D structures. The model starts from RNA structures and combines nucleotide interaction strength and conformational propensity, ligand (SHAPE reagent) accessibility, and base-pairing pattern through a composite function to quantify the correlation between SHAPE reactivity and nucleotide conformational stability. The 3DSSR model shows the relationship between SHAPE reactivity and RNA structure and energetics. Comparisons between the 3DSSR-predicted SHAPE profile and the experimental SHAPE data show correlation, suggesting that the extracted analytical function may have captured the key factors that determine the SHAPE reactivity profile. Furthermore, the theory offers an effective method to sieve RNA 3D models and exclude models that are incompatible with experimental SHAPE data.

  19. Footprinting analysis of interactions between the largest eukaryotic RNase P/MRP protein Pop1 and RNase P/MRP RNA components

    PubMed Central

    Fagerlund, Robert D.; Perederina, Anna; Berezin, Igor; Krasilnikov, Andrey S.

    2015-01-01

    Ribonuclease (RNase) P and RNase MRP are closely related catalytic ribonucleoproteins involved in the metabolism of a wide range of RNA molecules, including tRNA, rRNA, and some mRNAs. The catalytic RNA component of eukaryotic RNase P retains the core elements of the bacterial RNase P ribozyme; however, the peripheral RNA elements responsible for the stabilization of the global architecture are largely absent in the eukaryotic enzyme. At the same time, the protein makeup of eukaryotic RNase P is considerably more complex than that of the bacterial RNase P. RNase MRP, an essential and ubiquitous eukaryotic enzyme, has a structural organization resembling that of eukaryotic RNase P, and the two enzymes share most of their protein components. Here, we present the results of the analysis of interactions between the largest protein component of yeast RNases P/MRP, Pop1, and the RNA moieties of the enzymes, discuss structural implications of the results, and suggest that Pop1 plays the role of a scaffold for the stabilization of the global architecture of eukaryotic RNase P RNA, substituting for the network of RNA–RNA tertiary interactions that maintain the global RNA structure in bacterial RNase P. PMID:26135751

  20. Footprinting analysis of interactions between the largest eukaryotic RNase P/MRP protein Pop1 and RNase P/MRP RNA components.

    PubMed

    Fagerlund, Robert D; Perederina, Anna; Berezin, Igor; Krasilnikov, Andrey S

    2015-09-01

    Ribonuclease (RNase) P and RNase MRP are closely related catalytic ribonucleoproteins involved in the metabolism of a wide range of RNA molecules, including tRNA, rRNA, and some mRNAs. The catalytic RNA component of eukaryotic RNase P retains the core elements of the bacterial RNase P ribozyme; however, the peripheral RNA elements responsible for the stabilization of the global architecture are largely absent in the eukaryotic enzyme. At the same time, the protein makeup of eukaryotic RNase P is considerably more complex than that of the bacterial RNase P. RNase MRP, an essential and ubiquitous eukaryotic enzyme, has a structural organization resembling that of eukaryotic RNase P, and the two enzymes share most of their protein components. Here, we present the results of the analysis of interactions between the largest protein component of yeast RNases P/MRP, Pop1, and the RNA moieties of the enzymes, discuss structural implications of the results, and suggest that Pop1 plays the role of a scaffold for the stabilization of the global architecture of eukaryotic RNase P RNA, substituting for the network of RNA-RNA tertiary interactions that maintain the global RNA structure in bacterial RNase P. © 2015 Fagerlund et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  1. Customized workflow development and data modularization concepts for RNA-Sequencing and metatranscriptome experiments.

    PubMed

    Lott, Steffen C; Wolfien, Markus; Riege, Konstantin; Bagnacani, Andrea; Wolkenhauer, Olaf; Hoffmann, Steve; Hess, Wolfgang R

    2017-11-10

    RNA-Sequencing (RNA-Seq) has become a widely used approach to study quantitative and qualitative aspects of transcriptome data. The variety of RNA-Seq protocols, experimental study designs and the characteristic properties of the organisms under investigation greatly affect downstream and comparative analyses. In this review, we aim to explain the impact of structured pre-selection, classification and integration of best-performing tools within modularized data analysis workflows and ready-to-use computing infrastructures towards experimental data analyses. We highlight examples for workflows and use cases that are presented for pro-, eukaryotic and mixed dual RNA-Seq (meta-transcriptomics) experiments. In addition, we are summarizing the expertise of the laboratories participating in the project consortium "Structured Analysis and Integration of RNA-Seq experiments" (de.STAIR) and its integration with the Galaxy-workbench of the RNA Bioinformatics Center (RBC). Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.

  2. New families of human regulatory RNA structures identified by comparative analysis of vertebrate genomes.

    PubMed

    Parker, Brian J; Moltke, Ida; Roth, Adam; Washietl, Stefan; Wen, Jiayu; Kellis, Manolis; Breaker, Ronald; Pedersen, Jakob Skou

    2011-11-01

    Regulatory RNA structures are often members of families with multiple paralogous instances across the genome. Family members share functional and structural properties, which allow them to be studied as a whole, facilitating both bioinformatic and experimental characterization. We have developed a comparative method, EvoFam, for genome-wide identification of families of regulatory RNA structures, based on primary sequence and secondary structure similarity. We apply EvoFam to a 41-way genomic vertebrate alignment. Genome-wide, we identify 220 human, high-confidence families outside protein-coding regions comprising 725 individual structures, including 48 families with known structural RNA elements. Known families identified include both noncoding RNAs, e.g., miRNAs and the recently identified MALAT1/MEN β lincRNA family; and cis-regulatory structures, e.g., iron-responsive elements. We also identify tens of new families supported by strong evolutionary evidence and other statistical evidence, such as GO term enrichments. For some of these, detailed analysis has led to the formulation of specific functional hypotheses. Examples include two hypothesized auto-regulatory feedback mechanisms: one involving six long hairpins in the 3'-UTR of MAT2A, a key metabolic gene that produces the primary human methyl donor S-adenosylmethionine; the other involving a tRNA-like structure in the intron of the tRNA maturation gene POP1. We experimentally validate the predicted MAT2A structures. Finally, we identify potential new regulatory networks, including large families of short hairpins enriched in immunity-related genes, e.g., TNF, FOS, and CTLA4, which include known transcript destabilizing elements. Our findings exemplify the diversity of post-transcriptional regulation and provide a resource for further characterization of new regulatory mechanisms and families of noncoding RNAs.

  3. Comparative structural analysis of human DEAD-box RNA helicases.

    PubMed

    Schütz, Patrick; Karlberg, Tobias; van den Berg, Susanne; Collins, Ruairi; Lehtiö, Lari; Högbom, Martin; Holmberg-Schiavone, Lovisa; Tempel, Wolfram; Park, Hee-Won; Hammarström, Martin; Moche, Martin; Thorsell, Ann-Gerd; Schüler, Herwig

    2010-09-30

    DEAD-box RNA helicases play various, often critical, roles in all processes where RNAs are involved. Members of this family of proteins are linked to human disease, including cancer and viral infections. DEAD-box proteins contain two conserved domains that both contribute to RNA and ATP binding. Despite recent advances the molecular details of how these enzymes convert chemical energy into RNA remodeling is unknown. We present crystal structures of the isolated DEAD-domains of human DDX2A/eIF4A1, DDX2B/eIF4A2, DDX5, DDX10/DBP4, DDX18/myc-regulated DEAD-box protein, DDX20, DDX47, DDX52/ROK1, and DDX53/CAGE, and of the helicase domains of DDX25 and DDX41. Together with prior knowledge this enables a family-wide comparative structural analysis. We propose a general mechanism for opening of the RNA binding site. This analysis also provides insights into the diversity of DExD/H- proteins, with implications for understanding the functions of individual family members.

  4. Comparative Structural Analysis of Human DEAD-Box RNA Helicases

    PubMed Central

    Schütz, Patrick; Karlberg, Tobias; van den Berg, Susanne; Collins, Ruairi; Lehtiö, Lari; Högbom, Martin; Holmberg-Schiavone, Lovisa; Tempel, Wolfram; Park, Hee-Won; Hammarström, Martin; Moche, Martin; Thorsell, Ann-Gerd; Schüler, Herwig

    2010-01-01

    DEAD-box RNA helicases play various, often critical, roles in all processes where RNAs are involved. Members of this family of proteins are linked to human disease, including cancer and viral infections. DEAD-box proteins contain two conserved domains that both contribute to RNA and ATP binding. Despite recent advances the molecular details of how these enzymes convert chemical energy into RNA remodeling is unknown. We present crystal structures of the isolated DEAD-domains of human DDX2A/eIF4A1, DDX2B/eIF4A2, DDX5, DDX10/DBP4, DDX18/myc-regulated DEAD-box protein, DDX20, DDX47, DDX52/ROK1, and DDX53/CAGE, and of the helicase domains of DDX25 and DDX41. Together with prior knowledge this enables a family-wide comparative structural analysis. We propose a general mechanism for opening of the RNA binding site. This analysis also provides insights into the diversity of DExD/H- proteins, with implications for understanding the functions of individual family members. PMID:20941364

  5. Structural insights into RNA processing by the human RISC-loading complex.

    PubMed

    Wang, Hong-Wei; Noland, Cameron; Siridechadilok, Bunpote; Taylor, David W; Ma, Enbo; Felderer, Karin; Doudna, Jennifer A; Nogales, Eva

    2009-11-01

    Targeted gene silencing by RNA interference (RNAi) requires loading of a short guide RNA (small interfering RNA (siRNA) or microRNA (miRNA)) onto an Argonaute protein to form the functional center of an RNA-induced silencing complex (RISC). In humans, Argonaute2 (AGO2) assembles with the guide RNA-generating enzyme Dicer and the RNA-binding protein TRBP to form a RISC-loading complex (RLC), which is necessary for efficient transfer of nascent siRNAs and miRNAs from Dicer to AGO2. Here, using single-particle EM analysis, we show that human Dicer has an L-shaped structure. The RLC Dicer's N-terminal DExH/D domain, located in a short 'base branch', interacts with TRBP, whereas its C-terminal catalytic domains in the main body are proximal to AGO2. A model generated by docking the available atomic structures of Dicer and Argonaute homologs into the RLC reconstruction suggests a mechanism for siRNA transfer from Dicer to AGO2.

  6. An analytical platform for mass spectrometry-based identification and chemical analysis of RNA in ribonucleoprotein complexes.

    PubMed

    Taoka, Masato; Yamauchi, Yoshio; Nobe, Yuko; Masaki, Shunpei; Nakayama, Hiroshi; Ishikawa, Hideaki; Takahashi, Nobuhiro; Isobe, Toshiaki

    2009-11-01

    We describe here a mass spectrometry (MS)-based analytical platform of RNA, which combines direct nano-flow reversed-phase liquid chromatography (RPLC) on a spray tip column and a high-resolution LTQ-Orbitrap mass spectrometer. Operating RPLC under a very low flow rate with volatile solvents and MS in the negative mode, we could estimate highly accurate mass values sufficient to predict the nucleotide composition of a approximately 21-nucleotide small interfering RNA, detect post-transcriptional modifications in yeast tRNA, and perform collision-induced dissociation/tandem MS-based structural analysis of nucleolytic fragments of RNA at a sub-femtomole level. Importantly, the method allowed the identification and chemical analysis of small RNAs in ribonucleoprotein (RNP) complex, such as the pre-spliceosomal RNP complex, which was pulled down from cultured cells with a tagged protein cofactor as bait. We have recently developed a unique genome-oriented database search engine, Ariadne, which allows tandem MS-based identification of RNAs in biological samples. Thus, the method presented here has broad potential for automated analysis of RNA; it complements conventional molecular biology-based techniques and is particularly suited for simultaneous analysis of the composition, structure, interaction, and dynamics of RNA and protein components in various cellular RNP complexes.

  7. Structure and variation of the mitochondrial genome of fishes.

    PubMed

    Satoh, Takashi P; Miya, Masaki; Mabuchi, Kohji; Nishida, Mutsumi

    2016-09-07

    The mitochondrial (mt) genome has been used as an effective tool for phylogenetic and population genetic analyses in vertebrates. However, the structure and variability of the vertebrate mt genome are not well understood. A potential strategy for improving our understanding is to conduct a comprehensive comparative study of large mt genome data. The aim of this study was to characterize the structure and variability of the fish mt genome through comparative analysis of large datasets. An analysis of the secondary structure of proteins for 250 fish species (248 ray-finned and 2 cartilaginous fishes) illustrated that cytochrome c oxidase subunits (COI, COII, and COIII) and a cytochrome bc1 complex subunit (Cyt b) had substantial amino acid conservation. Among the four proteins, COI was the most conserved, as more than half of all amino acid sites were invariable among the 250 species. Our models identified 43 and 58 stems within 12S rRNA and 16S rRNA, respectively, with larger numbers than proposed previously for vertebrates. The models also identified 149 and 319 invariable sites in 12S rRNA and 16S rRNA, respectively, in all fishes. In particular, the present result verified that a region corresponding to the peptidyl transferase center in prokaryotic 23S rRNA, which is homologous to mt 16S rRNA, is also conserved in fish mt 16S rRNA. Concerning the gene order, we found 35 variations (in 32 families) that deviated from the common gene order in vertebrates. These gene rearrangements were mostly observed in the area spanning the ND5 gene to the control region as well as two tRNA gene cluster regions (IQM and WANCY regions). Although many of such gene rearrangements were unique to a specific taxon, some were shared polyphyletically between distantly related species. Through a large-scale comparative analysis of 250 fish species mt genomes, we elucidated various structural aspects of the fish mt genome and the encoded genes. The present results will be important for understanding functions of the mt genome and developing programs for nucleotide sequence analysis. This study demonstrated the significance of extensive comparisons for understanding the structure of the mt genome.

  8. CryoEM structure of yeast cytoplasmic exosome complex.

    PubMed

    Liu, Jun-Jie; Niu, Chu-Ya; Wu, Yao; Tan, Dan; Wang, Yang; Ye, Ming-Da; Liu, Yang; Zhao, Wenwei; Zhou, Ke; Liu, Quan-Sheng; Dai, Junbiao; Yang, Xuerui; Dong, Meng-Qiu; Huang, Niu; Wang, Hong-Wei

    2016-07-01

    The eukaryotic multi-subunit RNA exosome complex plays crucial roles in 3'-to-5' RNA processing and decay. Rrp6 and Ski7 are the major cofactors for the nuclear and cytoplasmic exosomes, respectively. In the cytoplasm, Ski7 helps the exosome to target mRNAs for degradation and turnover via a through-core pathway. However, the interaction between Ski7 and the exosome complex has remained unclear. The transaction of RNA substrates within the exosome is also elusive. In this work, we used single-particle cryo-electron microscopy to solve the structures of the Ski7-exosome complex in RNA-free and RNA-bound forms at resolutions of 4.2 Å and 5.8 Å, respectively. These structures reveal that the N-terminal domain of Ski7 adopts a structural arrangement and interacts with the exosome in a similar fashion to the C-terminal domain of nuclear Rrp6. Further structural analysis of exosomes with RNA substrates harboring 3' overhangs of different length suggests a switch mechanism of RNA-induced exosome activation in the through-core pathway of RNA processing.

  9. Comparative genome analysis of Alkhumra hemorrhagic fever virus with Kyasanur forest disease and tick-borne encephalitis viruses by the in silico approach.

    PubMed

    Palanisamy, Navaneethan; Akaberi, Dario; Lennerstrand, Johan; Lundkvist, Åke

    2018-05-10

    Alkhumra hemorrhagic fever virus (AHFV), a relatively new member of the Flaviviruses, was discovered in Saudi Arabia 23 years ago. AHFV is classified in the tick-borne encephalitis virus serocomplex, along with the Kyasanur forest disease virus (KFDV) and tick-borne encephalitis virus (TBEV). Currently, very little is known about the pathologies of AHFV. In this study, using the available genome information of AHFV, KFDV and TBEV, we have predicted and compared the following aspects of these viruses: evolution, nucleotide and protein compositions, recombination, codon frequency, substitution rate, N- and O-glycosylation sites, signal peptide and cleavage site, transmembrane region, secondary structure of 5' and 3' UTRs and RNA-RNA interactions. Additionally, we have modeled the 3D protease and RNA-dependent RNA polymerase structures for AHFV, KFDV and TBEV. Recombination analysis showed no evidence of recombination in the AHFV genome with that of either KFDV or TBEV, although single break point analysis showed that nucleotide position 7399 (in the NS4B) is a breakpoint location. AHFV, KFDV and TBEV are very similar in terms of codon frequency, the number of transmembrane regions, properties of the polyprotein, RNA-RNA interaction sequences, NS3 protease and NS5 polymerase structures and 5' UTR structure. Using genome sequences, we showed the similarities between these closely- related viruses on several different areas.

  10. Structure of the Nucleoprotein Binding Domain of Mokola Virus Phosphoprotein▿

    PubMed Central

    Assenberg, René; Delmas, Olivier; Ren, Jingshan; Vidalain, Pierre-Olivier; Verma, Anil; Larrous, Florence; Graham, Stephen C.; Tangy, Frédéric; Grimes, Jonathan M.; Bourhy, Hervé

    2010-01-01

    Mokola virus (MOKV) is a nonsegmented, negative-sense RNA virus that belongs to the Lyssavirus genus and Rhabdoviridae family. MOKV phosphoprotein P is an essential component of the replication and transcription complex and acts as a cofactor for the viral RNA-dependent RNA polymerase. P recruits the viral polymerase to the nucleoprotein-bound viral RNA (N-RNA) via an interaction between its C-terminal domain and the N-RNA complex. Here we present a structure for this domain of MOKV P, obtained by expression of full-length P in Escherichia coli, which was subsequently truncated during crystallization. The structure has a high degree of homology with P of rabies virus, another member of Lyssavirus genus, and to a lesser degree with P of vesicular stomatitis virus (VSV), a member of the related Vesiculovirus genus. In addition, analysis of the crystal packing of this domain reveals a potential binding site for the nucleoprotein N. Using both site-directed mutagenesis and yeast two-hybrid experiments to measure P-N interaction, we have determined the relative roles of key amino acids involved in this interaction to map the region of P that binds N. This analysis also reveals a structural relationship between the N-RNA binding domain of the P proteins of the Rhabdoviridae and the Paramyxoviridae. PMID:19906936

  11. DSSR-enhanced visualization of nucleic acid structures in Jmol.

    PubMed

    Hanson, Robert M; Lu, Xiang-Jun

    2017-07-03

    Sophisticated and interactive visualizations are essential for making sense of the intricate 3D structures of macromolecules. For proteins, secondary structural components are routinely featured in molecular graphics visualizations. However, the field of RNA structural bioinformatics is still lagging behind; for example, current molecular graphics tools lack built-in support even for base pairs, double helices, or hairpin loops. DSSR (Dissecting the Spatial Structure of RNA) is an integrated and automated command-line tool for the analysis and annotation of RNA tertiary structures. It calculates a comprehensive and unique set of features for characterizing RNA, as well as DNA structures. Jmol is a widely used, open-source Java viewer for 3D structures, with a powerful scripting language. JSmol, its reincarnation based on native JavaScript, has a predominant position in the post Java-applet era for web-based visualization of molecular structures. The DSSR-Jmol integration presented here makes salient features of DSSR readily accessible, either via the Java-based Jmol application itself, or its HTML5-based equivalent, JSmol. The DSSR web service accepts 3D coordinate files (in mmCIF or PDB format) initiated from a Jmol or JSmol session and returns DSSR-derived structural features in JSON format. This seamless combination of DSSR and Jmol/JSmol brings the molecular graphics of 3D RNA structures to a similar level as that for proteins, and enables a much deeper analysis of structural characteristics. It fills a gap in RNA structural bioinformatics, and is freely accessible (via the Jmol application or the JSmol-based website http://jmol.x3dna.org). © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. RNACompress: Grammar-based compression and informational complexity measurement of RNA secondary structure.

    PubMed

    Liu, Qi; Yang, Yu; Chen, Chun; Bu, Jiajun; Zhang, Yin; Ye, Xiuzi

    2008-03-31

    With the rapid emergence of RNA databases and newly identified non-coding RNAs, an efficient compression algorithm for RNA sequence and structural information is needed for the storage and analysis of such data. Although several algorithms for compressing DNA sequences have been proposed, none of them are suitable for the compression of RNA sequences with their secondary structures simultaneously. This kind of compression not only facilitates the maintenance of RNA data, but also supplies a novel way to measure the informational complexity of RNA structural data, raising the possibility of studying the relationship between the functional activities of RNA structures and their complexities, as well as various structural properties of RNA based on compression. RNACompress employs an efficient grammar-based model to compress RNA sequences and their secondary structures. The main goals of this algorithm are two fold: (1) present a robust and effective way for RNA structural data compression; (2) design a suitable model to represent RNA secondary structure as well as derive the informational complexity of the structural data based on compression. Our extensive tests have shown that RNACompress achieves a universally better compression ratio compared with other sequence-specific or common text-specific compression algorithms, such as Gencompress, winrar and gzip. Moreover, a test of the activities of distinct GTP-binding RNAs (aptamers) compared with their structural complexity shows that our defined informational complexity can be used to describe how complexity varies with activity. These results lead to an objective means of comparing the functional properties of heteropolymers from the information perspective. A universal algorithm for the compression of RNA secondary structure as well as the evaluation of its informational complexity is discussed in this paper. We have developed RNACompress, as a useful tool for academic users. Extensive tests have shown that RNACompress is a universally efficient algorithm for the compression of RNA sequences with their secondary structures. RNACompress also serves as a good measurement of the informational complexity of RNA secondary structure, which can be used to study the functional activities of RNA molecules.

  13. RNACompress: Grammar-based compression and informational complexity measurement of RNA secondary structure

    PubMed Central

    Liu, Qi; Yang, Yu; Chen, Chun; Bu, Jiajun; Zhang, Yin; Ye, Xiuzi

    2008-01-01

    Background With the rapid emergence of RNA databases and newly identified non-coding RNAs, an efficient compression algorithm for RNA sequence and structural information is needed for the storage and analysis of such data. Although several algorithms for compressing DNA sequences have been proposed, none of them are suitable for the compression of RNA sequences with their secondary structures simultaneously. This kind of compression not only facilitates the maintenance of RNA data, but also supplies a novel way to measure the informational complexity of RNA structural data, raising the possibility of studying the relationship between the functional activities of RNA structures and their complexities, as well as various structural properties of RNA based on compression. Results RNACompress employs an efficient grammar-based model to compress RNA sequences and their secondary structures. The main goals of this algorithm are two fold: (1) present a robust and effective way for RNA structural data compression; (2) design a suitable model to represent RNA secondary structure as well as derive the informational complexity of the structural data based on compression. Our extensive tests have shown that RNACompress achieves a universally better compression ratio compared with other sequence-specific or common text-specific compression algorithms, such as Gencompress, winrar and gzip. Moreover, a test of the activities of distinct GTP-binding RNAs (aptamers) compared with their structural complexity shows that our defined informational complexity can be used to describe how complexity varies with activity. These results lead to an objective means of comparing the functional properties of heteropolymers from the information perspective. Conclusion A universal algorithm for the compression of RNA secondary structure as well as the evaluation of its informational complexity is discussed in this paper. We have developed RNACompress, as a useful tool for academic users. Extensive tests have shown that RNACompress is a universally efficient algorithm for the compression of RNA sequences with their secondary structures. RNACompress also serves as a good measurement of the informational complexity of RNA secondary structure, which can be used to study the functional activities of RNA molecules. PMID:18373878

  14. The 5S rRNA loop E: chemical probing and phylogenetic data versus crystal structure.

    PubMed

    Leontis, N B; Westhof, E

    1998-09-01

    A significant fraction of the bases in a folded, structured RNA molecule participate in noncanonical base pairing interactions, often in the context of internal loops or multi-helix junction loops. The appearance of each new high-resolution RNA structure provides welcome data to guide efforts to understand and predict RNA 3D structure, especially when the RNA in question is a functionally conserved molecule. The recent publication of the crystal structure of the "Loop E" region of bacterial 5S ribosomal RNA is such an event [Correll CC, Freeborn B, Moore PB, Steitz TA, 1997, Cell 91:705-712]. In addition to providing more examples of already established noncanonical base pairs, such as purine-purine sheared pairings, trans-Hoogsteen UA, and GU wobble pairs, the structure provides the first high-resolution views of two new purine-purine pairings and a new GU pairing. The goal of the present analysis is to expand the capabilities of both chemical probing and phylogenetic analysis to predict with greater accuracy the structures of RNA molecules. First, in light of existing chemical probing data, we investigate what lessons could be learned regarding the interpretation of this widely used method of RNA structure probing. Then we analyze the 3D structure with reference to molecular phylogeny data (assuming conservation of function) to discover what alternative base pairings are geometrically compatible with the structure. The comparisons between previous modeling efforts and crystal structures show that the intricate involvements of ions and water molecules in the maintenance of non-Watson-Crick pairs render the process of correctly identifying the interacting sites in such pairs treacherous, except in cases of trans-Hoogsteen A/U or sheared A/G pairs for the adenine N1 site. The phylogenetic analysis identifies A/A, A/C, A/U and C/A, C/C, and C/U pairings isosteric with sheared A/G, as well as A/A and A/C pairings isosteric with both G/U and G/G bifurcated pairings. Thus, each non-Watson-Crick pair could be characterized by a phylogenetic signature of variations between isosteric-like pairings. In addition to the conservative changes, which form a dictionary of pairings isosterically compatible with those observed in the crystal structure, concerted changes involving several base pairs also occur. The latter covariations may indicate transitions between related but distinctive motifs within the loop E of 5S ribosomal RNA.

  15. The structural analysis of the mitochondrial SSUrRNA implies a close phylogenetic relationship between mitochondria from plants and from the heterotrophic alga Prototheca wickerhamii.

    PubMed

    Wolff, G; Kück, U

    1990-04-01

    The gene for the mitochondrial small subunit rRNA (SSUrRNA) from the heterotrophic alga Prototheca wickerhamii has been isolated from a gene library of extranuclear DNA. Sequence and structural analyses allow the determination of a secondary structure model for this rRNA. In addition, several sequence motifs are present which are typically found in SSUrRNAs of various mitochondrial origins. Unexpectedly, the Prototheca RNA sequence has more features in common with mitochondrial SSUrRNAs from plants than with that from the green alga Chlamydomonas reinhardtii. The phylogenetic relationship between mitochondria from plants and algae is discussed.

  16. Domain structure of the ribozyme from eubacterial ribonuclease P.

    PubMed Central

    Loria, A; Pan, T

    1996-01-01

    Large RNAs can be composed of discrete domains that fold independently. One such "folding domain" has been identified previously in the ribozyme from Bacillus subtilis ribonuclease P (denoted P RNA). This domain contains roughly one-third of all residues. Folding of an RNA construct consisting of the remaining two-thirds of B. subtilis P RNA was examined by Fe(II)-EDTA hydroxyl radical protection. This molecule folds into the proper higher-order structure under identical conditions as the full-length P RNA, suggesting the presence of a second folding domain in B. subtilis P RNA. Folding analysis of the Escherichia coli P RNA by hydroxyl radical protection shows that this P RNA is completely folded at 5-6 mM Mg2+. In order to analyze the structural organization of folding domains in E. coli P RNA, constructs were designed based on the domain structure of B. subtilis P RNA. Fe(II)-EDTA protection indicates that E. coli P RNA also contains two folding domains. Despite the significant differences at the secondary structure level, both P RNAs appear to converge structurally at the folding domain level. The pre-tRNA substrate, localized in previous studies, may bind across the folding domains with the acceptor stem/3'CCA contacting the domain including the active site and the T stem-loop contacting the other. Because all eubacterial P RNAs share considerable homology in secondary structure to either B. subtilis or E. coli P RNA, these results suggest that this domain structure may be applicable for most, if not all, eubacterial P RNAs. Identification of folding domains should be valuable in dissecting structure-function relationship of large RNAs. PMID:8718684

  17. TRAPR: R Package for Statistical Analysis and Visualization of RNA-Seq Data.

    PubMed

    Lim, Jae Hyun; Lee, Soo Youn; Kim, Ju Han

    2017-03-01

    High-throughput transcriptome sequencing, also known as RNA sequencing (RNA-Seq), is a standard technology for measuring gene expression with unprecedented accuracy. Numerous bioconductor packages have been developed for the statistical analysis of RNA-Seq data. However, these tools focus on specific aspects of the data analysis pipeline, and are difficult to appropriately integrate with one another due to their disparate data structures and processing methods. They also lack visualization methods to confirm the integrity of the data and the process. In this paper, we propose an R-based RNA-Seq analysis pipeline called TRAPR, an integrated tool that facilitates the statistical analysis and visualization of RNA-Seq expression data. TRAPR provides various functions for data management, the filtering of low-quality data, normalization, transformation, statistical analysis, data visualization, and result visualization that allow researchers to build customized analysis pipelines.

  18. Analysis of RNA binding by the dengue virus NS5 RNA capping enzyme.

    PubMed

    Henderson, Brittney R; Saeedi, Bejan J; Campagnola, Grace; Geiss, Brian J

    2011-01-01

    Flaviviruses are small, capped positive sense RNA viruses that replicate in the cytoplasm of infected cells. Dengue virus and other related flaviviruses have evolved RNA capping enzymes to form the viral RNA cap structure that protects the viral genome and directs efficient viral polyprotein translation. The N-terminal domain of NS5 possesses the methyltransferase and guanylyltransferase activities necessary for forming mature RNA cap structures. The mechanism for flavivirus guanylyltransferase activity is currently unknown, and how the capping enzyme binds its diphosphorylated RNA substrate is important for deciphering how the flavivirus guanylyltransferase functions. In this report we examine how flavivirus NS5 N-terminal capping enzymes bind to the 5' end of the viral RNA using a fluorescence polarization-based RNA binding assay. We observed that the K(D) for RNA binding is approximately 200 nM Dengue, Yellow Fever, and West Nile virus capping enzymes. Removal of one or both of the 5' phosphates reduces binding affinity, indicating that the terminal phosphates contribute significantly to binding. RNA binding affinity is negatively affected by the presence of GTP or ATP and positively affected by S-adensyl methoninine (SAM). Structural superpositioning of the dengue virus capping enzyme with the Vaccinia virus VP39 protein bound to RNA suggests how the flavivirus capping enzyme may bind RNA, and mutagenesis analysis of residues in the putative RNA binding site demonstrate that several basic residues are critical for RNA binding. Several mutants show differential binding to 5' di-, mono-, and un-phosphorylated RNAs. The mode of RNA binding appears similar to that found with other methyltransferase enzymes, and a discussion of diphosphorylated RNA binding is presented.

  19. The ins and outs of lncRNA structure: How, why and what comes next?

    PubMed

    Blythe, Amanda J; Fox, Archa H; Bond, Charles S

    2016-01-01

    The field of structural biology has the unique advantage of being able to provide a comprehensive picture of biological mechanisms at the molecular and atomic level. Long noncoding RNAs (lncRNAs) represent the new frontier in the molecular biology of complex organisms yet remain the least characterised of all the classes of RNA. Thousands of new lncRNAs are being reported each year yet very little structural data exists for this rapidly expanding field. The length of lncRNAs ranges from 200 nt to over 100 kb in length and they generally exhibit low cellular abundance. Therefore, obtaining sufficient quantities of lncRNA to use for structural analysis is challenging. However, as technologies develop structures of lncRNAs are starting to emerge providing important information regarding their mechanism of action. Here we review the current methods used to determine the structure of lncRNA and lncRNA:protein complexes and describe the significant contribution structural biology has and will make to the field of lncRNA research. This article is part of a Special Issue entitled: Clues to long noncoding RNA taxonomy1, edited by Dr. Tetsuro Hirose and Dr. Shinichi Nakagawa. Copyright © 2015 Elsevier B.V. All rights reserved.

  20. Mining for recurrent long-range interactions in RNA structures reveals embedded hierarchies in network families.

    PubMed

    Reinharz, Vladimir; Soulé, Antoine; Westhof, Eric; Waldispühl, Jérôme; Denise, Alain

    2018-05-04

    The wealth of the combinatorics of nucleotide base pairs enables RNA molecules to assemble into sophisticated interaction networks, which are used to create complex 3D substructures. These interaction networks are essential to shape the 3D architecture of the molecule, and also to provide the key elements to carry molecular functions such as protein or ligand binding. They are made of organised sets of long-range tertiary interactions which connect distinct secondary structure elements in 3D structures. Here, we present a de novo data-driven approach to extract automatically from large data sets of full RNA 3D structures the recurrent interaction networks (RINs). Our methodology enables us for the first time to detect the interaction networks connecting distinct components of the RNA structure, highlighting their diversity and conservation through non-related functional RNAs. We use a graphical model to perform pairwise comparisons of all RNA structures available and to extract RINs and modules. Our analysis yields a complete catalog of RNA 3D structures available in the Protein Data Bank and reveals the intricate hierarchical organization of the RNA interaction networks and modules. We assembled our results in an online database (http://carnaval.lri.fr) which will be regularly updated. Within the site, a tool allows users with a novel RNA structure to detect automatically whether the novel structure contains previously observed RINs.

  1. Integrated structural biology to unravel molecular mechanisms of protein-RNA recognition.

    PubMed

    Schlundt, Andreas; Tants, Jan-Niklas; Sattler, Michael

    2017-04-15

    Recent advances in RNA sequencing technologies have greatly expanded our knowledge of the RNA landscape in cells, often with spatiotemporal resolution. These techniques identified many new (often non-coding) RNA molecules. Large-scale studies have also discovered novel RNA binding proteins (RBPs), which exhibit single or multiple RNA binding domains (RBDs) for recognition of specific sequence or structured motifs in RNA. Starting from these large-scale approaches it is crucial to unravel the molecular principles of protein-RNA recognition in ribonucleoprotein complexes (RNPs) to understand the underlying mechanisms of gene regulation. Structural biology and biophysical studies at highest possible resolution are key to elucidate molecular mechanisms of RNA recognition by RBPs and how conformational dynamics, weak interactions and cooperative binding contribute to the formation of specific, context-dependent RNPs. While large compact RNPs can be well studied by X-ray crystallography and cryo-EM, analysis of dynamics and weak interaction necessitates the use of solution methods to capture these properties. Here, we illustrate methods to study the structure and conformational dynamics of protein-RNA complexes in solution starting from the identification of interaction partners in a given RNP. Biophysical and biochemical techniques support the characterization of a protein-RNA complex and identify regions relevant in structural analysis. Nuclear magnetic resonance (NMR) is a powerful tool to gain information on folding, stability and dynamics of RNAs and characterize RNPs in solution. It provides crucial information that is complementary to the static pictures derived from other techniques. NMR can be readily combined with other solution techniques, such as small angle X-ray and/or neutron scattering (SAXS/SANS), electron paramagnetic resonance (EPR), and Förster resonance energy transfer (FRET), which provide information about overall shapes, internal domain arrangements and dynamics. Principles of protein-RNA recognition and current approaches are reviewed and illustrated with recent studies. Copyright © 2017 Elsevier Inc. All rights reserved.

  2. The importance of mRNA structure in determining the pathogenicity of synonymous and non-synonymous mutations in haemophilia

    PubMed Central

    Hamasaki-Katagiri, Nobuko; Lin, Brian C.; Simon, Jonathan; Hunt, Ryan C.; Schiller, Tal; Russek-Cohen, Estelle; Komar, Anton A.; Bar, Haim; Kimchi-Sarfaty, Chava

    2016-01-01

    Introduction Mutational analysis is commonly used to support the diagnosis and management of haemophilia. This has allowed for the generation of large mutation databases which provide unparalleled insight into genotype-phenotype relationships. Haemophilia is associated with inversions, deletions, insertions, nonsense and missense mutations. Both synonymous and non-synonymous mutations influence the base pairing of messenger RNA (mRNA), which can alter mRNA structure, cellular half-life and ribosome processivity/elongation. However, the role of mRNA structure in determining the pathogenicity of point mutations in haemophilia has not been evaluated. Aim To evaluate mRNA thermodynamic stability and associated RNA prediction software as a means to distinguish between neutral and disease-associated mutations in haemophilia. Methods Five mRNA structure prediction software programs were used to assess the thermodynamic stability of mRNA fragments carrying neutral vs. disease-associated and synonymous vs. non-synonymous point mutations in F8, F9 and a third X-linked gene, DMD (dystrophin). Results In F8 and DMD, disease-associated mutations tend to occur in more structurally stable mRNA regions, represented by lower MFE (minimum free energy) levels. In comparing multiple software packages for mRNA structure prediction, a 101–151 nucleotide fragment length appears to be a feasible range for structuring future studies. Conclusion mRNA thermodynamic stability is one predictive characteristic, which when combined with other RNA and protein features, may offer significant insight when screening sequencing data for novel disease-associated mutations. Our results also suggest potential utility in evaluating the mRNA thermodynamic stability profile of a gene when determining the viability of interchanging codons for biological and therapeutic applications. PMID:27933712

  3. Structure of Escherichia coli Arginyl-tRNA Synthetase in Complex with tRNAArg: Pivotal Role of the D-loop.

    PubMed

    Stephen, Preyesh; Ye, Sheng; Zhou, Ming; Song, Jian; Zhang, Rongguang; Wang, En-Duo; Giegé, Richard; Lin, Sheng-Xiang

    2018-05-25

    Aminoacyl-tRNA synthetases are essential components in protein biosynthesis. Arginyl-tRNA synthetase (ArgRS) belongs to the small group of aminoacyl-tRNA synthetases requiring cognate tRNA for amino acid activation. The crystal structure of Escherichia coli (Eco) ArgRS has been solved in complex with tRNA Arg at 3.0-Å resolution. With this first bacterial tRNA complex, we are attempting to bridge the gap existing in structure-function understanding in prokaryotic tRNA Arg recognition. The structure shows a tight binding of tRNA on the synthetase through the identity determinant A20 from the D-loop, a tRNA recognition snapshot never elucidated structurally. This interaction of A20 involves 5 amino acids from the synthetase. Additional contacts via U20a and U16 from the D-loop reinforce the interaction. The importance of D-loop recognition in EcoArgRS functioning is supported by a mutagenesis analysis of critical amino acids that anchor tRNA Arg on the synthetase; in particular, mutations at amino acids interacting with A20 affect binding affinity to the tRNA and specificity of arginylation. Altogether the structural and functional data indicate that the unprecedented ArgRS crystal structure represents a snapshot during functioning and suggest that the recognition of the D-loop by ArgRS is an important trigger that anchors tRNA Arg on the synthetase. In this process, A20 plays a major role, together with prominent conformational changes in several ArgRS domains that may eventually lead to the mature ArgRS:tRNA complex and the arginine activation. Functional implications that could be idiosyncratic to the arginine identity of bacterial ArgRSs are discussed. Copyright © 2018 Elsevier Ltd. All rights reserved.

  4. Automated classification of RNA 3D motifs and the RNA 3D Motif Atlas

    PubMed Central

    Petrov, Anton I.; Zirbel, Craig L.; Leontis, Neocles B.

    2013-01-01

    The analysis of atomic-resolution RNA three-dimensional (3D) structures reveals that many internal and hairpin loops are modular, recurrent, and structured by conserved non-Watson–Crick base pairs. Structurally similar loops define RNA 3D motifs that are conserved in homologous RNA molecules, but can also occur at nonhomologous sites in diverse RNAs, and which often vary in sequence. To further our understanding of RNA motif structure and sequence variability and to provide a useful resource for structure modeling and prediction, we present a new method for automated classification of internal and hairpin loop RNA 3D motifs and a new online database called the RNA 3D Motif Atlas. To classify the motif instances, a representative set of internal and hairpin loops is automatically extracted from a nonredundant list of RNA-containing PDB files. Their structures are compared geometrically, all-against-all, using the FR3D program suite. The loops are clustered into motif groups, taking into account geometric similarity and structural annotations and making allowance for a variable number of bulged bases. The automated procedure that we have implemented identifies all hairpin and internal loop motifs previously described in the literature. All motif instances and motif groups are assigned unique and stable identifiers and are made available in the RNA 3D Motif Atlas (http://rna.bgsu.edu/motifs), which is automatically updated every four weeks. The RNA 3D Motif Atlas provides an interactive user interface for exploring motif diversity and tools for programmatic data access. PMID:23970545

  5. The bifurcated stem loop 4 (SL4) is crucial for efficient packaging of mouse mammary tumor virus (MMTV) genomic RNA.

    PubMed

    Mustafa, Farah; Vivet-Boudou, Valérie; Jabeen, Ayesha; Ali, Lizna M; Kalloush, Rawan M; Marquet, Roland; Rizvi, Tahir A

    2018-06-21

    Packaging the mouse mammary tumor virus (MMTV) genomic RNA (gRNA) requires the entire 5' untranslated region (UTR) in conjunction with the first 120 nucleotides of the gag gene. This region includes several palindromic (pal) sequence(s) and stable stem loops (SLs). Among these, stem loop 4 (SL4) adopts a bifurcated structure consisting of three stems, two apical loops, and an internal loop. Pal II, located in one of the apical loops, mediates gRNA dimerization, a process intricately linked to packaging. We thus hypothesized that the bifurcated SL4 structure could constitute the major gRNA packaging determinant. To test this hypothesis, the two apical loops and the flanking sequences forming the bifurcated SL4 were individually mutated. These mutations all had deleterious effects on gRNA packaging and propagation. Next, single and compensatory mutants were designed to destabilize then recreate the bifurcated SL4 structure. A structure-function analysis using bioinformatics predictions and RNA chemical probing revealed that mutations that led to the loss of the SL4 bifurcated structure abrogated RNA packaging and propagation, while compensatory mutations that recreated the native SL4 structure restored RNA packaging and propagation to wild type levels. Altogether, our results demonstrate that SL4 constitutes the principal packaging determinant of MMTV gRNA. Our findings further suggest that SL4 acts as a structural switch that can not only differentiate between RNA for translation versus packaging/dimerization, but its location also allows differentiation between spliced and unspliced RNAs during gRNA encapsidation.

  6. RNA 3D Modules in Genome-Wide Predictions of RNA 2D Structure

    PubMed Central

    Theis, Corinna; Zirbel, Craig L.; zu Siederdissen, Christian Höner; Anthon, Christian; Hofacker, Ivo L.; Nielsen, Henrik; Gorodkin, Jan

    2015-01-01

    Recent experimental and computational progress has revealed a large potential for RNA structure in the genome. This has been driven by computational strategies that exploit multiple genomes of related organisms to identify common sequences and secondary structures. However, these computational approaches have two main challenges: they are computationally expensive and they have a relatively high false discovery rate (FDR). Simultaneously, RNA 3D structure analysis has revealed modules composed of non-canonical base pairs which occur in non-homologous positions, apparently by independent evolution. These modules can, for example, occur inside structural elements which in RNA 2D predictions appear as internal loops. Hence one question is if the use of such RNA 3D information can improve the prediction accuracy of RNA secondary structure at a genome-wide level. Here, we use RNAz in combination with 3D module prediction tools and apply them on a 13-way vertebrate sequence-based alignment. We find that RNA 3D modules predicted by metaRNAmodules and JAR3D are significantly enriched in the screened windows compared to their shuffled counterparts. The initially estimated FDR of 47.0% is lowered to below 25% when certain 3D module predictions are present in the window of the 2D prediction. We discuss the implications and prospects for further development of computational strategies for detection of RNA 2D structure in genomic sequence. PMID:26509713

  7. RNA Structural Analysis by Evolving SHAPE Chemistry

    PubMed Central

    Spitale, Robert C.; Flynn, Ryan A.; Torre, Eduardo A.; Kool, Eric T.; Chang, Howard Y.

    2017-01-01

    RNA is central to the flow of biological information. From transcription to splicing, RNA localization, translation, and decay, RNA is intimately involved in regulating every step of the gene expression program, and is thus essential for health and understanding disease. RNA has the unique ability to base-pair with itself and other nucleic acids to form complex structures. Hence the information content in RNA is not simply its linear sequence of bases, but is also encoded in complex folding of RNA molecules. A general chemical functionality that all RNAs have is a 2’-hydroxyl group in the ribose ring, and the reactivity of the 2'-hydroxyl in RNA is gated by local nucleotide flexibility. In other words, the 2'-hydroxyl is reactive at single-stranded and conformationally flexible positions but is unreactive at nucleotides constrained by base pairing. Recent efforts have been focused on developing reagents that modify RNA as a function of RNA 2’ hydroxyl group flexibility. Such RNA structure probing techniques can be read out by primer extension in experiments termed RNA SHAPE (Selective 2’ Hydroxyl Acylation and Primer Extension). Herein we describe the efforts devoted to the design and utilization of SHAPE probes for characterizing RNA structure. We also describe current technological advances that are being used to utilize SHAPE chemistry with deep sequencing to probe many RNAs in parallel. The merger of chemistry with genomics is sure to open the door to genome-wide exploration of RNA structure and function. PMID:25132067

  8. Definition of a high-affinity Gag recognition structure mediating packaging of a retroviral RNA genome

    PubMed Central

    Gherghe, Cristina; Lombo, Tania; Leonard, Christopher W.; Datta, Siddhartha A. K.; Bess, Julian W.; Gorelick, Robert J.; Rein, Alan; Weeks, Kevin M.

    2010-01-01

    All retroviral genomic RNAs contain a cis-acting packaging signal by which dimeric genomes are selectively packaged into nascent virions. However, it is not understood how Gag (the viral structural protein) interacts with these signals to package the genome with high selectivity. We probed the structure of murine leukemia virus RNA inside virus particles using SHAPE, a high-throughput RNA structure analysis technology. These experiments showed that NC (the nucleic acid binding domain derived from Gag) binds within the virus to the sequence UCUG-UR-UCUG. Recombinant Gag and NC proteins bound to this same RNA sequence in dimeric RNA in vitro; in all cases, interactions were strongest with the first U and final G in each UCUG element. The RNA structural context is critical: High-affinity binding requires base-paired regions flanking this motif, and two UCUG-UR-UCUG motifs are specifically exposed in the viral RNA dimer. Mutating the guanosine residues in these two motifs—only four nucleotides per genomic RNA—reduced packaging 100-fold, comparable to the level of nonspecific packaging. These results thus explain the selective packaging of dimeric RNA. This paradigm has implications for RNA recognition in general, illustrating how local context and RNA structure can create information-rich recognition signals from simple single-stranded sequence elements in large RNAs. PMID:20974908

  9. A model for the study of ligand binding to the ribosomal RNA helix h44

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Dibrov, Sergey M.; Parsons, Jerod; Hermann, Thomas

    2010-09-02

    Oligonucleotide models of ribosomal RNA domains are powerful tools to study the binding and molecular recognition of antibiotics that interfere with bacterial translation. Techniques such as selective chemical modification, fluorescence labeling and mutations are cumbersome for the whole ribosome but readily applicable to model RNAs, which are readily crystallized and often give rise to higher resolution crystal structures suitable for detailed analysis of ligand-RNA interactions. Here, we have investigated the HX RNA construct which contains two adjacent ligand binding regions of helix h44 in 16S ribosomal RNA. High-resolution crystal structure analysis confirmed that the HX RNA is a faithful structuralmore » model of the ribosomal target. Solution studies showed that HX RNA carrying a fluorescent 2-aminopurine modification provides a model system that can be used to monitor ligand binding to both the ribosomal decoding site and, through an indirect effect, the hygromycin B interaction region.« less

  10. Detection of bacterial 16S rRNA using a molecular beacon-based X sensor

    PubMed Central

    Gerasimova, Yulia V.; Kolpashchikov, Dmitry M.

    2012-01-01

    We demonstrate how a long structurally constrained RNA can be analyzed in homogeneous solution at ambient temperatures with high specificity using a sophisticated biosensor. The sensor consists of a molecular beacon probe as a signal reporter and two DNA adaptor strands, which have fragments complementary to the reporter and to the analyzed RNA. One adaptor strand uses its long RNA-binding arm to unwind the RNA secondary structure. Second adaptor strand with a short RNA-binding arm hybridizes only to a fully complementary site, thus providing high recognition specificity. Overall the three-component sensor and the target RNA form a four-stranded DNA crossover (X) structure. Using this sensor, E.coli 16S rRNA was detected in real time with the detection limit of ~ 0.17 nM. The high specificity of the analysis was proven by differentiating B.subtilus from E.coli 16S rRNA sequences. The sensor responds to the presence of the analyte within seconds. PMID:23021850

  11. Template-Based Modeling of Protein-RNA Interactions.

    PubMed

    Zheng, Jinfang; Kundrotas, Petras J; Vakser, Ilya A; Liu, Shiyong

    2016-09-01

    Protein-RNA complexes formed by specific recognition between RNA and RNA-binding proteins play an important role in biological processes. More than a thousand of such proteins in human are curated and many novel RNA-binding proteins are to be discovered. Due to limitations of experimental approaches, computational techniques are needed for characterization of protein-RNA interactions. Although much progress has been made, adequate methodologies reliably providing atomic resolution structural details are still lacking. Although protein-RNA free docking approaches proved to be useful, in general, the template-based approaches provide higher quality of predictions. Templates are key to building a high quality model. Sequence/structure relationships were studied based on a representative set of binary protein-RNA complexes from PDB. Several approaches were tested for pairwise target/template alignment. The analysis revealed a transition point between random and correct binding modes. The results showed that structural alignment is better than sequence alignment in identifying good templates, suitable for generating protein-RNA complexes close to the native structure, and outperforms free docking, successfully predicting complexes where the free docking fails, including cases of significant conformational change upon binding. A template-based protein-RNA interaction modeling protocol PRIME was developed and benchmarked on a representative set of complexes.

  12. Assembly and analysis of eukaryotic Argonaute–RNA complexes in microRNA-target recognition

    PubMed Central

    Gan, Hin Hark; Gunsalus, Kristin C.

    2015-01-01

    Experimental studies have uncovered a variety of microRNA (miRNA)–target duplex structures that include perfect, imperfect and seedless duplexes. However, non-canonical binding modes from imperfect/seedless duplexes are not well predicted by computational approaches, which rely primarily on sequence and secondary structural features, nor have their tertiary structures been characterized because solved structures to date are limited to near perfect, straight duplexes in Argonautes (Agos). Here, we use structural modeling to examine the role of Ago dynamics in assembling viable eukaryotic miRNA-induced silencing complexes (miRISCs). We show that combinations of low-frequency, global modes of motion of Ago domains are required to accommodate RNA duplexes in model human and C. elegans Ago structures. Models of viable miRISCs imply that Ago adopts variable conformations at distinct target sites that generate distorted, imperfect miRNA-target duplexes. Ago's ability to accommodate a duplex is dependent on the region where structural distortions occur: distortions in solvent-exposed seed and 3′-end regions are less likely to produce steric clashes than those in the central duplex region. Energetic analyses of assembled miRISCs indicate that target recognition is also driven by favorable Ago-duplex interactions. Such structural insights into Ago loading and target recognition mechanisms may provide a more accurate assessment of miRNA function. PMID:26432829

  13. F-RAG: Generating Atomic Coordinates from RNA Graphs by Fragment Assembly.

    PubMed

    Jain, Swati; Schlick, Tamar

    2017-11-24

    Coarse-grained models represent attractive approaches to analyze and simulate ribonucleic acid (RNA) molecules, for example, for structure prediction and design, as they simplify the RNA structure to reduce the conformational search space. Our structure prediction protocol RAGTOP (RNA-As-Graphs Topology Prediction) represents RNA structures as tree graphs and samples graph topologies to produce candidate graphs. However, for a more detailed study and analysis, construction of atomic from coarse-grained models is required. Here we present our graph-based fragment assembly algorithm (F-RAG) to convert candidate three-dimensional (3D) tree graph models, produced by RAGTOP into atomic structures. We use our related RAG-3D utilities to partition graphs into subgraphs and search for structurally similar atomic fragments in a data set of RNA 3D structures. The fragments are edited and superimposed using common residues, full atomic models are scored using RAGTOP's knowledge-based potential, and geometries of top scoring models is optimized. To evaluate our models, we assess all-atom RMSDs and Interaction Network Fidelity (a measure of residue interactions) with respect to experimentally solved structures and compare our results to other fragment assembly programs. For a set of 50 RNA structures, we obtain atomic models with reasonable geometries and interactions, particularly good for RNAs containing junctions. Additional improvements to our protocol and databases are outlined. These results provide a good foundation for further work on RNA structure prediction and design applications. Copyright © 2017 Elsevier Ltd. All rights reserved.

  14. Accurate multiple sequence-structure alignment of RNA sequences using combinatorial optimization.

    PubMed

    Bauer, Markus; Klau, Gunnar W; Reinert, Knut

    2007-07-27

    The discovery of functional non-coding RNA sequences has led to an increasing interest in algorithms related to RNA analysis. Traditional sequence alignment algorithms, however, fail at computing reliable alignments of low-homology RNA sequences. The spatial conformation of RNA sequences largely determines their function, and therefore RNA alignment algorithms have to take structural information into account. We present a graph-based representation for sequence-structure alignments, which we model as an integer linear program (ILP). We sketch how we compute an optimal or near-optimal solution to the ILP using methods from combinatorial optimization, and present results on a recently published benchmark set for RNA alignments. The implementation of our algorithm yields better alignments in terms of two published scores than the other programs that we tested: This is especially the case with an increasing number of input sequences. Our program LARA is freely available for academic purposes from http://www.planet-lisa.net.

  15. Functional Advantages of Conserved Intrinsic Disorder in RNA-Binding Proteins.

    PubMed

    Varadi, Mihaly; Zsolyomi, Fruzsina; Guharoy, Mainak; Tompa, Peter

    2015-01-01

    Proteins form large macromolecular assemblies with RNA that govern essential molecular processes. RNA-binding proteins have often been associated with conformational flexibility, yet the extent and functional implications of their intrinsic disorder have never been fully assessed. Here, through large-scale analysis of comprehensive protein sequence and structure datasets we demonstrate the prevalence of intrinsic structural disorder in RNA-binding proteins and domains. We addressed their functionality through a quantitative description of the evolutionary conservation of disordered segments involved in binding, and investigated the structural implications of flexibility in terms of conformational stability and interface formation. We conclude that the functional role of intrinsically disordered protein segments in RNA-binding is two-fold: first, these regions establish extended, conserved electrostatic interfaces with RNAs via induced fit. Second, conformational flexibility enables them to target different RNA partners, providing multi-functionality, while also ensuring specificity. These findings emphasize the functional importance of intrinsically disordered regions in RNA-binding proteins.

  16. Fragmentation of the large subunit ribosomal RNA gene in oyster mitochondrial genomes.

    PubMed

    Milbury, Coren A; Lee, Jung C; Cannone, Jamie J; Gaffney, Patrick M; Gutell, Robin R

    2010-09-02

    Discontinuous genes have been observed in bacteria, archaea, and eukaryotic nuclei, mitochondria and chloroplasts. Gene discontinuity occurs in multiple forms: the two most frequent forms result from introns that are spliced out of the RNA and the resulting exons are spliced together to form a single transcript, and fragmented gene transcripts that are not covalently attached post-transcriptionally. Within the past few years, fragmented ribosomal RNA (rRNA) genes have been discovered in bilateral metazoan mitochondria, all within a group of related oysters. In this study, we have characterized this fragmentation with comparative analysis and experimentation. We present secondary structures, modeled using comparative sequence analysis of the discontinuous mitochondrial large subunit rRNA genes of the cupped oysters C. virginica, C. gigas, and C. hongkongensis. Comparative structure models for the large subunit rRNA in each of the three oyster species are generally similar to those for other bilateral metazoans. We also used RT-PCR and analyzed ESTs to determine if the two fragmented LSU rRNAs are spliced together. The two segments are transcribed separately, and not spliced together although they still form functional rRNAs and ribosomes. Although many examples of discontinuous ribosomal genes have been documented in bacteria and archaea, as well as the nuclei, chloroplasts, and mitochondria of eukaryotes, oysters are some of the first characterized examples of fragmented bilateral animal mitochondrial rRNA genes. The secondary structures of the oyster LSU rRNA fragments have been predicted on the basis of previous comparative metazoan mitochondrial LSU rRNA structure models.

  17. Structure of an archaeal non-discriminating glutamyl-tRNA synthetase: a missing link in the evolution of Gln-tRNAGln formation.

    PubMed

    Nureki, Osamu; O'Donoghue, Patrick; Watanabe, Nobuhisa; Ohmori, Atsuhiko; Oshikane, Hiroyuki; Araiso, Yuhei; Sheppard, Kelly; Söll, Dieter; Ishitani, Ryuichiro

    2010-11-01

    The molecular basis of the genetic code relies on the specific ligation of amino acids to their cognate tRNA molecules. However, two pathways exist for the formation of Gln-tRNA(Gln). The evolutionarily older indirect route utilizes a non-discriminating glutamyl-tRNA synthetase (ND-GluRS) that can form both Glu-tRNA(Glu) and Glu-tRNA(Gln). The Glu-tRNA(Gln) is then converted to Gln-tRNA(Gln) by an amidotransferase. Since the well-characterized bacterial ND-GluRS enzymes recognize tRNA(Glu) and tRNA(Gln) with an unrelated α-helical cage domain in contrast to the β-barrel anticodon-binding domain in archaeal and eukaryotic GluRSs, the mode of tRNA(Glu)/tRNA(Gln) discrimination in archaea and eukaryotes was unknown. Here, we present the crystal structure of the Methanothermobacter thermautotrophicus ND-GluRS, which is the evolutionary predecessor of both the glutaminyl-tRNA synthetase (GlnRS) and the eukaryotic discriminating GluRS. Comparison with the previously solved structure of the Escherichia coli GlnRS-tRNA(Gln) complex reveals the structural determinants responsible for specific tRNA(Gln) recognition by GlnRS compared to promiscuous recognition of both tRNAs by the ND-GluRS. The structure also shows the amino acid recognition pocket of GluRS is more variable than that found in GlnRS. Phylogenetic analysis is used to reconstruct the key events in the evolution from indirect to direct genetic encoding of glutamine.

  18. In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features.

    PubMed

    Ding, Yiliang; Tang, Yin; Kwok, Chun Kit; Zhang, Yu; Bevilacqua, Philip C; Assmann, Sarah M

    2014-01-30

    RNA structure has critical roles in processes ranging from ligand sensing to the regulation of translation, polyadenylation and splicing. However, a lack of genome-wide in vivo RNA structural data has limited our understanding of how RNA structure regulates gene expression in living cells. Here we present a high-throughput, genome-wide in vivo RNA structure probing method, structure-seq, in which dimethyl sulphate methylation of unprotected adenines and cytosines is identified by next-generation sequencing. Application of this method to Arabidopsis thaliana seedlings yielded the first in vivo genome-wide RNA structure map at nucleotide resolution for any organism, with quantitative structural information across more than 10,000 transcripts. Our analysis reveals a three-nucleotide periodic repeat pattern in the structure of coding regions, as well as a less-structured region immediately upstream of the start codon, and shows that these features are strongly correlated with translation efficiency. We also find patterns of strong and weak secondary structure at sites of alternative polyadenylation, as well as strong secondary structure at 5' splice sites that correlates with unspliced events. Notably, in vivo structures of messenger RNAs annotated for stress responses are poorly predicted in silico, whereas mRNA structures of genes related to cell function maintenance are well predicted. Global comparison of several structural features between these two categories shows that the mRNAs associated with stress responses tend to have more single-strandedness, longer maximal loop length and higher free energy per nucleotide, features that may allow these RNAs to undergo conformational changes in response to environmental conditions. Structure-seq allows the RNA structurome and its biological roles to be interrogated on a genome-wide scale and should be applicable to any organism.

  19. LCS-TA to identify similar fragments in RNA 3D structures.

    PubMed

    Wiedemann, Jakub; Zok, Tomasz; Milostan, Maciej; Szachniuk, Marta

    2017-10-23

    In modern structural bioinformatics, comparison of molecular structures aimed to identify and assess similarities and differences between them is one of the most commonly performed procedures. It gives the basis for evaluation of in silico predicted models. It constitutes the preliminary step in searching for structural motifs. In particular, it supports tracing the molecular evolution. Faced with an ever-increasing amount of available structural data, researchers need a range of methods enabling comparative analysis of the structures from either global or local perspective. Herein, we present a new, superposition-independent method which processes pairs of RNA 3D structures to identify their local similarities. The similarity is considered in the context of structure bending and bonds' rotation which are described by torsion angles. In the analyzed RNA structures, the method finds the longest continuous segments that show similar torsion within a user-defined threshold. The length of the segment is provided as local similarity measure. The method has been implemented as LCS-TA algorithm (Longest Continuous Segments in Torsion Angle space) and is incorporated into our MCQ4Structures application, freely available for download from http://www.cs.put.poznan.pl/tzok/mcq/ . The presented approach ties torsion-angle-based method of structure analysis with the idea of local similarity identification by handling continuous 3D structure segments. The first method, implemented in MCQ4Structures, has been successfully utilized in RNA-Puzzles initiative. The second one, originally applied in Euclidean space, is a component of LGA (Local-Global Alignment) algorithm commonly used in assessing protein models submitted to CASP. This unique combination of concepts implemented in LCS-TA provides a new perspective on structure quality assessment in local and quantitative aspect. A series of computational experiments show the first results of applying our method to comparison of RNA 3D models. LCS-TA can be used for identifying strengths and weaknesses in the prediction of RNA tertiary structures.

  20. Thermodynamics and kinetics of RNA tertiary structure formation in the junctionless hairpin ribozyme.

    PubMed

    White, Neil A; Hoogstraten, Charles G

    2017-09-01

    The hairpin ribozyme consists of two RNA internal loops that interact to form the catalytically active structure. This docking transition is a rare example of intermolecular formation of RNA tertiary structure without coupling to helix annealing. We have used temperature-dependent surface plasmon resonance (SPR) to characterize the thermodynamics and kinetics of RNA tertiary structure formation for the junctionless form of the ribozyme, in which loops A and B reside on separate molecules. We find docking to be strongly enthalpy-driven and to be accompanied by substantial activation barriers for association and dissociation, consistent with the structural reorganization of both internal loops upon complex formation. Comparisons with the parallel analysis of a ribozyme variant carrying a 2'-O-methyl modification at the self-cleavage site and with published data in other systems reveal a surprising diversity of thermodynamic signatures, emphasizing the delicate balance of contributions to the free energy of formation of RNA tertiary structure. Copyright © 2017 Elsevier B.V. All rights reserved.

  1. The origin and evolution of tRNA inferred from phylogenetic analysis of structure.

    PubMed

    Sun, Feng-Jie; Caetano-Anollés, Gustavo

    2008-01-01

    The evolutionary history of the two structural and functional domains of tRNA is controversial but harbors the secrets of early translation and the genetic code. To explore the origin and evolution of tRNA, we reconstructed phylogenetic trees directly from molecular structure. Forty-two structural characters describing the geometry of 571 tRNAs and three statistical parameters describing thermodynamic and mechanical features of molecules quantitatively were used to derive phylogenetic trees of molecules and molecular substructures. Trees of molecules failed to group tRNA according to amino acid specificity and did not reveal the tripartite nature of life, probably due to loss of phylogenetic signal or because tRNA diversification predated organismal diversification. Trees of substructures derived from both structural and statistical characters support the origin of tRNA in the acceptor arm and the hypothesis that the top half domain composed of acceptor and pseudouridine (TPsiC) arms is more ancient than the bottom half domain composed of dihydrouridine (DHU) and anticodon arms. This constitutes the cornerstone of the genomic tag hypothesis that postulates tRNAs were ancient telomeres in the RNA world. The trees of substructures suggest a model for the evolution of the major functional and structural components of tRNA. In this model, short RNA hairpins with stems homologous to the acceptor arm of present day tRNAs were extended with regions homologous to TPsiC and anticodon arms. The DHU arm was then incorporated into the resulting three-stemmed structure to form a proto-cloverleaf structure. The variable region was the last structural addition to the molecular repertoire of evolving tRNA substructures.

  2. Structure of Arabidopsis HYPONASTIC LEAVES1 and Its Molecular Implications for miRNA Processing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yang, S.; Chen, H; Yang, J

    2010-01-01

    The Arabidopsis HYPONASTIC LEAVES1 (HYL1) is a double-stranded RNA-binding protein that forms a complex with DICER-LIKE1 (DCL1) and SERRATE to facilitate processing of primary miRNAs into microRNAs (miRNAs). However, the structural mechanisms of miRNA maturation by this complex are poorly understood. Here, we present the crystal structures of double-stranded RNA binding domains (dsRBD1 and dsRBD2) of HYL1 and HYL1 dsRBD1 (HR1)/dsRNA complex as well as human TRBP2 dsRBD2 (TR2)/dsRNA complex for comparison analysis. Structural and functional study demonstrates that both HR1 and TR2 are canonical dsRBDs for dsRNA binding, whereas HR2 of HYL1 is a non-canonical dsRBD harboring a putativemore » dimerization interface. Domain swapping within the context of HYL1 demonstrates that TR2 can supplant the function of HR1 in vitro and in vivo. Further biochemical analyses suggest that HYL1 probably binds to the miRNA/miRNA* region of precursors as a dimer mediated by HR2.« less

  3. Inhibitor-induced structural change in the HCV IRES domain IIa RNA

    PubMed Central

    Paulsen, Ryan B.; Seth, Punit P.; Swayze, Eric E.; Griffey, Richard H.; Skalicky, Jack J.; Cheatham, Thomas E.; Davis, Darrell R.

    2010-01-01

    Translation of the hepatitis C virus (HCV) RNA is initiated from a highly structured internal ribosomal entry site (IRES) in the 5′ untranslated region (5′ UTR) of the RNA genome. An important structural feature of the native RNA is an approximately 90° helical bend localized to domain IIa that positions the apical loop of domain IIb of the IRES near the 40S ribosomal E-site to promote eIF2-GDP release, facilitating 80S ribosome assembly. We report here the NMR structure of a domain IIa construct in complex with a potent small-molecule inhibitor of HCV replication. Molecular dynamics refinement in explicit solvent and subsequent energetic analysis indicated that each inhibitor stereoisomer bound with comparable affinity and in an equivalent binding mode. The in silico analysis was substantiated by fluorescence-based assays showing that the relative binding free energies differed by only 0.7 kcal/mol. Binding of the inhibitor displaces key nucleotide residues within the bulge region, effecting a major conformational change that eliminates the bent RNA helical trajectory, providing a mechanism for the antiviral activity of this inhibitor class. PMID:20360559

  4. SMMRNA: a database of small molecule modulators of RNA

    PubMed Central

    Mehta, Ankita; Sonam, Surabhi; Gouri, Isha; Loharch, Saurabh; Sharma, Deepak K.; Parkesh, Raman

    2014-01-01

    We have developed SMMRNA, an interactive database, available at http://www.smmrna.org, with special focus on small molecule ligands targeting RNA. Currently, SMMRNA consists of ∼770 unique ligands along with structural images of RNA molecules. Each ligand in the SMMRNA contains information such as Kd, Ki, IC50, ΔTm, molecular weight (MW), hydrogen donor and acceptor count, XlogP, number of rotatable bonds, number of aromatic rings and 2D and 3D structures. These parameters can be explored using text search, advanced search, substructure and similarity-based analysis tools that are embedded in SMMRNA. A structure editor is provided for 3D visualization of ligands. Advance analysis can be performed using substructure and OpenBabel-based chemical similarity fingerprints. Upload facility for both RNA and ligands is also provided. The physicochemical properties of the ligands were further examined using OpenBabel descriptors, hierarchical clustering, binning partition and multidimensional scaling. We have also generated a 3D conformation database of ligands to support the structure and ligand-based screening. SMMRNA provides comprehensive resource for further design, development and refinement of small molecule modulators for selective targeting of RNA molecules. PMID:24163098

  5. Time-Resolved Hydroxyl Radical Footprinting of RNA with X-Rays.

    PubMed

    Hao, Yumeng; Bohon, Jen; Hulscher, Ryan; Rappé, Mollie C; Gupta, Sayan; Adilakshmi, Tadepalli; Woodson, Sarah A

    2018-06-01

    RNA footprinting by hydroxyl radical cleavage provides 'snapshots' of RNA tertiary structure or protein interactions that bury the RNA backbone. Generation of hydroxyl radicals with a high-flux synchrotron X-ray beam provides analysis on a short timescale (5-100 msec), which enables the structures of folding intermediates or other transient conformational states to be determined in biochemical solutions or cells. This article provides protocols for using synchrotron beamlines for hydroxyl radical footprinting. © 2018 by John Wiley & Sons, Inc. © 2018 John Wiley & Sons, Inc.

  6. A computational proposal for designing structured RNA pools for in vitro selection of RNAs.

    PubMed

    Kim, Namhee; Gan, Hin Hark; Schlick, Tamar

    2007-04-01

    Although in vitro selection technology is a versatile experimental tool for discovering novel synthetic RNA molecules, finding complex RNA molecules is difficult because most RNAs identified from random sequence pools are simple motifs, consistent with recent computational analysis of such sequence pools. Thus, enriching in vitro selection pools with complex structures could increase the probability of discovering novel RNAs. Here we develop an approach for engineering sequence pools that links RNA sequence space regions with corresponding structural distributions via a "mixing matrix" approach combined with a graph theory analysis. We define five classes of mixing matrices motivated by covariance mutations in RNA; these constructs define nucleotide transition rates and are applied to chosen starting sequences to yield specific nonrandom pools. We examine the coverage of sequence space as a function of the mixing matrix and starting sequence via clustering analysis. We show that, in contrast to random sequences, which are associated only with a local region of sequence space, our designed pools, including a structured pool for GTP aptamers, can target specific motifs. It follows that experimental synthesis of designed pools can benefit from using optimized starting sequences, mixing matrices, and pool fractions associated with each of our constructed pools as a guide. Automation of our approach could provide practical tools for pool design applications for in vitro selection of RNAs and related problems.

  7. JPRS Report, Science and Technology USSR: Life Sciences.

    DTIC Science & Technology

    1990-07-16

    4 1 VETERINARY MEDICINE Primary Structure of RNA Polymerase Gene of Foot-and-Mouth Disease Virus ( FMDV ...neering were used to obtain cDNA corresponding to the Primary Structure of RNA Polymerase Gene of RNA polymerase gene to FMDV A 2 2 , with a map of the...Foot-and-Mouth Disease Virus ( FMDV ) A22 primary nucleotide sequence of the cDNA provided. 18400538F Moscow BIOORGANICHESKA YA Analysis of the data

  8. Detection of 224 candidate structured RNAs by comparative analysis of specific subsets of intergenic regions

    PubMed Central

    Lünse, Christina E.; Corbino, Keith A.; Ames, Tyler D.; Nelson, James W.; Roth, Adam; Perkins, Kevin R.; Sherlock, Madeline E.

    2017-01-01

    Abstract The discovery of structured non-coding RNAs (ncRNAs) in bacteria can reveal new facets of biology and biochemistry. Comparative genomics analyses executed by powerful computer algorithms have successfully been used to uncover many novel bacterial ncRNA classes in recent years. However, this general search strategy favors the discovery of more common ncRNA classes, whereas progressively rarer classes are correspondingly more difficult to identify. In the current study, we confront this problem by devising several methods to select subsets of intergenic regions that can concentrate these rare RNA classes, thereby increasing the probability that comparative sequence analysis approaches will reveal their existence. By implementing these methods, we discovered 224 novel ncRNA classes, which include ROOL RNA, an RNA class averaging 581 nt and present in multiple phyla, several highly conserved and widespread ncRNA classes with properties that suggest sophisticated biochemical functions and a multitude of putative cis-regulatory RNA classes involved in a variety of biological processes. We expect that further research on these newly found RNA classes will reveal additional aspects of novel biology, and allow for greater insights into the biochemistry performed by ncRNAs. PMID:28977401

  9. Thermodynamics of RNA duplexes modified with unlocked nucleic acid nucleotides

    PubMed Central

    Pasternak, Anna; Wengel, Jesper

    2010-01-01

    Thermodynamics provides insights into the influence of modified nucleotide residues on stability of nucleic acids and is crucial for designing duplexes with given properties. In this article, we introduce detailed thermodynamic analysis of RNA duplexes modified with unlocked nucleic acid (UNA) nucleotide residues. We investigate UNA single substitutions as well as model mismatch and dangling end effects. UNA residues placed in a central position makes RNA duplex structure less favourable by 4.0–6.6 kcal/mol. Slight destabilization, by ∼0.5–1.5 kcal/mol, is observed for 5′- or 3′-terminal UNA residues. Furthermore, thermodynamic effects caused by UNA residues are extremely additive with ΔG°37 conformity up to 98%. Direct mismatches involving UNA residues decrease the thermodynamic stability less than unmodified mismatches in RNA duplexes. Additionally, the presence of UNA residues adjacent to unpaired RNA residues reduces mismatch discrimination. Thermodynamic analysis of UNA 5′- and 3′-dangling ends revealed that stacking interactions of UNA residues are always less favourable than that of RNA residues. Finally, circular dichroism spectra imply no changes in overall A-form structure of UNA–RNA/RNA duplexes relative to the unmodified RNA duplexes. PMID:20562222

  10. Oxidative damage of 18S and 5S ribosomal RNA in digestive gland of mussels exposed to trace metals.

    PubMed

    Kournoutou, Georgia G; Giannopoulou, Panagiota C; Sazakli, Eleni; Leotsinidis, Michel; Kalpaxis, Dimitrios L

    2017-11-01

    Numerous studies have shown the ability of trace metals to accumulate in marine organisms and cause oxidative stress that leads to perturbations in many important intracellular processes, including protein synthesis. This study is mainly focused on the exploration of structural changes, like base modifications, scissions, and conformational changes, caused in 18S and 5S ribosomal RNA (rRNA) isolated from the mussel Mytilus galloprovincialis exposed to 40μg/L Cu, 30μg/L Hg, or 100μg/L Cd, for 5 or 15days. 18S rRNA and 5S rRNA are components of the small and large ribosomal subunit, respectively, found in complex with ribosomal proteins, translation factors and other auxiliary components (metal ions, toxins etc). 18S rRNA plays crucial roles in all stages of protein synthesis, while 5S rRNA serves as a master signal transducer between several functional regions of 28S rRNA. Therefore, structural changes in these ribosomal constituents could affect the basic functions of ribosomes and hence the normal metabolism of cells. Especially, 18S rRNA along with ribosomal proteins forms the decoding centre that ensures the correct codon-anticodon pairing. As exemplified by ELISA, primer extension analysis and DMS footprinting analysis, each metal caused oxidative damage to rRNA, depending on the nature of metal ion and the duration of exposure. Interestingly, exposure of mussels to Cu or Hg caused structural alterations in 5S rRNA, localized in paired regions and within loops A, B, C, and E, leading to a continuous progressive loss of the 5S RNA structural integrity. In contrast, structural impairments of 5S rRNA in mussels exposed to Cd were accumulating for the initial 5days, and then progressively decreased to almost the normal level by day 15, probably due to the parallel elevation of metallothionein content that depletes the pools of free Cd. Regions of interest in 18S rRNA, such as the decoding centre, sites implicated in the binding of tRNAs (A- and P-sites) or translation factors, and areas related to translation fidelity, were found to undergo significant metal-induced conformational alterations, leading either to loosening of their structure or to more compact folding. These modifications were associated with parallel alterations in the translation process at multiple levels, a fact suggesting that structural perturbations in ribosomes, caused by metals, pose significant hurdles in translational efficiency and fidelity. Copyright © 2017 Elsevier B.V. All rights reserved.

  11. RNA structural constraints in the evolution of the influenza A virus genome NP segment

    PubMed Central

    Gultyaev, Alexander P; Tsyganov-Bodounov, Anton; Spronken, Monique IJ; van der Kooij, Sander; Fouchier, Ron AM; Olsthoorn, René CL

    2014-01-01

    Conserved RNA secondary structures were predicted in the nucleoprotein (NP) segment of the influenza A virus genome using comparative sequence and structure analysis. A number of structural elements exhibiting nucleotide covariations were identified over the whole segment length, including protein-coding regions. Calculations of mutual information values at the paired nucleotide positions demonstrate that these structures impose considerable constraints on the virus genome evolution. Functional importance of a pseudoknot structure, predicted in the NP packaging signal region, was confirmed by plaque assays of the mutant viruses with disrupted structure and those with restored folding using compensatory substitutions. Possible functions of the conserved RNA folding patterns in the influenza A virus genome are discussed. PMID:25180940

  12. Quantitative analysis of RNA-protein interactions on a massively parallel array for mapping biophysical and evolutionary landscapes

    PubMed Central

    Buenrostro, Jason D.; Chircus, Lauren M.; Araya, Carlos L.; Layton, Curtis J.; Chang, Howard Y.; Snyder, Michael P.; Greenleaf, William J.

    2015-01-01

    RNA-protein interactions drive fundamental biological processes and are targets for molecular engineering, yet quantitative and comprehensive understanding of the sequence determinants of affinity remains limited. Here we repurpose a high-throughput sequencing instrument to quantitatively measure binding and dissociation of MS2 coat protein to >107 RNA targets generated on a flow-cell surface by in situ transcription and inter-molecular tethering of RNA to DNA. We decompose the binding energy contributions from primary and secondary RNA structure, finding that differences in affinity are often driven by sequence-specific changes in association rates. By analyzing the biophysical constraints and modeling mutational paths describing the molecular evolution of MS2 from low- to high-affinity hairpins, we quantify widespread molecular epistasis, and a long-hypothesized structure-dependent preference for G:U base pairs over C:A intermediates in evolutionary trajectories. Our results suggest that quantitative analysis of RNA on a massively parallel array (RNAMaP) relationships across molecular variants. PMID:24727714

  13. A small stem-loop structure of the Ebola virus trailer is essential for replication and interacts with heat-shock protein A8.

    PubMed

    Sztuba-Solinska, Joanna; Diaz, Larissa; Kumar, Mia R; Kolb, Gaëlle; Wiley, Michael R; Jozwick, Lucas; Kuhn, Jens H; Palacios, Gustavo; Radoshitzky, Sheli R; J Le Grice, Stuart F; Johnson, Reed F

    2016-11-16

    Ebola virus (EBOV) is a single-stranded negative-sense RNA virus belonging to the Filoviridae family. The leader and trailer non-coding regions of the EBOV genome likely regulate its transcription, replication, and progeny genome packaging. We investigated the cis-acting RNA signals involved in RNA-RNA and RNA-protein interactions that regulate replication of eGFP-encoding EBOV minigenomic RNA and identified heat shock cognate protein family A (HSC70) member 8 (HSPA8) as an EBOV trailer-interacting host protein. Mutational analysis of the trailer HSPA8 binding motif revealed that this interaction is essential for EBOV minigenome replication. Selective 2'-hydroxyl acylation analyzed by primer extension analysis of the secondary structure of the EBOV minigenomic RNA indicates formation of a small stem-loop composed of the HSPA8 motif, a 3' stem-loop (nucleotides 1868-1890) that is similar to a previously identified structure in the replicative intermediate (RI) RNA and a panhandle domain involving a trailer-to-leader interaction. Results of minigenome assays and an EBOV reverse genetic system rescue support a role for both the panhandle domain and HSPA8 motif 1 in virus replication. Published by Oxford University Press on behalf of Nucleic Acids Research 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.

  14. RNA Polymerase III promoter screen uncovers a novel noncoding RNA family conserved in Caenorhabditis and other clade V nematodes.

    PubMed

    Gruber, Andreas R

    2014-07-10

    RNA Polymerase III is a highly specialized enzyme complex responsible for the transcription of a very distinct set of housekeeping noncoding RNAs including tRNAs, 7SK snRNA, Y RNAs, U6 snRNA, and the RNA components of RNaseP and RNaseMRP. In this work we have utilized the conserved promoter structure of known RNA Polymerase III transcripts consisting of characteristic sequence elements termed proximal sequence elements (PSE) A and B and a TATA-box to uncover a novel RNA Polymerase III-transcribed, noncoding RNA family found to be conserved in Caenorhabditis as well as other clade V nematode species. Homology search in combination with detailed sequence and secondary structure analysis revealed that members of this novel ncRNA family evolve rapidly, and only maintain a potentially functional small stem structure that links the 5' end to the very 3' end of the transcript and a small hairpin structure at the 3' end. This is most likely required for efficient transcription termination. In addition, our study revealed evidence that canonical C/D box snoRNAs are also transcribed from a PSE A-PSE B-TATA-box promoter in Caenorhabditis elegans. Copyright © 2014 Elsevier B.V. All rights reserved.

  15. Template-Based Modeling of Protein-RNA Interactions

    PubMed Central

    Zheng, Jinfang; Kundrotas, Petras J.; Vakser, Ilya A.

    2016-01-01

    Protein-RNA complexes formed by specific recognition between RNA and RNA-binding proteins play an important role in biological processes. More than a thousand of such proteins in human are curated and many novel RNA-binding proteins are to be discovered. Due to limitations of experimental approaches, computational techniques are needed for characterization of protein-RNA interactions. Although much progress has been made, adequate methodologies reliably providing atomic resolution structural details are still lacking. Although protein-RNA free docking approaches proved to be useful, in general, the template-based approaches provide higher quality of predictions. Templates are key to building a high quality model. Sequence/structure relationships were studied based on a representative set of binary protein-RNA complexes from PDB. Several approaches were tested for pairwise target/template alignment. The analysis revealed a transition point between random and correct binding modes. The results showed that structural alignment is better than sequence alignment in identifying good templates, suitable for generating protein-RNA complexes close to the native structure, and outperforms free docking, successfully predicting complexes where the free docking fails, including cases of significant conformational change upon binding. A template-based protein-RNA interaction modeling protocol PRIME was developed and benchmarked on a representative set of complexes. PMID:27662342

  16. The Drosophila hnRNP F/H Homolog Glorund Uses Two Distinct RNA-Binding Modes to Diversify Target Recognition.

    PubMed

    Tamayo, Joel V; Teramoto, Takamasa; Chatterjee, Seema; Hall, Traci M Tanaka; Gavis, Elizabeth R

    2017-04-04

    The Drosophila hnRNP F/H homolog, Glorund (Glo), regulates nanos mRNA translation by interacting with a structured UA-rich motif in the nanos 3' untranslated region. Glo regulates additional RNAs, however, and mammalian homologs bind G-tract sequences to regulate alternative splicing, suggesting that Glo also recognizes G-tract RNA. To gain insight into how Glo recognizes both structured UA-rich and G-tract RNAs, we used mutational analysis guided by crystal structures of Glo's RNA-binding domains and identified two discrete RNA-binding surfaces that allow Glo to recognize both RNA motifs. By engineering Glo variants that favor a single RNA-binding mode, we show that a subset of Glo's functions in vivo is mediated solely by the G-tract binding mode, whereas regulation of nanos requires both recognition modes. Our findings suggest a molecular mechanism for the evolution of dual RNA motif recognition in Glo that may be applied to understanding the functional diversity of other RNA-binding proteins. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.

  17. The Drosophila hnRNP F/H homolog glorund uses two distinct RNA-binding modes to diversify target recognition

    DOE PAGES

    Tamayo, Joel V.; Teramoto, Takamasa; Chatterjee, Seema; ...

    2017-04-04

    The Drosophila hnRNP F/H homolog, Glorund (Glo), regulates nanos mRNA translation by interacting with a structured UA-rich motif in the nanos 3' untranslated region. Glo regulates additional RNAs, however, and mammalian homologs bind G-tract sequences to regulate alternative splicing, suggesting that Glo also recognizes G-tract RNA. To gain insight into how Glo recognizes both structured UA-rich and G-tract RNAs, we used mutational analysis guided by crystal structures of Glo’s RNA-binding domains and identified two discrete RNA-binding surfaces that allow Glo to recognize both RNA motifs. By engineering Glo variants that favor a single RNA-binding mode, we show that a subsetmore » of Glo’s functions in vivo is mediated solely by the G-tract binding mode, whereas regulation of nanos requires both recognition modes. Lastly, our findings suggest a molecular mechanism for the evolution of dual RNA motif recognition in Glo that may be applied to understanding the functional diversity of other RNA-binding proteins.« less

  18. The Drosophila hnRNP F/H Homolog Glorund Uses Two Distinct RNA-Binding Modes to Diversify Target Recognition

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tamayo, Joel V.; Teramoto, Takamasa; Chatterjee, Seema

    The Drosophila hnRNP F/H homolog, Glorund (Glo), regulates nanos mRNA translation by interacting with a structured UA-rich motif in the nanos 3' untranslated region. Glo regulates additional RNAs, however, and mammalian homologs bind G-tract sequences to regulate alternative splicing, suggesting that Glo also recognizes G-tract RNA. To gain insight into how Glo recognizes both structured UA-rich and G-tract RNAs, we used mutational analysis guided by crystal structures of Glo’s RNA-binding domains and identified two discrete RNA-binding surfaces that allow Glo to recognize both RNA motifs. By engineering Glo variants that favor a single RNA-binding mode, we show that a subsetmore » of Glo’s functions in vivo is mediated solely by the G-tract binding mode, whereas regulation of nanos requires both recognition modes. Our findings suggest a molecular mechanism for the evolution of dual RNA motif recognition in Glo that may be applied to understanding the functional diversity of other RNA-binding proteins.« less

  19. The Drosophila hnRNP F/H homolog glorund uses two distinct RNA-binding modes to diversify target recognition

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tamayo, Joel V.; Teramoto, Takamasa; Chatterjee, Seema

    The Drosophila hnRNP F/H homolog, Glorund (Glo), regulates nanos mRNA translation by interacting with a structured UA-rich motif in the nanos 3' untranslated region. Glo regulates additional RNAs, however, and mammalian homologs bind G-tract sequences to regulate alternative splicing, suggesting that Glo also recognizes G-tract RNA. To gain insight into how Glo recognizes both structured UA-rich and G-tract RNAs, we used mutational analysis guided by crystal structures of Glo’s RNA-binding domains and identified two discrete RNA-binding surfaces that allow Glo to recognize both RNA motifs. By engineering Glo variants that favor a single RNA-binding mode, we show that a subsetmore » of Glo’s functions in vivo is mediated solely by the G-tract binding mode, whereas regulation of nanos requires both recognition modes. Lastly, our findings suggest a molecular mechanism for the evolution of dual RNA motif recognition in Glo that may be applied to understanding the functional diversity of other RNA-binding proteins.« less

  20. Spliced leader RNA of trypanosomes: in vivo mutational analysis reveals extensive and distinct requirements for trans splicing and cap4 formation.

    PubMed Central

    Lücke, S; Xu, G L; Palfi, Z; Cross, M; Bellofatto, V; Bindereif, A

    1996-01-01

    In trypanosomes mRNAs are generated through trans splicing. The spliced leader (SL) RNA, which donates the 5'-terminal mini-exon to each of the protein coding exons, plays a central role in the trans splicing process. We have established in vivo assays to study in detail trans splicing, cap4 modification, and RNP assembly of the SL RNA in the trypanosomatid species Leptomonas seymouri. First, we found that extensive sequences within the mini-exon are required for SL RNA function in vivo, although a conserved length of 39 nt is not essential. In contrast, the intron sequence appears to be surprisingly tolerant to mutation; only the stem-loop II structure is indispensable. The asymmetry of the sequence requirements in the stem I region suggests that this domain may exist in different functional conformations. Second, distinct mini-exon sequences outside the modification site are important for efficient cap4 formation. Third, all SL RNA mutations tested allowed core RNP assembly, suggesting flexible requirements for core protein binding. In sum, the results of our mutational analysis provide evidence for a discrete domain structure of the SL RNA and help to explain the strong phylogenetic conservation of the mini-exon sequence and of the overall SL RNA secondary structure; they also suggest that there may be certain differences between trans splicing in nematodes and trypanosomes. This approach provides a basis for studying RNA-RNA interactions in the trans spliceosome. Images PMID:8861965

  1. The 1.3 A resolution structure of the RNA tridecamer r(GCGUUUGAAACGC): metal ion binding correlates with base unstacking and groove contraction.

    PubMed

    Timsit, Youri; Bombard, Sophie

    2007-12-01

    Metal ions play a key role in RNA folding and activity. Elucidating the rules that govern the binding of metal ions is therefore an essential step for better understanding the RNA functions. High-resolution data are a prerequisite for a detailed structural analysis of ion binding on RNA and, in particular, the observation of monovalent cations. Here, the high-resolution crystal structures of the tridecamer duplex r(GCGUUUGAAACGC) crystallized under different conditions provides new structural insights on ion binding on GAAA/UUU sequences that exhibit both unusual structural and functional properties in RNA. The present study extends the repertory of RNA ion binding sites in showing that the two first bases of UUU triplets constitute a specific site for sodium ions. A striking asymmetric pattern of metal ion binding in the two equivalent halves of the palindromic sequence demonstrates that sequence and its environment act together to bind metal ions. A highly ionophilic half that binds six metal ions allows, for the first time, the observation of a disodium cluster in RNA. The comparison of the equivalent halves of the duplex provides experimental evidences that ion binding correlates with structural alterations and groove contraction.

  2. Molecular structure of r/GCG/d/TATACGC/ - A DNA-RNA hybrid helix joined to double helical DNA

    NASA Technical Reports Server (NTRS)

    Wang, A. H.-J.; Fujii, S.; Rich, A.; Van Boom, J. H.; Van Der Marel, G. A.; Van Boeckel, S. A. A.

    1982-01-01

    The molecule r(GCG)d(TATACGC) is self-complementary and forms two DNA-RNA hybrid segments surrounding a central region of double helical DNA; its molecular structure has been solved by X-ray analysis. All three parts of the molecule adopt a conformation which is close to that seen in the 11-fold RNA double helix. The conformation of the ribonucleotides is partly determined by water molecules bridging between the ribose O2' hydroxyl group and cytosine O2. The hybrid-DNA duplex junction contains no structural discontinuities. However, the central DNA TATA sequence has some structural irregularities.

  3. Transcriptome-Wide Analysis of UTRs in Non-Small Cell Lung Cancer Reveals Cancer-Related Genes with SNV-Induced Changes on RNA Secondary Structure and miRNA Target Sites

    PubMed Central

    Novotny, Peter; Tang, Xiaojia; Kalari, Krishna R.; Gorodkin, Jan

    2014-01-01

    Traditional mutation assessment methods generally focus on predicting disruptive changes in protein-coding regions rather than non-coding regulatory regions like untranslated regions (UTRs) of mRNAs. The UTRs, however, are known to have many sequence and structural motifs that can regulate translational and transcriptional efficiency and stability of mRNAs through interaction with RNA-binding proteins and other non-coding RNAs like microRNAs (miRNAs). In a recent study, transcriptomes of tumor cells harboring mutant and wild-type KRAS (V-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog) genes in patients with non-small cell lung cancer (NSCLC) have been sequenced to identify single nucleotide variations (SNVs). About 40% of the total SNVs (73,717) identified were mapped to UTRs, but omitted in the previous analysis. To meet this obvious demand for analysis of the UTRs, we designed a comprehensive pipeline to predict the effect of SNVs on two major regulatory elements, secondary structure and miRNA target sites. Out of 29,290 SNVs in 6462 genes, we predict 472 SNVs (in 408 genes) affecting local RNA secondary structure, 490 SNVs (in 447 genes) affecting miRNA target sites and 48 that do both. Together these disruptive SNVs were present in 803 different genes, out of which 188 (23.4%) were previously known to be cancer-associated. Notably, this ratio is significantly higher (one-sided Fisher's exact test p-value = 0.032) than the ratio (20.8%) of known cancer-associated genes (n = 1347) in our initial data set (n = 6462). Network analysis shows that the genes harboring disruptive SNVs were involved in molecular mechanisms of cancer, and the signaling pathways of LPS-stimulated MAPK, IL-6, iNOS, EIF2 and mTOR. In conclusion, we have found hundreds of SNVs which are highly disruptive with respect to changes in the secondary structure and miRNA target sites within UTRs. These changes hold the potential to alter the expression of known cancer genes or genes linked to cancer-associated pathways. PMID:24416147

  4. Transcriptome-wide analysis of UTRs in non-small cell lung cancer reveals cancer-related genes with SNV-induced changes on RNA secondary structure and miRNA target sites.

    PubMed

    Sabarinathan, Radhakrishnan; Wenzel, Anne; Novotny, Peter; Tang, Xiaojia; Kalari, Krishna R; Gorodkin, Jan

    2014-01-01

    Traditional mutation assessment methods generally focus on predicting disruptive changes in protein-coding regions rather than non-coding regulatory regions like untranslated regions (UTRs) of mRNAs. The UTRs, however, are known to have many sequence and structural motifs that can regulate translational and transcriptional efficiency and stability of mRNAs through interaction with RNA-binding proteins and other non-coding RNAs like microRNAs (miRNAs). In a recent study, transcriptomes of tumor cells harboring mutant and wild-type KRAS (V-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog) genes in patients with non-small cell lung cancer (NSCLC) have been sequenced to identify single nucleotide variations (SNVs). About 40% of the total SNVs (73,717) identified were mapped to UTRs, but omitted in the previous analysis. To meet this obvious demand for analysis of the UTRs, we designed a comprehensive pipeline to predict the effect of SNVs on two major regulatory elements, secondary structure and miRNA target sites. Out of 29,290 SNVs in 6462 genes, we predict 472 SNVs (in 408 genes) affecting local RNA secondary structure, 490 SNVs (in 447 genes) affecting miRNA target sites and 48 that do both. Together these disruptive SNVs were present in 803 different genes, out of which 188 (23.4%) were previously known to be cancer-associated. Notably, this ratio is significantly higher (one-sided Fisher's exact test p-value = 0.032) than the ratio (20.8%) of known cancer-associated genes (n = 1347) in our initial data set (n = 6462). Network analysis shows that the genes harboring disruptive SNVs were involved in molecular mechanisms of cancer, and the signaling pathways of LPS-stimulated MAPK, IL-6, iNOS, EIF2 and mTOR. In conclusion, we have found hundreds of SNVs which are highly disruptive with respect to changes in the secondary structure and miRNA target sites within UTRs. These changes hold the potential to alter the expression of known cancer genes or genes linked to cancer-associated pathways.

  5. Detailed analysis of RNA-protein interactions within the bacterial ribosomal protein L5/5S rRNA complex.

    PubMed

    Perederina, Anna; Nevskaya, Natalia; Nikonov, Oleg; Nikulin, Alexei; Dumas, Philippe; Yao, Min; Tanaka, Isao; Garber, Maria; Gongadze, George; Nikonov, Stanislav

    2002-12-01

    The crystal structure of ribosomal protein L5 from Thermus thermophilus complexed with a 34-nt fragment comprising helix III and loop C of Escherichia coli 5S rRNA has been determined at 2.5 A resolution. The protein specifically interacts with the bulged nucleotides at the top of loop C of 5S rRNA. The rRNA and protein contact surfaces are strongly stabilized by intramolecular interactions. Charged and polar atoms forming the network of conserved intermolecular hydrogen bonds are located in two narrow planar parallel layers belonging to the protein and rRNA, respectively. The regions, including these atoms conserved in Bacteria and Archaea, can be considered an RNA-protein recognition module. Comparison of the T. thermophilus L5 structure in the RNA-bound form with the isolated Bacillus stearothermophilus L5 structure shows that the RNA-recognition module on the protein surface does not undergo significant changes upon RNA binding. In the crystal of the complex, the protein interacts with another RNA molecule in the asymmetric unit through the beta-sheet concave surface. This protein/RNA interface simulates the interaction of L5 with 23S rRNA observed in the Haloarcula marismortui 50S ribosomal subunit.

  6. Detailed analysis of RNA-protein interactions within the bacterial ribosomal protein L5/5S rRNA complex.

    PubMed Central

    Perederina, Anna; Nevskaya, Natalia; Nikonov, Oleg; Nikulin, Alexei; Dumas, Philippe; Yao, Min; Tanaka, Isao; Garber, Maria; Gongadze, George; Nikonov, Stanislav

    2002-01-01

    The crystal structure of ribosomal protein L5 from Thermus thermophilus complexed with a 34-nt fragment comprising helix III and loop C of Escherichia coli 5S rRNA has been determined at 2.5 A resolution. The protein specifically interacts with the bulged nucleotides at the top of loop C of 5S rRNA. The rRNA and protein contact surfaces are strongly stabilized by intramolecular interactions. Charged and polar atoms forming the network of conserved intermolecular hydrogen bonds are located in two narrow planar parallel layers belonging to the protein and rRNA, respectively. The regions, including these atoms conserved in Bacteria and Archaea, can be considered an RNA-protein recognition module. Comparison of the T. thermophilus L5 structure in the RNA-bound form with the isolated Bacillus stearothermophilus L5 structure shows that the RNA-recognition module on the protein surface does not undergo significant changes upon RNA binding. In the crystal of the complex, the protein interacts with another RNA molecule in the asymmetric unit through the beta-sheet concave surface. This protein/RNA interface simulates the interaction of L5 with 23S rRNA observed in the Haloarcula marismortui 50S ribosomal subunit. PMID:12515387

  7. Structural virology. Near-atomic cryo-EM structure of the helical measles virus nucleocapsid.

    PubMed

    Gutsche, Irina; Desfosses, Ambroise; Effantin, Grégory; Ling, Wai Li; Haupt, Melina; Ruigrok, Rob W H; Sachse, Carsten; Schoehn, Guy

    2015-05-08

    Measles is a highly contagious human disease. We used cryo-electron microscopy and single particle-based helical image analysis to determine the structure of the helical nucleocapsid formed by the folded domain of the measles virus nucleoprotein encapsidating an RNA at a resolution of 4.3 angstroms. The resulting pseudoatomic model of the measles virus nucleocapsid offers important insights into the mechanism of the helical polymerization of nucleocapsids of negative-strand RNA viruses, in particular via the exchange subdomains of the nucleoprotein. The structure reveals the mode of the nucleoprotein-RNA interaction and explains why each nucleoprotein of measles virus binds six nucleotides, whereas the respiratory syncytial virus nucleoprotein binds seven. It provides a rational basis for further analysis of measles virus replication and transcription, and reveals potential targets for drug design. Copyright © 2015, American Association for the Advancement of Science.

  8. Transcripts with in silico predicted RNA structure are enriched everywhere in the mouse brain

    PubMed Central

    2012-01-01

    Background Post-transcriptional control of gene expression is mostly conducted by specific elements in untranslated regions (UTRs) of mRNAs, in collaboration with specific binding proteins and RNAs. In several well characterized cases, these RNA elements are known to form stable secondary structures. RNA secondary structures also may have major functional implications for long noncoding RNAs (lncRNAs). Recent transcriptional data has indicated the importance of lncRNAs in brain development and function. However, no methodical efforts to investigate this have been undertaken. Here, we aim to systematically analyze the potential for RNA structure in brain-expressed transcripts. Results By comprehensive spatial expression analysis of the adult mouse in situ hybridization data of the Allen Mouse Brain Atlas, we show that transcripts (coding as well as non-coding) associated with in silico predicted structured probes are highly and significantly enriched in almost all analyzed brain regions. Functional implications of these RNA structures and their role in the brain are discussed in detail along with specific examples. We observe that mRNAs with a structure prediction in their UTRs are enriched for binding, transport and localization gene ontology categories. In addition, after manual examination we observe agreement between RNA binding protein interaction sites near the 3’ UTR structures and correlated expression patterns. Conclusions Our results show a potential use for RNA structures in expressed coding as well as noncoding transcripts in the adult mouse brain, and describe the role of structured RNAs in the context of intracellular signaling pathways and regulatory networks. Based on this data we hypothesize that RNA structure is widely involved in transcriptional and translational regulatory mechanisms in the brain and ultimately plays a role in brain function. PMID:22651826

  9. Application of ion mobility-mass spectrometry to microRNA analysis.

    PubMed

    Takebayashi, Kosuke; Hirose, Kenji; Izumi, Yoshihiro; Bamba, Takeshi; Fukusaki, Eiichiro

    2013-03-01

    Liquid chromatography/mass spectrometry is widely used for studying sequence determination and modification analysis of small RNAs. However, the efficiency of liquid chromatography-based separation of intact small RNA species is insufficient, since the physiochemical properties among small RNAs are very similar. In this study, we focused on ion mobility-mass spectrometry (IM-MS), which is a gas-phase separation technique coupled with mass spectrometry; we have evaluated the utility of IM-MS for microRNA (miRNA) analysis. A multiply charged deprotonated ion derived from an 18-24-nt-long miRNA was formed by electrospray ionization, and then the time, called the "drift time", taken by each ion to migrate through a buffer gas was measured. Each multivalent ion was temporally separated on the basis of the charge state and structural formation; 3 types of unique mass-mobility correlation patterns (i.e., chainlike-form, hairpin-form, and dimer-form) were present on the two-dimensional mobility-mass spectrum. Moreover, we found that the ion size (sequence length) and the secondary structures of the small RNAs strongly contributed to the IM-MS-based separation, although solvent conditions such as pH had no effect. Therefore, sequence isomers could also be discerned by the selection of each specific charged ion, i.e., the 6(-) charged ion reflected a majority among chainlike-, hairpin-, and other structures. We concluded that the IM-MS provides additional capability for separation; thus, this analytical method will be a powerful tool for comprehensive small RNA analysis. Copyright © 2012. Published by Elsevier B.V.

  10. High-Density Proximity Mapping Reveals the Subcellular Organization of mRNA-Associated Granules and Bodies.

    PubMed

    Youn, Ji-Young; Dunham, Wade H; Hong, Seo Jung; Knight, James D R; Bashkurov, Mikhail; Chen, Ginny I; Bagci, Halil; Rathod, Bhavisha; MacLeod, Graham; Eng, Simon W M; Angers, Stéphane; Morris, Quaid; Fabian, Marc; Côté, Jean-François; Gingras, Anne-Claude

    2018-02-01

    mRNA processing, transport, translation, and ultimately degradation involve a series of dedicated protein complexes that often assemble into large membraneless structures such as stress granules (SGs) and processing bodies (PBs). Here, systematic in vivo proximity-dependent biotinylation (BioID) analysis of 119 human proteins associated with different aspects of mRNA biology uncovers 7424 unique proximity interactions with 1,792 proteins. Classical bait-prey analysis reveals connections of hundreds of proteins to distinct mRNA-associated processes or complexes, including the splicing and transcriptional elongation machineries (protein phosphatase 4) and the CCR4-NOT deadenylase complex (CEP85, RNF219, and KIAA0355). Analysis of correlated patterns between endogenous preys uncovers the spatial organization of RNA regulatory structures and enables the definition of 144 core components of SGs and PBs. We report preexisting contacts between most core SG proteins under normal growth conditions and demonstrate that several core SG proteins (UBAP2L, CSDE1, and PRRC2C) are critical for the formation of microscopically visible SGs. Copyright © 2017 Elsevier Inc. All rights reserved.

  11. The Crystal Structure and RNA-Binding of an Orthomyxovirus Nucleoprotein

    PubMed Central

    Zheng, Wenjie; Olson, John; Vakharia, Vikram; Tao, Yizhi Jane

    2013-01-01

    Genome packaging for viruses with segmented genomes is often a complex problem. This is particularly true for influenza viruses and other orthomyxoviruses, whose genome consists of multiple negative-sense RNAs encapsidated as ribonucleoprotein (RNP) complexes. To better understand the structural features of orthomyxovirus RNPs that allow them to be packaged, we determined the crystal structure of the nucleoprotein (NP) of a fish orthomyxovirus, the infectious salmon anemia virus (ISAV) (genus Isavirus). As the major protein component of the RNPs, ISAV-NP possesses a bi-lobular structure similar to the influenza virus NP. Because both RNA-free and RNA-bound ISAV NP forms stable dimers in solution, we were able to measure the NP RNA binding affinity as well as the stoichiometry using recombinant proteins and synthetic oligos. Our RNA binding analysis revealed that each ISAV-NP binds ∼12 nts of RNA, shorter than the 24–28 nts originally estimated for the influenza A virus NP based on population average. The 12-nt stoichiometry was further confirmed by results from electron microscopy and dynamic light scattering. Considering that RNPs of ISAV and the influenza viruses have similar morphologies and dimensions, our findings suggest that NP-free RNA may exist on orthomyxovirus RNPs, and selective RNP packaging may be accomplished through direct RNA-RNA interactions. PMID:24068932

  12. Thermal Stability of RNA Structures with Bulky Cations in Mixed Aqueous Solutions.

    PubMed

    Nakano, Shu-Ichi; Tanino, Yuichi; Hirayama, Hidenobu; Sugimoto, Naoki

    2016-10-04

    Bulky cations are used to develop nucleic-acid-based technologies for medical and technological applications in which nucleic acids function under nonaqueous conditions. In this study, the thermal stability of RNA structures was measured in the presence of various bulky cations in aqueous mixtures with organic solvents or polymer additives. The stability of oligonucleotide, transfer RNA, and polynucleotide structures was decreased in the presence of salts of tetrabutylammonium and tetrapentylammonium ions, and the stability and salt concentration dependences were dependent on cation sizes. The degree to which stability was dependent on salt concentration was correlated with reciprocals of the dielectric constants of mixed solutions, regardless of interactions between the cosolutes and RNA. Our results show that organic solvents affect the strength of electrostatic interactions between RNA and cations. Analysis of ion binding to RNA indicated greater enhancement of cation binding to RNA single strands than to duplexes in media with low dielectric constants. Furthermore, background bulky ions changed the dependence of RNA duplex stability on the concentration of metal ion salts. These unique properties of large tetraalkylammonium ions are useful for controlling the stability of RNA structures and its sensitivity to metal ion salts. Copyright © 2016 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  13. Mg2+ Effect on Argonaute and RNA Duplex by Molecular Dynamics and Bioinformatics Implications

    PubMed Central

    Nam, Seungyoon; Ryu, Hyojung; Son, Won-joon; Kim, Yon Hui; Kim, Kyung Tae; Balch, Curt; Nephew, Kenneth P.; Lee, Jinhyuk

    2014-01-01

    RNA interference (RNAi), mediated by small non-coding RNAs (e.g., miRNAs, siRNAs), influences diverse cellular functions. Highly complementary miRNA-target RNA (or siRNA-target RNA) duplexes are recognized by an Argonaute family protein (Ago2), and recent observations indicate that the concentration of Mg2+ ions influences miRNA targeting of specific mRNAs, thereby modulating miRNA-mRNA networks. In the present report, we studied the thermodynamic effects of differential [Mg2+] on slicing (RNA silencing cycle) through molecular dynamics simulation analysis, and its subsequent statistical analysis. Those analyses revealed different structural conformations of the RNA duplex in Ago2, depending on Mg2+ concentration. We also demonstrate that cation effects on Ago2 structural flexibility are critical to its catalytic/functional activity, with low [Mg2+] favoring greater Ago2 flexibility (e.g., greater entropy) and less miRNA/mRNA duplex stability, thus favoring slicing. The latter finding was supported by a negative correlation between expression of an Mg2+ influx channel, TRPM7, and one miRNA’s (miR-378) ability to downregulate its mRNA target, TMEM245. These results imply that thermodynamics could be applied to siRNA-based therapeutic strategies, using highly complementary binding targets, because Ago2 is also involved in RNAi slicing by exogenous siRNAs. However, the efficacy of a siRNA-based approach will differ, to some extent, based on the Mg2+ concentration even within the same disease type; therefore, different siRNA-based approaches might be considered for patient-to-patient needs. PMID:25330448

  14. Structural Rearrangement in an RsmA/CsrA Ortholog of Pseudomonas aeruginosa Creates a Dimeric RNA-Binding Protein, RsmN

    PubMed Central

    Morris, Elizabeth R.; Hall, Gareth; Li, Chan; Heeb, Stephan; Kulkarni, Rahul V.; Lovelock, Laura; Silistre, Hazel; Messina, Marco; Cámara, Miguel; Emsley, Jonas; Williams, Paul; Searle, Mark S.

    2013-01-01

    Summary In bacteria, the highly conserved RsmA/CsrA family of RNA-binding proteins functions as global posttranscriptional regulators acting on mRNA translation and stability. Through phenotypic complementation of an rsmA mutant in Pseudomonas aeruginosa, we discovered a family member, termed RsmN. Elucidation of the RsmN crystal structure and that of the complex with a hairpin from the sRNA, RsmZ, reveals a uniquely inserted α helix, which redirects the polypeptide chain to form a distinctly different protein fold to the domain-swapped dimeric structure of RsmA homologs. The overall β sheet structure required for RNA recognition is, however, preserved with compensatory sequence and structure differences, allowing the RsmN dimer to target binding motifs in both structured hairpin loops and flexible disordered RNAs. Phylogenetic analysis indicates that, although RsmN appears unique to P. aeruginosa, homologous proteins with the inserted α helix are more widespread and arose as a consequence of a gene duplication event. PMID:23954502

  15. Discrete RNA libraries from pseudo-torsional space

    PubMed Central

    Humphris-Narayanan, Elisabeth

    2012-01-01

    The discovery that RNA molecules can fold into complex structures and carry out diverse cellular roles has led to interest in developing tools for modeling RNA tertiary structure. While significant progress has been made in establishing that the RNA backbone is rotameric, few libraries of discrete conformations specifically for use in RNA modeling have been validated. Here, we present six libraries of discrete RNA conformations based on a simplified pseudo-torsional notation of the RNA backbone, comparable to phi and psi in the protein backbone. We evaluate the ability of each library to represent single nucleotide backbone conformations and we show how individual library fragments can be assembled into dinucleotides that are consistent with established RNA backbone descriptors spanning from sugar to sugar. We then use each library to build all-atom models of 20 test folds and we show how the composition of a fragment library can limit model quality. Despite the limitations inherent in using discretized libraries, we find that several hundred discrete fragments can rebuild RNA folds up to 174 nucleotides in length with atomic-level accuracy (<1.5Å RMSD). We anticipate the libraries presented here could easily be incorporated into RNA structural modeling, analysis, or refinement tools. PMID:22425640

  16. Tris-borate is a poor counterion for RNA: a cautionary tale for RNA folding studies

    PubMed Central

    Buchmueller, Karen L.; Weeks, Kevin M.

    2004-01-01

    Native polyacrylamide gel electrophoresis is a powerful approach for visualizing RNA folding states and folding intermediates. Tris-borate has a high-buffering capacity and is therefore widely used in electrophoresis-based investigations of RNA structure and folding. However, the effectiveness of Tris-borate as a counterion for RNA has not been systematically investigated. In a recirculated Hepes/KCl buffer, the catalytic core of the bI5 group I intron RNA undergoes a conformational collapse characterized by a bulk transition midpoint, or Mg1/2, of ∼3 mM, consistent with extensive independent biochemical experiments. In contrast, in Tris-borate, RNA collapse has a much smaller apparent Mg1/2, equal to 0.1 mM, because in this buffer the RNA undergoes a different, large amplitude, folding transition at low Mg2+ concentrations. Analysis of structural neighbors using a short-lived, RNA-tethered, photocrosslinker indicates that the global RNA structure eventually converges in the two buffer systems, as the divalent ion concentration approaches ∼1 mM Mg2+. The weak capacity of Tris-borate to stabilize RNA folding may reflect relatively unfavorable interactions between the bulky Tris-borate ion and RNA or partial coordination of RNA functional groups by borate. Under some conditions, Tris-borate is a poor counterion for RNA and its use merits careful evaluation in RNA folding studies. PMID:15601995

  17. RNApdbee--a webserver to derive secondary structures from pdb files of knotted and unknotted RNAs.

    PubMed

    Antczak, Maciej; Zok, Tomasz; Popenda, Mariusz; Lukasiak, Piotr; Adamiak, Ryszard W; Blazewicz, Jacek; Szachniuk, Marta

    2014-07-01

    In RNA structural biology and bioinformatics an access to correct RNA secondary structure and its proper representation is of crucial importance. This is true especially in the field of secondary and 3D RNA structure prediction. Here, we introduce RNApdbee-a new tool that allows to extract RNA secondary structure from the pdb file, and presents it in both textual and graphical form. RNApdbee supports processing of knotted and unknotted structures of large RNAs, also within protein complexes. The method works not only for first but also for high order pseudoknots, and gives an information about canonical and non-canonical base pairs. A combination of these features is unique among existing applications for RNA structure analysis. Additionally, a function of converting between the text notations, i.e. BPSEQ, CT and extended dot-bracket, is provided. In order to facilitate a more comprehensive study, the webserver integrates the functionality of RNAView, MC-Annotate and 3DNA/DSSR, being the most common tools used for automated identification and classification of RNA base pairs. RNApdbee is implemented as a publicly available webserver with an intuitive interface and can be freely accessed at http://rnapdbee.cs.put.poznan.pl/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. Accelerated probabilistic inference of RNA structure evolution

    PubMed Central

    Holmes, Ian

    2005-01-01

    Background Pairwise stochastic context-free grammars (Pair SCFGs) are powerful tools for evolutionary analysis of RNA, including simultaneous RNA sequence alignment and secondary structure prediction, but the associated algorithms are intensive in both CPU and memory usage. The same problem is faced by other RNA alignment-and-folding algorithms based on Sankoff's 1985 algorithm. It is therefore desirable to constrain such algorithms, by pre-processing the sequences and using this first pass to limit the range of structures and/or alignments that can be considered. Results We demonstrate how flexible classes of constraint can be imposed, greatly reducing the computational costs while maintaining a high quality of structural homology prediction. Any score-attributed context-free grammar (e.g. energy-based scoring schemes, or conditionally normalized Pair SCFGs) is amenable to this treatment. It is now possible to combine independent structural and alignment constraints of unprecedented general flexibility in Pair SCFG alignment algorithms. We outline several applications to the bioinformatics of RNA sequence and structure, including Waterman-Eggert N-best alignments and progressive multiple alignment. We evaluate the performance of the algorithm on test examples from the RFAM database. Conclusion A program, Stemloc, that implements these algorithms for efficient RNA sequence alignment and structure prediction is available under the GNU General Public License. PMID:15790387

  19. Abstract shapes of RNA.

    PubMed

    Giegerich, Robert; Voss, Björn; Rehmsmeier, Marc

    2004-01-01

    The function of a non-protein-coding RNA is often determined by its structure. Since experimental determination of RNA structure is time-consuming and expensive, its computational prediction is of great interest, and efficient solutions based on thermodynamic parameters are known. Frequently, however, the predicted minimum free energy structures are not the native ones, leading to the necessity of generating suboptimal solutions. While this can be accomplished by a number of programs, the user is often confronted with large outputs of similar structures, although he or she is interested in structures with more fundamental differences, or, in other words, with different abstract shapes. Here, we formalize the concept of abstract shapes and introduce their efficient computation. Each shape of an RNA molecule comprises a class of similar structures and has a representative structure of minimal free energy within the class. Shape analysis is implemented in the program RNAshapes. We applied RNAshapes to the prediction of optimal and suboptimal abstract shapes of several RNAs. For a given energy range, the number of shapes is considerably smaller than the number of structures, and in all cases, the native structures were among the top shape representatives. This demonstrates that the researcher can quickly focus on the structures of interest, without processing up to thousands of near-optimal solutions. We complement this study with a large-scale analysis of the growth behaviour of structure and shape spaces. RNAshapes is available for download and as an online version on the Bielefeld Bioinformatics Server.

  20. Functional and structural analysis of a highly-expressed Yersinia pestis small RNA following infection of cultured macrophages

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, Nan; Hennelly, Scott P.; Stubben, Chris J.

    Non-coding small RNAs (sRNAs) are found in practically all bacterial genomes and play important roles in regulating gene expression to impact bacterial metabolism, growth, and virulence. We performed transcriptomics analysis to identify sRNAs that are differentially expressed in Yersinia pestis that invaded the human macrophage cell line THP-1, compared to pathogens that remained extracellular in the presence of host. Using ultra high-throughput sequencing, we identified 37 novel and 143 previously known sRNAs in Y. pestis. In particular, the sRNA Ysr170 was highly expressed in intracellular Yersinia and exhibited a log2 fold change ~3.6 higher levels compared to extracellular bacteria. Wemore » found that knock-down of Ysr170 expression attenuated infection efficiency in cell culture and growth rate in response to different stressors. In addition, we applied selective 2’-hydroxyl acylation analyzed by primer extension (SHAPE) analysis to determine the secondary structure of Ysr170 and observed structural changes resulting from interactions with the aminoglycoside antibiotic gentamycin and the RNA chaperone Hfq. Interestingly, gentamicin stabilized helix 4 of Ysr170, which structurally resembles the native gentamicin 16S ribosomal binding site. Lastly, we modeled the tertiary structure of Ysr170 binding to gentamycin using RNA motif modeling. Integration of these experimental and structural methods can provide further insight into the design of small molecules that can inhibit function of sRNAs required for pathogen virulence.« less

  1. Functional and structural analysis of a highly-expressed Yersinia pestis small RNA following infection of cultured macrophages

    DOE PAGES

    Li, Nan; Hennelly, Scott P.; Stubben, Chris J.; ...

    2016-12-28

    Non-coding small RNAs (sRNAs) are found in practically all bacterial genomes and play important roles in regulating gene expression to impact bacterial metabolism, growth, and virulence. We performed transcriptomics analysis to identify sRNAs that are differentially expressed in Yersinia pestis that invaded the human macrophage cell line THP-1, compared to pathogens that remained extracellular in the presence of host. Using ultra high-throughput sequencing, we identified 37 novel and 143 previously known sRNAs in Y. pestis. In particular, the sRNA Ysr170 was highly expressed in intracellular Yersinia and exhibited a log2 fold change ~3.6 higher levels compared to extracellular bacteria. Wemore » found that knock-down of Ysr170 expression attenuated infection efficiency in cell culture and growth rate in response to different stressors. In addition, we applied selective 2’-hydroxyl acylation analyzed by primer extension (SHAPE) analysis to determine the secondary structure of Ysr170 and observed structural changes resulting from interactions with the aminoglycoside antibiotic gentamycin and the RNA chaperone Hfq. Interestingly, gentamicin stabilized helix 4 of Ysr170, which structurally resembles the native gentamicin 16S ribosomal binding site. Lastly, we modeled the tertiary structure of Ysr170 binding to gentamycin using RNA motif modeling. Integration of these experimental and structural methods can provide further insight into the design of small molecules that can inhibit function of sRNAs required for pathogen virulence.« less

  2. RNA-LIM: a novel procedure for analyzing protein/single-stranded RNA propensity data with concomitant estimation of interface structure.

    PubMed

    Hall, Damien; Li, Songling; Yamashita, Kazuo; Azuma, Ryuzo; Carver, John A; Standley, Daron M

    2015-03-01

    RNA-LIM is a procedure that can analyze various pseudo-potentials describing the affinity between single-stranded RNA (ssRNA) ribonucleotides and surface amino acids to produce a coarse-grained estimate of the structure of the ssRNA at the protein interface. The search algorithm works by evolving an ssRNA chain, of known sequence, as a series of walks between fixed sites on a protein surface. Optimal routes are found by application of a set of minimal "limiting" restraints derived jointly from (i) selective sampling of the ribonucleotide amino acid affinity pseudo-potential data, (ii) limited surface path exploration by prior determination of surface arc lengths, and (iii) RNA structural specification obtained from a statistical potential gathered from a library of experimentally determined ssRNA structures. We describe the general approach using a NAST (Nucleic Acid Simulation Tool)-like approximation of the ssRNA chain and a generalized pseudo-potential reflecting the location of nucleic acid binding residues. Minimum and maximum performance indicators of the methodology are established using both synthetic data, for which the pseudo-potential defining nucleic acid binding affinity is systematically degraded, and a representative real case, where the RNA binding sites are predicted by the amplified antisense RNA (aaRNA) method. Some potential uses and extensions of the routine are discussed. RNA-LIM analysis programs along with detailed instructions for their use are available on request from the authors. Crown Copyright © 2014. Published by Elsevier Inc. All rights reserved.

  3. A small stem-loop structure of the Ebola virus trailer is essential for replication and interacts with heat-shock protein A8

    PubMed Central

    Sztuba-Solinska, Joanna; Diaz, Larissa; Kumar, Mia R.; Kolb, Gaëlle; Wiley, Michael R.; Jozwick, Lucas; Kuhn, Jens H.; Palacios, Gustavo; Radoshitzky, Sheli R.; J. Le Grice, Stuart F.; Johnson, Reed F.

    2016-01-01

    Ebola virus (EBOV) is a single-stranded negative-sense RNA virus belonging to the Filoviridae family. The leader and trailer non-coding regions of the EBOV genome likely regulate its transcription, replication, and progeny genome packaging. We investigated the cis-acting RNA signals involved in RNA–RNA and RNA–protein interactions that regulate replication of eGFP-encoding EBOV minigenomic RNA and identified heat shock cognate protein family A (HSC70) member 8 (HSPA8) as an EBOV trailer-interacting host protein. Mutational analysis of the trailer HSPA8 binding motif revealed that this interaction is essential for EBOV minigenome replication. Selective 2′-hydroxyl acylation analyzed by primer extension analysis of the secondary structure of the EBOV minigenomic RNA indicates formation of a small stem-loop composed of the HSPA8 motif, a 3′ stem-loop (nucleotides 1868–1890) that is similar to a previously identified structure in the replicative intermediate (RI) RNA and a panhandle domain involving a trailer-to-leader interaction. Results of minigenome assays and an EBOV reverse genetic system rescue support a role for both the panhandle domain and HSPA8 motif 1 in virus replication. PMID:27651462

  4. Insights into molecular plasticity in protein complexes from Trm9-Trm112 tRNA modifying enzyme crystal structure

    PubMed Central

    Létoquart, Juliette; van Tran, Nhan; Caroline, Vonny; Aleksandrov, Alexey; Lazar, Noureddine; van Tilbeurgh, Herman; Liger, Dominique; Graille, Marc

    2015-01-01

    Most of the factors involved in translation (tRNA, rRNA and proteins) are subject to post-transcriptional and post-translational modifications, which participate in the fine-tuning and tight control of ribosome and protein synthesis processes. In eukaryotes, Trm112 acts as an obligate activating platform for at least four methyltransferases (MTase) involved in the modification of 18S rRNA (Bud23), tRNA (Trm9 and Trm11) and translation termination factor eRF1 (Mtq2). Trm112 is then at a nexus between ribosome synthesis and function. Here, we present a structure-function analysis of the Trm9-Trm112 complex, which is involved in the 5-methoxycarbonylmethyluridine (mcm5U) modification of the tRNA anticodon wobble position and hence promotes translational fidelity. We also compare the known crystal structures of various Trm112-MTase complexes, highlighting the structural plasticity allowing Trm112 to interact through a very similar mode with its MTase partners, although those share less than 20% sequence identity. PMID:26438534

  5. Solution structure of a DNA mimicking motif of an RNA aptamer against transcription factor AML1 Runt domain.

    PubMed

    Nomura, Yusuke; Tanaka, Yoichiro; Fukunaga, Jun-ichi; Fujiwara, Kazuya; Chiba, Manabu; Iibuchi, Hiroaki; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Kozu, Tomoko; Sakamoto, Taiichi

    2013-12-01

    AML1/RUNX1 is an essential transcription factor involved in the differentiation of hematopoietic cells. AML1 binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. In a previous study, we obtained RNA aptamers against the AML1 Runt domain by systematic evolution of ligands by exponential enrichment and revealed that RNA aptamers exhibit higher affinity for the Runt domain than that for RDE and possess the 5'-GCGMGNN-3' and 5'-N'N'CCAC-3' conserved motif (M: A or C; N and N' form Watson-Crick base pairs) that is important for Runt domain binding. In this study, to understand the structural basis of recognition of the Runt domain by the aptamer motif, the solution structure of a 22-mer RNA was determined using nuclear magnetic resonance. The motif contains the AH(+)-C mismatch and base triple and adopts an unusual backbone structure. Structural analysis of the aptamer motif indicated that the aptamer binds to the Runt domain by mimicking the RDE sequence and structure. Our data should enhance the understanding of the structural basis of DNA mimicry by RNA molecules.

  6. Direct detection of RNA in vitro and in situ by target-primed RCA: The impact of E. coli RNase III on the detection efficiency of RNA sequences distanced far from the 3'-end.

    PubMed

    Merkiene, Egle; Gaidamaviciute, Edita; Riauba, Laurynas; Janulaitis, Arvydas; Lagunavicius, Arunas

    2010-08-01

    We improved the target RNA-primed RCA technique for direct detection and analysis of RNA in vitro and in situ. Previously we showed that the 3' --> 5' single-stranded RNA exonucleolytic activity of Phi29 DNA polymerase converts the target RNA into a primer and uses it for RCA initiation. However, in some cases, the single-stranded RNA exoribonucleolytic activity of the polymerase is hindered by strong double-stranded structures at the 3'-end of target RNAs. We demonstrate that in such hampered cases, the double-stranded RNA-specific Escherichia coli RNase III efficiently assists Phi29 DNA polymerase in converting the target RNA into a primer. These observations extend the target RNA-primed RCA possibilities to test RNA sequences distanced far from the 3'-end and customize this technique for the inner RNA sequence analysis.

  7. Generation and development of RNA ligase ribozymes with modular architecture through "design and selection".

    PubMed

    Fujita, Yuki; Ishikawa, Junya; Furuta, Hiroyuki; Ikawa, Yoshiya

    2010-08-26

    In vitro selection with long random RNA libraries has been used as a powerful method to generate novel functional RNAs, although it often requires laborious structural analysis of isolated RNA molecules. Rational RNA design is an attractive alternative to avoid this laborious step, but rational design of catalytic modules is still a challenging task. A hybrid strategy of in vitro selection and rational design has been proposed. With this strategy termed "design and selection," new ribozymes can be generated through installation of catalytic modules onto RNA scaffolds with defined 3D structures. This approach, the concept of which was inspired by the modular architecture of naturally occurring ribozymes, allows prediction of the overall architectures of the resulting ribozymes, and the structural modularity of the resulting ribozymes allows modification of their structures and functions. In this review, we summarize the design, generation, properties, and engineering of four classes of ligase ribozyme generated by design and selection.

  8. Structural basis for viral 5′-PPP-RNA recognition by human IFIT proteins

    PubMed Central

    Abbas, Yazan M.; Pichlmair, Andreas; Górna, Maria W.; Superti-Furga, Giulio; Nagar, Bhushan

    2016-01-01

    IFIT proteins are interferon-inducible, innate immune effector molecules that are thought to confer antiviral defence through disruption of protein-protein interactions in the host translation initiation machinery. However, recently it was discovered that IFITs could directly recognize viral RNA bearing a 5′-triphosphate group (PPP-RNA), which is a molecular signature that distinguishes it from host RNA. Here, we report crystal structures of human IFIT5, its complex with PPP-RNAs, and an N-terminal fragment of IFIT1. The structures reveal a new helical domain that houses a positively charged cavity designed to specifically engage only single stranded PPP-RNA, thus distinguishing it from the canonical cytosolic sensor of double stranded viral PPP-RNA, RIG-I. Mutational analysis, proteolysis and gel-shift assays reveal that PPP-RNA is bound in a non-sequence specific manner and requires approximately a 3-nucleotide 5′-overhang. Abrogation of PPP-RNA binding in IFIT1 and IFIT5 were found to cause a defect in the anti-viral response by HEK cells. These results demonstrate the mechanism by which IFIT proteins selectively recognize viral RNA and lend insight into their downstream effector function. PMID:23334420

  9. Accurate prediction of RNA-binding protein residues with two discriminative structural descriptors.

    PubMed

    Sun, Meijian; Wang, Xia; Zou, Chuanxin; He, Zenghui; Liu, Wei; Li, Honglin

    2016-06-07

    RNA-binding proteins participate in many important biological processes concerning RNA-mediated gene regulation, and several computational methods have been recently developed to predict the protein-RNA interactions of RNA-binding proteins. Newly developed discriminative descriptors will help to improve the prediction accuracy of these prediction methods and provide further meaningful information for researchers. In this work, we designed two structural features (residue electrostatic surface potential and triplet interface propensity) and according to the statistical and structural analysis of protein-RNA complexes, the two features were powerful for identifying RNA-binding protein residues. Using these two features and other excellent structure- and sequence-based features, a random forest classifier was constructed to predict RNA-binding residues. The area under the receiver operating characteristic curve (AUC) of five-fold cross-validation for our method on training set RBP195 was 0.900, and when applied to the test set RBP68, the prediction accuracy (ACC) was 0.868, and the F-score was 0.631. The good prediction performance of our method revealed that the two newly designed descriptors could be discriminative for inferring protein residues interacting with RNAs. To facilitate the use of our method, a web-server called RNAProSite, which implements the proposed method, was constructed and is freely available at http://lilab.ecust.edu.cn/NABind .

  10. CryoEM structures of two spliceosomal complexes: starter and dessert at the spliceosome feast.

    PubMed

    Nguyen, Thi Hoang Duong; Galej, Wojciech P; Fica, Sebastian M; Lin, Pei-Chun; Newman, Andrew J; Nagai, Kiyoshi

    2016-02-01

    The spliceosome is formed on pre-mRNA substrates from five small nuclear ribonucleoprotein particles (U1, U2, U4/U6 and U5 snRNPs), and numerous non-snRNP factors. Saccharomyces cerevisiae U4/U6.U5 tri-snRNP comprises U5 snRNA, U4/U6 snRNA duplex and approximately 30 proteins and represents a substantial part of the spliceosome before activation. Schizosaccharomyces pombe U2.U6.U5 spliceosomal complex is a post-catalytic intron lariat spliceosome containing U2 and U5 snRNPs, NTC (nineteen complex), NTC-related proteins (NTR), U6 snRNA, and an RNA intron lariat. Two recent papers describe near-complete atomic structures of these complexes based on cryoEM single-particle analysis. The U4/U6.U5 tri-snRNP structure provides crucial insight into the activation mechanism of the spliceosome. The U2.U6.U5 complex reveals the striking architecture of NTC and NTR and important features of the group II intron-like catalytic RNA core remaining after spliced mRNA is released. These two structures greatly advance our understanding of the mechanism of pre-mRNA splicing. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.

  11. The ribosome uses two active mechanisms to unwind messenger RNA during translation.

    PubMed

    Qu, Xiaohui; Wen, Jin-Der; Lancaster, Laura; Noller, Harry F; Bustamante, Carlos; Tinoco, Ignacio

    2011-07-06

    The ribosome translates the genetic information encoded in messenger RNA into protein. Folded structures in the coding region of an mRNA represent a kinetic barrier that lowers the peptide elongation rate, as the ribosome must disrupt structures it encounters in the mRNA at its entry site to allow translocation to the next codon. Such structures are exploited by the cell to create diverse strategies for translation regulation, such as programmed frameshifting, the modulation of protein expression levels, ribosome localization and co-translational protein folding. Although strand separation activity is inherent to the ribosome, requiring no exogenous helicases, its mechanism is still unknown. Here, using a single-molecule optical tweezers assay on mRNA hairpins, we find that the translation rate of identical codons at the decoding centre is greatly influenced by the GC content of folded structures at the mRNA entry site. Furthermore, force applied to the ends of the hairpin to favour its unfolding significantly speeds translation. Quantitative analysis of the force dependence of its helicase activity reveals that the ribosome, unlike previously studied helicases, uses two distinct active mechanisms to unwind mRNA structure: it destabilizes the helical junction at the mRNA entry site by biasing its thermal fluctuations towards the open state, increasing the probability of the ribosome translocating unhindered; and it mechanically pulls apart the mRNA single strands of the closed junction during the conformational changes that accompany ribosome translocation. The second of these mechanisms ensures a minimal basal rate of translation in the cell; specialized, mechanically stable structures are required to stall the ribosome temporarily. Our results establish a quantitative mechanical basis for understanding the mechanism of regulation of the elongation rate of translation by structured mRNAs. ©2011 Macmillan Publishers Limited. All rights reserved

  12. RNA-seq mixology: designing realistic control experiments to compare protocols and analysis methods

    PubMed Central

    Holik, Aliaksei Z.; Law, Charity W.; Liu, Ruijie; Wang, Zeya; Wang, Wenyi; Ahn, Jaeil; Asselin-Labat, Marie-Liesse; Smyth, Gordon K.

    2017-01-01

    Abstract Carefully designed control experiments provide a gold standard for benchmarking different genomics research tools. A shortcoming of many gene expression control studies is that replication involves profiling the same reference RNA sample multiple times. This leads to low, pure technical noise that is atypical of regular studies. To achieve a more realistic noise structure, we generated a RNA-sequencing mixture experiment using two cell lines of the same cancer type. Variability was added by extracting RNA from independent cell cultures and degrading particular samples. The systematic gene expression changes induced by this design allowed benchmarking of different library preparation kits (standard poly-A versus total RNA with Ribozero depletion) and analysis pipelines. Data generated using the total RNA kit had more signal for introns and various RNA classes (ncRNA, snRNA, snoRNA) and less variability after degradation. For differential expression analysis, voom with quality weights marginally outperformed other popular methods, while for differential splicing, DEXSeq was simultaneously the most sensitive and the most inconsistent method. For sample deconvolution analysis, DeMix outperformed IsoPure convincingly. Our RNA-sequencing data set provides a valuable resource for benchmarking different protocols and data pre-processing workflows. The extra noise mimics routine lab experiments more closely, ensuring any conclusions are widely applicable. PMID:27899618

  13. Selective 2′-hydroxyl acylation analyzed by primer extension and mutational profiling (SHAPE-MaP) for direct, versatile, and accurate RNA structure analysis

    PubMed Central

    Smola, Matthew J.; Rice, Greggory M.; Busan, Steven; Siegfried, Nathan A.; Weeks, Kevin M.

    2016-01-01

    SHAPE chemistries exploit small electrophilic reagents that react with the 2′-hydroxyl group to interrogate RNA structure at single-nucleotide resolution. Mutational profiling (MaP) identifies modified residues based on the ability of reverse transcriptase to misread a SHAPE-modified nucleotide and then counting the resulting mutations by massively parallel sequencing. The SHAPE-MaP approach measures the structure of large and transcriptome-wide systems as accurately as for simple model RNAs. This protocol describes the experimental steps, implemented over three days, required to perform SHAPE probing and construct multiplexed SHAPE-MaP libraries suitable for deep sequencing. These steps include RNA folding and SHAPE structure probing, mutational profiling by reverse transcription, library construction, and sequencing. Automated processing of MaP sequencing data is accomplished using two software packages. ShapeMapper converts raw sequencing files into mutational profiles, creates SHAPE reactivity plots, and provides useful troubleshooting information, often within an hour. SuperFold uses these data to model RNA secondary structures, identify regions with well-defined structures, and visualize probable and alternative helices, often in under a day. We illustrate these algorithms with the E. coli thiamine pyrophosphate riboswitch, E. coli 16S rRNA, and HIV-1 genomic RNAs. SHAPE-MaP can be used to make nucleotide-resolution biophysical measurements of individual RNA motifs, rare components of complex RNA ensembles, and entire transcriptomes. The straightforward MaP strategy greatly expands the number, length, and complexity of analyzable RNA structures. PMID:26426499

  14. Visualizing the global secondary structure of a viral RNA genome with cryo-electron microscopy

    PubMed Central

    Garmann, Rees F.; Gopal, Ajaykumar; Athavale, Shreyas S.; Knobler, Charles M.; Gelbart, William M.; Harvey, Stephen C.

    2015-01-01

    The lifecycle, and therefore the virulence, of single-stranded (ss)-RNA viruses is regulated not only by their particular protein gene products, but also by the secondary and tertiary structure of their genomes. The secondary structure of the entire genomic RNA of satellite tobacco mosaic virus (STMV) was recently determined by selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE). The SHAPE analysis suggested a single highly extended secondary structure with much less branching than occurs in the ensemble of structures predicted by purely thermodynamic algorithms. Here we examine the solution-equilibrated STMV genome by direct visualization with cryo-electron microscopy (cryo-EM), using an RNA of similar length transcribed from the yeast genome as a control. The cryo-EM data reveal an ensemble of branching patterns that are collectively consistent with the SHAPE-derived secondary structure model. Thus, our results both elucidate the statistical nature of the secondary structure of large ss-RNAs and give visual support for modern RNA structure determination methods. Additionally, this work introduces cryo-EM as a means to distinguish between competing secondary structure models if the models differ significantly in terms of the number and/or length of branches. Furthermore, with the latest advances in cryo-EM technology, we suggest the possibility of developing methods that incorporate restraints from cryo-EM into the next generation of algorithms for the determination of RNA secondary and tertiary structures. PMID:25752599

  15. A Sequence-Independent, Unstructured Internal Ribosome Entry Site Is Responsible for Internal Expression of the Coat Protein of Turnip Crinkle Virus

    PubMed Central

    May, Jared; Johnson, Philip; Saleem, Huma

    2017-01-01

    ABSTRACT To maximize the coding potential of viral genomes, internal ribosome entry sites (IRES) can be used to bypass the traditional requirement of a 5′ cap and some/all of the associated translation initiation factors. Although viral IRES typically contain higher-order RNA structure, an unstructured sequence of about 84 nucleotides (nt) immediately upstream of the Turnip crinkle virus (TCV) coat protein (CP) open reading frame (ORF) has been found to promote internal expression of the CP from the genomic RNA (gRNA) both in vitro and in vivo. An absence of extensive RNA structure was predicted using RNA folding algorithms and confirmed by selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) RNA structure probing. Analysis of the IRES region in vitro by use of both the TCV gRNA and reporter constructs did not reveal any sequence-specific elements but rather suggested that an overall lack of structure was an important feature for IRES activity. The CP IRES is A-rich, independent of orientation, and strongly conserved among viruses in the same genus. The IRES was dependent on eIF4G, but not eIF4E, for activity. Low levels of CP accumulated in vivo in the absence of detectable TCV subgenomic RNAs, strongly suggesting that the IRES was active in the gRNA in vivo. Since the TCV CP also serves as the viral silencing suppressor, early translation of the CP from the viral gRNA is likely important for countering host defenses. Cellular mRNA IRES also lack extensive RNA structures or sequence conservation, suggesting that this viral IRES and cellular IRES may have similar strategies for internal translation initiation. IMPORTANCE Cap-independent translation is a common strategy among positive-sense, single-stranded RNA viruses for bypassing the host cell requirement of a 5′ cap structure. Viral IRES, in general, contain extensive secondary structure that is critical for activity. In contrast, we demonstrate that a region of viral RNA devoid of extensive secondary structure has IRES activity and produces low levels of viral coat protein in vitro and in vivo. Our findings may be applicable to cellular mRNA IRES that also have little or no sequences/structures in common. PMID:28179526

  16. Annealing to sequences within the primer binding site loop promotes an HIV-1 RNA conformation favoring RNA dimerization and packaging

    PubMed Central

    Seif, Elias; Niu, Meijuan; Kleiman, Lawrence

    2013-01-01

    The 5′ untranslated region (5′ UTR) of HIV-1 genomic RNA (gRNA) includes structural elements that regulate reverse transcription, transcription, translation, tRNALys3 annealing to the gRNA, and gRNA dimerization and packaging into viruses. It has been reported that gRNA dimerization and packaging are regulated by changes in the conformation of the 5′-UTR RNA. In this study, we show that annealing of tRNALys3 or a DNA oligomer complementary to sequences within the primer binding site (PBS) loop of the 5′ UTR enhances its dimerization in vitro. Structural analysis of the 5′-UTR RNA using selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) shows that the annealing promotes a conformational change of the 5′ UTR that has been previously reported to favor gRNA dimerization and packaging into virus. The model predicted by SHAPE analysis is supported by antisense experiments designed to test which annealed sequences will promote or inhibit gRNA dimerization. Based on reports showing that the gRNA dimerization favors its incorporation into viruses, we tested the ability of a mutant gRNA unable to anneal to tRNALys3 to be incorporated into virions. We found a ∼60% decrease in mutant gRNA packaging compared with wild-type gRNA. Together, these data further support a model for viral assembly in which the initial annealing of tRNALys3 to gRNA is cytoplasmic, which in turn aids in the promotion of gRNA dimerization and its incorporation into virions. PMID:23960173

  17. In-cell RNA structure probing with SHAPE-MaP.

    PubMed

    Smola, Matthew J; Weeks, Kevin M

    2018-06-01

    This protocol is an extension to: Nat. Protoc. 10, 1643-1669 (2015); doi:10.1038/nprot.2015.103; published online 01 October 2015RNAs play key roles in many cellular processes. The underlying structure of RNA is an important determinant of how transcripts function, are processed, and interact with RNA-binding proteins and ligands. RNA structure analysis by selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) takes advantage of the reactivity of small electrophilic chemical probes that react with the 2'-hydroxyl group to assess RNA structure at nucleotide resolution. When coupled with mutational profiling (MaP), in which modified nucleotides are detected as internal miscodings during reverse transcription and then read out by massively parallel sequencing, SHAPE yields quantitative per-nucleotide measurements of RNA structure. Here, we provide an extension to our previous in vitro SHAPE-MaP protocol with detailed guidance for undertaking and analyzing SHAPE-MaP probing experiments in live cells. The MaP strategy works for both abundant-transcriptome experiments and for cellular RNAs of low to moderate abundance, which are not well examined by whole-transcriptome methods. In-cell SHAPE-MaP, performed in roughly 3 d, can be applied in cell types ranging from bacteria to cultured mammalian cells and is compatible with a variety of structure-probing reagents. We detail several strategies by which in-cell SHAPE-MaP can inform new biological hypotheses and emphasize downstream analyses that reveal sequence or structure motifs important for RNA interactions in cells.

  18. Molecular dynamics simulations of viral RNA polymerases link conserved and correlated motions of functional elements to fidelity

    PubMed Central

    Moustafa, Ibrahim M.; Shen, Hujun; Morton, Brandon; Colina, Coray M.; Cameron, Craig E.

    2011-01-01

    The viral RNA-dependent RNA polymerase (RdRp) is essential for multiplication of all RNA viruses. The sequence diversity of an RNA virus population contributes to its ability to infect the host. This diversity emanates from errors made by the RdRp during RNA synthesis. The physical basis for RdRp fidelity is unclear but is linked to conformational changes occurring during the nucleotide-addition cycle. To understand RdRp dynamics that might influence RdRp function, we have analyzed all-atom molecular dynamics (MD) simulations on the nanosecond timescale of four RdRps from the picornavirus family that exhibit 30–74% sequence identity. Principal component analysis showed that the major motions observed during the simulations derived from conserved structural motifs and regions of known function. Dynamics of residues participating in the same biochemical property, for example RNA binding, nucleotide binding or catalysis, were correlated even when spatially distant on the RdRp structure. The conserved and correlated dynamics of functional, structural elements suggest co-evolution of dynamics with structure and function of the RdRp. Crystal structures of all picornavirus RdRps exhibit a template-nascent RNA duplex channel too small to fully accommodate duplex RNA. Simulations revealed opening and closing motions of the RNA and NTP channels, which might be relevant to NTP entry, PPi exit and translocation. A role for nanosecond timescale dynamics in RdRp fidelity is supported by altered dynamics of the high-fidelity G64S derivative of PV RdRp relative to wild-type enzyme. PMID:21575642

  19. Dual RNA regulatory control of a Staphylococcus aureus virulence factor.

    PubMed

    Chabelskaya, Svetlana; Bordeau, Valérie; Felden, Brice

    2014-04-01

    In pathogens, the accurate programming of virulence gene expression is essential for infection. It is achieved by sophisticated arrays of regulatory proteins and ribonucleic acids (sRNAs), but in many cases their contributions and connections are not yet known. Based on genetic, biochemical and structural evidence, we report that the expression pattern of a Staphylococcus aureus host immune evasion protein is enabled by the collaborative actions of RNAIII and small pathogenicity island RNA D (SprD). Their combined expression profiles during bacterial growth permit early and transient synthesis of Sbi to avoid host immune responses. Together, these two sRNAs use antisense mechanisms to monitor Sbi expression at the translational level. Deletion analysis combined with structural analysis of RNAIII in complex with its novel messenger RNA (mRNA) target indicate that three distant RNAIII domains interact with distinct sites of the sbi mRNA and that two locations are deep in the sbi coding region. Through distinct domains, RNAIII lowers production of two proteins required for avoiding innate host immunity, staphylococcal protein A and Sbi. Toeprints and in vivo mutational analysis reveal a novel regulatory module within RNAIII essential for attenuation of Sbi translation. The sophisticated translational control of mRNA by two differentially expressed sRNAs ensures supervision of host immune escape by a major pathogen.

  20. Spliced leader RNA trans-splicing discovered in copepods

    NASA Astrophysics Data System (ADS)

    Yang, Feifei; Xu, Donghui; Zhuang, Yunyun; Yi, Xiaoyan; Huang, Yousong; Chen, Hongju; Lin, Senjie; Campbell, David A.; Sturm, Nancy R.; Liu, Guangxing; Zhang, Huan

    2015-12-01

    Copepods are one of the most abundant metazoans in the marine ecosystem, constituting a critical link in aquatic food webs and contributing significantly to the global carbon budget, yet molecular mechanisms of their gene expression are not well understood. Here we report the detection of spliced leader (SL) trans-splicing in calanoid copepods. We have examined nine species of wild-caught copepods from Jiaozhou Bay, China that represent the major families of the calanoids. All these species contained a common 46-nt SL (CopepodSL). We further determined the size of CopepodSL precursor RNA (slRNA; 108-158 nt) through genomic analysis and 3‧-RACE technique, which was confirmed by RNA blot analysis. Structure modeling showed that the copepod slRNA folded into typical slRNA secondary structures. Using a CopepodSL-based primer set, we selectively enriched and sequenced copepod full-length cDNAs, which led to the characterization of copepod transcripts and the cataloging of the complete set of 79 eukaryotic cytoplasmic ribosomal proteins (cRPs) for a single copepod species. We uncovered the SL trans-splicing in copepod natural populations, and demonstrated that CopepodSL was a sensitive and specific tool for copepod transcriptomic studies at both the individual and population levels and that it would be useful for metatranscriptomic analysis of copepods.

  1. Structural similarities and functional differences clarify evolutionary relationships between tRNA healing enzymes and the myelin enzyme CNPase.

    PubMed

    Muruganandam, Gopinath; Raasakka, Arne; Myllykoski, Matti; Kursula, Inari; Kursula, Petri

    2017-05-16

    Eukaryotic tRNA splicing is an essential process in the transformation of a primary tRNA transcript into a mature functional tRNA molecule. 5'-phosphate ligation involves two steps: a healing reaction catalyzed by polynucleotide kinase (PNK) in association with cyclic phosphodiesterase (CPDase), and a sealing reaction catalyzed by an RNA ligase. The enzymes that catalyze tRNA healing in yeast and higher eukaryotes are homologous to the members of the 2H phosphoesterase superfamily, in particular to the vertebrate myelin enzyme 2',3'-cyclic nucleotide 3'-phosphodiesterase (CNPase). We employed different biophysical and biochemical methods to elucidate the overall structural and functional features of the tRNA healing enzymes yeast Trl1 PNK/CPDase and lancelet PNK/CPDase and compared them with vertebrate CNPase. The yeast and the lancelet enzymes have cyclic phosphodiesterase and polynucleotide kinase activity, while vertebrate CNPase lacks PNK activity. In addition, we also show that the healing enzymes are structurally similar to the vertebrate CNPase by applying synchrotron radiation circular dichroism spectroscopy and small-angle X-ray scattering. We provide a structural analysis of the tRNA healing enzyme PNK and CPDase domains together. Our results support evolution of vertebrate CNPase from tRNA healing enzymes with a loss of function at its N-terminal PNK-like domain.

  2. Probing the structural dynamics of the CRISPR-Cas9 RNA-guided DNA-cleavage system by coarse-grained modeling.

    PubMed

    Zheng, Wenjun

    2017-02-01

    In the adaptive immune systems of many bacteria and archaea, the Cas9 endonuclease forms a complex with specific guide/scaffold RNA to identify and cleave complementary target sequences in foreign DNA. This DNA targeting machinery has been exploited in numerous applications of genome editing and transcription control. However, the molecular mechanism of the Cas9 system is still obscure. Recently, high-resolution structures have been solved for Cas9 in different structural forms (e.g., unbound forms, RNA-bound binary complexes, and RNA-DNA-bound tertiary complexes, corresponding to an inactive state, a pre-target-bound state, and a cleavage-competent or product state), which offered key structural insights to the Cas9 mechanism. To further probe the structural dynamics of Cas9 interacting with RNA and DNA at the amino-acid level of details, we have performed systematic coarse-grained modeling using an elastic network model and related analyses. Our normal mode analysis predicted a few key modes of collective motions that capture the observed conformational changes featuring large domain motions triggered by binding of RNA and DNA. Our flexibility analysis identified specific regions with high or low flexibility that coincide with key functional sites (such as DNA/RNA-binding sites, nuclease cleavage sites, and key hinges). We also identified a small set of hotspot residues that control the energetics of functional motions, which overlap with known functional sites and offer promising targets for future mutagenesis efforts to improve the specificity of Cas9. Finally, we modeled the conformational transitions of Cas9 from the unbound form to the binary complex and then the tertiary complex, and predicted a distinct sequence of domain motions. In sum, our findings have offered rich structural and dynamic details relevant to the Cas9 machinery, and will guide future investigation and engineering of the Cas9 systems. Proteins 2017; 85:342-353. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  3. Steric interactions lead to collective tilting motion in the ribosome during mRNA-tRNA translocation

    NASA Astrophysics Data System (ADS)

    Nguyen, Kien; Whitford, Paul C.

    2016-02-01

    Translocation of mRNA and tRNA through the ribosome is associated with large-scale rearrangements of the head domain in the 30S ribosomal subunit. To elucidate the relationship between 30S head dynamics and mRNA-tRNA displacement, we apply molecular dynamics simulations using an all-atom structure-based model. Here we provide a statistical analysis of 250 spontaneous transitions between the A/P-P/E and P/P-E/E ensembles. Consistent with structural studies, the ribosome samples a chimeric ap/P-pe/E intermediate, where the 30S head is rotated ~18°. It then transiently populates a previously unreported intermediate ensemble, which is characterized by a ~10° tilt of the head. To identify the origins of head tilting, we analyse 781 additional simulations in which specific steric features are perturbed. These calculations show that head tilting may be attributed to specific steric interactions between tRNA and the 30S subunit (PE loop and protein S13). Taken together, this study demonstrates how molecular structure can give rise to large-scale collective rearrangements.

  4. Designing synthetic RNA for delivery by nanoparticles

    NASA Astrophysics Data System (ADS)

    Jedrzejczyk, Dominika; Gendaszewska-Darmach, Edyta; Pawlowska, Roza; Chworos, Arkadiusz

    2017-03-01

    The rapid development of synthetic biology and nanobiotechnology has led to the construction of various synthetic RNA nanoparticles of different functionalities and potential applications. As they occur naturally, nucleic acids are an attractive construction material for biocompatible nanoscaffold and nanomachine design. In this review, we provide an overview of the types of RNA and nucleic acid’s nanoparticle design, with the focus on relevant nanostructures utilized for gene-expression regulation in cellular models. Structural analysis and modeling is addressed along with the tools available for RNA structural prediction. The functionalization of RNA-based nanoparticles leading to prospective applications of such constructs in potential therapies is shown. The route from the nanoparticle design and modeling through synthesis and functionalization to cellular application is also described. For a better understanding of the fate of targeted RNA after delivery, an overview of RNA processing inside the cell is also provided.

  5. RNase MRP and the RNA processing cascade in the eukaryotic ancestor.

    PubMed

    Woodhams, Michael D; Stadler, Peter F; Penny, David; Collins, Lesley J

    2007-02-08

    Within eukaryotes there is a complex cascade of RNA-based macromolecules that process other RNA molecules, especially mRNA, tRNA and rRNA. An example is RNase MRP processing ribosomal RNA (rRNA) in ribosome biogenesis. One hypothesis is that this complexity was present early in eukaryotic evolution; an alternative is that an initial simpler network later gained complexity by gene duplication in lineages that led to animals, fungi and plants. Recently there has been a rapid increase in support for the complexity-early theory because the vast majority of these RNA-processing reactions are found throughout eukaryotes, and thus were likely to be present in the last common ancestor of living eukaryotes, herein called the Eukaryotic Ancestor. We present an overview of the RNA processing cascade in the Eukaryotic Ancestor and investigate in particular, RNase MRP which was previously thought to have evolved later in eukaryotes due to its apparent limited distribution in fungi and animals and plants. Recent publications, as well as our own genomic searches, find previously unknown RNase MRP RNAs, indicating that RNase MRP has a wide distribution in eukaryotes. Combining secondary structure and promoter region analysis of RNAs for RNase MRP, along with analysis of the target substrate (rRNA), allows us to discuss this distribution in the light of eukaryotic evolution. We conclude that RNase MRP can now be placed in the RNA-processing cascade of the Eukaryotic Ancestor, highlighting the complexity of RNA-processing in early eukaryotes. Promoter analyses of MRP-RNA suggest that regulation of the critical processes of rRNA cleavage can vary, showing that even these key cellular processes (for which we expect high conservation) show some species-specific variability. We present our consensus MRP-RNA secondary structure as a useful model for further searches.

  6. Protein structure and the sequential structure of mRNA: alpha-helix and beta-sheet signals at the nucleotide level.

    PubMed

    Brunak, S; Engelbrecht, J

    1996-06-01

    A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed. We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting protein. The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain. A complete search for GenBank nucleotide sequences coding for structural entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment. By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets. These signals do not originate from the clustering of rare codons, but from the similarity of codons coding for very abundant amino acid residues at the N- and C-termini of helices and sheets. No correlation between the positioning of rare codons and the location of structural units was found. The mRNA signals were also compared with conserved nucleotide features of 16S-like ribosomal RNA sequences and related to mechanisms for maintaining the correct reading frame by the ribosome.

  7. Lessons on RNA Silencing Mechanisms in Plants from Eukaryotic Argonaute Structures[W

    PubMed Central

    Poulsen, Christian; Vaucheret, Hervé; Brodersen, Peter

    2013-01-01

    RNA silencing refers to a collection of gene regulatory mechanisms that use small RNAs for sequence specific repression. These mechanisms rely on ARGONAUTE (AGO) proteins that directly bind small RNAs and thereby constitute the central component of the RNA-induced silencing complex (RISC). AGO protein function has been probed extensively by mutational analyses, particularly in plants where large allelic series of several AGO proteins have been isolated. Structures of entire human and yeast AGO proteins have only very recently been obtained, and they allow more precise analyses of functional consequences of mutations obtained by forward genetics. To a large extent, these analyses support current models of regions of particular functional importance of AGO proteins. Interestingly, they also identify previously unrecognized parts of AGO proteins with profound structural and functional importance and provide the first hints at structural elements that have important functions specific to individual AGO family members. A particularly important outcome of the analysis concerns the evidence for existence of Gly-Trp (GW) repeat interactors of AGO proteins acting in the plant microRNA pathway. The parallel analysis of AGO structures and plant AGO mutations also suggests that such interactions with GW proteins may be a determinant of whether an endonucleolytically competent RISC is formed. PMID:23303917

  8. Lessons on RNA silencing mechanisms in plants from eukaryotic argonaute structures.

    PubMed

    Poulsen, Christian; Vaucheret, Hervé; Brodersen, Peter

    2013-01-01

    RNA silencing refers to a collection of gene regulatory mechanisms that use small RNAs for sequence specific repression. These mechanisms rely on ARGONAUTE (AGO) proteins that directly bind small RNAs and thereby constitute the central component of the RNA-induced silencing complex (RISC). AGO protein function has been probed extensively by mutational analyses, particularly in plants where large allelic series of several AGO proteins have been isolated. Structures of entire human and yeast AGO proteins have only very recently been obtained, and they allow more precise analyses of functional consequences of mutations obtained by forward genetics. To a large extent, these analyses support current models of regions of particular functional importance of AGO proteins. Interestingly, they also identify previously unrecognized parts of AGO proteins with profound structural and functional importance and provide the first hints at structural elements that have important functions specific to individual AGO family members. A particularly important outcome of the analysis concerns the evidence for existence of Gly-Trp (GW) repeat interactors of AGO proteins acting in the plant microRNA pathway. The parallel analysis of AGO structures and plant AGO mutations also suggests that such interactions with GW proteins may be a determinant of whether an endonucleolytically competent RISC is formed.

  9. Special Focus

    PubMed Central

    Nawrocki, Eric P.; Burge, Sarah W.

    2013-01-01

    The development of RNA bioinformatic tools began more than 30 y ago with the description of the Nussinov and Zuker dynamic programming algorithms for single sequence RNA secondary structure prediction. Since then, many tools have been developed for various RNA sequence analysis problems such as homology search, multiple sequence alignment, de novo RNA discovery, read-mapping, and many more. In this issue, we have collected a sampling of reviews and original research that demonstrate some of the many ways bioinformatics is integrated with current RNA biology research. PMID:23948768

  10. TBI server: a web server for predicting ion effects in RNA folding.

    PubMed

    Zhu, Yuhong; He, Zhaojian; Chen, Shi-Jie

    2015-01-01

    Metal ions play a critical role in the stabilization of RNA structures. Therefore, accurate prediction of the ion effects in RNA folding can have a far-reaching impact on our understanding of RNA structure and function. Multivalent ions, especially Mg²⁺, are essential for RNA tertiary structure formation. These ions can possibly become strongly correlated in the close vicinity of RNA surface. Most of the currently available software packages, which have widespread success in predicting ion effects in biomolecular systems, however, do not explicitly account for the ion correlation effect. Therefore, it is important to develop a software package/web server for the prediction of ion electrostatics in RNA folding by including ion correlation effects. The TBI web server http://rna.physics.missouri.edu/tbi_index.html provides predictions for the total electrostatic free energy, the different free energy components, and the mean number and the most probable distributions of the bound ions. A novel feature of the TBI server is its ability to account for ion correlation and ion distribution fluctuation effects. By accounting for the ion correlation and fluctuation effects, the TBI server is a unique online tool for computing ion-mediated electrostatic properties for given RNA structures. The results can provide important data for in-depth analysis for ion effects in RNA folding including the ion-dependence of folding stability, ion uptake in the folding process, and the interplay between the different energetic components.

  11. Covariant Evolutionary Event Analysis for Base Interaction Prediction Using a Relational Database Management System for RNA.

    PubMed

    Xu, Weijia; Ozer, Stuart; Gutell, Robin R

    2009-01-01

    With an increasingly large amount of sequences properly aligned, comparative sequence analysis can accurately identify not only common structures formed by standard base pairing but also new types of structural elements and constraints. However, traditional methods are too computationally expensive to perform well on large scale alignment and less effective with the sequences from diversified phylogenetic classifications. We propose a new approach that utilizes coevolutional rates among pairs of nucleotide positions using phylogenetic and evolutionary relationships of the organisms of aligned sequences. With a novel data schema to manage relevant information within a relational database, our method, implemented with a Microsoft SQL Server 2005, showed 90% sensitivity in identifying base pair interactions among 16S ribosomal RNA sequences from Bacteria, at a scale 40 times bigger and 50% better sensitivity than a previous study. The results also indicated covariation signals for a few sets of cross-strand base stacking pairs in secondary structure helices, and other subtle constraints in the RNA structure.

  12. Covariant Evolutionary Event Analysis for Base Interaction Prediction Using a Relational Database Management System for RNA

    PubMed Central

    Xu, Weijia; Ozer, Stuart; Gutell, Robin R.

    2010-01-01

    With an increasingly large amount of sequences properly aligned, comparative sequence analysis can accurately identify not only common structures formed by standard base pairing but also new types of structural elements and constraints. However, traditional methods are too computationally expensive to perform well on large scale alignment and less effective with the sequences from diversified phylogenetic classifications. We propose a new approach that utilizes coevolutional rates among pairs of nucleotide positions using phylogenetic and evolutionary relationships of the organisms of aligned sequences. With a novel data schema to manage relevant information within a relational database, our method, implemented with a Microsoft SQL Server 2005, showed 90% sensitivity in identifying base pair interactions among 16S ribosomal RNA sequences from Bacteria, at a scale 40 times bigger and 50% better sensitivity than a previous study. The results also indicated covariation signals for a few sets of cross-strand base stacking pairs in secondary structure helices, and other subtle constraints in the RNA structure. PMID:20502534

  13. Structural variations of single and tandem mismatches in RNA duplexes: a joint MD simulation and crystal structure database analysis.

    PubMed

    Halder, Sukanya; Bhattacharyya, Dhananjay

    2012-10-04

    Internal loops within RNA duplex regions are formed by single or tandem basepairing mismatches with flanking canonical Watson-Crick basepairs on both sides. They are the most common motif observed in RNA secondary structures and play integral functional and structural roles. In this report, we have studied the structural features of 1 × 1, 2 × 2, and 3 × 3 internal loops using all-atom molecular dynamics (MD) simulation technique with explicit solvent model. As MD simulation is intricately dependent on the choice of force-field and these are often rather approximate, we have used both the most popular force-fields for nucleic acids-CHARMM27 and AMBER94-for a comparative analysis. We find that tandem noncanonical basepairs forming 2 × 2 and 3 × 3 internal loops are considerably more stable than the single mismatches forming 1 × 1 internal loops, irrespective of the force field. We have also analyzed crystal structure database to study the conservation of these helical fragments in the corresponding sets of RNA structures. We observe that the nature of stability in MD simulations mimic their fluctuating natures in crystal data sets also, probably indicating reliable natures of both the force fields to reproduce experimental results. We also notice significant structural changes in the wobble G:U basepairs present in these double helical stretches, leading to a biphasic stability for these wobble pairs to release the deformational strains introduced by internal loops within duplex regions.

  14. Biomarker MicroRNAs for Diagnosis of Oral Squamous Cell Carcinoma Identified Based on Gene Expression Data and MicroRNA-mRNA Network Analysis

    PubMed Central

    Zhang, Hui; Li, Tangxin; Zheng, Linqing

    2017-01-01

    Oral squamous cell carcinoma is one of the most malignant tumors with high mortality rate worldwide. Biomarker discovery is critical for early diagnosis and precision treatment of this disease. MicroRNAs are small noncoding RNA molecules which often regulate essential biological processes and are good candidates for biomarkers. By integrative analysis of both the cancer-associated gene expression data and microRNA-mRNA network, miR-148b-3p, miR-629-3p, miR-27a-3p, and miR-142-3p were screened as novel diagnostic biomarkers for oral squamous cell carcinoma based on their unique regulatory abilities in the network structure of the conditional microRNA-mRNA network and their important functions. These findings were confirmed by literature verification and functional enrichment analysis. Future experimental validation is expected for the further investigation of their molecular mechanisms. PMID:29098014

  15. A mRNA-Responsive G-Quadruplex-Based Drug Release System

    PubMed Central

    Yaku, Hidenobu; Murashima, Takashi; Miyoshi, Daisuke; Sugimoto, Naoki

    2015-01-01

    G-quadruplex-based drug delivery carriers (GDDCs) were designed to capture and release a telomerase inhibitor in response to a target mRNA. Hybridization between a loop on the GDDC structure and the mRNA should cause the G-quadruplex structure of the GDDC to unfold and release the bound inhibitor, anionic copper(II) phthalocyanine (CuAPC). As a proof of concept, GDDCs were designed with a 10-30-mer loop, which can hybridize with a target sequence in epidermal growth factor receptor (EGFR) mRNA. Structural analysis using circular dichroism (CD) spectroscopy showed that the GDDCs form a (3 + 1) type G-quadruplex structure in 100 mM KCl and 10 mM MgCl2 in the absence of the target RNA. Visible absorbance titration experiments showed that the GDDCs bind to CuAPC with Ka values of 1.5 × 105 to 5.9 × 105 M−1 (Kd values of 6.7 to 1.7 μM) at 25 °C, depending on the loop length. Fluorescence titration further showed that the G-quadruplex structure unfolds upon binding to the target RNA with Ka values above 1.0 × 108 M−1 (Kd values below 0.01 μM) at 25 °C. These results suggest the carrier can sense and bind to the target RNA, which should result in release of the bound drug. Finally, visible absorbance titration experiments demonstrated that the GDDC release CuAPC in response to the target RNA. PMID:25905703

  16. In vivo tmRNA protection by SmpB and pre-ribosome binding conformation in solution.

    PubMed

    Ranaei-Siadat, Ehsan; Mérigoux, Cécile; Seijo, Bili; Ponchon, Luc; Saliou, Jean-Michel; Bernauer, Julie; Sanglier-Cianférani, Sarah; Dardel, Fréderic; Vachette, Patrice; Nonin-Lecomte, Sylvie

    2014-10-01

    TmRNA is an abundant RNA in bacteria with tRNA and mRNA features. It is specialized in trans-translation, a translation rescuing system. We demonstrate that its partner protein SmpB binds the tRNA-like region (TLD) in vivo and chaperones the fold of the TLD-H2 region. We use an original approach combining the observation of tmRNA degradation pathways in a heterologous system, the analysis of the tmRNA digests by MS and NMR, and co-overproduction assays of tmRNA and SmpB. We study the conformation in solution of tmRNA alone or in complex with one SmpB before ribosome binding using SAXS. Our data show that Mg(2+) drives compaction of the RNA structure and that, in the absence of Mg(2+), SmpB has a similar effect albeit to a lesser extent. Our results show that tmRNA is intrinsically structured in solution with identical topology to that observed on complexes on ribosomes which should facilitate its subsequent recruitment by the 70S ribosome, free or preloaded with one SmpB molecule. © 2014 Ranaei-Siadat et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  17. Structural and Functional Analysis of an mRNP Complex That Mediates the High Stability of Human β-Globin mRNA

    PubMed Central

    Yu, Jia; Russell, J. Eric

    2001-01-01

    Human globins are encoded by mRNAs exhibiting high stabilities in transcriptionally silenced erythrocyte progenitors. Unlike α-globin mRNA, whose stability is enhanced by assembly of a specific messenger RNP (mRNP) α complex on its 3′ untranslated region (UTR), neither the structure(s) nor the mechanism(s) that effects the high-level stability of human β-globin mRNA has been identified. The present work describes an mRNP complex assembling on the 3′ UTR of the β-globin mRNA that exhibits many of the properties of the stability-enhancing α complex. The β-globin mRNP complex is shown to contain one or more factors homologous to αCP, a 39-kDa RNA-binding protein that is integral to α-complex assembly. Sequence analysis implicates a specific 14-nucleotide pyrimidine-rich track within its 3′ UTR as the site of β-globin mRNP assembly. The importance of this track to mRNA stability is subsequently verified in vivo using mice expressing human β-globin transgenes that contain informative mutations in this region. In combination, the in vitro and in vivo analyses indicate that the high stabilities of the α- and β-globin mRNAs are maintained through related mRNP complexes that may share a common regulatory pathway. PMID:11486027

  18. HCV RNA traffic and association with NS5A in living cells

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fiches, Guillaume N.; Eyre, Nicholas S.; Aloia, Amanda L.

    The spatiotemporal dynamics of Hepatitis C Virus (HCV) RNA localisation are poorly understood. To address this we engineered HCV genomes harbouring MS2 bacteriophage RNA stem-loops within the 3′-untranslated region to allow tracking of HCV RNA via specific interaction with a MS2-Coat-mCherry fusion protein. Despite the impact of these insertions on viral fitness, live imaging revealed that replication of tagged-HCV genomes induced specific redistribution of the mCherry-tagged-MS2-Coat protein to motile and static foci. Further analysis showed that HCV RNA was associated with NS5A in both static and motile structures while a subset of motile NS5A structures was devoid of HCV RNA.more » Further investigation of viral RNA traffic with respect to lipid droplets (LDs) revealed HCV RNA-positive structures in close association with LDs. These studies provide new insights into the dynamics of HCV RNA traffic with NS5A and LDs and provide a platform for future investigations of HCV replication and assembly. - Highlights: • HCV can tolerate can bacteriophage MS2 stem-loop insertions within the 3′ UTR. • MS2 stem-loop containing HCV genomes allow for real-time imaging of HCV RNA. • HCV RNA is both static and motile and associates with NS5A and lipid droplets.« less

  19. Expanding RNA binding specificity and affinity of engineered PUF domains.

    PubMed

    Zhao, Yang-Yang; Mao, Miao-Wei; Zhang, Wen-Jing; Wang, Jue; Li, Hai-Tao; Yang, Yi; Wang, Zefeng; Wu, Jia-Wei

    2018-05-18

    Specific manipulation of RNA is necessary for the research in biotechnology and medicine. The RNA-binding domains of Pumilio/fem-3 mRNA binding factors (PUF domains) are programmable RNA binding scaffolds used to engineer artificial proteins that specifically modulate RNAs. However, the native PUF domains generally recognize 8-nt RNAs, limiting their applications. Here, we modify the PUF domain of human Pumilio1 to engineer PUFs that recognize RNA targets of different length. The engineered PUFs bind to their RNA targets specifically and PUFs with more repeats have higher binding affinity than the canonical eight-repeat domains; however, the binding affinity reaches the peak at those with 9 and 10 repeats. Structural analysis on PUF with nine repeats reveals a higher degree of curvature, and the RNA binding unexpectedly and dramatically opens the curved structure. Investigation of the residues positioned in between two RNA bases demonstrates that tyrosine and arginine have favored stacking interactions. Further tests on the availability of the engineered PUFs in vitro and in splicing function assays indicate that our engineered PUFs bind RNA targets with high affinity in a programmable way.

  20. Expanding RNA binding specificity and affinity of engineered PUF domains

    PubMed Central

    Zhao, Yang-Yang; Zhang, Wen-Jing; Wang, Jue; Li, Hai-Tao; Yang, Yi; Wang, Zefeng; Wu, Jia-Wei

    2018-01-01

    Abstract Specific manipulation of RNA is necessary for the research in biotechnology and medicine. The RNA-binding domains of Pumilio/fem-3 mRNA binding factors (PUF domains) are programmable RNA binding scaffolds used to engineer artificial proteins that specifically modulate RNAs. However, the native PUF domains generally recognize 8-nt RNAs, limiting their applications. Here, we modify the PUF domain of human Pumilio1 to engineer PUFs that recognize RNA targets of different length. The engineered PUFs bind to their RNA targets specifically and PUFs with more repeats have higher binding affinity than the canonical eight-repeat domains; however, the binding affinity reaches the peak at those with 9 and 10 repeats. Structural analysis on PUF with nine repeats reveals a higher degree of curvature, and the RNA binding unexpectedly and dramatically opens the curved structure. Investigation of the residues positioned in between two RNA bases demonstrates that tyrosine and arginine have favored stacking interactions. Further tests on the availability of the engineered PUFs in vitro and in splicing function assays indicate that our engineered PUFs bind RNA targets with high affinity in a programmable way. PMID:29490074

  1. RNA-Rocket: an RNA-Seq analysis resource for infectious disease research

    PubMed Central

    Warren, Andrew S.; Aurrecoechea, Cristina; Brunk, Brian; Desai, Prerak; Emrich, Scott; Giraldo-Calderón, Gloria I.; Harb, Omar; Hix, Deborah; Lawson, Daniel; Machi, Dustin; Mao, Chunhong; McClelland, Michael; Nordberg, Eric; Shukla, Maulik; Vosshall, Leslie B.; Wattam, Alice R.; Will, Rebecca; Yoo, Hyun Seung; Sobral, Bruno

    2015-01-01

    Motivation: RNA-Seq is a method for profiling transcription using high-throughput sequencing and is an important component of many research projects that wish to study transcript isoforms, condition specific expression and transcriptional structure. The methods, tools and technologies used to perform RNA-Seq analysis continue to change, creating a bioinformatics challenge for researchers who wish to exploit these data. Resources that bring together genomic data, analysis tools, educational material and computational infrastructure can minimize the overhead required of life science researchers. Results: RNA-Rocket is a free service that provides access to RNA-Seq and ChIP-Seq analysis tools for studying infectious diseases. The site makes available thousands of pre-indexed genomes, their annotations and the ability to stream results to the bioinformatics resources VectorBase, EuPathDB and PATRIC. The site also provides a combination of experimental data and metadata, examples of pre-computed analysis, step-by-step guides and a user interface designed to enable both novice and experienced users of RNA-Seq data. Availability and implementation: RNA-Rocket is available at rnaseq.pathogenportal.org. Source code for this project can be found at github.com/cidvbi/PathogenPortal. Contact: anwarren@vt.edu Supplementary information: Supplementary materials are available at Bioinformatics online. PMID:25573919

  2. RNA-Rocket: an RNA-Seq analysis resource for infectious disease research.

    PubMed

    Warren, Andrew S; Aurrecoechea, Cristina; Brunk, Brian; Desai, Prerak; Emrich, Scott; Giraldo-Calderón, Gloria I; Harb, Omar; Hix, Deborah; Lawson, Daniel; Machi, Dustin; Mao, Chunhong; McClelland, Michael; Nordberg, Eric; Shukla, Maulik; Vosshall, Leslie B; Wattam, Alice R; Will, Rebecca; Yoo, Hyun Seung; Sobral, Bruno

    2015-05-01

    RNA-Seq is a method for profiling transcription using high-throughput sequencing and is an important component of many research projects that wish to study transcript isoforms, condition specific expression and transcriptional structure. The methods, tools and technologies used to perform RNA-Seq analysis continue to change, creating a bioinformatics challenge for researchers who wish to exploit these data. Resources that bring together genomic data, analysis tools, educational material and computational infrastructure can minimize the overhead required of life science researchers. RNA-Rocket is a free service that provides access to RNA-Seq and ChIP-Seq analysis tools for studying infectious diseases. The site makes available thousands of pre-indexed genomes, their annotations and the ability to stream results to the bioinformatics resources VectorBase, EuPathDB and PATRIC. The site also provides a combination of experimental data and metadata, examples of pre-computed analysis, step-by-step guides and a user interface designed to enable both novice and experienced users of RNA-Seq data. RNA-Rocket is available at rnaseq.pathogenportal.org. Source code for this project can be found at github.com/cidvbi/PathogenPortal. anwarren@vt.edu Supplementary materials are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  3. Insights into molecular plasticity in protein complexes from Trm9-Trm112 tRNA modifying enzyme crystal structure.

    PubMed

    Létoquart, Juliette; van Tran, Nhan; Caroline, Vonny; Aleksandrov, Alexey; Lazar, Noureddine; van Tilbeurgh, Herman; Liger, Dominique; Graille, Marc

    2015-12-15

    Most of the factors involved in translation (tRNA, rRNA and proteins) are subject to post-transcriptional and post-translational modifications, which participate in the fine-tuning and tight control of ribosome and protein synthesis processes. In eukaryotes, Trm112 acts as an obligate activating platform for at least four methyltransferases (MTase) involved in the modification of 18S rRNA (Bud23), tRNA (Trm9 and Trm11) and translation termination factor eRF1 (Mtq2). Trm112 is then at a nexus between ribosome synthesis and function. Here, we present a structure-function analysis of the Trm9-Trm112 complex, which is involved in the 5-methoxycarbonylmethyluridine (mcm(5)U) modification of the tRNA anticodon wobble position and hence promotes translational fidelity. We also compare the known crystal structures of various Trm112-MTase complexes, highlighting the structural plasticity allowing Trm112 to interact through a very similar mode with its MTase partners, although those share less than 20% sequence identity. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. Structural basis for viral 5'-PPP-RNA recognition by human IFIT proteins.

    PubMed

    Abbas, Yazan M; Pichlmair, Andreas; Górna, Maria W; Superti-Furga, Giulio; Nagar, Bhushan

    2013-02-07

    Interferon-induced proteins with tetratricopeptide repeats (IFITs) are innate immune effector molecules that are thought to confer antiviral defence through disruption of protein-protein interactions in the host translation-initiation machinery. However, it was recently discovered that IFITs can directly recognize viral RNA bearing a 5'-triphosphate group (PPP-RNA), which is a molecular signature that distinguishes it from host RNA. Here we report crystal structures of human IFIT5, its complex with PPP-RNAs, and an amino-terminal fragment of IFIT1. The structures reveal a new helical domain that houses a positively charged cavity designed to specifically engage only single-stranded PPP-RNA, thus distinguishing it from the canonical cytosolic sensor of double-stranded viral PPP-RNA, retinoic acid-inducible gene I (RIG-I, also known as DDX58). Mutational analysis, proteolysis and gel-shift assays reveal that PPP-RNA is bound in a non-sequence-specific manner and requires a 5'-overhang of approximately three nucleotides. Abrogation of PPP-RNA binding in IFIT1 and IFIT5 was found to cause a defect in the antiviral response by human embryonic kidney cells. These results demonstrate the mechanism by which IFIT proteins selectively recognize viral RNA, and lend insight into their downstream effector function.

  5. Novel cis-acting element within the capsid-coding region enhances flavivirus viral-RNA replication by regulating genome cyclization.

    PubMed

    Liu, Zhong-Yu; Li, Xiao-Feng; Jiang, Tao; Deng, Yong-Qiang; Zhao, Hui; Wang, Hong-Jiang; Ye, Qing; Zhu, Shun-Ya; Qiu, Yang; Zhou, Xi; Qin, E-De; Qin, Cheng-Feng

    2013-06-01

    cis-Acting elements in the viral genome RNA (vRNA) are essential for the translation, replication, and/or encapsidation of RNA viruses. In this study, a novel conserved cis-acting element was identified in the capsid-coding region of mosquito-borne flavivirus. The downstream of 5' cyclization sequence (5'CS) pseudoknot (DCS-PK) element has a three-stem pseudoknot structure, as demonstrated by structure prediction and biochemical analysis. Using dengue virus as a model, we show that DCS-PK enhances vRNA replication and that its function depends on its secondary structure and specific primary sequence. Mutagenesis revealed that the highly conserved stem 1 and loop 2, which are involved in potential loop-helix interactions, are crucial for DCS-PK function. A predicted loop 1-stem 3 base triple interaction is important for the structural stability and function of DCS-PK. Moreover, the function of DCS-PK depends on its position relative to the 5'CS, and the presence of DCS-PK facilitates the formation of 5'-3' RNA complexes. Taken together, our results reveal that the cis-acting element DCS-PK enhances vRNA replication by regulating genome cyclization, and DCS-PK might interplay with other cis-acting elements to form a functional vRNA cyclization domain, thus playing critical roles during the flavivirus life cycle and evolution.

  6. Novel cis-Acting Element within the Capsid-Coding Region Enhances Flavivirus Viral-RNA Replication by Regulating Genome Cyclization

    PubMed Central

    Liu, Zhong-Yu; Li, Xiao-Feng; Jiang, Tao; Deng, Yong-Qiang; Zhao, Hui; Wang, Hong-Jiang; Ye, Qing; Zhu, Shun-Ya; Qiu, Yang; Zhou, Xi; Qin, E-De

    2013-01-01

    cis-Acting elements in the viral genome RNA (vRNA) are essential for the translation, replication, and/or encapsidation of RNA viruses. In this study, a novel conserved cis-acting element was identified in the capsid-coding region of mosquito-borne flavivirus. The downstream of 5′ cyclization sequence (5′CS) pseudoknot (DCS-PK) element has a three-stem pseudoknot structure, as demonstrated by structure prediction and biochemical analysis. Using dengue virus as a model, we show that DCS-PK enhances vRNA replication and that its function depends on its secondary structure and specific primary sequence. Mutagenesis revealed that the highly conserved stem 1 and loop 2, which are involved in potential loop-helix interactions, are crucial for DCS-PK function. A predicted loop 1-stem 3 base triple interaction is important for the structural stability and function of DCS-PK. Moreover, the function of DCS-PK depends on its position relative to the 5′CS, and the presence of DCS-PK facilitates the formation of 5′-3′ RNA complexes. Taken together, our results reveal that the cis-acting element DCS-PK enhances vRNA replication by regulating genome cyclization, and DCS-PK might interplay with other cis-acting elements to form a functional vRNA cyclization domain, thus playing critical roles during the flavivirus life cycle and evolution. PMID:23576500

  7. RNAPattMatch: a web server for RNA sequence/structure motif detection based on pattern matching with flexible gaps

    PubMed Central

    Drory Retwitzer, Matan; Polishchuk, Maya; Churkin, Elena; Kifer, Ilona; Yakhini, Zohar; Barash, Danny

    2015-01-01

    Searching for RNA sequence-structure patterns is becoming an essential tool for RNA practitioners. Novel discoveries of regulatory non-coding RNAs in targeted organisms and the motivation to find them across a wide range of organisms have prompted the use of computational RNA pattern matching as an enhancement to sequence similarity. State-of-the-art programs differ by the flexibility of patterns allowed as queries and by their simplicity of use. In particular—no existing method is available as a user-friendly web server. A general program that searches for RNA sequence-structure patterns is RNA Structator. However, it is not available as a web server and does not provide the option to allow flexible gap pattern representation with an upper bound of the gap length being specified at any position in the sequence. Here, we introduce RNAPattMatch, a web-based application that is user friendly and makes sequence/structure RNA queries accessible to practitioners of various background and proficiency. It also extends RNA Structator and allows a more flexible variable gaps representation, in addition to analysis of results using energy minimization methods. RNAPattMatch service is available at http://www.cs.bgu.ac.il/rnapattmatch. A standalone version of the search tool is also available to download at the site. PMID:25940619

  8. RNA recognition by a human antibody against brain cytoplasmic 200 RNA

    PubMed Central

    Jung, Euihan; Lee, Jungmin; Hong, Hyo Jeong; Park, Insoo; Lee, Younghoon

    2014-01-01

    Diverse functional RNAs participate in a wide range of cellular processes. The RNA structure is critical for function, either on its own or as a complex form with proteins and other ligands. Therefore, analysis of the RNA conformation in cells is essential for understanding their functional mechanisms. However, no appropriate methods have been established as yet. Here, we developed an efficient strategy for panning and affinity maturation of anti-RNA human monoclonal antibodies from a naïve antigen binding fragment (Fab) combinatorial phage library. Brain cytoplasmic 200 (BC200) RNA, which is also highly expressed in some tumors, was used as an RNA antigen. We identified MabBC200-A3 as the optimal binding antibody. Mutagenesis and SELEX experiments showed that the antibody recognized a domain of BC200 in a structure- and sequence-dependent manner. Various breast cancer cell lines were further examined for BC200 RNA expression using conventional hybridization and immunoanalysis with MabBC200-A3 to see whether the antibody specifically recognizes BC200 RNA among the total purified RNAs. The amounts of antibody-recognizable BC200 RNA were consistent with hybridization signals among the cell lines. Furthermore, the antibody was able to discriminate BC200 RNA from other RNAs, supporting the utility of this antibody as a specific RNA structure-recognizing probe. Intriguingly, however, when permeabilized cells were subjected to immunoanalysis instead of purified total RNA, the amount of antibody-recognizable RNA was not correlated with the cellular level of BC200 RNA, indicating that BC200 RNA exists as two distinct forms (antibody-recognizable and nonrecognizable) in breast cancer cells and that their distribution depends on the cell type. Our results clearly demonstrate that anti-RNA antibodies provide an effective novel tool for detecting and analyzing RNA conformation. PMID:24759090

  9. In vivo tmRNA protection by SmpB and pre-ribosome binding conformation in solution

    PubMed Central

    Ranaei-Siadat, Ehsan; Mérigoux, Cécile; Seijo, Bili; Ponchon, Luc; Saliou, Jean-Michel; Bernauer, Julie; Sanglier-Cianférani, Sarah; Dardel, Fréderic

    2014-01-01

    TmRNA is an abundant RNA in bacteria with tRNA and mRNA features. It is specialized in trans-translation, a translation rescuing system. We demonstrate that its partner protein SmpB binds the tRNA-like region (TLD) in vivo and chaperones the fold of the TLD-H2 region. We use an original approach combining the observation of tmRNA degradation pathways in a heterologous system, the analysis of the tmRNA digests by MS and NMR, and co-overproduction assays of tmRNA and SmpB. We study the conformation in solution of tmRNA alone or in complex with one SmpB before ribosome binding using SAXS. Our data show that Mg2+ drives compaction of the RNA structure and that, in the absence of Mg2+, SmpB has a similar effect albeit to a lesser extent. Our results show that tmRNA is intrinsically structured in solution with identical topology to that observed on complexes on ribosomes which should facilitate its subsequent recruitment by the 70S ribosome, free or preloaded with one SmpB molecule. PMID:25135523

  10. RNase MRP cleaves pre-tRNASer-Met in the tRNA maturation pathway.

    PubMed

    Saito, Yuichiro; Takeda, Jun; Adachi, Kousuke; Nobe, Yuko; Kobayashi, Junya; Hirota, Kouji; Oliveira, Douglas V; Taoka, Masato; Isobe, Toshiaki

    2014-01-01

    Ribonuclease mitochondrial RNA processing (RNase MRP) is a multifunctional ribonucleoprotein (RNP) complex that is involved in the maturation of various types of RNA including ribosomal RNA. RNase MRP consists of a potential catalytic RNA and several protein components, all of which are required for cell viability. We show here that the temperature-sensitive mutant of rmp1, the gene for a unique protein component of RNase MRP, accumulates the dimeric tRNA precursor, pre-tRNA(Ser-Met). To examine whether RNase MRP mediates tRNA maturation, we purified the RNase MRP holoenzyme from the fission yeast Schizosaccharomyces pombe and found that the enzyme directly and selectively cleaves pre-tRNA(Ser-Met), suggesting that RNase MRP participates in the maturation of specific tRNA in vivo. In addition, mass spectrometry-based ribonucleoproteomic analysis demonstrated that this RNase MRP consists of one RNA molecule and 11 protein components, including a previously unknown component Rpl701. Notably, limited nucleolysis of RNase MRP generated an active catalytic core consisting of partial mrp1 RNA fragments, which constitute "Domain 1" in the secondary structure of RNase MRP, and 8 proteins. Thus, the present study provides new insight into the structure and function of RNase MRP.

  11. 2-D Structure of the A Region of Xist RNA and Its Implication for PRC2 Association

    PubMed Central

    Maenner, Sylvain; Blaud, Magali; Fouillen, Laetitia; Savoye, Anne; Marchand, Virginie; Dubois, Agnès; Sanglier-Cianférani, Sarah; Van Dorsselaer, Alain; Clerc, Philippe; Avner, Philip; Visvikis, Athanase; Branlant, Christiane

    2010-01-01

    In placental mammals, inactivation of one of the X chromosomes in female cells ensures sex chromosome dosage compensation. The 17 kb non-coding Xist RNA is crucial to this process and accumulates on the future inactive X chromosome. The most conserved Xist RNA region, the A region, contains eight or nine repeats separated by U-rich spacers. It is implicated in the recruitment of late inactivated X genes to the silencing compartment and likely in the recruitment of complex PRC2. Little is known about the structure of the A region and more generally about Xist RNA structure. Knowledge of its structure is restricted to an NMR study of a single A repeat element. Our study is the first experimental analysis of the structure of the entire A region in solution. By the use of chemical and enzymatic probes and FRET experiments, using oligonucleotides carrying fluorescent dyes, we resolved problems linked to sequence redundancies and established a 2-D structure for the A region that contains two long stem-loop structures each including four repeats. Interactions formed between repeats and between repeats and spacers stabilize these structures. Conservation of the spacer terminal sequences allows formation of such structures in all sequenced Xist RNAs. By combination of RNP affinity chromatography, immunoprecipitation assays, mass spectrometry, and Western blot analysis, we demonstrate that the A region can associate with components of the PRC2 complex in mouse ES cell nuclear extracts. Whilst a single four-repeat motif is able to associate with components of this complex, recruitment of Suz12 is clearly more efficient when the entire A region is present. Our data with their emphasis on the importance of inter-repeat pairing change fundamentally our conception of the 2-D structure of the A region of Xist RNA and support its possible implication in recruitment of the PRC2 complex. PMID:20052282

  12. Recognition of chimeric small-subunit ribosomal DNAs composed of genes from uncultivated microorganisms

    NASA Technical Reports Server (NTRS)

    Kopczynski, E. D.; Bateson, M. M.; Ward, D. M.

    1994-01-01

    When PCR was used to recover small-subunit (SSU) rRNA genes from a hot spring cyanobacterial mat community, chimeric SSU rRNA sequences which exhibited little or no secondary structural abnormality were recovered. They were revealed as chimeras of SSU rRNA genes of uncultivated species through separate phylogenetic analysis of short sequence domains.

  13. Trick or TREAT: A Scary-Good New Approach for Single-Molecule mRNA Decay Analysis.

    PubMed

    Russo, Joseph; Wilusz, Jeffrey

    2017-11-02

    In this issue of Molecular Cell, Horvathova et al. (2017) have developed a powerful approach to single-molecule assessment of RNA decay in living cells by exploiting the ability of flavivirus RNA structural elements to trap XRN1 decay intermediates in dual-labeled reporter constructs. Copyright © 2017 Elsevier Inc. All rights reserved.

  14. Dithiothreitol (DTT) Acts as a Specific, UV-inducible Cross-linker in Elucidation of Protein–RNA Interactions*

    PubMed Central

    Zaman, Uzma; Richter, Florian M.; Hofele, Romina; Kramer, Katharina; Sachsenberg, Timo; Kohlbacher, Oliver; Lenz, Christof; Urlaub, Henning

    2015-01-01

    Protein–RNA cross-linking by UV irradiation at 254 nm wavelength has been established as an unbiased method to identify proteins in direct contact with RNA, and has been successfully applied to investigate the spatial arrangement of protein and RNA in large macromolecular assemblies, e.g. ribonucleoprotein-complex particles (RNPs). The mass spectrometric analysis of such peptide-RNA cross-links provides high resolution structural data to the point of mapping protein–RNA interactions to specific peptides or even amino acids. However, the approach suffers from the low yield of cross-linking products, which can be addressed by improving enrichment and analysis methods. In the present article, we introduce dithiothreitol (DTT) as a potent protein–RNA cross-linker. In order to evaluate the efficiency and specificity of DTT, we used two systems, a small synthetic peptide from smB protein incubated with U1 snRNA oligonucleotide and native ribonucleoprotein complexes from S. cerevisiae. Our results unambiguously show that DTT covalently participates in cysteine-uracil crosslinks, which is observable as a mass increment of 151.9966 Da (C4H8S2O2) upon mass spectrometric analysis. DTT presents advantages for cross-linking of cysteine containing regions of proteins. This is evidenced by comparison to experiments where (tris(2-carboxyethyl)phosphine) is used as reducing agent, and significantly less cross-links encompassing cysteine residues are found. We further propose insertion of DTT between the cysteine and uracil reactive sites as the most probable structure of the cross-linking products. PMID:26450613

  15. microRNAs Databases: Developmental Methodologies, Structural and Functional Annotations.

    PubMed

    Singh, Nagendra Kumar

    2017-09-01

    microRNA (miRNA) is an endogenous and evolutionary conserved non-coding RNA, involved in post-transcriptional process as gene repressor and mRNA cleavage through RNA-induced silencing complex (RISC) formation. In RISC, miRNA binds in complementary base pair with targeted mRNA along with Argonaut proteins complex, causes gene repression or endonucleolytic cleavage of mRNAs and results in many diseases and syndromes. After the discovery of miRNA lin-4 and let-7, subsequently large numbers of miRNAs were discovered by low-throughput and high-throughput experimental techniques along with computational process in various biological and metabolic processes. The miRNAs are important non-coding RNA for understanding the complex biological phenomena of organism because it controls the gene regulation. This paper reviews miRNA databases with structural and functional annotations developed by various researchers. These databases contain structural and functional information of animal, plant and virus miRNAs including miRNAs-associated diseases, stress resistance in plant, miRNAs take part in various biological processes, effect of miRNAs interaction on drugs and environment, effect of variance on miRNAs, miRNAs gene expression analysis, sequence of miRNAs, structure of miRNAs. This review focuses on the developmental methodology of miRNA databases such as computational tools and methods used for extraction of miRNAs annotation from different resources or through experiment. This study also discusses the efficiency of user interface design of every database along with current entry and annotations of miRNA (pathways, gene ontology, disease ontology, etc.). Here, an integrated schematic diagram of construction process for databases is also drawn along with tabular and graphical comparison of various types of entries in different databases. Aim of this paper is to present the importance of miRNAs-related resources at a single place.

  16. RNA-dependent RNA polymerase complex of Brome mosaic virus: analysis of the molecular structure with monoclonal antibodies.

    PubMed

    Dohi, Koji; Mise, Kazuyuki; Furusawa, Iwao; Okuno, Tetsuro

    2002-11-01

    Viral RNA-dependent RNA polymerase (RdRp) plays crucial roles in the genomic replication and subgenomic transcription of Brome mosaic virus (BMV), a positive-stranded RNA plant virus. BMV RdRp is a complex of virus-encoded 1a and 2a proteins and some cellular factors, and associates with the endoplasmic reticulum at an infection-specific structure in the cytoplasm of host cells. In this study, we investigate the gross structure of the active BMV RdRp complex using monoclonal antibodies raised against the 1a and 2a proteins. Immunoprecipitation experiments showed that the intermediate region between the N-terminal methyltransferase-like domain and the C-terminal helicase-like domain of 1a protein, and the N terminus region of 2a protein are exposed on the surface of the solubilized RdRp complex. Inhibition assays for membrane-bound RdRp suggested that the intermediate region between the methyltransferase-like and the helicase-like domains of 1a protein is located at the border of the region buried within a membrane structure or with membrane-associated material.

  17. Evolutionary plasticity of the NHL domain underlies distinct solutions to RNA recognition.

    PubMed

    Kumari, Pooja; Aeschimann, Florian; Gaidatzis, Dimos; Keusch, Jeremy J; Ghosh, Pritha; Neagu, Anca; Pachulska-Wieczorek, Katarzyna; Bujnicki, Janusz M; Gut, Heinz; Großhans, Helge; Ciosk, Rafal

    2018-04-19

    RNA-binding proteins regulate all aspects of RNA metabolism. Their association with RNA is mediated by RNA-binding domains, of which many remain uncharacterized. A recently reported example is the NHL domain, found in prominent regulators of cellular plasticity like the C. elegans LIN-41. Here we employ an integrative approach to dissect the RNA specificity of LIN-41. Using computational analysis, structural biology, and in vivo studies in worms and human cells, we find that a positively charged pocket, specific to the NHL domain of LIN-41 and its homologs (collectively LIN41), recognizes a stem-loop RNA element, whose shape determines the binding specificity. Surprisingly, the mechanism of RNA recognition by LIN41 is drastically different from that of its more distant relative, the fly Brat. Our phylogenetic analysis suggests that this reflects a rapid evolution of the domain, presenting an interesting example of a conserved protein fold that acquired completely different solutions to RNA recognition.

  18. Selective 2'-hydroxyl acylation analyzed by primer extension and mutational profiling (SHAPE-MaP) for direct, versatile and accurate RNA structure analysis.

    PubMed

    Smola, Matthew J; Rice, Greggory M; Busan, Steven; Siegfried, Nathan A; Weeks, Kevin M

    2015-11-01

    Selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) chemistries exploit small electrophilic reagents that react with 2'-hydroxyl groups to interrogate RNA structure at single-nucleotide resolution. Mutational profiling (MaP) identifies modified residues by using reverse transcriptase to misread a SHAPE-modified nucleotide and then counting the resulting mutations by massively parallel sequencing. The SHAPE-MaP approach measures the structure of large and transcriptome-wide systems as accurately as can be done for simple model RNAs. This protocol describes the experimental steps, implemented over 3 d, that are required to perform SHAPE probing and to construct multiplexed SHAPE-MaP libraries suitable for deep sequencing. Automated processing of MaP sequencing data is accomplished using two software packages. ShapeMapper converts raw sequencing files into mutational profiles, creates SHAPE reactivity plots and provides useful troubleshooting information. SuperFold uses these data to model RNA secondary structures, identify regions with well-defined structures and visualize probable and alternative helices, often in under 1 d. SHAPE-MaP can be used to make nucleotide-resolution biophysical measurements of individual RNA motifs, rare components of complex RNA ensembles and entire transcriptomes.

  19. RNA-Seq analysis of yak ovary: improving yak gene structure information and mining reproduction-related genes.

    PubMed

    Lan, DaoLiang; Xiong, XianRong; Wei, YanLi; Xu, Tong; Zhong, JinCheng; Zhi, XiangDong; Wang, Yong; Li, Jian

    2014-09-01

    RNA-Seq, a high-throughput (HT) sequencing technique, has been used effectively in large-scale transcriptomic studies, and is particularly useful for improving gene structure information and mining of new genes. In this study, RNA-Seq HT technology was employed to analyze the transcriptome of yak ovary. After Illumina-Solexa deep sequencing, 26826516 clean reads with a total of 4828772880 bp were obtained from the ovary library. Alignment analysis showed that 16992 yak genes mapped to the yak genome and 3734 of these genes were involved in alternative splicing. Gene structure refinement analysis showed that 7340 genes that were annotated in the yak genome could be extended at the 5' or 3' ends based on the alignments been the transcripts and the genome sequence. Novel transcript prediction analysis identified 6321 new transcripts with lengths ranging from 180 to 14884 bp, and 2267 of them were predicted to code proteins. BLAST analysis of the new transcripts showed that 1200?4933 mapped to the non-redundant (nr), nucleotide (nt) and/or SwissProt sequence databases. Comparative statistical analysis of the new mapped transcripts showed that the majority of them were similar to genes in Bos taurus (41.4%), Bos grunniens mutus (33.0%), Ovis aries (6.3%), Homo sapiens (2.8%), Mus musculus (1.6%) and other species. Functional analysis showed that these expressed genes were involved in various Gene Ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes pathways. GO analysis of the new transcripts found that the largest proportion of them was associated with reproduction. The results of this study will provide a basis for describing the normal transcriptome map of yak ovary and for future studies on yak breeding performance. Moreover, the results confirmed that RNA-Seq HT technology is highly advantageous in improving gene structure information and mining of new genes, as well as in providing valuable data to expand the yak genome information.

  20. Structures and functions of proteins and nucleic acids in protein biosynthesis

    NASA Astrophysics Data System (ADS)

    Miyazawa, Tatsuo; Yokoyama, Shigeyuki

    Infrared and Raman spectroscopy is useful for studying helical conformations of polypeptides, which are determined by molecular structure parameters. Nuclear magnetic resonance spectroscopy, as well as X-ray analysis, is now established to be important for conformation studies of proteins and nucleic acids in solution. This article is mainly concerned with the conformational aspect and function regulation in protein biosynthesis. The strict recognition of transfer ribonucleic acid (tRNA) by aminoacyl-tRNA synthetase (ARS) is achieved by multi-step mutual adaptation. The conformations of ARS-bound amino acids have been elucidated by transferred nuclear Overhauser effect analysis. Aminoacyl-tRNA takes the 3‧-isomeric form in the polypeptide chain elongation cycle. The regulation of codon recognition by post-transcriptional modification is achieved by conversion of the conformational characteristic of the anticodon of tRNA. The cytidine → lysidine modification of the anticodon of minor isoleucine tRNA concurrently converts the amino acid specificity and the codon specificity. As novel protein engineering, a basic strategy has been established for in vivo biosynthesis of proteins that are substituted with unnatural amino acids (alloproteins).

  1. New insights from cluster analysis methods for RNA secondary structure prediction

    PubMed Central

    Rogers, Emily; Heitsch, Christine

    2016-01-01

    A widening gap exists between the best practices for RNA secondary structure prediction developed by computational researchers and the methods used in practice by experimentalists. Minimum free energy (MFE) predictions, although broadly used, are outperformed by methods which sample from the Boltzmann distribution and data mine the results. In particular, moving beyond the single structure prediction paradigm yields substantial gains in accuracy. Furthermore, the largest improvements in accuracy and precision come from viewing secondary structures not at the base pair level but at lower granularity/higher abstraction. This suggests that random errors affecting precision and systematic ones affecting accuracy are both reduced by this “fuzzier” view of secondary structures. Thus experimentalists who are willing to adopt a more rigorous, multilayered approach to secondary structure prediction by iterating through these levels of granularity will be much better able to capture fundamental aspects of RNA base pairing. PMID:26971529

  2. Analysis of hairpin RNA transgene-induced gene silencing in Fusarium oxysporum

    PubMed Central

    2013-01-01

    Background Hairpin RNA (hpRNA) transgenes can be effective at inducing RNA silencing and have been exploited as a powerful tool for gene function analysis in many organisms. However, in fungi, expression of hairpin RNA transcripts can induce post-transcriptional gene silencing, but in some species can also lead to transcriptional gene silencing, suggesting a more complex interplay of the two pathways at least in some fungi. Because many fungal species are important pathogens, RNA silencing is a powerful technique to understand gene function, particularly when gene knockouts are difficult to obtain. We investigated whether the plant pathogenic fungus Fusarium oxysporum possesses a functional gene silencing machinery and whether hairpin RNA transcripts can be employed to effectively induce gene silencing. Results Here we show that, in the phytopathogenic fungus F. oxysporum, hpRNA transgenes targeting either a β-glucuronidase (Gus) reporter transgene (hpGus) or the endogenous gene Frp1 (hpFrp) did not induce significant silencing of the target genes. Expression analysis suggested that the hpRNA transgenes are prone to transcriptional inactivation, resulting in low levels of hpRNA and siRNA production. However, the hpGus RNA can be efficiently transcribed by promoters acquired either by recombination with a pre-existing, actively transcribed Gus transgene or by fortuitous integration near an endogenous gene promoter allowing siRNA production. These siRNAs effectively induced silencing of a target Gus transgene, which in turn appeared to also induce secondary siRNA production. Furthermore, our results suggested that hpRNA transcripts without poly(A) tails are efficiently processed into siRNAs to induce gene silencing. A convergent promoter transgene, designed to express poly(A)-minus sense and antisense Gus RNAs, without an inverted-repeat DNA structure, induced consistent Gus silencing in F. oxysporum. Conclusions These results indicate that F. oxysporum possesses functional RNA silencing machineries for siRNA production and target mRNA cleavage, but hpRNA transgenes may induce transcriptional self-silencing due to its inverted-repeat structure. Our results suggest that F. oxysporum possesses a similar gene silencing pathway to other fungi like fission yeast, and indicate a need for developing more effective RNA silencing technology for gene function studies in this fungal pathogen. PMID:23819794

  3. Two new mutations in the MT-TW gene leading to the disruption of the secondary structure of the tRNA(Trp) in patients with Leigh syndrome.

    PubMed

    Mkaouar-Rebai, Emna; Chamkha, Imen; Kammoun, Fatma; Kammoun, Thouraya; Aloulou, Hajer; Hachicha, Mongia; Triki, Chahnez; Fakhfakh, Faiza

    2009-07-01

    Leigh syndrome is a progressive neurodegenerative disorder occurring in infancy and childhood characterized in most cases by a psychomotor retardation, optic atrophy, ataxia, dystonia, failure to thrive, seizures and respiratory failure. In this study, we performed a systematic sequence analysis of mitochondrial genes associated with LS in Tunisian patients. We sequenced the encoded complex I units: ND2, ND3, ND4, ND5 and ND6 genes and the mitochondrial ATPase 6, tRNA(Val), tRNA(Leu(UUR)), tRNA(Trp) and tRNA(Lys) genes in 10 unrelated patients with Leigh syndrome. We revealed the presence of 34 reported polymorphisms, nine novel nucleotide variants and two new mutations (T5523G and A5559G) in the tested patients. These two mutations were localized in two conserved regions of the tRNA(Trp) and affect, respectively, the D-stem and the T-stem of the mitochondrial tRNA leading to a disruption of the secondary structure of this tRNA. SSP-PCR analysis showed that the T5523G and A5559G mutations were present with respective heteroplasmic rates of 66% and 43 %. We report here the first mutational screening of mitochondrial mutations in Tunisian patients with Leigh syndrome which described two novel mutations associated with this disorder.

  4. Global Maps of ProQ Binding In Vivo Reveal Target Recognition via RNA Structure and Stability Control at mRNA 3' Ends.

    PubMed

    Holmqvist, Erik; Li, Lei; Bischler, Thorsten; Barquist, Lars; Vogel, Jörg

    2018-05-15

    The conserved RNA-binding protein ProQ has emerged as the centerpiece of a previously unknown third large network of post-transcriptional control in enterobacteria. Here, we have used in vivo UV crosslinking and RNA sequencing (CLIP-seq) to map hundreds of ProQ binding sites in Salmonella enterica and Escherichia coli. Our analysis of these binding sites, many of which are conserved, suggests that ProQ recognizes its cellular targets through RNA structural motifs found in small RNAs (sRNAs) and at the 3' end of mRNAs. Using the cspE mRNA as a model for 3' end targeting, we reveal a function for ProQ in protecting mRNA against exoribonucleolytic activity. Taken together, our results underpin the notion that ProQ governs a post-transcriptional network distinct from those of the well-characterized sRNA-binding proteins, CsrA and Hfq, and suggest a previously unrecognized, sRNA-independent role of ProQ in stabilizing mRNAs. Copyright © 2018 Elsevier Inc. All rights reserved.

  5. Discrimination between Closely Related Cellular Metabolites by the SAM-I Riboswitch

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Montange, R.; Mondragon, E; van Tyne, D

    2010-01-01

    The SAM-I riboswitch is a cis-acting element of genetic control found in bacterial mRNAs that specifically binds S-adenosylmethionine (SAM). We previously determined the 2.9-{angstrom} X-ray crystal structure of the effector-binding domain of this RNA element, revealing details of RNA-ligand recognition. To improve this structure, variations were made to the RNA sequence to alter lattice contacts, resulting in a 0.5-{angstrom} improvement in crystallographic resolution and allowing for a more accurate refinement of the crystallographic model. The basis for SAM specificity was addressed by a structural analysis of the RNA complexed to S-adenosylhomocysteine (SAH) and sinefungin and by measuring the affinity ofmore » SAM and SAH for a series of mutants using isothermal titration calorimetry. These data illustrate the importance of two universally conserved base pairs in the RNA that form electrostatic interactions with the positively charged sulfonium group of SAM, thereby providing a basis for discrimination between SAM and SAH.« less

  6. A DNA sequence obtained by replacement of the dopamine RNA aptamer bases is not an aptamer.

    PubMed

    Álvarez-Martos, Isabel; Ferapontova, Elena E

    2017-08-05

    A unique specificity of the aptamer-ligand biorecognition and binding facilitates bioanalysis and biosensor development, contributing to discrimination of structurally related molecules, such as dopamine and other catecholamine neurotransmitters. The aptamer sequence capable of specific binding of dopamine is a 57 nucleotides long RNA sequence reported in 1997 (Biochemistry, 1997, 36, 9726). Later, it was suggested that the DNA homologue of the RNA aptamer retains the specificity of dopamine binding (Biochem. Biophys. Res. Commun., 2009, 388, 732). Here, we show that the DNA sequence obtained by the replacement of the RNA aptamer bases for their DNA analogues is not able of specific biorecognition of dopamine, in contrast to the original RNA aptamer sequence. This DNA sequence binds dopamine and structurally related catecholamine neurotransmitters non-specifically, as any DNA sequence, and, thus, is not an aptamer and cannot be used neither for in vivo nor in situ analysis of dopamine in the presence of structurally related neurotransmitters. Copyright © 2017 Elsevier Inc. All rights reserved.

  7. Assembly of RNA nanostructures on supported lipid bilayers

    NASA Astrophysics Data System (ADS)

    Dabkowska, Aleksandra P.; Michanek, Agnes; Jaeger, Luc; Rabe, Michael; Chworos, Arkadiusz; Höök, Fredrik; Nylander, Tommy; Sparr, Emma

    2014-12-01

    The assembly of nucleic acid nanostructures with controlled size and shape has large impact in the fields of nanotechnology, nanomedicine and synthetic biology. The directed arrangement of nano-structures at interfaces is important for many applications. In spite of this, the use of laterally mobile lipid bilayers to control RNA three-dimensional nanostructure formation on surfaces remains largely unexplored. Here, we direct the self-assembly of RNA building blocks into three-dimensional structures of RNA on fluid lipid bilayers composed of cationic 1,2-dioleoyl-3-trimethylammonium-propane (DOTAP) or mixtures of zwitterionic 1,2-dioleoyl-sn-glycero-3-phosphatidylcholine (DOPC) and cationic sphingosine. We demonstrate the stepwise supramolecular assembly of discrete building blocks through specific and selective RNA-RNA interactions, based on results from quartz crystal microbalance with dissipation (QCM-D), ellipsometry, fluorescence recovery after photobleaching (FRAP) and total internal reflection fluorescence microscopy (TIRF) experiments. The assembly can be controlled to give a densely packed single layer of RNA polyhedrons at the fluid lipid bilayer surface. We show that assembly of the 3D structure can be modulated by sequence specific interactions, surface charge and changes in the salt composition and concentration. In addition, the tertiary structure of the RNA polyhedron can be controllably switched from an extended structure to one that is dense and compact. The versatile approach to building up three-dimensional structures of RNA does not require modification of the surface or the RNA molecules, and can be used as a bottom-up means of nanofabrication of functionalized bio-mimicking surfaces.The assembly of nucleic acid nanostructures with controlled size and shape has large impact in the fields of nanotechnology, nanomedicine and synthetic biology. The directed arrangement of nano-structures at interfaces is important for many applications. In spite of this, the use of laterally mobile lipid bilayers to control RNA three-dimensional nanostructure formation on surfaces remains largely unexplored. Here, we direct the self-assembly of RNA building blocks into three-dimensional structures of RNA on fluid lipid bilayers composed of cationic 1,2-dioleoyl-3-trimethylammonium-propane (DOTAP) or mixtures of zwitterionic 1,2-dioleoyl-sn-glycero-3-phosphatidylcholine (DOPC) and cationic sphingosine. We demonstrate the stepwise supramolecular assembly of discrete building blocks through specific and selective RNA-RNA interactions, based on results from quartz crystal microbalance with dissipation (QCM-D), ellipsometry, fluorescence recovery after photobleaching (FRAP) and total internal reflection fluorescence microscopy (TIRF) experiments. The assembly can be controlled to give a densely packed single layer of RNA polyhedrons at the fluid lipid bilayer surface. We show that assembly of the 3D structure can be modulated by sequence specific interactions, surface charge and changes in the salt composition and concentration. In addition, the tertiary structure of the RNA polyhedron can be controllably switched from an extended structure to one that is dense and compact. The versatile approach to building up three-dimensional structures of RNA does not require modification of the surface or the RNA molecules, and can be used as a bottom-up means of nanofabrication of functionalized bio-mimicking surfaces. Electronic supplementary information (ESI) available: Table with sequences of tRNA units used in this study; schematic structures of the RNA polyhedron and its building blocks; gel electrophoresis characterization of the RNA polyhedron and squares; AFM characterization of RNA tectosquare; schematic structures of RNA-9 and RNA-10 and their association with lipid bilayers; QCM-D frequency and dissipation data (as function of time) for adsorption of RNA polyhedrons, RNA squares and RNA9-10 TIRF images of RNA with Gelstar after photobleaching with analysis; Correlation plot in change of shear viscosity for TS3 and TO3-4 models for the stoichiometry of TS; QCM-D dissipation data for the sequential experiment in Fig. 5a; QCM-D and for the assembly of building blocks at the bilayer scaffold at varying bulk concentrations; QCM-D of adsorption of TS3. See DOI: 10.1039/c4nr05968a

  8. Modulation of RNA function by aminoglycoside antibiotics.

    PubMed

    Schroeder, R; Waldsich, C; Wank, H

    2000-01-04

    One of the most important families of antibiotics are the aminoglycosides, including drugs such as neomycin B, paromomycin, gentamicin and streptomycin. With the discovery of the catalytic potential of RNA, these antibiotics became very popular due to their RNA-binding capacity. They serve for the analysis of RNA function as well as for the study of RNA as a potential therapeutic target. Improvements in RNA structure determination recently provided first insights into the decoding site of the ribosome at high resolution and how aminoglycosides might induce misreading of the genetic code. In addition to inhibiting prokaryotic translation, aminoglycosides inhibit several catalytic RNAs such as self-splicing group I introns, RNase P and small ribozymes in vitro. Furthermore, these antibiotics interfere with human immunodeficiency virus (HIV) replication by disrupting essential RNA-protein contacts. Most exciting is the potential of many RNA-binding antibiotics to stimulate RNA activities, conceiving small-molecule partners for the hypothesis of an ancient RNA world. SELEX (systematic evolution of ligands by exponential enrichment) has been used in this evolutionary game leading to small synthetic RNAs, whose NMR structures gave valuable information on how aminoglycosides interact with RNA, which could possibly be used in applied science.

  9. A Data Driven Model for Predicting RNA-Protein Interactions based on Gradient Boosting Machine.

    PubMed

    Jain, Dharm Skandh; Gupte, Sanket Rajan; Aduri, Raviprasad

    2018-06-22

    RNA protein interactions (RPI) play a pivotal role in the regulation of various biological processes. Experimental validation of RPI has been time-consuming, paving the way for computational prediction methods. The major limiting factor of these methods has been the accuracy and confidence of the predictions, and our in-house experiments show that they fail to accurately predict RPI involving short RNA sequences such as TERRA RNA. Here, we present a data-driven model for RPI prediction using a gradient boosting classifier. Amino acids and nucleotides are classified based on the high-resolution structural data of RNA protein complexes. The minimum structural unit consisting of five residues is used as the descriptor. Comparative analysis of existing methods shows the consistently higher performance of our method irrespective of the length of RNA present in the RPI. The method has been successfully applied to map RPI networks involving both long noncoding RNA as well as TERRA RNA. The method is also shown to successfully predict RNA and protein hubs present in RPI networks of four different organisms. The robustness of this method will provide a way for predicting RPI networks of yet unknown interactions for both long noncoding RNA and microRNA.

  10. New Era of Studying RNA Secondary Structure and Its Influence on Gene Regulation in Plants.

    PubMed

    Yang, Xiaofei; Yang, Minglei; Deng, Hongjing; Ding, Yiliang

    2018-01-01

    The dynamic structure of RNA plays a central role in post-transcriptional regulation of gene expression such as RNA maturation, degradation, and translation. With the rise of next-generation sequencing, the study of RNA structure has been transformed from in vitro low-throughput RNA structure probing methods to in vivo high-throughput RNA structure profiling. The development of these methods enables incremental studies on the function of RNA structure to be performed, revealing new insights of novel regulatory mechanisms of RNA structure in plants. Genome-wide scale RNA structure profiling allows us to investigate general RNA structural features over 10s of 1000s of mRNAs and to compare RNA structuromes between plant species. Here, we provide a comprehensive and up-to-date overview of: (i) RNA structure probing methods; (ii) the biological functions of RNA structure; (iii) genome-wide RNA structural features corresponding to their regulatory mechanisms; and (iv) RNA structurome evolution in plants.

  11. Predicting RNA Duplex Dimerization Free-Energy Changes upon Mutations Using Molecular Dynamics Simulations.

    PubMed

    Sakuraba, Shun; Asai, Kiyoshi; Kameda, Tomoshi

    2015-11-05

    The dimerization free energies of RNA-RNA duplexes are fundamental values that represent the structural stability of RNA complexes. We report a comparative analysis of RNA-RNA duplex dimerization free-energy changes upon mutations, estimated from a molecular dynamics simulation and experiments. A linear regression for nine pairs of double-stranded RNA sequences, six base pairs each, yielded a mean absolute deviation of 0.55 kcal/mol and an R(2) value of 0.97, indicating quantitative agreement between simulations and experimental data. The observed accuracy indicates that the molecular dynamics simulation with the current molecular force field is capable of estimating the thermodynamic properties of RNA molecules.

  12. Sounds of silence: synonymous nucleotides as a key to biological regulation and complexity

    PubMed Central

    Shabalina, Svetlana A.; Spiridonov, Nikolay A.; Kashina, Anna

    2013-01-01

    Messenger RNA is a key component of an intricate regulatory network of its own. It accommodates numerous nucleotide signals that overlap protein coding sequences and are responsible for multiple levels of regulation and generation of biological complexity. A wealth of structural and regulatory information, which mRNA carries in addition to the encoded amino acid sequence, raises the question of how these signals and overlapping codes are delineated along non-synonymous and synonymous positions in protein coding regions, especially in eukaryotes. Silent or synonymous codon positions, which do not determine amino acid sequences of the encoded proteins, define mRNA secondary structure and stability and affect the rate of translation, folding and post-translational modifications of nascent polypeptides. The RNA level selection is acting on synonymous sites in both prokaryotes and eukaryotes and is more common than previously thought. Selection pressure on the coding gene regions follows three-nucleotide periodic pattern of nucleotide base-pairing in mRNA, which is imposed by the genetic code. Synonymous positions of the coding regions have a higher level of hybridization potential relative to non-synonymous positions, and are multifunctional in their regulatory and structural roles. Recent experimental evidence and analysis of mRNA structure and interspecies conservation suggest that there is an evolutionary tradeoff between selective pressure acting at the RNA and protein levels. Here we provide a comprehensive overview of the studies that define the role of silent positions in regulating RNA structure and processing that exert downstream effects on proteins and their functions. PMID:23293005

  13. Bioinformatics of cardiovascular miRNA biology.

    PubMed

    Kunz, Meik; Xiao, Ke; Liang, Chunguang; Viereck, Janika; Pachel, Christina; Frantz, Stefan; Thum, Thomas; Dandekar, Thomas

    2015-12-01

    MicroRNAs (miRNAs) are small ~22 nucleotide non-coding RNAs and are highly conserved among species. Moreover, miRNAs regulate gene expression of a large number of genes associated with important biological functions and signaling pathways. Recently, several miRNAs have been found to be associated with cardiovascular diseases. Thus, investigating the complex regulatory effect of miRNAs may lead to a better understanding of their functional role in the heart. To achieve this, bioinformatics approaches have to be coupled with validation and screening experiments to understand the complex interactions of miRNAs with the genome. This will boost the subsequent development of diagnostic markers and our understanding of the physiological and therapeutic role of miRNAs in cardiac remodeling. In this review, we focus on and explain different bioinformatics strategies and algorithms for the identification and analysis of miRNAs and their regulatory elements to better understand cardiac miRNA biology. Starting with the biogenesis of miRNAs, we present approaches such as LocARNA and miRBase for combining sequence and structure analysis including phylogenetic comparisons as well as detailed analysis of RNA folding patterns, functional target prediction, signaling pathway as well as functional analysis. We also show how far bioinformatics helps to tackle the unprecedented level of complexity and systemic effects by miRNA, underlining the strong therapeutic potential of miRNA and miRNA target structures in cardiovascular disease. In addition, we discuss drawbacks and limitations of bioinformatics algorithms and the necessity of experimental approaches for miRNA target identification. This article is part of a Special Issue entitled 'Non-coding RNAs'. Copyright © 2014 Elsevier Ltd. All rights reserved.

  14. Nuclear matrix and hnRNP share a common structural constituent associated with premessenger RNA.

    PubMed Central

    Gallinaro, H; Puvion, E; Kister, L; Jacob, M

    1983-01-01

    Nuclear matrix and heterogeneous nuclear ribonucleoprotein (hnRNP) were compared to establish whether premessenger RNA (premRNA) was associated with a same constituent in both structures. The isolation of nuclear matrix included the removal of chromatin and of 0.4 M KCl-soluble material. HnRNP, isolated by a standard method was also treated by 0.4 M KCl. Both isolation procedures caused the removal of DNA, histones, a fraction of small nuclear RNA and of nonhistone proteins including the hnRNP proteins in the 30 000-40 000 mol. wt. range. High resolution autoradiography showed that hnRNA remained associated with the residual fibrils in both structures. They both contained the same premRNA and maturation products as shown by the analysis of the transcripts of the early region 3 of adenovirus 2. In addition, the small nuclear RNA and protein of the salt-resistant complexes were also present in the matrix. The results are compatible with the idea that the salt-resistant complexes from hnRNP constitute the fibrils associated with premRNA in the nucleoplasmic matrix. The fibrils may be the basic unit of splicing and their organization in matrix might provide the spatial configuration necessary for regulation. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. Fig. 7. PMID:6557026

  15. PRince: a web server for structural and physicochemical analysis of protein-RNA interface.

    PubMed

    Barik, Amita; Mishra, Abhishek; Bahadur, Ranjit Prasad

    2012-07-01

    We have developed a web server, PRince, which analyzes the structural features and physicochemical properties of the protein-RNA interface. Users need to submit a PDB file containing the atomic coordinates of both the protein and the RNA molecules in complex form (in '.pdb' format). They should also mention the chain identifiers of interacting protein and RNA molecules. The size of the protein-RNA interface is estimated by measuring the solvent accessible surface area buried in contact. For a given protein-RNA complex, PRince calculates structural, physicochemical and hydration properties of the interacting surfaces. All these parameters generated by the server are presented in a tabular format. The interacting surfaces can also be visualized with software plug-in like Jmol. In addition, the output files containing the list of the atomic coordinates of the interacting protein, RNA and interface water molecules can be downloaded. The parameters generated by PRince are novel, and users can correlate them with the experimentally determined biophysical and biochemical parameters for better understanding the specificity of the protein-RNA recognition process. This server will be continuously upgraded to include more parameters. PRince is publicly accessible and free for use. Available at http://www.facweb.iitkgp.ernet.in/~rbahadur/prince/home.html.

  16. Population and allelic variation of A-to-I RNA editing in human transcriptomes.

    PubMed

    Park, Eddie; Guo, Jiguang; Shen, Shihao; Demirdjian, Levon; Wu, Ying Nian; Lin, Lan; Xing, Yi

    2017-07-28

    A-to-I RNA editing is an important step in RNA processing in which specific adenosines in some RNA molecules are post-transcriptionally modified to inosines. RNA editing has emerged as a widespread mechanism for generating transcriptome diversity. However, there remain significant knowledge gaps about the variation and function of RNA editing. In order to determine the influence of genetic variation on A-to-I RNA editing, we integrate genomic and transcriptomic data from 445 human lymphoblastoid cell lines by combining an RNA editing QTL (edQTL) analysis with an allele-specific RNA editing (ASED) analysis. We identify 1054 RNA editing events associated with cis genetic polymorphisms. Additionally, we find that a subset of these polymorphisms is linked to genome-wide association study signals of complex traits or diseases. Finally, compared to random cis polymorphisms, polymorphisms associated with RNA editing variation are located closer spatially to their respective editing sites and have a more pronounced impact on RNA secondary structure. Our study reveals widespread cis variation in RNA editing among genetically distinct individuals and sheds light on possible phenotypic consequences of such variation on complex traits and diseases.

  17. A high-throughput approach to profile RNA structure.

    PubMed

    Delli Ponti, Riccardo; Marti, Stefanie; Armaos, Alexandros; Tartaglia, Gian Gaetano

    2017-03-17

    Here we introduce the Computational Recognition of Secondary Structure (CROSS) method to calculate the structural profile of an RNA sequence (single- or double-stranded state) at single-nucleotide resolution and without sequence length restrictions. We trained CROSS using data from high-throughput experiments such as Selective 2΄-Hydroxyl Acylation analyzed by Primer Extension (SHAPE; Mouse and HIV transcriptomes) and Parallel Analysis of RNA Structure (PARS; Human and Yeast transcriptomes) as well as high-quality NMR/X-ray structures (PDB database). The algorithm uses primary structure information alone to predict experimental structural profiles with >80% accuracy, showing high performances on large RNAs such as Xist (17 900 nucleotides; Area Under the ROC Curve AUC of 0.75 on dimethyl sulfate (DMS) experiments). We integrated CROSS in thermodynamics-based methods to predict secondary structure and observed an increase in their predictive power by up to 30%. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. Structural, Functional, and Genetic Analysis of Sorangicin Inhibition of Bacterial RNA Polymerase

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Campbell,E.; Pavlova, O.; Zenkin, N.

    2005-01-01

    A combined structural, functional, and genetic approach was used to investigate inhibition of bacterial RNA polymerase (RNAP) by sorangicin (Sor), a macrolide polyether antibiotic. Sor lacks chemical and structural similarity to the ansamycin rifampicin (Rif), an RNAP inhibitor widely used to treat tuberculosis. Nevertheless, structural analysis revealed Sor binds in the same RNAP {beta} subunit pocket as Rif, with almost complete overlap of RNAP binding determinants, and functional analysis revealed that both antibiotics inhibit transcription by directly blocking the path of the elongating transcript at a length of 2-3 nucleotides. Genetic analysis indicates that Rif binding is extremely sensitive tomore » mutations expected to change the shape of the antibiotic binding pocket, while Sor is not. We suggest that conformational flexibility of Sor, in contrast to the rigid conformation of Rif, allows Sor to adapt to changes in the binding pocket. This has important implications for drug design against rapidly mutating targets.« less

  19. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jackson, R.N.; Robinson, H.; Klauer, A. A.

    The essential RNA helicase, Mtr4, performs a critical role in RNA processing and degradation as an activator of the nuclear exosome. The molecular basis for this vital function is not understood and detailed analysis is significantly limited by the lack of structural data. In this study, we present the crystal structure of Mtr4. The structure reveals a new arch-like domain that is specific to Mtr4 and Ski2 (the cytosolic homologue of Mtr4). In vivo and in vitro analyses demonstrate that the Mtr4 arch domain is required for proper 5.8S rRNA processing, and suggest that the arch functions independently of canonicalmore » helicase activity. In addition, extensive conservation along the face of the putative RNA exit site highlights a potential interface with the exosome. These studies provide a molecular framework for understanding fundamental aspects of helicase function in exosome activation, and more broadly define the molecular architecture of Ski2-like helicases.« less

  20. Analysis of secondary structural elements in human microRNA hairpin precursors.

    PubMed

    Liu, Biao; Childs-Disney, Jessica L; Znosko, Brent M; Wang, Dan; Fallahi, Mohammad; Gallo, Steven M; Disney, Matthew D

    2016-03-01

    MicroRNAs (miRNAs) regulate gene expression by targeting complementary mRNAs for destruction or translational repression. Aberrant expression of miRNAs has been associated with various diseases including cancer, thus making them interesting therapeutic targets. The composite of secondary structural elements that comprise miRNAs could aid the design of small molecules that modulate their function. We analyzed the secondary structural elements, or motifs, present in all human miRNA hairpin precursors and compared them to highly expressed human RNAs with known structures and other RNAs from various organisms. Amongst human miRNAs, there are 3808 are unique motifs, many residing in processing sites. Further, we identified motifs in miRNAs that are not present in other highly expressed human RNAs, desirable targets for small molecules. MiRNA motifs were incorporated into a searchable database that is freely available. We also analyzed the most frequently occurring bulges and internal loops for each RNA class and found that the smallest loops possible prevail. However, the distribution of loops and the preferred closing base pairs were unique to each class. Collectively, we have completed a broad survey of motifs found in human miRNA precursors, highly expressed human RNAs, and RNAs from other organisms. Interestingly, unique motifs were identified in human miRNA processing sites, binding to which could inhibit miRNA maturation and hence function.

  1. Improved systematic tRNA gene annotation allows new insights into the evolution of mitochondrial tRNA structures and into the mechanisms of mitochondrial genome rearrangements

    PubMed Central

    Jühling, Frank; Pütz, Joern; Bernt, Matthias; Donath, Alexander; Middendorf, Martin; Florentz, Catherine; Stadler, Peter F.

    2012-01-01

    Transfer RNAs (tRNAs) are present in all types of cells as well as in organelles. tRNAs of animal mitochondria show a low level of primary sequence conservation and exhibit ‘bizarre’ secondary structures, lacking complete domains of the common cloverleaf. Such sequences are hard to detect and hence frequently missed in computational analyses and mitochondrial genome annotation. Here, we introduce an automatic annotation procedure for mitochondrial tRNA genes in Metazoa based on sequence and structural information in manually curated covariance models. The method, applied to re-annotate 1876 available metazoan mitochondrial RefSeq genomes, allows to distinguish between remaining functional genes and degrading ‘pseudogenes’, even at early stages of divergence. The subsequent analysis of a comprehensive set of mitochondrial tRNA genes gives new insights into the evolution of structures of mitochondrial tRNA sequences as well as into the mechanisms of genome rearrangements. We find frequent losses of tRNA genes concentrated in basal Metazoa, frequent independent losses of individual parts of tRNA genes, particularly in Arthropoda, and wide-spread conserved overlaps of tRNAs in opposite reading direction. Direct evidence for several recent Tandem Duplication-Random Loss events is gained, demonstrating that this mechanism has an impact on the appearance of new mitochondrial gene orders. PMID:22139921

  2. Methyltransferase That Modifies Guanine 966 of the 16 S rRNA: FUNCTIONAL IDENTIFICATION AND TERTIARY STRUCTURE*

    PubMed Central

    Lesnyak, Dmitry V.; Osipiuk, Jerzy; Skarina, Tatiana; Sergiev, Petr V.; Bogdanov, Alexey A.; Edwards, Aled; Savchenko, Alexei; Joachimiak, Andrzej; Dontsova, Olga A.

    2010-01-01

    N2-Methylguanine 966 is located in the loop of Escherichia coli 16 S rRNA helix 31, forming a part of the P-site tRNA-binding pocket. We found yhhF to be a gene encoding for m2G966 specific 16 S rRNA methyltransferase. Disruption of the yhhF gene by kanamycin resistance marker leads to a loss of modification at G966. The modification could be rescued by expression of recombinant protein from the plasmid carrying the yhhF gene. Moreover, purified m2G966 methyltransferase, in the presence of S-adenosylomethionine (AdoMet), is able to methylate 30 S ribosomal subunits that were purified from yhhF knock-out strain in vitro. The methylation is specific for G966 base of the 16 S rRNA. The m2G966 methyltransferase was crystallized, and its structure has been determined and refined to 2.05 Å. The structure closely resembles RsmC rRNA methyltransferase, specific for m2G1207 of the 16 S rRNA. Structural comparisons and analysis of the enzyme active site suggest modes for binding AdoMet and rRNA to m2G966 methyltransferase. Based on the experimental data and current nomenclature the protein expressed from the yhhF gene was renamed to RsmD. A model for interaction of RsmD with ribosome has been proposed. PMID:17189261

  3. Methyltransferase that modifies guanine 966 of the 16 S rRNA: functional identification and tertiary structure.

    PubMed

    Lesnyak, Dmitry V; Osipiuk, Jerzy; Skarina, Tatiana; Sergiev, Petr V; Bogdanov, Alexey A; Edwards, Aled; Savchenko, Alexei; Joachimiak, Andrzej; Dontsova, Olga A

    2007-02-23

    N(2)-Methylguanine 966 is located in the loop of Escherichia coli 16 S rRNA helix 31, forming a part of the P-site tRNA-binding pocket. We found yhhF to be a gene encoding for m(2)G966 specific 16 S rRNA methyltransferase. Disruption of the yhhF gene by kanamycin resistance marker leads to a loss of modification at G966. The modification could be rescued by expression of recombinant protein from the plasmid carrying the yhhF gene. Moreover, purified m(2)G966 methyltransferase, in the presence of S-adenosylomethionine (AdoMet), is able to methylate 30 S ribosomal subunits that were purified from yhhF knock-out strain in vitro. The methylation is specific for G966 base of the 16 S rRNA. The m(2)G966 methyltransferase was crystallized, and its structure has been determined and refined to 2.05A(.) The structure closely resembles RsmC rRNA methyltransferase, specific for m(2)G1207 of the 16 S rRNA. Structural comparisons and analysis of the enzyme active site suggest modes for binding AdoMet and rRNA to m(2)G966 methyltransferase. Based on the experimental data and current nomenclature the protein expressed from the yhhF gene was renamed to RsmD. A model for interaction of RsmD with ribosome has been proposed.

  4. Chemical and structural characterization of a model Post-Termination Complex (PoTC) for the ribosome recycling reaction: Evidence for the release of the mRNA by RRF and EF-G

    PubMed Central

    Iwakura, Nobuhiro; Yokoyama, Takeshi; Quaglia, Fabio; Mitsuoka, Kaoru; Mio, Kazuhiro; Shigematsu, Hideki; Shirouzu, Mikako; Kaji, Akira; Kaji, Hideko

    2017-01-01

    A model Post-Termination Complex (PoTC) used for the discovery of Ribosome Recycling Factor (RRF) was purified and characterized by cryo-electron microscopic analysis and biochemical methods. We established that the model PoTC has mostly one tRNA, at the P/E or P/P position, together with one mRNA. The structural studies were supported by the biochemical measurement of bound tRNA and mRNA. Using this substrate, we establish that the release of tRNA, release of mRNA and splitting of ribosomal subunits occur during the recycling reaction. Order of these events is tRNA release first followed by mRNA release and splitting almost simultaneously. Moreover, we demonstrate that IF3 is not involved in any of the recycling reactions but simply prevents the re-association of split ribosomal subunits. Our finding demonstrates that the important function of RRF includes the release of mRNA, which is often missed by the use of a short ORF with the Shine-Dalgarno sequence near the termination site. PMID:28542628

  5. Lost in folding space? Comparing four variants of the thermodynamic model for RNA secondary structure prediction.

    PubMed

    Janssen, Stefan; Schudoma, Christian; Steger, Gerhard; Giegerich, Robert

    2011-11-03

    Many bioinformatics tools for RNA secondary structure analysis are based on a thermodynamic model of RNA folding. They predict a single, "optimal" structure by free energy minimization, they enumerate near-optimal structures, they compute base pair probabilities and dot plots, representative structures of different abstract shapes, or Boltzmann probabilities of structures and shapes. Although all programs refer to the same physical model, they implement it with considerable variation for different tasks, and little is known about the effects of heuristic assumptions and model simplifications used by the programs on the outcome of the analysis. We extract four different models of the thermodynamic folding space which underlie the programs RNAFOLD, RNASHAPES, and RNASUBOPT. Their differences lie within the details of the energy model and the granularity of the folding space. We implement probabilistic shape analysis for all models, and introduce the shape probability shift as a robust measure of model similarity. Using four data sets derived from experimentally solved structures, we provide a quantitative evaluation of the model differences. We find that search space granularity affects the computed shape probabilities less than the over- or underapproximation of free energy by a simplified energy model. Still, the approximations perform similar enough to implementations of the full model to justify their continued use in settings where computational constraints call for simpler algorithms. On the side, we observe that the rarely used level 2 shapes, which predict the complete arrangement of helices, multiloops, internal loops and bulges, include the "true" shape in a rather small number of predicted high probability shapes. This calls for an investigation of new strategies to extract high probability members from the (very large) level 2 shape space of an RNA sequence. We provide implementations of all four models, written in a declarative style that makes them easy to be modified. Based on our study, future work on thermodynamic RNA folding may make a choice of model based on our empirical data. It can take our implementations as a starting point for further program development.

  6. Structural studies of bean pod mottle virus, capsid, and RNA in crystal and solution states by laser Raman spectroscopy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, Tiansheng; Thomas, G.J. Jr.; Chen, Zhongguo

    Structures of protein and RNA components of bean pod mottle virus (BPMV) have been investigated by use of laser Raman spectroscopy. Raman spectra were collected from both aqueous solutions and single crystals of BPMV capsids (top component) and virions (middle and bottom components, which package, respectively, small and large RNA molecules). Analysis of the data permits the assignment of conformation-sensitive Raman bands to viral protein and RNA constituents and observation of structural similarities and differences between solution and crystalline states of BPMV components. The Raman results show that the protein subunits of the empty capsid contain between 45% and 55%more » {beta}-strand and {beta}-turn secondary structure, in agreement with the recently determined X-ray crystal structure, and that this total {beta}-strand content undergoes a small increase with packaging of RNA. A comparison of Raman spectra of crystal and solution states of the BPMV middle component reveals only minor structural differences between the two, and these are restricted almost exclusively to Raman bands of RNA in the region of assigned phosphodiester conformation markers. Although in both the crystal and solution only C3{prime} endo/anti nucleosides are detected, the crystal exhibits a weaker 813-cm{sup {minus}1} band and strong 870-cm{sup {minus}1} band, which suggests that {approximately}8% of the nucleotides have O-P-O torsions configured differently in the crystal from that in the solution.« less

  7. The structure of Zika virus NS5 reveals a conserved domain conformation

    DOE PAGES

    Wang, Boxiao; Tan, Xiao -Feng; Thurmond, Stephanie; ...

    2017-03-27

    The recent outbreak of Zika virus (ZIKV) has imposed a serious threat to public health. Here we report the crystal structure of the ZIKV NS5 protein in complex with S-adenosyl-L-homocysteine, in which the tandem methyltransferase (MTase) and RNA-dependent RNA polymerase (RdRp) domains stack into one of the two alternative conformations of flavivirus NS5 proteins. In conclusion, the activity of this NS5 protein is verified through a de novo RdRp assay on a subgenomic ZIKV RNA template. Importantly, our structural analysis leads to the identification of a potential drug-binding site of ZIKV NS5, which might facilitate the development of novel antiviralsmore » for ZIKV.« less

  8. Secondary RNA structure and its role in RNA interference to silence the respiratory syncytial virus fusion protein gene.

    PubMed

    Vig, Komal; Lewis, Nuruddeen; Moore, Eddie G; Pillai, Shreekumar; Dennis, Vida A; Singh, Shree R

    2009-11-01

    RNA interference (RNAi) is a post-transcriptional, gene silencing mechanism which uses small interfering RNA molecules (siRNA) for gene silencing. Respiratory Syncytial Virus (RSV) is an important respiratory pathogen of medical significance that causes high mortality in infants. The fusion (F) protein of RSV is a good target for therapeutic purposes as it is primarily responsible for penetration of the virus into host cells and subsequent syncytium formation during infection. In the present study, four siRNAs were designed and used individually as well as a mixture, to silence the RSV F gene. The relationship between siRNA design, target RNA structure, and their thermodynamics was also investigated. Silencing of F gene was observed using indirect immunofluorescence, western blot, reverse transcription PCR, and progeny viral titers. Our results show F gene silencing by all the four siRNAs individually and collectively. RT-PCR analysis revealed a decrease in mRNA level which corresponded to decreased F protein expression. siRNAs also inhibited RSV progeny as shown by viral titer estimation on infected HEp-2 cells. The present study demonstrates the silencing of the F gene using siRNA. Thermodynamic characteristics of the target RSV mRNA and siRNA seem to play an important role in siRNA gene silencing efficiency.

  9. A mRNA and cognate microRNAs localize in the nucleolus.

    PubMed

    Reyes-Gutierrez, Pablo; Ritland Politz, Joan C; Pederson, Thoru

    2014-01-01

    We previously discovered that a set of 5 microRNAs are concentrated in the nucleolus of rat myoblasts. We now report that several mRNAs are also localized in the nucleoli of these cells as determined by microarray analysis of RNA from purified nucleoli. Among the most abundant of these nucleolus-localized mRNAs is that encoding insulin-like growth factor 2 (IGF2), a regulator of myoblast proliferation and differentiation. The presence of IGF2 mRNA in nucleoli was confirmed by fluorescence in situ hybridization, and RT-PCR experiments demonstrated that these nucleolar transcripts are spliced, thus arriving from the nucleoplasm. Bioinformatics analysis predicted canonically structured, highly thermodynamically stable interactions between IGF2 mRNA and all 5 of the nucleolus-localized microRNAs. These results raise the possibility that the nucleolus is a staging site for setting up particular mRNA-microRNA interactions prior to export to the cytoplasm.

  10. Assessing the 5S ribosomal RNA heterogeneity in Arabidopsis thaliana using short RNA next generation sequencing data.

    PubMed

    Szymanski, Maciej; Karlowski, Wojciech M

    2016-01-01

    In eukaryotes, ribosomal 5S rRNAs are products of multigene families organized within clusters of tandemly repeated units. Accumulation of genomic data obtained from a variety of organisms demonstrated that the potential 5S rRNA coding sequences show a large number of variants, often incompatible with folding into a correct secondary structure. Here, we present results of an analysis of a large set of short RNA sequences generated by the next generation sequencing techniques, to address the problem of heterogeneity of the 5S rRNA transcripts in Arabidopsis and identification of potentially functional rRNA-derived fragments.

  11. Heart structure-specific transcriptomic atlas reveals conserved microRNA-mRNA interactions.

    PubMed

    Vacchi-Suzzi, Caterina; Hahne, Florian; Scheubel, Philippe; Marcellin, Magali; Dubost, Valerie; Westphal, Magdalena; Boeglen, Catherine; Büchmann-Møller, Stine; Cheung, Ming Sin; Cordier, André; De Benedetto, Christopher; Deurinck, Mark; Frei, Moritz; Moulin, Pierre; Oakeley, Edward; Grenet, Olivier; Grevot, Armelle; Stull, Robert; Theil, Diethilde; Moggs, Jonathan G; Marrer, Estelle; Couttet, Philippe

    2013-01-01

    MicroRNAs are short non-coding RNAs that regulate gene expression at the post-transcriptional level and play key roles in heart development and cardiovascular diseases. Here, we have characterized the expression and distribution of microRNAs across eight cardiac structures (left and right ventricles, apex, papillary muscle, septum, left and right atrium and valves) in rat, Beagle dog and cynomolgus monkey using microRNA sequencing. Conserved microRNA signatures enriched in specific heart structures across these species were identified for cardiac valve (miR-let-7c, miR-125b, miR-127, miR-199a-3p, miR-204, miR-320, miR-99b, miR-328 and miR-744) and myocardium (miR-1, miR-133b, miR-133a, miR-208b, miR-30e, miR-499-5p, miR-30e*). The relative abundance of myocardium-enriched (miR-1) and valve-enriched (miR-125b-5p and miR-204) microRNAs was confirmed using in situ hybridization. MicroRNA-mRNA interactions potentially relevant for cardiac functions were explored using anti-correlation expression analysis and microRNA target prediction algorithms. Interactions between miR-1/Timp3, miR-125b/Rbm24, miR-204/Tgfbr2 and miR-208b/Csnk2a2 were identified and experimentally investigated in human pulmonary smooth muscle cells and luciferase reporter assays. In conclusion, we have generated a high-resolution heart structure-specific mRNA/microRNA expression atlas for three mammalian species that provides a novel resource for investigating novel microRNA regulatory circuits involved in cardiac molecular physiopathology.

  12. Molecular insights into the specific recognition between the RNA binding domain qRRM2 of hnRNP F and G-tract RNA: A molecular dynamics study.

    PubMed

    Wang, Lingyun; Yan, Feng

    2017-12-09

    Heterogeneous nuclear ribonucleoprotein F (hnRNP F) controls the expression of various genes through regulating the alternative splicing of pre-mRNAs in the nucleus. It uses three quasi-RNA recognition motifs (qRRMs) to recognize G-tract RNA which contains at least three consecutive guanines. The structures containing qRRMs of hnRNP F in complex with G-tract RNA have been determined by nuclear magnetic resonance (NMR) spectroscopy, shedding light on the recognition mechanism of qRRMs with G-tract RNA. However, knowledge of the recognition details is still lacking. To investigate how qRRMs specifically bind with G-tract RNA and how the mutations of any guanine to an adenine in the G-tract affect the binding, molecular dynamics simulations with binding free energy analysis were performed based on the NMR structure of qRRM2 in complex with G-tract RNA. Simulation results demonstrate that qRRM2 binds strongly with G-tract RNA, but any mutation of the G-tract leads to a drastic reduction of the binding free energy. Further comparisons of the energetic components reveal that van der Waals and non-polar interactions play essential roles in the binding between qRRM2 and G-tract RNA, but the interactions are weakened by the effect of RNA mutations. Structural and dynamical analyses indicate that when qRRM2 binds with G-tract RNA, both qRRM2 and G-tract maintain stabilized structures and dynamics; however, the stability is disrupted by the mutations of the G-tract. These results provide novel insights into the recognition mechanism of qRRM2 with G-tract RNA that are not elucidated by the NMR technique. Copyright © 2017 Elsevier Inc. All rights reserved.

  13. Single-molecule FRET reveals a corkscrew RNA structure for the polymerase-bound influenza virus promoter.

    PubMed

    Tomescu, Alexandra I; Robb, Nicole C; Hengrung, Narin; Fodor, Ervin; Kapanidis, Achillefs N

    2014-08-12

    The influenza virus is a major human and animal pathogen responsible for seasonal epidemics and occasional pandemics. The genome of the influenza A virus comprises eight segments of single-stranded, negative-sense RNA with highly conserved 5' and 3' termini. These termini interact to form a double-stranded promoter structure that is recognized and bound by the viral RNA-dependent RNA polymerase (RNAP); however, no 3D structural information for the influenza polymerase-bound promoter exists. Functional studies have led to the proposal of several 2D models for the secondary structure of the bound promoter, including a corkscrew model in which the 5' and 3' termini form short hairpins. We have taken advantage of an insect-cell system to prepare large amounts of active recombinant influenza virus RNAP, and used this to develop a highly sensitive single-molecule FRET assay to measure distances between fluorescent dyes located on the promoter and map its structure both with and without the polymerase bound. These advances enabled the direct analysis of the influenza promoter structure in complex with the viral RNAP, and provided 3D structural information that is in agreement with the corkscrew model for the influenza virus promoter RNA. Our data provide insights into the mechanisms of promoter binding by the influenza RNAP and have implications for the understanding of the regulatory mechanisms involved in the transcription of viral genes and replication of the viral RNA genome. In addition, the simplicity of this system should translate readily to the study of any virus polymerase-promoter interaction.

  14. Single-molecule FRET reveals a corkscrew RNA structure for the polymerase-bound influenza virus promoter

    PubMed Central

    Tomescu, Alexandra I.; Robb, Nicole C.; Hengrung, Narin; Fodor, Ervin; Kapanidis, Achillefs N.

    2014-01-01

    The influenza virus is a major human and animal pathogen responsible for seasonal epidemics and occasional pandemics. The genome of the influenza A virus comprises eight segments of single-stranded, negative-sense RNA with highly conserved 5′ and 3′ termini. These termini interact to form a double-stranded promoter structure that is recognized and bound by the viral RNA-dependent RNA polymerase (RNAP); however, no 3D structural information for the influenza polymerase-bound promoter exists. Functional studies have led to the proposal of several 2D models for the secondary structure of the bound promoter, including a corkscrew model in which the 5′ and 3′ termini form short hairpins. We have taken advantage of an insect-cell system to prepare large amounts of active recombinant influenza virus RNAP, and used this to develop a highly sensitive single-molecule FRET assay to measure distances between fluorescent dyes located on the promoter and map its structure both with and without the polymerase bound. These advances enabled the direct analysis of the influenza promoter structure in complex with the viral RNAP, and provided 3D structural information that is in agreement with the corkscrew model for the influenza virus promoter RNA. Our data provide insights into the mechanisms of promoter binding by the influenza RNAP and have implications for the understanding of the regulatory mechanisms involved in the transcription of viral genes and replication of the viral RNA genome. In addition, the simplicity of this system should translate readily to the study of any virus polymerase–promoter interaction. PMID:25071209

  15. Sparse RNA folding revisited: space-efficient minimum free energy structure prediction.

    PubMed

    Will, Sebastian; Jabbari, Hosna

    2016-01-01

    RNA secondary structure prediction by energy minimization is the central computational tool for the analysis of structural non-coding RNAs and their interactions. Sparsification has been successfully applied to improve the time efficiency of various structure prediction algorithms while guaranteeing the same result; however, for many such folding problems, space efficiency is of even greater concern, particularly for long RNA sequences. So far, space-efficient sparsified RNA folding with fold reconstruction was solved only for simple base-pair-based pseudo-energy models. Here, we revisit the problem of space-efficient free energy minimization. Whereas the space-efficient minimization of the free energy has been sketched before, the reconstruction of the optimum structure has not even been discussed. We show that this reconstruction is not possible in trivial extension of the method for simple energy models. Then, we present the time- and space-efficient sparsified free energy minimization algorithm SparseMFEFold that guarantees MFE structure prediction. In particular, this novel algorithm provides efficient fold reconstruction based on dynamically garbage-collected trace arrows. The complexity of our algorithm depends on two parameters, the number of candidates Z and the number of trace arrows T; both are bounded by [Formula: see text], but are typically much smaller. The time complexity of RNA folding is reduced from [Formula: see text] to [Formula: see text]; the space complexity, from [Formula: see text] to [Formula: see text]. Our empirical results show more than 80 % space savings over RNAfold [Vienna RNA package] on the long RNAs from the RNA STRAND database (≥2500 bases). The presented technique is intentionally generalizable to complex prediction algorithms; due to their high space demands, algorithms like pseudoknot prediction and RNA-RNA-interaction prediction are expected to profit even stronger than "standard" MFE folding. SparseMFEFold is free software, available at http://www.bioinf.uni-leipzig.de/~will/Software/SparseMFEFold.

  16. Genome-Wide Comparative In Silico Analysis of the RNA Helicase Gene Family in Zea mays and Glycine max: A Comparison with Arabidopsis and Oryza sativa

    PubMed Central

    Huang, Jinguang; Zheng, Chengchao

    2013-01-01

    RNA helicases are enzymes that are thought to unwind double-stranded RNA molecules in an energy-dependent fashion through the hydrolysis of NTP. RNA helicases are associated with all processes involving RNA molecules, including nuclear transcription, editing, splicing, ribosome biogenesis, RNA export, and organelle gene expression. The involvement of RNA helicase in response to stress and in plant growth and development has been reported previously. While their importance in Arabidopsis and Oryza sativa has been partially studied, the function of RNA helicase proteins is poorly understood in Zea mays and Glycine max. In this study, we identified a total of RNA helicase genes in Arabidopsis and other crop species genome by genome-wide comparative in silico analysis. We classified the RNA helicase genes into three subfamilies according to the structural features of the motif II region, such as DEAD-box, DEAH-box and DExD/H-box, and different species showed different patterns of alternative splicing. Secondly, chromosome location analysis showed that the RNA helicase protein genes were distributed across all chromosomes with different densities in the four species. Thirdly, phylogenetic tree analyses identified the relevant homologs of DEAD-box, DEAH-box and DExD/H-box RNA helicase proteins in each of the four species. Fourthly, microarray expression data showed that many of these predicted RNA helicase genes were expressed in different developmental stages and different tissues under normal growth conditions. Finally, real-time quantitative PCR analysis showed that the expression levels of 10 genes in Arabidopsis and 13 genes in Zea mays were in close agreement with the microarray expression data. To our knowledge, this is the first report of a comparative genome-wide analysis of the RNA helicase gene family in Arabidopsis, Oryza sativa, Zea mays and Glycine max. This study provides valuable information for understanding the classification and putative functions of the RNA helicase gene family in crop growth and development. PMID:24265739

  17. Genome-wide comparative in silico analysis of the RNA helicase gene family in Zea mays and Glycine max: a comparison with Arabidopsis and Oryza sativa.

    PubMed

    Xu, Ruirui; Zhang, Shizhong; Huang, Jinguang; Zheng, Chengchao

    2013-01-01

    RNA helicases are enzymes that are thought to unwind double-stranded RNA molecules in an energy-dependent fashion through the hydrolysis of NTP. RNA helicases are associated with all processes involving RNA molecules, including nuclear transcription, editing, splicing, ribosome biogenesis, RNA export, and organelle gene expression. The involvement of RNA helicase in response to stress and in plant growth and development has been reported previously. While their importance in Arabidopsis and Oryza sativa has been partially studied, the function of RNA helicase proteins is poorly understood in Zea mays and Glycine max. In this study, we identified a total of RNA helicase genes in Arabidopsis and other crop species genome by genome-wide comparative in silico analysis. We classified the RNA helicase genes into three subfamilies according to the structural features of the motif II region, such as DEAD-box, DEAH-box and DExD/H-box, and different species showed different patterns of alternative splicing. Secondly, chromosome location analysis showed that the RNA helicase protein genes were distributed across all chromosomes with different densities in the four species. Thirdly, phylogenetic tree analyses identified the relevant homologs of DEAD-box, DEAH-box and DExD/H-box RNA helicase proteins in each of the four species. Fourthly, microarray expression data showed that many of these predicted RNA helicase genes were expressed in different developmental stages and different tissues under normal growth conditions. Finally, real-time quantitative PCR analysis showed that the expression levels of 10 genes in Arabidopsis and 13 genes in Zea mays were in close agreement with the microarray expression data. To our knowledge, this is the first report of a comparative genome-wide analysis of the RNA helicase gene family in Arabidopsis, Oryza sativa, Zea mays and Glycine max. This study provides valuable information for understanding the classification and putative functions of the RNA helicase gene family in crop growth and development.

  18. A dual-specific Glu-tRNA(Gln) and Asp-tRNA(Asn) amidotransferase is involved in decoding glutamine and asparagine codons in Acidithiobacillus ferrooxidans.

    PubMed

    Salazar, J C; Zúñiga, R; Raczniak, G; Becker, H; Söll, D; Orellana, O

    2001-07-06

    The gatC, gatA and gatB genes encoding the three subunits of glutamyl-tRNA(Gln) amidotransferase from Acidithiobacillus ferrooxidans, an acidophilic bacterium used in bioleaching of minerals, have been cloned and expressed in Escherichia coli. As in Bacillus subtilis the three gat genes are organized in an operon-like structure in A. ferrooxidans. The heterologously overexpressed enzyme converts Glu-tRNA(Gln) to Gln-tRNA(Gln) and Asp-tRNA(Asn) to Asn-tRNA(Asn). Biochemical analysis revealed that neither glutaminyl-tRNA synthetase nor asparaginyl-tRNA synthetase is present in A. ferrooxidans, but that glutamyl-tRNA synthetase and aspartyl-tRNA synthetase enzymes are present in the organism. These data suggest that the transamidation pathway is responsible for the formation of Gln-tRNA and Asn-tRNA in A. ferrooxidans.

  19. Overlapping local and long-range RNA-RNA interactions modulate dengue virus genome cyclization and replication.

    PubMed

    de Borba, Luana; Villordo, Sergio M; Iglesias, Nestor G; Filomatori, Claudia V; Gebhard, Leopoldo G; Gamarnik, Andrea V

    2015-03-01

    The dengue virus genome is a dynamic molecule that adopts different conformations in the infected cell. Here, using RNA folding predictions, chemical probing analysis, RNA binding assays, and functional studies, we identified new cis-acting elements present in the capsid coding sequence that facilitate cyclization of the viral RNA by hybridization with a sequence involved in a local dumbbell structure at the viral 3' untranslated region (UTR). The identified interaction differentially enhances viral replication in mosquito and mammalian cells. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  20. psRNATarget: a plant small RNA target analysis server

    PubMed Central

    Dai, Xinbin; Zhao, Patrick Xuechun

    2011-01-01

    Plant endogenous non-coding short small RNAs (20–24 nt), including microRNAs (miRNAs) and a subset of small interfering RNAs (ta-siRNAs), play important role in gene expression regulatory networks (GRNs). For example, many transcription factors and development-related genes have been reported as targets of these regulatory small RNAs. Although a number of miRNA target prediction algorithms and programs have been developed, most of them were designed for animal miRNAs which are significantly different from plant miRNAs in the target recognition process. These differences demand the development of separate plant miRNA (and ta-siRNA) target analysis tool(s). We present psRNATarget, a plant small RNA target analysis server, which features two important analysis functions: (i) reverse complementary matching between small RNA and target transcript using a proven scoring schema, and (ii) target-site accessibility evaluation by calculating unpaired energy (UPE) required to ‘open’ secondary structure around small RNA’s target site on mRNA. The psRNATarget incorporates recent discoveries in plant miRNA target recognition, e.g. it distinguishes translational and post-transcriptional inhibition, and it reports the number of small RNA/target site pairs that may affect small RNA binding activity to target transcript. The psRNATarget server is designed for high-throughput analysis of next-generation data with an efficient distributed computing back-end pipeline that runs on a Linux cluster. The server front-end integrates three simplified user-friendly interfaces to accept user-submitted or preloaded small RNAs and transcript sequences; and outputs a comprehensive list of small RNA/target pairs along with the online tools for batch downloading, key word searching and results sorting. The psRNATarget server is freely available at http://plantgrn.noble.org/psRNATarget/. PMID:21622958

  1. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Boxiao; Tan, Xiao -Feng; Thurmond, Stephanie

    The recent outbreak of Zika virus (ZIKV) has imposed a serious threat to public health. Here we report the crystal structure of the ZIKV NS5 protein in complex with S-adenosyl-L-homocysteine, in which the tandem methyltransferase (MTase) and RNA-dependent RNA polymerase (RdRp) domains stack into one of the two alternative conformations of flavivirus NS5 proteins. In conclusion, the activity of this NS5 protein is verified through a de novo RdRp assay on a subgenomic ZIKV RNA template. Importantly, our structural analysis leads to the identification of a potential drug-binding site of ZIKV NS5, which might facilitate the development of novel antiviralsmore » for ZIKV.« less

  2. Identification of 15 candidate structured noncoding RNA motifs in fungi by comparative genomics.

    PubMed

    Li, Sanshu; Breaker, Ronald R

    2017-10-13

    With the development of rapid and inexpensive DNA sequencing, the genome sequences of more than 100 fungal species have been made available. This dataset provides an excellent resource for comparative genomics analyses, which can be used to discover genetic elements, including noncoding RNAs (ncRNAs). Bioinformatics tools similar to those used to uncover novel ncRNAs in bacteria, likewise, should be useful for searching fungal genomic sequences, and the relative ease of genetic experiments with some model fungal species could facilitate experimental validation studies. We have adapted a bioinformatics pipeline for discovering bacterial ncRNAs to systematically analyze many fungal genomes. This comparative genomics pipeline integrates information on conserved RNA sequence and structural features with alternative splicing information to reveal fungal RNA motifs that are candidate regulatory domains, or that might have other possible functions. A total of 15 prominent classes of structured ncRNA candidates were identified, including variant HDV self-cleaving ribozyme representatives, atypical snoRNA candidates, and possible structured antisense RNA motifs. Candidate regulatory motifs were also found associated with genes for ribosomal proteins, S-adenosylmethionine decarboxylase (SDC), amidase, and HexA protein involved in Woronin body formation. We experimentally confirm that the variant HDV ribozymes undergo rapid self-cleavage, and we demonstrate that the SDC RNA motif reduces the expression of SAM decarboxylase by translational repression. Furthermore, we provide evidence that several other motifs discovered in this study are likely to be functional ncRNA elements. Systematic screening of fungal genomes using a computational discovery pipeline has revealed the existence of a variety of novel structured ncRNAs. Genome contexts and similarities to known ncRNA motifs provide strong evidence for the biological and biochemical functions of some newly found ncRNA motifs. Although initial examinations of several motifs provide evidence for their likely functions, other motifs will require more in-depth analysis to reveal their functions.

  3. Protein functional features are reflected in the patterns of mRNA translation speed.

    PubMed

    López, Daniel; Pazos, Florencio

    2015-07-09

    The degeneracy of the genetic code makes it possible for the same amino acid string to be coded by different messenger RNA (mRNA) sequences. These "synonymous mRNAs" may differ largely in a number of aspects related to their overall translational efficiency, such as secondary structure content and availability of the encoded transfer RNAs (tRNAs). Consequently, they may render different yields of the translated polypeptides. These mRNA features related to translation efficiency are also playing a role locally, resulting in a non-uniform translation speed along the mRNA, which has been previously related to some protein structural features and also used to explain some dramatic effects of "silent" single-nucleotide-polymorphisms (SNPs). In this work we perform the first large scale analysis of the relationship between three experimental proxies of mRNA local translation efficiency and the local features of the corresponding encoded proteins. We found that a number of protein functional and structural features are reflected in the patterns of ribosome occupancy, secondary structure and tRNA availability along the mRNA. One or more of these proxies of translation speed have distinctive patterns around the mRNA regions coding for certain protein local features. In some cases the three patterns follow a similar trend. We also show specific examples where these patterns of translation speed point to the protein's important structural and functional features. This support the idea that the genome not only codes the protein functional features as sequences of amino acids, but also as subtle patterns of mRNA properties which, probably through local effects on the translation speed, have some consequence on the final polypeptide. These results open the possibility of predicting a protein's functional regions based on a single genomic sequence, and have implications for heterologous protein expression and fine-tuning protein function.

  4. Three-dimensional tertiary structure of yeast phenylalanine transfer RNA

    NASA Technical Reports Server (NTRS)

    Kim, S. H.; Sussman, J. L.; Suddath, F. L.; Quigley, G. J.; Mcpherson, A.; Wang, A. H. J.; Seeman, N. C.; Rich, A.

    1974-01-01

    Results of an analysis and interpretation of a 3-A electron density map of yeast phenylalanine transfer RNA. Some earlier detailed assignments of nucleotide residues to electron density peaks are found to be in error, even though the overall tracing of the backbone conformation of yeast phenylalanine transfer RNA was generally correct. A new, more comprehensive interpretation is made which makes it possible to define the tertiary interactions in the molecule. The new interpretation makes it possible to visualize a number of tertiary interactions which not only explain the structural role of most of the bases which are constant in transfer RNAs, but also makes it possible to understand in a direct and simple fashion the chemical modification data on transfer RNA. In addition, this pattern of tertiary interactions provides a basis for understanding the general three-dimensional folding of all transfer RNA molecules.

  5. Exploring RNA structure and dynamics through enhanced sampling simulations.

    PubMed

    Mlýnský, Vojtěch; Bussi, Giovanni

    2018-04-01

    RNA function is intimately related to its structural dynamics. Molecular dynamics simulations are useful for exploring biomolecular flexibility but are severely limited by the accessible timescale. Enhanced sampling methods allow this timescale to be effectively extended in order to probe biologically relevant conformational changes and chemical reactions. Here, we review the role of enhanced sampling techniques in the study of RNA systems. We discuss the challenges and promises associated with the application of these methods to force-field validation, exploration of conformational landscapes and ion/ligand-RNA interactions, as well as catalytic pathways. Important technical aspects of these methods, such as the choice of the biased collective variables and the analysis of multi-replica simulations, are examined in detail. Finally, a perspective on the role of these methods in the characterization of RNA dynamics is provided. Copyright © 2018 Elsevier Ltd. All rights reserved.

  6. DNA recognition by an RNA-guided bacterial Argonaute

    PubMed Central

    Doudna, Jennifer A.

    2017-01-01

    Argonaute (Ago) proteins are widespread in prokaryotes and eukaryotes and share a four-domain architecture capable of RNA- or DNA-guided nucleic acid recognition. Previous studies identified a prokaryotic Argonaute protein from the eubacterium Marinitoga piezophila (MpAgo), which binds preferentially to 5′-hydroxylated guide RNAs and cleaves single-stranded RNA (ssRNA) and DNA (ssDNA) targets. Here we present a 3.2 Å resolution crystal structure of MpAgo bound to a 21-nucleotide RNA guide and a complementary 21-nucleotide ssDNA substrate. Comparison of this ternary complex to other target-bound Argonaute structures reveals a unique orientation of the N-terminal domain, resulting in a straight helical axis of the entire RNA-DNA heteroduplex through the central cleft of the protein. Additionally, mismatches introduced into the heteroduplex reduce MpAgo cleavage efficiency with a symmetric profile centered around the middle of the helix. This pattern differs from the canonical mismatch tolerance of other Argonautes, which display decreased cleavage efficiency for substrates bearing sequence mismatches to the 5′ region of the guide strand. This structural analysis of MpAgo bound to a hybrid helix advances our understanding of the diversity of target recognition mechanisms by Argonaute proteins. PMID:28520746

  7. Unraveling the Web of Viroinformatics: Computational Tools and Databases in Virus Research

    PubMed Central

    Priyadarshini, Pragya; Vrati, Sudhanshu

    2014-01-01

    The beginning of the second century of research in the field of virology (the first virus was discovered in 1898) was marked by its amalgamation with bioinformatics, resulting in the birth of a new domain—viroinformatics. The availability of more than 100 Web servers and databases embracing all or specific viruses (for example, dengue virus, influenza virus, hepatitis virus, human immunodeficiency virus [HIV], hemorrhagic fever virus [HFV], human papillomavirus [HPV], West Nile virus, etc.) as well as distinct applications (comparative/diversity analysis, viral recombination, small interfering RNA [siRNA]/short hairpin RNA [shRNA]/microRNA [miRNA] studies, RNA folding, protein-protein interaction, structural analysis, and phylotyping and genotyping) will definitely aid the development of effective drugs and vaccines. However, information about their access and utility is not available at any single source or on any single platform. Therefore, a compendium of various computational tools and resources dedicated specifically to virology is presented in this article. PMID:25428870

  8. Unraveling the web of viroinformatics: computational tools and databases in virus research.

    PubMed

    Sharma, Deepak; Priyadarshini, Pragya; Vrati, Sudhanshu

    2015-02-01

    The beginning of the second century of research in the field of virology (the first virus was discovered in 1898) was marked by its amalgamation with bioinformatics, resulting in the birth of a new domain--viroinformatics. The availability of more than 100 Web servers and databases embracing all or specific viruses (for example, dengue virus, influenza virus, hepatitis virus, human immunodeficiency virus [HIV], hemorrhagic fever virus [HFV], human papillomavirus [HPV], West Nile virus, etc.) as well as distinct applications (comparative/diversity analysis, viral recombination, small interfering RNA [siRNA]/short hairpin RNA [shRNA]/microRNA [miRNA] studies, RNA folding, protein-protein interaction, structural analysis, and phylotyping and genotyping) will definitely aid the development of effective drugs and vaccines. However, information about their access and utility is not available at any single source or on any single platform. Therefore, a compendium of various computational tools and resources dedicated specifically to virology is presented in this article. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  9. Crystallographic and Computational Analyses of AUUCU Repeating RNA That Causes Spinocerebellar Ataxia Type 10 (SCA10).

    PubMed

    Park, HaJeung; González, Àlex L; Yildirim, Ilyas; Tran, Tuan; Lohman, Jeremy R; Fang, Pengfei; Guo, Min; Disney, Matthew D

    2015-06-23

    Spinocerebellar ataxia type 10 (SCA10) is caused by a pentanucleotide repeat expansion of r(AUUCU) within intron 9 of the ATXN10 pre-mRNA. The RNA causes disease by a gain-of-function mechanism in which it inactivates proteins involved in RNA biogenesis. Spectroscopic studies showed that r(AUUCU) repeats form a hairpin structure; however, there were no high-resolution structural models prior to this work. Herein, we report the first crystal structure of model r(AUUCU) repeats refined to 2.8 Å and analysis of the structure via molecular dynamics simulations. The r(AUUCU) tracts adopt an overall A-form geometry in which 3 × 3 nucleotide (5')UCU(3')/(3')UCU(5') internal loops are closed by AU pairs. Helical parameters of the refined structure as well as the corresponding electron density map on the crystallographic model reflect dynamic features of the internal loop. The computational analyses captured dynamic motion of the loop closing pairs, which can form single-stranded conformations with relatively low energies. Overall, the results presented here suggest the possibility for r(AUUCU) repeats to form metastable A-from structures, which can rearrange into single-stranded conformations and attract proteins such as heterogeneous nuclear ribonucleoprotein K (hnRNP K). The information presented here may aid in the rational design of therapeutics targeting this RNA.

  10. Crystallographic and Computational Analyses of AUUCU Repeating RNA That Causes Spinocerebellar Ataxia Type 10 (SCA10)

    PubMed Central

    Park, HaJeung; González, Àlex L.; Yildirim, Ilyas; Tran, Tuan; Lohman, Jeremy R.; Fang, Pengfei; Guo, Min; Disney, Matthew D.

    2016-01-01

    Spinocerebellar ataxia type 10 (SCA10) is caused by a pentanucleotide repeat expansion of r(AUUCU) within intron 9 of the ATXN10 pre-mRNA. The RNA causes disease by a gain-of-function mechanism in which it inactivates proteins involved in RNA biogenesis. Spectroscopic studies showed that r(AUUCU) repeats form a hairpin structure; however, there were no high-resolution structural models prior to this work. Herein, we report the first crystal structure of model r(AUUCU) repeats refined to 2.8 Å and analysis of the structure via molecular dynamics simulations. The r(AUUCU) tracts adopt an overall A-form geometry in which 3 × 3 nucleotide 5′UCU3′/3′UCU5′ internal loops are closed by AU pairs. Helical parameters of the refined structure as well as the corresponding electron density map on the crystallographic model reflect dynamic features of the internal loop. The computational analyses captured dynamic motion of the loop closing pairs, which can form single-stranded conformations with relatively low energies. Overall, the results presented here suggest the possibility for r(AUUCU) repeats to form metastable A-from structures, which can rearrange into single-stranded conformations and attract proteins such as heterogeneous nuclear ribonucleoprotein K (hnRNP K). The information presented here may aid in the rational design of therapeutics targeting this RNA. PMID:26039897

  11. Structural complexity of Dengue virus untranslated regions: cis-acting RNA motifs and pseudoknot interactions modulating functionality of the viral genome

    PubMed Central

    Sztuba-Solinska, Joanna; Teramoto, Tadahisa; Rausch, Jason W.; Shapiro, Bruce A.; Padmanabhan, Radhakrishnan; Le Grice, Stuart F. J.

    2013-01-01

    The Dengue virus (DENV) genome contains multiple cis-acting elements required for translation and replication. Previous studies indicated that a 719-nt subgenomic minigenome (DENV-MINI) is an efficient template for translation and (−) strand RNA synthesis in vitro. We performed a detailed structural analysis of DENV-MINI RNA, combining chemical acylation techniques, Pb2+ ion-induced hydrolysis and site-directed mutagenesis. Our results highlight protein-independent 5′–3′ terminal interactions involving hybridization between recognized cis-acting motifs. Probing analyses identified tandem dumbbell structures (DBs) within the 3′ terminus spaced by single-stranded regions, internal loops and hairpins with embedded GNRA-like motifs. Analysis of conserved motifs and top loops (TLs) of these dumbbells, and their proposed interactions with downstream pseudoknot (PK) regions, predicted an H-type pseudoknot involving TL1 of the 5′ DB and the complementary region, PK2. As disrupting the TL1/PK2 interaction, via ‘flipping’ mutations of PK2, previously attenuated DENV replication, this pseudoknot may participate in regulation of RNA synthesis. Computer modeling implied that this motif might function as autonomous structural/regulatory element. In addition, our studies targeting elements of the 3′ DB and its complementary region PK1 indicated that communication between 5′–3′ terminal regions strongly depends on structure and sequence composition of the 5′ cyclization region. PMID:23531545

  12. Modelling the structure of a ceRNA-theoretical, bipartite microRNA-mRNA interaction network regulating intestinal epithelial cellular pathways using R programming.

    PubMed

    Robinson, J M; Henderson, W A

    2018-01-12

    We report a method using functional-molecular databases and network modelling to identify hypothetical mRNA-miRNA interaction networks regulating intestinal epithelial barrier function. The model forms a data-analysis component of our cell culture experiments, which produce RNA expression data from Nanostring Technologies nCounter ® system. The epithelial tight-junction (TJ) and actin cytoskeleton interact as molecular components of the intestinal epithelial barrier. Upstream regulation of TJ-cytoskeleton interaction is effected by the Rac/Rock/Rho signaling pathway and other associated pathways which may be activated or suppressed by extracellular signaling from growth factors, hormones, and immune receptors. Pathway activations affect epithelial homeostasis, contributing to degradation of the epithelial barrier associated with osmotic dysregulation, inflammation, and tumor development. The complexity underlying miRNA-mRNA interaction networks represents a roadblock for prediction and validation of competing-endogenous RNA network function. We developed a network model to identify hypothetical co-regulatory motifs in a miRNA-mRNA interaction network related to epithelial function. A mRNA-miRNA interaction list was generated using KEGG and miRWalk2.0 databases. R-code was developed to quantify and visualize inherent network structures. We identified a sub-network with a high number of shared, targeting miRNAs, of genes associated with cellular proliferation and cancer, including c-MYC and Cyclin D.

  13. A structured sparse regression method for estimating isoform expression level from multi-sample RNA-seq data.

    PubMed

    Zhang, L; Liu, X J

    2016-06-03

    With the rapid development of next-generation high-throughput sequencing technology, RNA-seq has become a standard and important technique for transcriptome analysis. For multi-sample RNA-seq data, the existing expression estimation methods usually deal with each single-RNA-seq sample, and ignore that the read distributions are consistent across multiple samples. In the current study, we propose a structured sparse regression method, SSRSeq, to estimate isoform expression using multi-sample RNA-seq data. SSRSeq uses a non-parameter model to capture the general tendency of non-uniformity read distribution for all genes across multiple samples. Additionally, our method adds a structured sparse regularization, which not only incorporates the sparse specificity between a gene and its corresponding isoform expression levels, but also reduces the effects of noisy reads, especially for lowly expressed genes and isoforms. Four real datasets were used to evaluate our method on isoform expression estimation. Compared with other popular methods, SSRSeq reduced the variance between multiple samples, and produced more accurate isoform expression estimations, and thus more meaningful biological interpretations.

  14. Conserved structures formed by heterogeneous RNA sequences drive silencing of an inflammation responsive post-transcriptional operon

    PubMed Central

    Basu, Abhijit; Jain, Niyati; Tolbert, Blanton S.; Komar, Anton A.

    2017-01-01

    Abstract RNA–protein interactions with physiological outcomes usually rely on conserved sequences within the RNA element. By contrast, activity of the diverse gamma-interferon-activated inhibitor of translation (GAIT)-elements relies on the conserved RNA folding motifs rather than the conserved sequence motifs. These elements drive the translational silencing of a group of chemokine (CC/CXC) and chemokine receptor (CCR) mRNAs, thereby helping to resolve physiological inflammation. Despite sequence dissimilarity, these RNA elements adopt common secondary structures (as revealed by 2D-1H NMR spectroscopy), providing a basis for their interaction with the RNA-binding GAIT complex. However, many of these elements (e.g. those derived from CCL22, CXCL13, CCR4 and ceruloplasmin (Cp) mRNAs) have substantially different affinities for GAIT complex binding. Toeprinting analysis shows that different positions within the overall conserved GAIT element structure contribute to differential affinities of the GAIT protein complex towards the elements. Thus, heterogeneity of GAIT elements may provide hierarchical fine-tuning of the resolution of inflammation. PMID:29069516

  15. Optimized guide RNA structure for genome editing via Cas9

    PubMed Central

    Xu, Jianyong; Lian, Wei; Jia, Yuning; Li, Lingyun; Huang, Zhong

    2017-01-01

    The genome editing tool Cas9-gRNA (guide RNA) has been successfully applied in different cell types and organisms with high efficiency. However, more efforts need to be made to enhance both efficiency and specificity. In the current study, we optimized the guide RNA structure of Streptococcus pyogenes CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)/Cas (CRISPR-associated) system to improve its genome editing efficiency. Comparing with the original functional structure of guide RNA, which is composed of crRNA and tracrRNA, the widely used chimeric gRNA has shorter crRNA and tracrRNA sequence. The deleted RNA sequence could form extra loop structure, which might enhance the stability of the guide RNA structure and subsequently the genome editing efficiency. Thus the genome editing efficiency of different forms of guide RNA was tested. And we found that the chimeric structure of gRNA with original full length of crRNA and tracrRNA showed higher genome editing efficiency than the conventional chimeric structure or other types of gRNA we tested. Therefore our data here uncovered the new type of gRNA structure with higher genome editing efficiency. PMID:29212218

  16. Computational RNomics of Drosophilids

    PubMed Central

    Rose, Dominic; Hackermüller, Jörg; Washietl, Stefan; Reiche, Kristin; Hertel, Jana; Findeiß, Sven; Stadler, Peter F; Prohaska, Sonja J

    2007-01-01

    Background Recent experimental and computational studies have provided overwhelming evidence for a plethora of diverse transcripts that are unrelated to protein-coding genes. One subclass consists of those RNAs that require distinctive secondary structure motifs to exert their biological function and hence exhibit distinctive patterns of sequence conservation characteristic for positive selection on RNA secondary structure. The deep-sequencing of 12 drosophilid species coordinated by the NHGRI provides an ideal data set of comparative computational approaches to determine those genomic loci that code for evolutionarily conserved RNA motifs. This class of loci includes the majority of the known small ncRNAs as well as structured RNA motifs in mRNAs. We report here on a genome-wide survey using RNAz. Results We obtain 16 000 high quality predictions among which we recover the majority of the known ncRNAs. Taking a pessimistically estimated false discovery rate of 40% into account, this implies that at least some ten thousand loci in the Drosophila genome show the hallmarks of stabilizing selection action of RNA structure, and hence are most likely functional at the RNA level. A subset of RNAz predictions overlapping with TRF1 and BRF binding sites [Isogai et al., EMBO J. 26: 79–89 (2007)], which are plausible candidates of Pol III transcripts, have been studied in more detail. Among these sequences we identify several "clusters" of ncRNA candidates with striking structural similarities. Conclusion The statistical evaluation of the RNAz predictions in comparison with a similar analysis of vertebrate genomes [Washietl et al., Nat. Biotech. 23: 1383–1390 (2005)] shows that qualitatively similar fractions of structured RNAs are found in introns, UTRs, and intergenic regions. The intergenic RNA structures, however, are concentrated much more closely around known protein-coding loci, suggesting that flies have significantly smaller complement of independent structured ncRNAs compared to mammals. PMID:17996037

  17. Structure and reconstitution of yeast Mpp6-nuclear exosome complexes reveals that Mpp6 stimulates RNA decay and recruits the Mtr4 helicase.

    PubMed

    Wasmuth, Elizabeth V; Zinder, John C; Zattas, Dimitrios; Das, Mom; Lima, Christopher D

    2017-07-25

    Nuclear RNA exosomes catalyze a range of RNA processing and decay activities that are coordinated in part by cofactors, including Mpp6, Rrp47, and the Mtr4 RNA helicase. Mpp6 interacts with the nine-subunit exosome core, while Rrp47 stabilizes the exoribonuclease Rrp6 and recruits Mtr4, but it is less clear if these cofactors work together. Using biochemistry with Saccharomyces cerevisiae proteins, we show that Rrp47 and Mpp6 stimulate exosome-mediated RNA decay, albeit with unique dependencies on elements within the nuclear exosome. Mpp6-exosomes can recruit Mtr4, while Mpp6 and Rrp47 each contribute to Mtr4-dependent RNA decay, with maximal Mtr4-dependent decay observed with both cofactors. The 3.3 Å structure of a twelve-subunit nuclear Mpp6 exosome bound to RNA shows the central region of Mpp6 bound to the exosome core, positioning its Mtr4 recruitment domain next to Rrp6 and the exosome central channel. Genetic analysis reveals interactions that are largely consistent with our model.

  18. Mutational analysis of three predicted 5'-proximal stem-loop structures in the genome of tick-borne encephalitis virus indicates different roles in RNA replication and translation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rouha, Harald; Hoenninger, Verena M.; Thurner, Caroline

    2011-08-15

    Flavivirus gene expression is modulated by RNA secondary structure elements at the terminal ends of the viral RNA molecule. For tick-borne encephalitis virus (TBEV), four stem-loop (SL) elements have been predicted in the first 180 nucleotides of the viral genome: 5'-SL1, 5'-SL2, 5'-SL3 and 5'-SL4. The last three of these appear to be unique to tick-borne flaviviruses. Here, we report their characterization by mutagenesis in a TBEV luciferase reporter system. By manipulating their thermodynamic properties, we found that an optimal stability of the 5'-SL2 is required for efficient RNA replication. 5'-SL3 formation is also important for viral RNA replication, butmore » although it contains the viral start codon, its formation is dispensable for RNA translation. 5'-SL4 appears to facilitate both RNA translation and replication. Our data suggest that maintenance of the balanced thermodynamic stability of these SL elements is important for temporal regulation of its different functions.« less

  19. Viral replication. Structural basis for RNA replication by the hepatitis C virus polymerase.

    PubMed

    Appleby, Todd C; Perry, Jason K; Murakami, Eisuke; Barauskas, Ona; Feng, Joy; Cho, Aesop; Fox, David; Wetmore, Diana R; McGrath, Mary E; Ray, Adrian S; Sofia, Michael J; Swaminathan, S; Edwards, Thomas E

    2015-02-13

    Nucleotide analog inhibitors have shown clinical success in the treatment of hepatitis C virus (HCV) infection, despite an incomplete mechanistic understanding of NS5B, the viral RNA-dependent RNA polymerase. Here we study the details of HCV RNA replication by determining crystal structures of stalled polymerase ternary complexes with enzymes, RNA templates, RNA primers, incoming nucleotides, and catalytic metal ions during both primed initiation and elongation of RNA synthesis. Our analysis revealed that highly conserved active-site residues in NS5B position the primer for in-line attack on the incoming nucleotide. A β loop and a C-terminal membrane-anchoring linker occlude the active-site cavity in the apo state, retract in the primed initiation assembly to enforce replication of the HCV genome from the 3' terminus, and vacate the active-site cavity during elongation. We investigated the incorporation of nucleotide analog inhibitors, including the clinically active metabolite formed by sofosbuvir, to elucidate key molecular interactions in the active site. Copyright © 2015, American Association for the Advancement of Science.

  20. Prevention of cross-talk in conserved regulatory systems: identification of specificity determinants in RNA-binding anti-termination proteins of the BglG family

    PubMed Central

    Hübner, Sebastian; Declerck, Nathalie; Diethmaier, Christine; Le Coq, Dominique; Aymerich, Stephane; Stülke, Jörg

    2011-01-01

    Each family of signal transduction systems requires specificity determinants that link individual signals to the correct regulatory output. In Bacillus subtilis, a family of four anti-terminator proteins controls the expression of genes for the utilisation of alternative sugars. These regulatory systems contain the anti-terminator proteins and a RNA structure, the RNA anti-terminator (RAT) that is bound by the anti-terminator proteins. We have studied three of these proteins (SacT, SacY, and LicT) to understand how they can transmit a specific signal in spite of their strong structural homology. A screen for random mutations that render SacT capable to bind a RNA structure recognized by LicT only revealed a substitution (P26S) at one of the few non-conserved residues that are in contact with the RNA. We have randomly modified this position in SacT together with another non-conserved RNA-contacting residue (Q31). Surprisingly, the mutant proteins could bind all RAT structures that are present in B. subtilis. In a complementary approach, reciprocal amino acid exchanges have been introduced in LicT and SacY at non-conserved positions of the RNA-binding site. This analysis revealed the key role of an arginine side-chain for both the high affinity and specificity of LicT for its cognate RAT. Introduction of this Arg at the equivalent position of SacY (A26) increased the RNA binding in vitro but also resulted in a relaxed specificity. Altogether our results suggest that this family of anti-termination proteins has evolved to reach a compromise between RNA binding efficacy and specific interaction with individual target sequences. PMID:21278164

  1. An empirical strategy to detect bacterial transcript structure from directional RNA-seq transcriptome data.

    PubMed

    Wang, Yejun; MacKenzie, Keith D; White, Aaron P

    2015-05-07

    As sequencing costs are being lowered continuously, RNA-seq has gradually been adopted as the first choice for comparative transcriptome studies with bacteria. Unlike microarrays, RNA-seq can directly detect cDNA derived from mRNA transcripts at a single nucleotide resolution. Not only does this allow researchers to determine the absolute expression level of genes, but it also conveys information about transcript structure. Few automatic software tools have yet been established to investigate large-scale RNA-seq data for bacterial transcript structure analysis. In this study, 54 directional RNA-seq libraries from Salmonella serovar Typhimurium (S. Typhimurium) 14028s were examined for potential relationships between read mapping patterns and transcript structure. We developed an empirical method, combined with statistical tests, to automatically detect key transcript features, including transcriptional start sites (TSSs), transcriptional termination sites (TTSs) and operon organization. Using our method, we obtained 2,764 TSSs and 1,467 TTSs for 1331 and 844 different genes, respectively. Identification of TSSs facilitated further discrimination of 215 putative sigma 38 regulons and 863 potential sigma 70 regulons. Combining the TSSs and TTSs with intergenic distance and co-expression information, we comprehensively annotated the operon organization in S. Typhimurium 14028s. Our results show that directional RNA-seq can be used to detect transcriptional borders at an acceptable resolution of ±10-20 nucleotides. Technical limitations of the RNA-seq procedure may prevent single nucleotide resolution. The automatic transcript border detection methods, statistical models and operon organization pipeline that we have described could be widely applied to RNA-seq studies in other bacteria. Furthermore, the TSSs, TTSs, operons, promoters and unstranslated regions that we have defined for S. Typhimurium 14028s may constitute valuable resources that can be used for comparative analyses with other Salmonella serotypes.

  2. Using secondary structure to identify ribosomal numts: cautionary examples from the human genome.

    PubMed

    Olson, Link E; Yoder, Anne D

    2002-01-01

    The identification of inadvertently sequenced mitochondrial pseudogenes (numts) is critical to any study employing mitochondrial DNA sequence data. Failure to discriminate numts correctly can confound phylogenetic reconstruction and studies of molecular evolution. This is especially problematic for ribosomal mtDNA genes. Unlike protein-coding loci, whose pseudogenes tend to accumulate diagnostic frameshift or premature stop mutations, functional ribosomal genes are not constrained to maintain a reading frame and can accumulate insertion-deletion events of varying length, particularly in nonpairing regions. Several authors have advocated using structural features of the transcribed rRNA molecule to differentiate functional mitochondrial rRNA genes from their nuclear paralogs. We explored this approach using the mitochondrial 12S rRNA gene and three known 12S numts from the human genome in the context of anthropoid phylogeny and the inferred secondary structure of primate 12S rRNA. Contrary to expectation, each of the three human numts exhibits striking concordance with secondary structure models, with little, if any, indication of their pseudogene status, and would likely escape detection based on structural criteria alone. Furthermore, we show that the unwitting inclusion of a particularly ancient (18-25 Myr old) and surprisingly cryptic human numt in a phylogenetic analysis would yield a well-supported but dramatically incorrect conclusion regarding anthropoid relationships. Though we endorse the use of secondary structure models for inferring positional homology wholeheartedly, we caution against reliance on structural criteria for the discrimination of rRNA numts, given the potential fallibility of this approach.

  3. RNase MRP Cleaves Pre-tRNASer-Met in the tRNA Maturation Pathway

    PubMed Central

    Adachi, Kousuke; Nobe, Yuko; Kobayashi, Junya; Hirota, Kouji; Oliveira, Douglas V.; Taoka, Masato; Isobe, Toshiaki

    2014-01-01

    Ribonuclease mitochondrial RNA processing (RNase MRP) is a multifunctional ribonucleoprotein (RNP) complex that is involved in the maturation of various types of RNA including ribosomal RNA. RNase MRP consists of a potential catalytic RNA and several protein components, all of which are required for cell viability. We show here that the temperature-sensitive mutant of rmp1, the gene for a unique protein component of RNase MRP, accumulates the dimeric tRNA precursor, pre-tRNASer-Met. To examine whether RNase MRP mediates tRNA maturation, we purified the RNase MRP holoenzyme from the fission yeast Schizosaccharomyces pombe and found that the enzyme directly and selectively cleaves pre-tRNASer-Met, suggesting that RNase MRP participates in the maturation of specific tRNA in vivo. In addition, mass spectrometry–based ribonucleoproteomic analysis demonstrated that this RNase MRP consists of one RNA molecule and 11 protein components, including a previously unknown component Rpl701. Notably, limited nucleolysis of RNase MRP generated an active catalytic core consisting of partial mrp1 RNA fragments, which constitute “Domain 1” in the secondary structure of RNase MRP, and 8 proteins. Thus, the present study provides new insight into the structure and function of RNase MRP. PMID:25401760

  4. 5S rRNA and ribosome.

    PubMed

    Gongadze, G M

    2011-12-01

    5S rRNA is an integral component of the ribosome of all living organisms. It is known that the ribosome without 5S rRNA is functionally inactive. However, the question about the specific role of this RNA in functioning of the translation apparatus is still open. This review presents a brief history of the discovery of 5S rRNA and studies of its origin and localization in the ribosome. The previously expressed hypotheses about the role of this RNA in the functioning of the ribosome are discussed considering the unique location of 5S rRNA in the ribosome and its intermolecular contacts. Based on analysis of the current data on ribosome structure and its functional complexes, the role of 5S rRNA as an intermediary between ribosome functional domains is discussed.

  5. Recognition of the murine coronavirus genomic RNA packaging signal depends on the second RNA-binding domain of the nucleocapsid protein.

    PubMed

    Kuo, Lili; Koetzner, Cheri A; Hurst, Kelley R; Masters, Paul S

    2014-04-01

    The coronavirus nucleocapsid (N) protein forms a helical ribonucleoprotein with the viral positive-strand RNA genome and binds to the principal constituent of the virion envelope, the membrane (M) protein, to facilitate assembly and budding. Besides these structural roles, N protein associates with a component of the replicase-transcriptase complex, nonstructural protein 3, at a critical early stage of infection. N protein has also been proposed to participate in the replication and selective packaging of genomic RNA and the transcription and translation of subgenomic mRNA. Coronavirus N proteins contain two structurally distinct RNA-binding domains, an unusual characteristic among RNA viruses. To probe the functions of these domains in the N protein of the model coronavirus mouse hepatitis virus (MHV), we constructed mutants in which each RNA-binding domain was replaced by its counterpart from the N protein of severe acute respiratory syndrome coronavirus (SARS-CoV). Mapping of revertants of the resulting chimeric viruses provided evidence for extensive intramolecular interactions between the two RNA-binding domains. Through analysis of viral RNA that was packaged into virions we identified the second of the two RNA-binding domains as a principal determinant of MHV packaging signal recognition. As expected, the interaction of N protein with M protein was not affected in either of the chimeric viruses. Moreover, the SARS-CoV N substitutions did not alter the fidelity of leader-body junction formation during subgenomic mRNA synthesis. These results more clearly delineate the functions of N protein and establish a basis for further exploration of the mechanism of genomic RNA packaging. This work describes the interactions of the two RNA-binding domains of the nucleocapsid protein of a model coronavirus, mouse hepatitis virus. The main finding is that the second of the two domains plays an essential role in recognizing the RNA structure that allows the selective packaging of genomic RNA into assembled virions.

  6. Structural and functional analysis of mRNA export regulation by the nuclear pore complex.

    PubMed

    Lin, Daniel H; Correia, Ana R; Cai, Sarah W; Huber, Ferdinand M; Jette, Claudia A; Hoelz, André

    2018-06-13

    The nuclear pore complex (NPC) controls the passage of macromolecules between the nucleus and cytoplasm, but how the NPC directly participates in macromolecular transport remains poorly understood. In the final step of mRNA export, the DEAD-box helicase DDX19 is activated by the nucleoporins Gle1, Nup214, and Nup42 to remove Nxf1•Nxt1 from mRNAs. Here, we report crystal structures of Gle1•Nup42 from three organisms that reveal an evolutionarily conserved binding mode. Biochemical reconstitution of the DDX19 ATPase cycle establishes that human DDX19 activation does not require IP 6 , unlike its fungal homologs, and that Gle1 stability affects DDX19 activation. Mutations linked to motor neuron diseases cause decreased Gle1 thermostability, implicating nucleoporin misfolding as a disease determinant. Crystal structures of human Gle1•Nup42•DDX19 reveal the structural rearrangements in DDX19 from an auto-inhibited to an RNA-binding competent state. Together, our results provide the foundation for further mechanistic analyses of mRNA export in humans.

  7. Efficient HIV-1 inhibition by a 16 nt-long RNA aptamer designed by combining in vitro selection and in silico optimisation strategies

    PubMed Central

    Sánchez-Luque, Francisco J.; Stich, Michael; Manrubia, Susanna; Briones, Carlos; Berzal-Herranz, Alfredo

    2014-01-01

    The human immunodeficiency virus type-1 (HIV-1) genome contains multiple, highly conserved structural RNA domains that play key roles in essential viral processes. Interference with the function of these RNA domains either by disrupting their structures or by blocking their interaction with viral or cellular factors may seriously compromise HIV-1 viability. RNA aptamers are amongst the most promising synthetic molecules able to interact with structural domains of viral genomes. However, aptamer shortening up to their minimal active domain is usually necessary for scaling up production, what requires very time-consuming, trial-and-error approaches. Here we report on the in vitro selection of 64 nt-long specific aptamers against the complete 5′-untranslated region of HIV-1 genome, which inhibit more than 75% of HIV-1 production in a human cell line. The analysis of the selected sequences and structures allowed for the identification of a highly conserved 16 nt-long stem-loop motif containing a common 8 nt-long apical loop. Based on this result, an in silico designed 16 nt-long RNA aptamer, termed RNApt16, was synthesized, with sequence 5′-CCCCGGCAAGGAGGGG-3′. The HIV-1 inhibition efficiency of such an aptamer was close to 85%, thus constituting the shortest RNA molecule so far described that efficiently interferes with HIV-1 replication. PMID:25175101

  8. Structure and stability of the consecutive stereoregulated chiral phosphorothioate DNA duplex.

    PubMed

    Kanaori, K; Tamura, Y; Wada, T; Nishi, M; Kanehara, H; Morii, T; Tajima, K; Makino, K

    1999-12-07

    The duplex structures of the stereoregulated phosphorothioate DNAs, [R(p),R(p)]- and [S(p),S(p)]-[d(GC(ps)T(ps)ACG)] (ps, phosphorothioate; PS-DNA), with their complementary RNA have been investigated by combined use of (1)H NMR and restrained molecular dynamics calculation. Compared to those obtained for the unmodified duplex structures (PO-DNA.RNA), the NOE cross-peak intensities are virtually identical for the PS-DNA.RNA hybrid duplexes. The structural analysis on the basis of the NOE restraints reveals that all of the three DNA.RNA duplexes take a A-form conformation and that there is no significant difference in the base stacking for the DNA.RNA hybrid duplexes. On the other hand, the NOE cross-peak intensities of the protons around the central T(ps)A step of the PS-DNA.DNA duplexes are apparently different from those of PO-DNA. DNA. The chemical shifts of H8/6 and H1' at the T(ps)A step are also largely different among PS-DNA.DNAs and PO-DNA.DNA, suggesting that the DNA.DNA structure is readily changed by the introduction of the phosphorothioate groups to the central T(p)A step. The structure calculations indicate that all of these DNA.DNA duplexes are B-form although there exist some small differences in helical parameters between the [R(p),R(p)]- and [S(p),S(p)]PS-DNA.DNA duplexes. The melting temperatures (T(m)) were determined for all of the duplexes by plotting the chemical shift change of isolated peaks as a function of temperature. For the PS-DNA.RNA hybrid duplexes, the [S(p),S(p)] isomer is less stable than the [R(p),R(p)] isomer while this trend is reversed for the PS-DNA.DNA duplexes. Consequently, although the PS-DNA.RNA duplexes take the similar A-form structure, the duplex stability is different between PS-DNA.RNA duplexes. The stability of the DNA.RNA duplexes may not be governed by the A-form structure itself but by some other factors such as the hydration around the phosphorothioate backbone, although the T(m) difference of the DNA.DNA duplexes could be explained by the structural factor.

  9. Avilamycin and evernimicin induce structural changes in rProteins uL16 and CTC that enhance the inhibition of A-site tRNA binding

    PubMed Central

    Krupkin, Miri; Wekselman, Itai; Matzov, Donna; Eyal, Zohar; Diskin Posner, Yael; Rozenberg, Haim; Zimmerman, Ella; Bashan, Anat; Yonath, Ada

    2016-01-01

    Two structurally unique ribosomal antibiotics belonging to the orthosomycin family, avilamycin and evernimicin, possess activity against Enterococci, Staphylococci, and Streptococci, and other Gram-positive bacteria. Here, we describe the high-resolution crystal structures of the eubacterial large ribosomal subunit in complex with them. Their extended binding sites span the A-tRNA entrance corridor, thus inhibiting protein biosynthesis by blocking the binding site of the A-tRNA elbow, a mechanism not shared with other known antibiotics. Along with using the ribosomal components that bind and discriminate the A-tRNA—namely, ribosomal RNA (rRNA) helices H89, H91, and ribosomal proteins (rProtein) uL16—these structures revealed novel interactions with domain 2 of the CTC protein, a feature typical to various Gram-positive bacteria. Furthermore, analysis of these structures explained how single nucleotide mutations and methylations in helices H89 and H91 confer resistance to orthosomycins and revealed the sequence variations in 23S rRNA nucleotides alongside the difference in the lengths of the eukaryotic and prokaryotic α1 helix of protein uL16 that play a key role in the selectivity of those drugs. The accurate interpretation of the crystal structures that could be performed beyond that recently reported in cryo-EM models provide structural insights that may be useful for the design of novel pathogen-specific antibiotics, and for improving the potency of orthosomycins. Because both drugs are extensively metabolized in vivo, their environmental toxicity is very low, thus placing them at the frontline of drugs with reduced ecological hazards. PMID:27791159

  10. A semi-supervised learning approach for RNA secondary structure prediction.

    PubMed

    Yonemoto, Haruka; Asai, Kiyoshi; Hamada, Michiaki

    2015-08-01

    RNA secondary structure prediction is a key technology in RNA bioinformatics. Most algorithms for RNA secondary structure prediction use probabilistic models, in which the model parameters are trained with reliable RNA secondary structures. Because of the difficulty of determining RNA secondary structures by experimental procedures, such as NMR or X-ray crystal structural analyses, there are still many RNA sequences that could be useful for training whose secondary structures have not been experimentally determined. In this paper, we introduce a novel semi-supervised learning approach for training parameters in a probabilistic model of RNA secondary structures in which we employ not only RNA sequences with annotated secondary structures but also ones with unknown secondary structures. Our model is based on a hybrid of generative (stochastic context-free grammars) and discriminative models (conditional random fields) that has been successfully applied to natural language processing. Computational experiments indicate that the accuracy of secondary structure prediction is improved by incorporating RNA sequences with unknown secondary structures into training. To our knowledge, this is the first study of a semi-supervised learning approach for RNA secondary structure prediction. This technique will be useful when the number of reliable structures is limited. Copyright © 2015 Elsevier Ltd. All rights reserved.

  11. Mutational analysis of the RNA-binding domain of the Prunus necrotic ringspot virus (PNRSV) movement protein reveals its requirement for cell-to-cell movement

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Carmen Herranz, Ma; Sanchez-Navarro, Jesus-Angel; Sauri, Ana

    2005-08-15

    The movement protein (MP) of Prunus necrotic ringspot virus (PNRSV) is required for cell-to-cell movement. MP subcellular localization studies using a GFP fusion protein revealed highly punctate structures between neighboring cells, believed to represent plasmodesmata. Deletion of the RNA-binding domain (RBD) of PNRSV MP abolishes the cell-to-cell movement. A mutational analysis on this RBD was performed in order to identify in vivo the features that govern viral transport. Loss of positive charges prevented the cell-to-cell movement even though all mutants showed a similar accumulation level in protoplasts to those observed with the wild-type (wt) MP. Synthetic peptides representing the mutantsmore » and wild-type RBDs were used to study RNA-binding affinities by EMSA assays being approximately 20-fold lower in the mutants. Circular dichroism analyses revealed that the secondary structure of the peptides was not significantly affected by mutations. The involvement of the affinity changes between the viral RNA and the MP in the viral cell-to-cell movement is discussed.« less

  12. Mutational analysis of the RNA-binding domain of the Prunus necrotic ringspot virus (PNRSV) movement protein reveals its requirement for cell-to-cell movement.

    PubMed

    Carmen Herranz, Ma; Sanchez-Navarro, Jesús-Angel; Saurí, Ana; Mingarro, Ismael; Pallás, Vicente

    2005-08-15

    The movement protein (MP) of Prunus necrotic ringspot virus (PNRSV) is required for cell-to-cell movement. MP subcellular localization studies using a GFP fusion protein revealed highly punctate structures between neighboring cells, believed to represent plasmodesmata. Deletion of the RNA-binding domain (RBD) of PNRSV MP abolishes the cell-to-cell movement. A mutational analysis on this RBD was performed in order to identify in vivo the features that govern viral transport. Loss of positive charges prevented the cell-to-cell movement even though all mutants showed a similar accumulation level in protoplasts to those observed with the wild-type (wt) MP. Synthetic peptides representing the mutants and wild-type RBDs were used to study RNA-binding affinities by EMSA assays being approximately 20-fold lower in the mutants. Circular dichroism analyses revealed that the secondary structure of the peptides was not significantly affected by mutations. The involvement of the affinity changes between the viral RNA and the MP in the viral cell-to-cell movement is discussed.

  13. Effect of PEG and mPEG-anthracene on tRNA aggregation and particle formation.

    PubMed

    Froehlich, E; Mandeville, J S; Arnold, D; Kreplak, L; Tajmir-Riahi, H A

    2012-01-09

    Poly(ethylene glycol) (PEG) and its derivatives are synthetic polymers with major applications in gene and drug delivery systems. Synthetic polymers are also used to transport miRNA and siRNA in vitro. We studied the interaction of tRNA with several PEGs of different compositions, such as PEG 3350, PEG 6000, and mPEG-anthracene under physiological conditions. FTIR, UV-visible, CD, and fluorescence spectroscopic methods as well as atomic force microscopy (AFM) were used to analyze the PEG binding mode, the binding constant, and the effects of polymer complexation on tRNA stability, aggregation, and particle formation. Structural analysis showed that PEG-tRNA interaction occurs via RNA bases and the backbone phosphate group with both hydrophilic and hydrophobic contacts. The overall binding constants of K(PEG 3350-tRNA)= 1.9 (±0.5) × 10(4) M(-1), K(PEG 6000-tRNA) = 8.9 (±1) × 10(4) M(-1), and K(mPEG-anthracene)= 1.2 (±0.40) × 10(3) M(-1) show stronger polymer-RNA complexation by PEG 6000 and by PEG 3350 than the mPEG-anthracene. AFM imaging showed that PEG complexes contain on average one tRNA with PEG 3350, five tRNA with PEG 6000, and ten tRNA molecules with mPEG-anthracene. tRNA aggregation and particle formation occurred at high polymer concentrations, whereas it remains in A-family structure.

  14. Structural imprints in vivo decode RNA regulatory mechanisms.

    PubMed

    Spitale, Robert C; Flynn, Ryan A; Zhang, Qiangfeng Cliff; Crisalli, Pete; Lee, Byron; Jung, Jong-Wha; Kuchelmeister, Hannes Y; Batista, Pedro J; Torre, Eduardo A; Kool, Eric T; Chang, Howard Y

    2015-03-26

    Visualizing the physical basis for molecular behaviour inside living cells is a great challenge for biology. RNAs are central to biological regulation, and the ability of RNA to adopt specific structures intimately controls every step of the gene expression program. However, our understanding of physiological RNA structures is limited; current in vivo RNA structure profiles include only two of the four nucleotides that make up RNA. Here we present a novel biochemical approach, in vivo click selective 2'-hydroxyl acylation and profiling experiment (icSHAPE), which enables the first global view, to our knowledge, of RNA secondary structures in living cells for all four bases. icSHAPE of the mouse embryonic stem cell transcriptome versus purified RNA folded in vitro shows that the structural dynamics of RNA in the cellular environment distinguish different classes of RNAs and regulatory elements. Structural signatures at translational start sites and ribosome pause sites are conserved from in vitro conditions, suggesting that these RNA elements are programmed by sequence. In contrast, focal structural rearrangements in vivo reveal precise interfaces of RNA with RNA-binding proteins or RNA-modification sites that are consistent with atomic-resolution structural data. Such dynamic structural footprints enable accurate prediction of RNA-protein interactions and N(6)-methyladenosine (m(6)A) modification genome wide. These results open the door for structural genomics of RNA in living cells and reveal key physiological structures controlling gene expression.

  15. Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign

    PubMed Central

    2007-01-01

    Background Joint alignment and secondary structure prediction of two RNA sequences can significantly improve the accuracy of the structural predictions. Methods addressing this problem, however, are forced to employ constraints that reduce computation by restricting the alignments and/or structures (i.e. folds) that are permissible. In this paper, a new methodology is presented for the purpose of establishing alignment constraints based on nucleotide alignment and insertion posterior probabilities. Using a hidden Markov model, posterior probabilities of alignment and insertion are computed for all possible pairings of nucleotide positions from the two sequences. These alignment and insertion posterior probabilities are additively combined to obtain probabilities of co-incidence for nucleotide position pairs. A suitable alignment constraint is obtained by thresholding the co-incidence probabilities. The constraint is integrated with Dynalign, a free energy minimization algorithm for joint alignment and secondary structure prediction. The resulting method is benchmarked against the previous version of Dynalign and against other programs for pairwise RNA structure prediction. Results The proposed technique eliminates manual parameter selection in Dynalign and provides significant computational time savings in comparison to prior constraints in Dynalign while simultaneously providing a small improvement in the structural prediction accuracy. Savings are also realized in memory. In experiments over a 5S RNA dataset with average sequence length of approximately 120 nucleotides, the method reduces computation by a factor of 2. The method performs favorably in comparison to other programs for pairwise RNA structure prediction: yielding better accuracy, on average, and requiring significantly lesser computational resources. Conclusion Probabilistic analysis can be utilized in order to automate the determination of alignment constraints for pairwise RNA structure prediction methods in a principled fashion. These constraints can reduce the computational and memory requirements of these methods while maintaining or improving their accuracy of structural prediction. This extends the practical reach of these methods to longer length sequences. The revised Dynalign code is freely available for download. PMID:17445273

  16. An efficient method for the prediction of deleterious multiple-point mutations in the secondary structure of RNAs using suboptimal folding solutions

    PubMed Central

    Churkin, Alexander; Barash, Danny

    2008-01-01

    Background RNAmute is an interactive Java application which, given an RNA sequence, calculates the secondary structure of all single point mutations and organizes them into categories according to their similarity to the predicted structure of the wild type. The secondary structure predictions are performed using the Vienna RNA package. A more efficient implementation of RNAmute is needed, however, to extend from the case of single point mutations to the general case of multiple point mutations, which may often be desired for computational predictions alongside mutagenesis experiments. But analyzing multiple point mutations, a process that requires traversing all possible mutations, becomes highly expensive since the running time is O(nm) for a sequence of length n with m-point mutations. Using Vienna's RNAsubopt, we present a method that selects only those mutations, based on stability considerations, which are likely to be conformational rearranging. The approach is best examined using the dot plot representation for RNA secondary structure. Results Using RNAsubopt, the suboptimal solutions for a given wild-type sequence are calculated once. Then, specific mutations are selected that are most likely to cause a conformational rearrangement. For an RNA sequence of about 100 nts and 3-point mutations (n = 100, m = 3), for example, the proposed method reduces the running time from several hours or even days to several minutes, thus enabling the practical application of RNAmute to the analysis of multiple-point mutations. Conclusion A highly efficient addition to RNAmute that is as user friendly as the original application but that facilitates the practical analysis of multiple-point mutations is presented. Such an extension can now be exploited prior to site-directed mutagenesis experiments by virologists, for example, who investigate the change of function in an RNA virus via mutations that disrupt important motifs in its secondary structure. A complete explanation of the application, called MultiRNAmute, is available at [1]. PMID:18445289

  17. Identification and analysis of host proteins that interact with the 3'-untranslated region of tick-borne encephalitis virus genomic RNA.

    PubMed

    Muto, Memi; Kamitani, Wataru; Sakai, Mizuki; Hirano, Minato; Kobayashi, Shintaro; Kariwa, Hiroaki; Yoshii, Kentaro

    2018-04-02

    Tick-borne encephalitis virus (TBEV) causes severe neurological disease, but the pathogenetic mechanism is unclear. The conformational structure of the 3'-untranslated region (UTR) of TBEV is associated with its virulence. We tried to identify host proteins interacting with the 3'-UTR of TBEV. Cellular proteins of HEK293T cells were co-precipitated with biotinylated RNAs of the 3'-UTR of low- and high-virulence TBEV strains and subjected to mass spectrometry analysis. Fifteen host proteins were found to bind to the 3'-UTR of TBEV, four of which-cold shock domain containing-E1 (CSDE1), spermatid perinuclear RNA binding protein (STRBP), fragile X mental retardation protein (FMRP), and interleukin enhancer binding factor 3 (ILF3)-bound specifically to that of the low-virulence strain. An RNA immunoprecipitation and pull-down assay confirmed the interactions of the complete 3'-UTRs of TBEV genomic RNA with CSDE1, FMRP, and ILF3. Partial deletion of the stem loop (SL) 3 to SL 5 structure of the variable region of the 3'-UTR did not affect interactions with the host proteins, but the interactions were markedly suppressed by deletion of the complete SL 3, 4, and 5 structures, as in the high-virulence TBEV strain. Further analysis of the roles of host proteins in the neurologic pathogenicity of TBEV is warranted. Copyright © 2018 Elsevier B.V. All rights reserved.

  18. Modified RNA-seq method for microbial community and diversity analysis using rRNA in different types of environmental samples

    PubMed Central

    Yan, Yong-Wei; Zou, Bin; Zhu, Ting; Hozzein, Wael N.

    2017-01-01

    RNA-seq-based SSU (small subunit) rRNA (ribosomal RNA) analysis has provided a better understanding of potentially active microbial community within environments. However, for RNA-seq library construction, high quantities of purified RNA are typically required. We propose a modified RNA-seq method for SSU rRNA-based microbial community analysis that depends on the direct ligation of a 5’ adaptor to RNA before reverse-transcription. The method requires only a low-input quantity of RNA (10–100 ng) and does not require a DNA removal step. The method was initially tested on three mock communities synthesized with enriched SSU rRNA of archaeal, bacterial and fungal isolates at different ratios, and was subsequently used for environmental samples of high or low biomass. For high-biomass salt-marsh sediments, enriched SSU rRNA and total nucleic acid-derived RNA-seq datasets revealed highly consistent community compositions for all of the SSU rRNA sequences, and as much as 46.4%-59.5% of 16S rRNA sequences were suitable for OTU (operational taxonomic unit)-based community and diversity analyses with complete coverage of V1-V2 regions. OTU-based community structures for the two datasets were also highly consistent with those determined by all of the 16S rRNA reads. For low-biomass samples, total nucleic acid-derived RNA-seq datasets were analyzed, and highly active bacterial taxa were also identified by the OTU-based method, notably including members of the previously underestimated genus Nitrospira and phylum Acidobacteria in tap water, members of the phylum Actinobacteria on a shower curtain, and members of the phylum Cyanobacteria on leaf surfaces. More than half of the bacterial 16S rRNA sequences covered the complete region of primer 8F, and non-coverage rates as high as 38.7% were obtained for phylum-unclassified sequences, providing many opportunities to identify novel bacterial taxa. This modified RNA-seq method will provide a better snapshot of diverse microbial communities, most notably by OTU-based analysis, even communities with low-biomass samples. PMID:29016661

  19. StructRNAfinder: an automated pipeline and web server for RNA families prediction.

    PubMed

    Arias-Carrasco, Raúl; Vásquez-Morán, Yessenia; Nakaya, Helder I; Maracaja-Coutinho, Vinicius

    2018-02-17

    The function of many noncoding RNAs (ncRNAs) depend upon their secondary structures. Over the last decades, several methodologies have been developed to predict such structures or to use them to functionally annotate RNAs into RNA families. However, to fully perform this analysis, researchers should utilize multiple tools, which require the constant parsing and processing of several intermediate files. This makes the large-scale prediction and annotation of RNAs a daunting task even to researchers with good computational or bioinformatics skills. We present an automated pipeline named StructRNAfinder that predicts and annotates RNA families in transcript or genome sequences. This single tool not only displays the sequence/structural consensus alignments for each RNA family, according to Rfam database but also provides a taxonomic overview for each assigned functional RNA. Moreover, we implemented a user-friendly web service that allows researchers to upload their own nucleotide sequences in order to perform the whole analysis. Finally, we provided a stand-alone version of StructRNAfinder to be used in large-scale projects. The tool was developed under GNU General Public License (GPLv3) and is freely available at http://structrnafinder.integrativebioinformatics.me . The main advantage of StructRNAfinder relies on the large-scale processing and integrating the data obtained by each tool and database employed along the workflow, of which several files are generated and displayed in user-friendly reports, useful for downstream analyses and data exploration.

  20. CHRONOS: a time-varying method for microRNA-mediated subpathway enrichment analysis.

    PubMed

    Vrahatis, Aristidis G; Dimitrakopoulou, Konstantina; Balomenos, Panos; Tsakalidis, Athanasios K; Bezerianos, Anastasios

    2016-03-15

    In the era of network medicine and the rapid growth of paired time series mRNA/microRNA expression experiments, there is an urgent need for pathway enrichment analysis methods able to capture the time- and condition-specific 'active parts' of the biological circuitry as well as the microRNA impact. Current methods ignore the multiple dynamical 'themes'-in the form of enriched biologically relevant microRNA-mediated subpathways-that determine the functionality of signaling networks across time. To address these challenges, we developed time-vaRying enriCHment integrOmics Subpathway aNalysis tOol (CHRONOS) by integrating time series mRNA/microRNA expression data with KEGG pathway maps and microRNA-target interactions. Specifically, microRNA-mediated subpathway topologies are extracted and evaluated based on the temporal transition and the fold change activity of the linked genes/microRNAs. Further, we provide measures that capture the structural and functional features of subpathways in relation to the complete organism pathway atlas. Our application to synthetic and real data shows that CHRONOS outperforms current subpathway-based methods into unraveling the inherent dynamic properties of pathways. CHRONOS is freely available at http://biosignal.med.upatras.gr/chronos/ tassos.bezerianos@nus.edu.sg Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  1. Structural and functional analysis of virus factories purified from Rabbit vesivirus-infected Vero cells.

    PubMed

    Casais, Rosa; Molleda, Lorenzo González; Machín, Angeles; del Barrio, Gloria; Manso, Alberto García; Dalton, Kevin P; Coto, Ana; Alonso, José Manuel Martín; Prieto, Miguel; Parra, Francisco

    2008-10-01

    Rabbit vesivirus infection induces membrane modifications and accumulation of vesicular structures in the cytoplasm of infected Vero cells. Crude RaV replication complexes (RCs) have been purified and their structural and functional properties have been characterized. We show that calnexin, an ER-resident protein, RaV non-structural proteins 2AB-, 2C-, 3A-, 3B- and 3CD-like as well as viral RNAs co-localize within membranous structures which are able to replicate the endogenous RNA templates. The purified virus factories protected their viral RNA contents from microccocal nuclease degradation and were inaccessible to exogenously added synthetic transcripts. In addition, we have shown that RCs can be used to investigate uridylylation of native endogenous VPg. In contrast to the observation that the virus factories were inaccessible to RNAs, RCs were accessible to added recombinant VPg which was subsequently nucleotidylylated. Nevertheless no elongation of an RNA chain attached to native or recombinant VPg could be demonstrated.

  2. Cooperative gene regulation by microRNA pairs and their identification using a computational workflow

    PubMed Central

    Schmitz, Ulf; Lai, Xin; Winter, Felix; Wolkenhauer, Olaf; Vera, Julio; Gupta, Shailendra K.

    2014-01-01

    MicroRNAs (miRNAs) are an integral part of gene regulation at the post-transcriptional level. Recently, it has been shown that pairs of miRNAs can repress the translation of a target mRNA in a cooperative manner, which leads to an enhanced effectiveness and specificity in target repression. However, it remains unclear which miRNA pairs can synergize and which genes are target of cooperative miRNA regulation. In this paper, we present a computational workflow for the prediction and analysis of cooperating miRNAs and their mutual target genes, which we refer to as RNA triplexes. The workflow integrates methods of miRNA target prediction; triplex structure analysis; molecular dynamics simulations and mathematical modeling for a reliable prediction of functional RNA triplexes and target repression efficiency. In a case study we analyzed the human genome and identified several thousand targets of cooperative gene regulation. Our results suggest that miRNA cooperativity is a frequent mechanism for an enhanced target repression by pairs of miRNAs facilitating distinctive and fine-tuned target gene expression patterns. Human RNA triplexes predicted and characterized in this study are organized in a web resource at www.sbi.uni-rostock.de/triplexrna/. PMID:24875477

  3. Ab initio RNA folding by discrete molecular dynamics: From structure prediction to folding mechanisms

    PubMed Central

    Ding, Feng; Sharma, Shantanu; Chalasani, Poornima; Demidov, Vadim V.; Broude, Natalia E.; Dokholyan, Nikolay V.

    2008-01-01

    RNA molecules with novel functions have revived interest in the accurate prediction of RNA three-dimensional (3D) structure and folding dynamics. However, existing methods are inefficient in automated 3D structure prediction. Here, we report a robust computational approach for rapid folding of RNA molecules. We develop a simplified RNA model for discrete molecular dynamics (DMD) simulations, incorporating base-pairing and base-stacking interactions. We demonstrate correct folding of 150 structurally diverse RNA sequences. The majority of DMD-predicted 3D structures have <4 Å deviations from experimental structures. The secondary structures corresponding to the predicted 3D structures consist of 94% native base-pair interactions. Folding thermodynamics and kinetics of tRNAPhe, pseudoknots, and mRNA fragments in DMD simulations are in agreement with previous experimental findings. Folding of RNA molecules features transient, non-native conformations, suggesting non-hierarchical RNA folding. Our method allows rapid conformational sampling of RNA folding, with computational time increasing linearly with RNA length. We envision this approach as a promising tool for RNA structural and functional analyses. PMID:18456842

  4. Examination of Csr regulatory circuitry using epistasis analysis with RNA-seq (Epi-seq) confirms that CsrD affects gene expression via CsrA, CsrB and CsrC.

    PubMed

    Potts, Anastasia H; Leng, Yuanyuan; Babitzke, Paul; Romeo, Tony

    2018-03-29

    The Csr global regulatory system coordinates gene expression in response to metabolic status. This system utilizes the RNA binding protein CsrA to regulate gene expression by binding to transcripts of structural and regulatory genes, thus affecting their structure, stability, translation, and/or transcription elongation. CsrA activity is controlled by sRNAs, CsrB and CsrC, which sequester CsrA away from other transcripts. CsrB/C levels are partly determined by their rates of turnover, which requires CsrD to render them susceptible to RNase E cleavage. Previous epistasis analysis suggested that CsrD affects gene expression through the other Csr components, CsrB/C and CsrA. However, those conclusions were based on a limited analysis of reporters. Here, we reassessed the global behavior of the Csr circuitry using epistasis analysis with RNA seq (Epi-seq). Because CsrD effects on mRNA levels were entirely lost in the csrA mutant and largely eliminated in a csrB/C mutant under our experimental conditions, while the majority of CsrA effects persisted in the absence of csrD, the original model accounts for the global behavior of the Csr system. Our present results also reflect a more nuanced role of CsrA as terminal regulator of the Csr system than has been recognized.

  5. Small indels induced by CRISPR/Cas9 in the 5' region of microRNA lead to its depletion and Drosha processing retardance.

    PubMed

    Jiang, Qian; Meng, Xing; Meng, Lingwei; Chang, Nannan; Xiong, Jingwei; Cao, Huiqing; Liang, Zicai

    2014-01-01

    MicroRNA knockout by genome editing technologies is promising. In order to extend the application of the technology and to investigate the function of a specific miRNA, we used CRISPR/Cas9 to deplete human miR-93 from a cluster by targeting its 5' region in HeLa cells. Various small indels were induced in the targeted region containing the Drosha processing site and seed sequences. Interestingly, we found that even a single nucleotide deletion led to complete knockout of the target miRNA with high specificity. Functional knockout was confirmed by phenotype analysis. Furthermore, de novo microRNAs were not found by RNA-seq. Nevertheless, expression of the pri-microRNAs was increased. When combined with structural analysis, the data indicated that biogenesis was impaired. Altogether, we showed that small indels in the 5' region of a microRNA result in sequence depletion as well as Drosha processing retard.

  6. Unraveling the structural complexity in a single-stranded RNA tail: implications for efficient ligand binding in the prequeuosine riboswitch

    PubMed Central

    Eichhorn, Catherine D.; Feng, Jun; Suddala, Krishna C.; Walter, Nils G.; Brooks, Charles L.; Al-Hashimi, Hashim M.

    2012-01-01

    Single-stranded RNAs (ssRNAs) are ubiquitous RNA elements that serve diverse functional roles. Much of our understanding of ssRNA conformational behavior is limited to structures in which ssRNA directly engages in tertiary interactions or is recognized by proteins. Little is known about the structural and dynamic behavior of free ssRNAs at atomic resolution. Here, we report the collaborative application of nuclear magnetic resonance (NMR) and replica exchange molecular dynamics (REMD) simulations to characterize the 12 nt ssRNA tail derived from the prequeuosine riboswitch. NMR carbon spin relaxation data and residual dipolar coupling measurements reveal a flexible yet stacked core adopting an A-form-like conformation, with the level of order decreasing toward the terminal ends. An A-to-C mutation within the polyadenine tract alters the observed dynamics consistent with the introduction of a dynamic kink. Pre-ordering of the tail may increase the efficacy of ligand binding above that achieved by a random-coil ssRNA. The REMD simulations recapitulate important trends in the NMR data, but suggest more internal motions than inferred from the NMR analysis. Our study unmasks a previously unappreciated level of complexity in ssRNA, which we believe will also serve as an excellent model system for testing and developing computational force fields. PMID:22009676

  7. Structural imprints in vivo decode RNA regulatory mechanisms

    PubMed Central

    Spitale, Robert C.; Flynn, Ryan A.; Zhang, Qiangfeng Cliff; Crisalli, Pete; Lee, Byron; Jung, Jong-Wha; Kuchelmeister, Hannes Y.; Batista, Pedro J.; Torre, Eduardo A.; Kool, Eric T.; Chang, Howard Y.

    2015-01-01

    Visualizing the physical basis for molecular behavior inside living cells is a grand challenge in biology. RNAs are central to biological regulation, and RNA’s ability to adopt specific structures intimately controls every step of the gene expression program1. However, our understanding of physiological RNA structures is limited; current in vivo RNA structure profiles view only two of four nucleotides that make up RNA2,3. Here we present a novel biochemical approach, In Vivo Click SHAPE (icSHAPE), that enables the first global view of RNA secondary structures of all four bases in living cells. icSHAPE of mouse embryonic stem cell transcriptome versus purified RNA folded in vitro shows that the structural dynamics of RNA in the cellular environment distinguishes different classes of RNAs and regulatory elements. Structural signatures at translational start sites and ribosome pause sites are conserved from in vitro, suggesting that these RNA elements are programmed by sequence. In contrast, focal structural rearrangements in vivo reveal precise interfaces of RNA with RNA binding proteins or RNA modification sites that are consistent with atomic-resolution structural data. Such dynamic structural footprints enable accurate prediction of RNA-protein interactions and N6-methyladenosine (m6A) modification genome-wide. These results open the door for structural genomics of RNA in living cells and reveal key physiological structures controlling gene expression. PMID:25799993

  8. Arabidopsis DRB4, AGO1, AGO7, and RDR6 participate in a DCL4-initiated antiviral RNA silencing pathway negatively regulated by DCL1.

    PubMed

    Qu, Feng; Ye, Xiaohong; Morris, T Jack

    2008-09-23

    Plant RNA silencing machinery enlists four primary classes of proteins to achieve sequence-specific regulation of gene expression and mount an antiviral defense. These include Dicer-like ribonucleases (DCLs), Argonaute proteins (AGOs), dsRNA-binding proteins (DRBs), and RNA-dependent RNA polymerases (RDRs). Although at least four distinct endogenous RNA silencing pathways have been thoroughly characterized, a detailed understanding of the antiviral RNA silencing pathway is just emerging. In this report, we have examined the role of four DCLs, two AGOs, one DRB, and one RDR in controlling viral RNA accumulation in infected Arabidopsis plants by using a mutant virus lacking its silencing suppressor. Our results show that all four DCLs contribute to antiviral RNA silencing. We confirm previous reports implicating both DCL4 and DCL2 in this process and establish a minor role for DCL3. Surprisingly, we found that DCL1 represses antiviral RNA silencing through negatively regulating the expression of DCL4 and DCL3. We also implicate DRB4 in antiviral RNA silencing. Finally, we show that both AGO1 and AGO7 function to ensure efficient clearance of viral RNAs and establish that AGO1 is capable of targeting viral RNAs with more compact structures, whereas AGO7 and RDR6 favor less structured RNA targets. Our results resolve several key steps in the antiviral RNA silencing pathway and provide a basis for further in-depth analysis.

  9. RNA polymerase pausing and nascent RNA structure formation are linked through clamp domain movement

    PubMed Central

    Hein, Pyae P.; Kolb, Kellie E.; Windgassen, Tricia; Bellecourt, Michael J.; Darst, Seth A.; Mooney, Rachel A.; Landick, Robert

    2014-01-01

    The rates of RNA synthesis and nascent RNA folding into biologically active structures are linked via pausing by RNA polymerase (RNAP). Structures that form within the RNA exit channel can increase pausing by interacting with bacterial RNAP or decrease pausing by preventing backtracking. Conversely, pausing is required for proper folding of some RNAs. Opening of the RNAP clamp domain is proposed to mediate some effects of nascent RNA structures. However, the connections among RNA structure formation, clamp movement, and catalytic activity remain uncertain. We assayed exit-channel structure formation in Escherichia coli RNAP together with disulfide crosslinks that favor closed or open clamp conformations and found that clamp position directly influences RNA structure formation and catalytic activity. We report that exit-channel RNA structures slow pause escape by favoring clamp opening and through interactions with the flap that slow translocation. PMID:25108353

  10. A 3D puzzle approach to building protein-DNA structures.

    PubMed

    Hinton, Deborah M

    2017-03-15

    Despite recent advances in structural analysis, it is still challenging to obtain a high-resolution structure for a complex of RNA polymerase, transcriptional factors, and DNA. However, using biochemical constraints, 3D printed models of available structures, and computer modeling, one can build biologically relevant models of such supramolecular complexes.

  11. The novel regulatory ncRNA, NfiS, optimizes nitrogen fixation via base pairing with the nitrogenase gene nifK mRNA in Pseudomonas stutzeri A1501

    PubMed Central

    Zhan, Yuhua; Yan, Yongliang; Deng, Zhiping; Chen, Ming; Lu, Wei; Lu, Chao; Shang, Liguo; Yang, Zhimin; Zhang, Wei; Wang, Wei; Li, Yun; Ke, Qi; Lu, Jiasi; Xu, Yuquan; Zhang, Liwen; Xie, Zhihong; Cheng, Qi; Elmerich, Claudine; Lin, Min

    2016-01-01

    Unlike most Pseudomonas, the root-associated bacterium Pseudomonas stutzeri A1501 fixes nitrogen after the horizontal acquisition of a nitrogen-fixing (nif) island. A genome-wide search for small noncoding RNAs (ncRNAs) in P. stutzeri A1501 identified the novel P. stutzeri-specific ncRNA NfiS in the core genome, whose synthesis was significantly induced under nitrogen fixation or sorbitol stress conditions. The expression of NfiS was RNA chaperone Hfq-dependent and activated by the sigma factor RpoN/global nitrogen activator NtrC/nif-specific activator NifA regulatory cascade. The nfiS-deficient mutant displayed reduced nitrogenase activity, as well as increased sensitivity to multiple stresses, such as osmotic and oxidative stresses. Secondary structure prediction and complementation studies confirmed that a stem-loop structure was essential for NfiS to regulate the nitrogenase gene nifK mRNA synthesis and thus nitrogenase activity. Microscale thermophoresis and physiological analysis showed that NfiS directly pairs with nifK mRNA and ultimately enhances nitrogenase activity by increasing the translation efficiency and the half-life of nifK mRNA. Our data also suggest structural and functional divergence of NfiS evolution in diazotrophic and nondiazotrophic backgrounds. It is proposed that NfiS was recruited by nifK mRNA as a novel regulator to integrate the horizontally acquired nif island into host global networks. PMID:27407147

  12. A surprisingly large RNase P RNA in Candida glabrata

    PubMed Central

    KACHOURI, RYM; STRIBINSKIS, VILIUS; ZHU, YANGLONG; RAMOS, KENNETH S.; WESTHOF, ERIC; LI, YONG

    2005-01-01

    We have found an extremely large ribonuclease P (RNase P) RNA (RPR1) in the human pathogen Candida glabrata and verified that this molecule is expressed and present in the active enzyme complex of this hemiascomycete yeast. A structural alignment of the C. glabrata sequence with 36 other hemiascomycete RNase P RNAs (abbreviated as P RNAs) allows us to characterize the types of insertions. In addition, 15 P RNA sequences were newly characterized by searching in the recently sequenced genomes Candida albicans, C. glabrata, Debaryomyces hansenii, Eremothecium gossypii, Kluyveromyces lactis, Kluyveromyces waltii, Naumovia castellii, Saccharomyces kudriavzevii, Saccharomyces mikatae, and Yarrowia lipolytica; and by PCR amplification for other Candida species (Candida guilliermondii, Candida krusei, Candida parapsilosis, Candida stellatoidea, and Candida tropicalis). The phylogenetic comparative analysis identifies a hemiascomycete secondary structure consensus that presents a conserved core in all species with variable insertions or deletions. The most significant variability is found in C. glabrata P RNA in which three insertions exceeding in total 700 nt are present in the Specificity domain. This P RNA is more than twice the length of any other homologous P RNAs known in the three domains of life and is eight times the size of the smallest. RNase P RNA, therefore, represents one of the most diversified noncoding RNAs in terms of size variation and structural diversity. PMID:15987816

  13. [Comparative analysis of clustered regularly interspaced short palindromic repeats (CRISPRs) loci in the genomes of halophilic archaea].

    PubMed

    Zhang, Fan; Zhang, Bing; Xiang, Hua; Hu, Songnian

    2009-11-01

    Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a widespread system that provides acquired resistance against phages in bacteria and archaea. Here we aim to genome-widely analyze the CRISPR in extreme halophilic archaea, of which the whole genome sequences are available at present time. We used bioinformatics methods including alignment, conservation analysis, GC content and RNA structure prediction to analyze the CRISPR structures of 7 haloarchaeal genomes. We identified the CRISPR structures in 5 halophilic archaea and revealed a conserved palindromic motif in the flanking regions of these CRISPR structures. In addition, we found that the repeat sequences of large CRISPR structures in halophilic archaea were greatly conserved, and two types of predicted RNA secondary structures derived from the repeat sequences were likely determined by the fourth base of the repeat sequence. Our results support the proposal that the leader sequence may function as recognition site by having palindromic structures in flanking regions, and the stem-loop secondary structure formed by repeat sequences may function in mediating the interaction between foreign genetic elements and CAS-encoded proteins.

  14. Isosteric And Non-Isosteric Base Pairs In RNA Motifs: Molecular Dynamics And Bioinformatics Study Of The Sarcin-Ricin Internal Loop

    PubMed Central

    Havrila, Marek; Réblová, Kamila; Zirbel, Craig L.; Leontis, Neocles B.; Šponer, Jiří

    2013-01-01

    The Sarcin-Ricin RNA motif (SR motif) is one of the most prominent recurrent RNA building blocks that occurs in many different RNA contexts and folds autonomously, i.e., in a context-independent manner. In this study, we combined bioinformatics analysis with explicit-solvent molecular dynamics (MD) simulations to better understand the relation between the RNA sequence and the evolutionary patterns of SR motif. SHAPE probing experiment was also performed to confirm fidelity of MD simulations. We identified 57 instances of the SR motif in a non-redundant subset of the RNA X-ray structure database and analyzed their basepairing, base-phosphate, and backbone-backbone interactions. We extracted sequences aligned to these instances from large ribosomal RNA alignments to determine frequency of occurrence for different sequence variants. We then used a simple scoring scheme based on isostericity to suggest 10 sequence variants with highly variable expected degree of compatibility with the SR motif 3D structure. We carried out MD simulations of SR motifs with these base substitutions. Non isosteric base substitutions led to unstable structures, but so did isosteric substitutions which were unable to make key base-phosphate interactions. MD technique explains why some potentially isosteric SR motifs are not realized during evolution. We also found that inability to form stable cWW geometry is an important factor in case of the first base pair of the flexible region of the SR motif. Comparison of structural, bioinformatics, SHAPE probing and MD simulation data reveals that explicit solvent MD simulations neatly reflect viability of different sequence variants of the SR motif. Thus, MD simulations can efficiently complement bioinformatics tools in studies of conservation patterns of RNA motifs and provide atomistic insight into the role of their different signature interactions. PMID:24144333

  15. Evolutionary Origin and Conserved Structural Building Blocks of Riboswitches and Ribosomal RNAs: Riboswitches as Probable Target Sites for Aminoglycosides Interaction.

    PubMed

    Mehdizadeh Aghdam, Elnaz; Barzegar, Abolfazl; Hejazi, Mohammad Saeid

    2014-01-01

    Riboswitches, as noncoding RNA sequences, control gene expression through direct ligand binding. Sporadic reports on the structural relation of riboswitches with ribosomal RNAs (rRNA), raises an interest in possible similarity between riboswitches and rRNAs evolutionary origins. Since aminoglycoside antibiotics affect microbial cells through binding to functional sites of the bacterial rRNA, finding any conformational and functional relation between riboswitches/rRNAs is utmost important in both of medicinal and basic research. Analysis of the riboswitches structures were carried out using bioinformatics and computational tools. The possible functional similarity of riboswitches with rRNAs was evaluated based on the affinity of paromomycin antibiotic (targeting "A site" of 16S rRNA) to riboswitches via docking method. There was high structural similarity between riboswitches and rRNAs, but not any particular sequence based similarity between them was found. The building blocks including "hairpin loop containing UUU", "peptidyl transferase center conserved hairpin A loop"," helix 45" and "S2 (G8) hairpin" as high identical rRNA motifs were detected in all kinds of riboswitches. Surprisingly, binding energies of paromomycin with different riboswitches are considerably better than the binding energy of paromomycin with "16S rRNA A site". Therefore the high affinity of paromomycin to bind riboswitches in comparison with rRNA "A site" suggests a new insight about riboswitches as possible targets for aminoglycoside antibiotics. These findings are considered as a possible supporting evidence for evolutionary origin of riboswitches/rRNAs and also their role in the exertion of antibiotics effects to design new drugs based on the concomitant effects via rRNA/riboswitches.

  16. Structure analysis of the global metabolic regulator Crc from Pseudomonas aeruginosa.

    PubMed

    Wei, Yong; Zhang, Heng; Gao, Zeng-Qiang; Xu, Jian-Hua; Liu, Quan-Sheng; Dong, Yu-Hui

    2013-01-01

    The global metabolic regulator catabolite repression control (Crc) has recently been found to modulate the susceptibility to antibiotics and virulence in the opportunistic pathogen Pseudomonas aeruginosa and been suggested as a nonlethal target for novel antimicrobials. In P. aeruginosa, Crc couples with the CA motifs from the small RNA CrcZ to form a post-transcriptional regulator system and is removed from the 5'-end of the target mRNAs. In this study, we first reported the crystal structure of Crc from P. aeruginosa refined to 2.20 Å. The structure showed that it consists of two halves with similar overall topology and there are 11 β strands surrounded by 13 helices, forming a four-layered α/β-sandwich. The circular dichroism spectroscopy revealed that it is thermostable in solution and shares similar characteristics to that in crystal. Comprehensive structural analysis and comparison with the homologies of Crc showed high similarity with several known nucleases and consequently may be classified into a member exodeoxyribonuclease III. However, it shows distinct substrate specificity (RNA as the preferred substrate) compared to these DNA endonucleases. Structural comparisons also revealed potential RNA recognition and binding region mainly consisting of five flexible loops. Our structure study provided the basis for the future application of Crc as a target to develop new antibiotics. Copyright © 2012 International Union of Biochemistry and Molecular Biology, Inc.

  17. Highly sensitive and selective microRNA detection based on DNA-bio-bar-code and enzyme-assisted strand cycle exponential signal amplification.

    PubMed

    Dong, Haifeng; Meng, Xiangdan; Dai, Wenhao; Cao, Yu; Lu, Huiting; Zhou, Shufeng; Zhang, Xueji

    2015-04-21

    Herein, a highly sensitive and selective microRNA (miRNA) detection strategy using DNA-bio-bar-code amplification (BCA) and Nb·BbvCI nicking enzyme-assisted strand cycle for exponential signal amplification was designed. The DNA-BCA system contains a locked nucleic acid (LNA) modified DNA probe for improving hybridization efficiency, while a signal reported molecular beacon (MB) with an endonuclease recognition site was designed for strand cycle amplification. In the presence of target miRNA, the oligonucleotides functionalized magnetic nanoprobe (MNP-DNA) and gold nanoprobe (AuNP-DNA) with numerous reported probes (RP) can hybridize with target miRNA, respectively, to form a sandwich structure. After sandwich structures were separated from the solution by the magnetic field, the RP were released under high temperature to recognize the MB and cleaved the hairpin DNA to induce the dissociation of RP. The dissociated RP then triggered the next strand cycle to produce exponential fluorescent signal amplification for miRNA detection. Under optimized conditions, the exponential signal amplification system shows a good linear range of 6 orders of magnitude (from 0.3 pM to 3 aM) with limit of detection (LOD) down to 52.5 zM, while the sandwich structure renders the system with high selectivity. Meanwhile, the feasibility of the proposed strategy for cell miRNA detection was confirmed by analyzing miRNA-21 in HeLa lysates. Given the high-performance for miRNA analysis, the strategy has a promising application in biological detection and in clinical diagnosis.

  18. Noncanoncial signal recognition particle RNAs in a major eukaryotic phylum revealed by purification of SRP from the human pathogen Cryptococcus neoformans

    PubMed Central

    Dumesic, Phillip A.; Rosenblad, Magnus A.; Samuelsson, Tore; Nguyen, Tiffany; Moresco, James J.; Yates, John R.; Madhani, Hiten D.

    2015-01-01

    Despite conservation of the signal recognition particle (SRP) from bacteria to man, computational approaches have failed to identify SRP components from genomes of many lower eukaryotes, raising the possibility that they have been lost or altered in those lineages. We report purification and analysis of SRP in the human pathogen Cryptococcus neoformans, providing the first description of SRP in basidiomycetous yeast. The C. neoformans SRP RNA displays a predicted structure in which the universally conserved helix 8 contains an unprecedented stem-loop insertion. Guided by this sequence, we computationally identified 152 SRP RNAs throughout the phylum Basidiomycota. This analysis revealed additional helix 8 alterations including single and double stem-loop insertions as well as loop diminutions affecting RNA structural elements that are otherwise conserved from bacteria to man. Strikingly, these SRP RNA features in Basidiomycota are accompanied by phylum-specific alterations in the RNA-binding domain of Srp54, the SRP protein subunit that directly interacts with helix 8. Our findings reveal unexpected fungal SRP diversity and suggest coevolution of the two most conserved SRP features—SRP RNA helix 8 and Srp54—in basidiomycetes. Because members of this phylum include important human and plant pathogens, these noncanonical features provide new targets for antifungal compound development. PMID:26275773

  19. The ancient history of the structure of ribonuclease P and the early origins of Archaea

    PubMed Central

    2010-01-01

    Background Ribonuclease P is an ancient endonuclease that cleaves precursor tRNA and generally consists of a catalytic RNA subunit (RPR) and one or more proteins (RPPs). It represents an important macromolecular complex and model system that is universally distributed in life. Its putative origins have inspired fundamental hypotheses, including the proposal of an ancient RNA world. Results To study the evolution of this complex, we constructed rooted phylogenetic trees of RPR molecules and substructures and estimated RPP age using a cladistic method that embeds structure directly into phylogenetic analysis. The general approach was used previously to study the evolution of tRNA, SINE RNA and 5S rRNA, the origins of metabolism, and the evolution and complexity of the protein world, and revealed here remarkable evolutionary patterns. Trees of molecules uncovered the tripartite nature of life and the early origin of archaeal RPRs. Trees of substructures showed molecules originated in stem P12 and were accessorized with a catalytic P1-P4 core structure before the first substructure was lost in Archaea. This core currently interacts with RPPs and ancient segments of the tRNA molecule. Finally, a census of protein domain structure in hundreds of genomes established RPPs appeared after the rise of metabolic enzymes at the onset of the protein world. Conclusions The study provides a detailed account of the history and early diversification of a fundamental ribonucleoprotein and offers further evidence in support of the existence of a tripartite organismal world that originated by the segregation of archaeal lineages from an ancient community of primordial organisms. PMID:20334683

  20. Molecular cloning, expression pattern, and 3D structural prediction of the cold inducible RNA-binding protein (CIRP) in Japanese flounder ( Paralichthys olivaceus)

    NASA Astrophysics Data System (ADS)

    Yang, Xiao; Gao, Jinning; Ma, Liman; Li, Zan; Wang, Wenji; Wang, Zhongkai; Yu, Haiyang; Qi, Jie; Wang, Xubo; Wang, Zhigang; Zhang, Quanqi

    2015-02-01

    Cold-inducible RNA-binding protein (CIRP) is a kind of RNA binding proteins that plays important roles in many physiological processes. The CIRP has been widely studied in mammals and amphibians since it was first cloned from mammals. On the contrary, there are little reports in teleosts. In this study, the Po CIRP gene of the Japanese flounder was cloned and sequenced. The genomic sequence consists of seven exons and six introns. The putative PoCIRP protein of flounder was 198 amino acid residues long containing the RNA recognition motif (RRM). Phylogenetic analysis showed that the flounder PoCIRP is highly conserved with other teleost CIRPs. The 5' flanking sequence was cloned by genome walking and many transcription factor binding sites were identified. There is a CpGs region located in promoter and exon I region and the methylation state is low. Quantitative real-time PCR analysis uncovered that Po CIRP gene was widely expressed in adult tissues with the highest expression level in the ovary. The mRNA of the Po CIRP was maternally deposited and the expression level of the gene was regulated up during the gastrula and neurula stages. In order to gain the information how the protein interacts with mRNA, we performed the modeling of the 3D structure of the flounder PoCIRP. The results showed a cleft existing the surface of the molecular. Taken together, the results indicate that the CIRP is a multifunctional molecular in teleosts and the findings about the structure provide valuable information for understanding the basis of this protein's function.

  1. A statistical analysis of RNA folding algorithms through thermodynamic parameter perturbation.

    PubMed

    Layton, D M; Bundschuh, R

    2005-01-01

    Computational RNA secondary structure prediction is rather well established. However, such prediction algorithms always depend on a large number of experimentally measured parameters. Here, we study how sensitive structure prediction algorithms are to changes in these parameters. We found already that for changes corresponding to the actual experimental error to which these parameters have been determined, 30% of the structure are falsely predicted whereas the ground state structure is preserved under parameter perturbation in only 5% of all the cases. We establish that base-pairing probabilities calculated in a thermal ensemble are viable although not a perfect measure for the reliability of the prediction of individual structure elements. Here, a new measure of stability using parameter perturbation is proposed, and its limitations are discussed.

  2. In silico molecular docking analysis of the human Argonaute 2 PAZ domain reveals insights into RNA interference.

    PubMed

    Kandeel, Mahmoud; Kitade, Yukio

    2013-07-01

    RNA interference (RNAi) is a critical cellular pathway activated by double stranded RNA and regulates the gene expression of target mRNA. During RNAi, the 3' end of siRNA binds with the PAZ domain, followed by release and rebinding in a cyclic manner, which deemed essential for proper gene silencing. Recently, we provided the forces underlying the recognition of small interfering RNA by PAZ in a computational study based on the structure of Drosophila Argonaute 2 (Ago2) PAZ domain. We have now reanalyzed these data within the view of the new available structures from human Argonauts. While the parameters of weak binding are correlated with higher (RNAi) in the Drosophila model, a different profile is predicted with the human Ago2 PAZ domain. On the basis of the human Ago2 PAZ models, the indicators of stronger binding as the total binding energy and the free energy were associated with better RNAi efficacy. This discrepancy might be attributable to differences in the binding site topology and the difference in the conformation of the bound nucleotides.

  3. NMR Structural Profiling of Transcriptional Intermediates Reveals Riboswitch Regulation by Metastable RNA Conformations.

    PubMed

    Helmling, Christina; Wacker, Anna; Wolfinger, Michael T; Hofacker, Ivo L; Hengesbach, Martin; Fürtig, Boris; Schwalbe, Harald

    2017-02-22

    Gene repression induced by the formation of transcriptional terminators represents a prime example for the coupling of RNA synthesis, folding, and regulation. In this context, mapping the changes in available conformational space of transcription intermediates during RNA synthesis is important to understand riboswitch function. A majority of riboswitches, an important class of small metabolite-sensing regulatory RNAs, act as transcriptional regulators, but the dependence of ligand binding and the subsequent allosteric conformational switch on mRNA transcript length has not yet been investigated. We show a strict fine-tuning of binding and sequence-dependent alterations of conformational space by structural analysis of all relevant transcription intermediates at single-nucleotide resolution for the I-A type 2'dG-sensing riboswitch from Mesoplasma florum by NMR spectroscopy. Our results provide a general framework to dissect the coupling of synthesis and folding essential for riboswitch function, revealing the importance of metastable states for RNA-based gene regulation.

  4. Alignment of RNA molecules: Binding energy and statistical properties of random sequences

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Valba, O. V., E-mail: valbaolga@gmail.com; Nechaev, S. K., E-mail: sergei.nechaev@gmail.com; Tamm, M. V., E-mail: thumm.m@gmail.com

    2012-02-15

    A new statistical approach to the problem of pairwise alignment of RNA sequences is proposed. The problem is analyzed for a pair of interacting polymers forming an RNA-like hierarchical cloverleaf structures. An alignment is characterized by the numbers of matches, mismatches, and gaps. A weight function is assigned to each alignment; this function is interpreted as a free energy taking into account both direct monomer-monomer interactions and a combinatorial contribution due to formation of various cloverleaf secondary structures. The binding free energy is determined for a pair of RNA molecules. Statistical properties are discussed, including fluctuations of the binding energymore » between a pair of RNA molecules and loop length distribution in a complex. Based on an analysis of the free energy per nucleotide pair complexes of random RNAs as a function of the number of nucleotide types c, a hypothesis is put forward about the exclusivity of the alphabet c = 4 used by nature.« less

  5. Protocols for Molecular Dynamics Simulations of RNA Nanostructures.

    PubMed

    Kim, Taejin; Kasprzak, Wojciech K; Shapiro, Bruce A

    2017-01-01

    Molecular dynamics (MD) simulations have been used as one of the main research tools to study a wide range of biological systems and bridge the gap between X-ray crystallography or NMR structures and biological mechanism. In the field of RNA nanostructures, MD simulations have been used to fix steric clashes in computationally designed RNA nanostructures, characterize the dynamics, and investigate the interaction between RNA and other biomolecules such as delivery agents and membranes.In this chapter we present examples of computational protocols for molecular dynamics simulations in explicit and implicit solvent using the Amber Molecular Dynamics Package. We also show examples of post-simulation analysis steps and briefly mention selected tools beyond the Amber package. Limitations of the methods, tools, and protocols are also discussed. Most of the examples are illustrated for a small RNA duplex (helix), but the protocols are applicable to any nucleic acid structure, subject only to the computational speed and memory limitations of the hardware available to the user.

  6. Mechanism for priming DNA synthesis by yeast DNA Polymerase α

    PubMed Central

    Perera, Rajika L; Torella, Rubben; Klinge, Sebastian; Kilkenny, Mairi L; Maman, Joseph D; Pellegrini, Luca

    2013-01-01

    The DNA Polymerase α (Pol α)/primase complex initiates DNA synthesis in eukaryotic replication. In the complex, Pol α and primase cooperate in the production of RNA-DNA oligonucleotides that prime synthesis of new DNA. Here we report crystal structures of the catalytic core of yeast Pol α in unliganded form, bound to an RNA primer/DNA template and extending an RNA primer with deoxynucleotides. We combine the structural analysis with biochemical and computational data to demonstrate that Pol α specifically recognizes the A-form RNA/DNA helix and that the ensuing synthesis of B-form DNA terminates primer synthesis. The spontaneous release of the completed RNA-DNA primer by the Pol α/primase complex simplifies current models of primer transfer to leading- and lagging strand polymerases. The proposed mechanism of nucleotide polymerization by Pol α might contribute to genomic stability by limiting the amount of inaccurate DNA to be corrected at the start of each Okazaki fragment. DOI: http://dx.doi.org/10.7554/eLife.00482.001 PMID:23599895

  7. Direct observation of processive exoribonuclease motion using optical tweezers.

    PubMed

    Fazal, Furqan M; Koslover, Daniel J; Luisi, Ben F; Block, Steven M

    2015-12-08

    Bacterial RNases catalyze the turnover of RNA and are essential for gene expression and quality surveillance of transcripts. In Escherichia coli, the exoribonucleases RNase R and polynucleotide phosphorylase (PNPase) play critical roles in degrading RNA. Here, we developed an optical-trapping assay to monitor the translocation of individual enzymes along RNA-based substrates. Single-molecule records of motion reveal RNase R to be highly processive: one molecule can unwind over 500 bp of a structured substrate. However, enzyme progress is interrupted by pausing and stalling events that can slow degradation in a sequence-dependent fashion. We found that the distance traveled by PNPase through structured RNA is dependent on the A+U content of the substrate and that removal of its KH and S1 RNA-binding domains can reduce enzyme processivity without affecting the velocity. By a periodogram analysis of single-molecule records, we establish that PNPase takes discrete steps of six or seven nucleotides. These findings, in combination with previous structural and biochemical data, support an asymmetric inchworm mechanism for PNPase motion. The assay developed here for RNase R and PNPase is well suited to studies of other exonucleases and helicases.

  8. Design-Based Peptidomimetic Ligand Discovery to Target HIV TAR RNA Using Comparative Analysis of Different Docking Methods.

    PubMed

    Fu, Junjie; Xia, Amy; Dai, Yao; Qi, Xin

    2016-01-01

    Discovering molecules capable of binding to HIV trans-activation responsive region (TAR) RNA thereby disrupting its interaction with Tat protein is an attractive strategy for developing novel antiviral drugs. Computational docking is considered as a useful tool for predicting binding affinity and conducting virtual screening. Although great progress in predicting protein-ligand interactions has been achieved in the past few decades, modeling RNA-ligand interactions is still largely unexplored due to the highly flexible nature of RNA. In this work, we performed molecular docking study with HIV TAR RNA using previously identified cyclic peptide L22 and its analogues with varying affinities toward HIV-1 TAR RNA. Furthermore, sarcosine scan was conducted to generate derivatives of CGP64222, a peptide-peptoid hybrid with inhibitory activity on Tat/TAR RNA interaction. Each compound was docked using CDOCKER, Surflex-Dock and FlexiDock to compare the effectiveness of each method. It was found that FlexiDock energy values correlated well with the experimental Kd values and could be used to predict the affinity of the ligands toward HIV-1 TAR RNA with a superior accuracy. Our results based on comparative analysis of different docking methods in RNA-ligand modeling will facilitate the structure-based discovery of HIV TAR RNA ligands for antiviral therapy.

  9. Computational study of RNA folding kinetics and thermodynamics

    NASA Astrophysics Data System (ADS)

    Morgan, Steven Robert

    RNA in its many forms is involved in the processes of protein manufacture, gene splicing, catalysis and gene regulation. It is also the store of genetic information in some viruses. The function of the RNA is determined by its structure, and it is the purpose of this thesis to investigate kinetic and thermodynamic properties of RNA secondary structures in order to obtain a better understanding of their formation and function. Our main tenet is that kinetic formation of RNA structure is necessary to explain features found in natural RNA structures, as well as aspects of the biological function of RNA. Firstly we show that examination of the energies of fragments of RNA secondary structure provides evidence for kinetic formation of structure. Local regions of RNA of length less than about 100 nucleotides adopt a conformation with energy near or equal to the minimum possible for those regions, whilst the energies of larger domains are much further from the their respective minima. This is consistent with the patterns that would be expected if RNA structure is folded Idneticatic during transcription. A Monte-Carlo algorithm is then used to model the kinetic folding of RNA during transcriptional growth. The algorithm is capable of finding the correct structure of a natural RNA for which the minimum free energy approach is unsuccessful. In the viral phage MS2 Idneticatic formed RNA structure plays an important role in the regulation of gene expression. The folding algorithm can accurately model this by IdneticaUy controlling access to the gene initiation region. The algorithm is also successfully used to model the control of replication in the ColEl plasmid. Taking a different approach, we then use a simplified model of RNA secondary structure to investigate the size of energy barriers between degenerate minimum energy structures. This model has much in common with physical systems such as spin glasses, and in fact shows similar behaviour to these systems in that energy barriers between structures grow quickly with the length of the RNA sequence. These barriers will serve to trap RNA in non-optimal structures. Together these studies demonstrate the necessity of studying RNA secondary structure from a kinetic point of view, and provide clear directions in which further work may be taken. Kinetic models of RNA secondary structure should continue to prove useful in modelling the structure and function of RNA.

  10. The complete mitochondrial genome sequence of the maned wolf (Chrysocyon brachyurus).

    PubMed

    Zhao, Chao; Yang, Xiufeng; Zhang, Honghai; Zhang, Jin; Chen, Lei; Sha, Weilai; Liu, Guangshuai

    2016-01-01

    In this study, the complete mitochondrial genome of the maned wolf (Chrysocyon brachyurus), the unique species in Chrysocyon, was sequenced and reported for the first time using blood samples obtained from a female individual in Shanghai Zoo, China. Sequence analysis showed that the genome structure was in accordance with other Canidae species and it contained 12 S rRNA gene, 16 S rRNA gene, 22 tRNA genes, 13 protein-coding genes and 1 control region.

  11. Structure and reconstitution of yeast Mpp6-nuclear exosome complexes reveals that Mpp6 stimulates RNA decay and recruits the Mtr4 helicase

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wasmuth, Elizabeth V.; Zinder, John C.; Zattas, Dimitrios

    Nuclear RNA exosomes catalyze a range of RNA processing and decay activities that are coordinated in part by cofactors, including Mpp6, Rrp47, and the Mtr4 RNA helicase. Mpp6 interacts with the nine-subunit exosome core, while Rrp47 stabilizes the exoribonuclease Rrp6 and recruits Mtr4, but it is less clear if these cofactors work together. Using biochemistry with Saccharomyces cerevisiae proteins, we show that Rrp47 and Mpp6 stimulate exosome-mediated RNA decay, albeit with unique dependencies on elements within the nuclear exosome. Mpp6-exosomes can recruit Mtr4, while Mpp6 and Rrp47 each contribute to Mtr4-dependent RNA decay, with maximal Mtr4-dependent decay observed with bothmore » cofactors. The 3.3 Å structure of a twelve-subunit nuclear Mpp6 exosome bound to RNA shows the central region of Mpp6 bound to the exosome core, positioning its Mtr4 recruitment domain next to Rrp6 and the exosome central channel. Genetic analysis reveals interactions that are largely consistent with our model.« less

  12. The role of the mitochondrial ribosome in human disease: searching for mutations in 12S mitochondrial rRNA with high disruptive potential

    PubMed Central

    Smith, Paul M.; Elson, Joanna L.; Greaves, Laura C.; Wortmann, Saskia B.; Rodenburg, Richard J.T.; Lightowlers, Robert N.; Chrzanowska-Lightowlers, Zofia M.A.; Taylor, Robert W.; Vila-Sanjurjo, Antón

    2014-01-01

    Mutations of mitochondrial DNA are linked to many human diseases. Despite the identification of a large number of variants in the mitochondrially encoded rRNA (mt-rRNA) genes, the evidence supporting their pathogenicity is, at best, circumstantial. Establishing the pathogenicity of these variations is of major diagnostic importance. Here, we aim to estimate the disruptive effect of mt-rRNA variations on the function of the mitochondrial ribosome. In the absence of direct biochemical methods to study the effect of mt-rRNA variations, we relied on the universal conservation of the rRNA fold to infer their disruptive potential. Our method, named heterologous inferential analysis or HIA, combines conservational information with functional and structural data obtained from heterologous ribosomal sources. Thus, HIA's predictive power is superior to the traditional reliance on simple conservation indexes. By using HIA, we have been able to evaluate the disruptive potential for a subset of uncharacterized 12S mt-rRNA variations. Our analysis revealed the existence of variations in the rRNA component of the human mitoribosome with different degrees of disruptive power. In cases where sufficient information regarding the genetic and pathological manifestation of the mitochondrial phenotype is available, HIA data can be used to predict the pathogenicity of mt-rRNA mutations. In other cases, HIA analysis will allow the prioritization of variants for additional investigation. Eventually, HIA-inspired analysis of potentially pathogenic mt-rRNA variations, in the context of a scoring system specifically designed for these variants, could lead to a powerful diagnostic tool. PMID:24092330

  13. An efficient algorithm for planar drawing of RNA structures with pseudoknots of any type.

    PubMed

    Byun, Yanga; Han, Kyungsook

    2016-06-01

    An RNA pseudoknot is a tertiary structural element in which bases of a loop pair with complementary bases are outside the loop. A drawing of RNA secondary structures is a tree, but a drawing of RNA pseudoknots is a graph that has an inner cycle within a pseudoknot and possibly outer cycles formed between the pseudoknot and other structural elements. Visualizing a large-scale RNA structure with pseudoknots as a planar drawing is challenging because a planar drawing of an RNA structure requires both pseudoknots and an entire structure enclosing the pseudoknots to be embedded into a plane without overlapping or crossing. This paper presents an efficient heuristic algorithm for visualizing a pseudoknotted RNA structure as a planar drawing. The algorithm consists of several parts for finding crossing stems and page mapping the stems, for the layout of stem-loops and pseudoknots, and for overlap detection between structural elements and resolving it. Unlike previous algorithms, our algorithm generates a planar drawing for a large RNA structure with pseudoknots of any type and provides a bracket view of the structure. It generates a compact and aesthetic structure graph for a large pseudoknotted RNA structure in O([Formula: see text]) time, where n is the number of stems of the RNA structure.

  14. Computational strategies for the automated design of RNA nanoscale structures from building blocks using NanoTiler.

    PubMed

    Bindewald, Eckart; Grunewald, Calvin; Boyle, Brett; O'Connor, Mary; Shapiro, Bruce A

    2008-10-01

    One approach to designing RNA nanoscale structures is to use known RNA structural motifs such as junctions, kissing loops or bulges and to construct a molecular model by connecting these building blocks with helical struts. We previously developed an algorithm for detecting internal loops, junctions and kissing loops in RNA structures. Here we present algorithms for automating or assisting many of the steps that are involved in creating RNA structures from building blocks: (1) assembling building blocks into nanostructures using either a combinatorial search or constraint satisfaction; (2) optimizing RNA 3D ring structures to improve ring closure; (3) sequence optimisation; (4) creating a unique non-degenerate RNA topology descriptor. This effectively creates a computational pipeline for generating molecular models of RNA nanostructures and more specifically RNA ring structures with optimized sequences from RNA building blocks. We show several examples of how the algorithms can be utilized to generate RNA tecto-shapes.

  15. Computational strategies for the automated design of RNA nanoscale structures from building blocks using NanoTiler☆

    PubMed Central

    Bindewald, Eckart; Grunewald, Calvin; Boyle, Brett; O’Connor, Mary; Shapiro, Bruce A.

    2013-01-01

    One approach to designing RNA nanoscale structures is to use known RNA structural motifs such as junctions, kissing loops or bulges and to construct a molecular model by connecting these building blocks with helical struts. We previously developed an algorithm for detecting internal loops, junctions and kissing loops in RNA structures. Here we present algorithms for automating or assisting many of the steps that are involved in creating RNA structures from building blocks: (1) assembling building blocks into nanostructures using either a combinatorial search or constraint satisfaction; (2) optimizing RNA 3D ring structures to improve ring closure; (3) sequence optimisation; (4) creating a unique non-degenerate RNA topology descriptor. This effectively creates a computational pipeline for generating molecular models of RNA nanostructures and more specifically RNA ring structures with optimized sequences from RNA building blocks. We show several examples of how the algorithms can be utilized to generate RNA tecto-shapes. PMID:18838281

  16. Structure-seq2: sensitive and accurate genome-wide profiling of RNA structure in vivo

    PubMed Central

    Ritchey, Laura E.; Su, Zhao; Tang, Yin; Tack, David C.

    2017-01-01

    Abstract RNA serves many functions in biology such as splicing, temperature sensing, and innate immunity. These functions are often determined by the structure of RNA. There is thus a pressing need to understand RNA structure and how it changes during diverse biological processes both in vivo and genome-wide. Here, we present Structure-seq2, which provides nucleotide-resolution RNA structural information in vivo and genome-wide. This optimized version of our original Structure-seq method increases sensitivity by at least 4-fold and improves data quality by minimizing formation of a deleterious by-product, reducing ligation bias, and improving read coverage. We also present a variation of Structure-seq2 in which a biotinylated nucleotide is incorporated during reverse transcription, which greatly facilitates the protocol by eliminating two PAGE purification steps. We benchmark Structure-seq2 on both mRNA and rRNA structure in rice (Oryza sativa). We demonstrate that Structure-seq2 can lead to new biological insights. Our Structure-seq2 datasets uncover hidden breaks in chloroplast rRNA and identify a previously unreported N1-methyladenosine (m1A) in a nuclear-encoded Oryza sativa rRNA. Overall, Structure-seq2 is a rapid, sensitive, and unbiased method to probe RNA in vivo and genome-wide that facilitates new insights into RNA biology. PMID:28637286

  17. Dawn of the in vivo RNA structurome and interactome.

    PubMed

    Kwok, Chun Kit

    2016-10-15

    RNA is one of the most fascinating biomolecules in living systems given its structural versatility to fold into elaborate architectures for important biological functions such as gene regulation, catalysis, and information storage. Knowledge of RNA structures and interactions can provide deep insights into their functional roles in vivo For decades, RNA structural studies have been conducted on a transcript-by-transcript basis. The advent of next-generation sequencing (NGS) has enabled the development of transcriptome-wide structural probing methods to profile the global landscape of RNA structures and interactions, also known as the RNA structurome and interactome, which transformed our understanding of the RNA structure-function relationship on a transcriptomic scale. In this review, molecular tools and NGS methods used for RNA structure probing are presented, novel insights uncovered by RNA structurome and interactome studies are highlighted, and perspectives on current challenges and potential future directions are discussed. A more complete understanding of the RNA structures and interactions in vivo will help illuminate the novel roles of RNA in gene regulation, development, and diseases. © 2016 The Author(s); published by Portland Press Limited on behalf of the Biochemical Society.

  18. RNA 3D Structural Motifs: Definition, Identification, Annotation, and Database Searching

    NASA Astrophysics Data System (ADS)

    Nasalean, Lorena; Stombaugh, Jesse; Zirbel, Craig L.; Leontis, Neocles B.

    Structured RNA molecules resemble proteins in the hierarchical organization of their global structures, folding and broad range of functions. Structured RNAs are composed of recurrent modular motifs that play specific functional roles. Some motifs direct the folding of the RNA or stabilize the folded structure through tertiary interactions. Others bind ligands or proteins or catalyze chemical reactions. Therefore, it is desirable, starting from the RNA sequence, to be able to predict the locations of recurrent motifs in RNA molecules. Conversely, the potential occurrence of one or more known 3D RNA motifs may indicate that a genomic sequence codes for a structured RNA molecule. To identify known RNA structural motifs in new RNA sequences, precise structure-based definitions are needed that specify the core nucleotides of each motif and their conserved interactions. By comparing instances of each recurrent motif and applying base pair isosteriCity relations, one can identify neutral mutations that preserve its structure and function in the contexts in which it occurs.

  19. High-throughput Screening Identification of Poliovirus RNA-dependent RNA Polymerase Inhibitors

    PubMed Central

    Campagnola, Grace; Gong, Peng; Peersen, Olve B.

    2011-01-01

    Viral RNA-dependent RNA polymerase (RdRP) enzymes are essential for the replication of positive-strand RNA viruses and established targets for the development of selective antiviral therapeutics. In this work we have carried out a high-throughput screen of 154,267 compounds to identify poliovirus polymerase inhibitors using a fluorescence based RNA elongation assay. Screening and subsequent validation experiments using kinetic methods and RNA product analysis resulted in the identification of seven inhibitors that affect the RNA binding, initiation, or elongation activity of the polymerase. X-ray crystallography data show clear density for five of the compounds in the active site of the poliovirus polymerase elongation complex. The inhibitors occupy the NTP binding site by stacking on the priming nucleotide and interacting with the templating base, yet competition studies show fairly weak IC50 values in the low μM range. A comparison with nucleotide bound structures suggests that weak binding is likely due to the lack of a triphosphate group on the inhibitors. Consequently, the inhibitors are primarily effective at blocking polymerase initiation and do not effectively compete with NTP binding during processive elongation. These findings are discussed in the context of the polymerase elongation complex structure and allosteric control of the viral RdRP catalytic cycle. PMID:21722674

  20. Amino acid repeats avert mRNA folding through conservative substitutions and synonymous codons, regardless of codon bias.

    PubMed

    Barik, Sailen

    2017-12-01

    A significant number of proteins in all living species contains amino acid repeats (AARs) of various lengths and compositions, many of which play important roles in protein structure and function. Here, I have surveyed select homopolymeric single [(A)n] and double [(AB)n] AARs in the human proteome. A close examination of their codon pattern and analysis of RNA structure propensity led to the following set of empirical rules: (1) One class of amino acid repeats (Class I) uses a mixture of synonymous codons, some of which approximate the codon bias ratio in the overall human proteome; (2) The second class (Class II) disregards the codon bias ratio, and appears to have originated by simple repetition of the same codon (or just a few codons); and finally, (3) In all AARs (including Class I, Class II, and the in-betweens), the codons are chosen in a manner that precludes the formation of RNA secondary structure. It appears that the AAR genes have evolved by orchestrating a balance between codon usage and mRNA secondary structure. The insights gained here should provide a better understanding of AAR evolution and may assist in designing synthetic genes.

  1. Improve the prediction of RNA-binding residues using structural neighbours.

    PubMed

    Li, Quan; Cao, Zanxia; Liu, Haiyan

    2010-03-01

    The interactions between RNA-binding proteins (RBPs) with RNA play key roles in managing some of the cell's basic functions. The identification and prediction of RNA binding sites is important for understanding the RNA-binding mechanism. Computational approaches are being developed to predict RNA-binding residues based on the sequence- or structure-derived features. To achieve higher prediction accuracy, improvements on current prediction methods are necessary. We identified that the structural neighbors of RNA-binding and non-RNA-binding residues have different amino acid compositions. Combining this structure-derived feature with evolutionary (PSSM) and other structural information (secondary structure and solvent accessibility) significantly improves the predictions over existing methods. Using a multiple linear regression approach and 6-fold cross validation, our best model can achieve an overall correct rate of 87.8% and MCC of 0.47, with a specificity of 93.4%, correctly predict 52.4% of the RNA-binding residues for a dataset containing 107 non-homologous RNA-binding proteins. Compared with existing methods, including the amino acid compositions of structure neighbors lead to clearly improvement. A web server was developed for predicting RNA binding residues in a protein sequence (or structure),which is available at http://mcgill.3322.org/RNA/.

  2. Regulatory effects of cotranscriptional RNA structure formation and transitions.

    PubMed

    Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi

    2016-09-01

    RNAs, which play significant roles in many fundamental biological processes of life, fold into sophisticated and precise structures. RNA folding is a dynamic and intricate process, which conformation transition of coding and noncoding RNAs form the primary elements of genetic regulation. The cellular environment contains various intrinsic and extrinsic factors that potentially affect RNA folding in vivo, and experimental and theoretical evidence increasingly indicates that the highly flexible features of the RNA structure are affected by these factors, which include the flanking sequence context, physiochemical conditions, cis RNA-RNA interactions, and RNA interactions with other molecules. Furthermore, distinct RNA structures have been identified that govern almost all steps of biological processes in cells, including transcriptional activation and termination, transcriptional mutagenesis, 5'-capping, splicing, 3'-polyadenylation, mRNA export and localization, and translation. Here, we briefly summarize the dynamic and complex features of RNA folding along with a wide variety of intrinsic and extrinsic factors that affect RNA folding. We then provide several examples to elaborate RNA structure-mediated regulation at the transcriptional and posttranscriptional levels. Finally, we illustrate the regulatory roles of RNA structure and discuss advances pertaining to RNA structure in plants. WIREs RNA 2016, 7:562-574. doi: 10.1002/wrna.1350 For further resources related to this article, please visit the WIREs website. © 2016 Wiley Periodicals, Inc.

  3. Structural and functional analyses of Saccharomyces cerevisiae wild-type and mutant RNA1 genes.

    PubMed Central

    Traglia, H M; Atkinson, N S; Hopper, A K

    1989-01-01

    The yeast gene RNA1 has been defined by the thermosensitive rna1-1 lesion. This lesion interferes with the processing and production of all major classes of RNA. Each class of RNA is affected at a distinct and presumably unrelated step. Furthermore, RNA does not appear to exit the nucleus. To investigate how the RNA1 gene product can pleiotropically affect disparate processes, we undertook a structural analysis of wild-type and mutant RNA1 genes. The wild-type gene was found to contain a 407-amino-acid open reading frame that encodes a hydrophilic protein. No clue regarding the function of the RNA1 protein was obtained by searching banks for similarity to other known gene products. Surprisingly, the rna1-1 lesion was found to code for two amino acid differences from wild type. We found that neither single-amino-acid change alone resulted in temperature sensitivity. The carboxy-terminal region of the RNA1 open reading frame contains a highly acidic domain extending from amino acids 334 to 400. We generated genomic deletions that removed C-terminal regions of this protein. Deletion of amino acids 397 to 407 did not appear to affect cell growth. Removal of amino acids 359 to 397, a region containing 24 acidic residues, caused temperature-sensitive growth. This allele, rna1-delta 359-397, defines a second conditional lesion of the RNA1 locus. We found that strains possessing the rna1-delta 359-397 allele did not show thermosensitive defects in pre-rRNA or pre-tRNA processing. Removal of amino acids 330 to 407 resulted in loss of viability. Images PMID:2674676

  4. Changes in beta-actin mRNA expression in remodeling canine myocardium.

    PubMed

    Carlyle, W C; Toher, C A; Vandervelde, J R; McDonald, K M; Homans, D C; Cohn, J N

    1996-01-01

    Beta-actin, a cytoskeletal protein important in the maintenance of cytoarchitecture, has long been thought to be expressed constitutively in myocardial tissue. As such, beta-actin mRNA has been used as a control gene in a wide range of experiments. However, we have uncovered consistent changes in beta-actin mRNA expression in canine myocardium remodeling as a result of insult to the left ventricle. The experimental canine models used were either DC shock damage to the left ventricle or volume overload resulting from severe mitral regurgitation. The remodeling process in both canine models is characterized by an increase in left ventricular mass. PCR amplification using primers designed to selectively amplify the 3' end and a portion of the 3' untranslated region of beta-actin mRNA resulted in the generation of a 297 base pair product predominant only in normal canine myocardium and a 472 base pair product that became increasingly prominent from 1 to 30 days after DC shock damage to the left ventricle and from 10 to 90 days after creation of mitral regurgitation. Northern analysis showed a three-fold increase in beta-actin mRNA after either DC shock or creation of mitral regurgitation. Western analysis revealed an early increase in beta-actin protein followed by an apparent decrease to below baseline levels. These observations suggest that changes in beta-actin mRNA expression accompany the structural alterations that occur in response to myocardial damage. Whether or not the changes in beta-actin mRNA expression play a role in mediating these structural alterations remains to be determined.

  5. The molecular variability analysis of the RNA 3 of fifteen isolates of Prunus necrotic ringspot virus sheds light on the minimal requirements for the synthesis of its subgenomic RNA.

    PubMed

    Aparicio, Frederic; Pallás, Vicente

    2002-01-01

    The nucleotide sequences of the RNA 3 of fifteen isolates of Prunus necrotic ringspot virus (PNRSV) varying in the symptomatology they cause in six different Prunus spp. were determined. Analysis of the molecular variability has allowed, in addition to study the phylogenetic relationships among them, to evaluate the minimal requirements for the synthesis of the subgenomic RNA in Ilarvirus genus and their comparison to other members of the Bromoviridae family. Computer assisted comparisons led recently to Jaspars (Virus Genes 17, 233-242, 1998) to propose that a hairpin structure in viral minus strand RNA is required for subgenomic promoter activity of viruses from at least two, and possibly all five, genera in the family of Bromoviridae. For PNRSV and Apple mosaic virus two stable hairpins were proposed whereas for the rest of Ilarviruses and the other four genera of the Bromoviridae family only one stable hairpin was predicted. Comparative analysis of this region among the fifteen PNRSV isolates characterized in this study revealed that two of them showed a 12-nt deletion that led to the disappearance of the most proximal hairpin to the initiation site. Interestingly, the only hairpin found in these two isolates is very similar in primary and secondary structure to the one previously shown in Brome mosaic virus to be required for the synthesis of the subgenomic RNA. In this hairpin, the molecular diversity was concentrated mostly at the loop whereas compensatory mutations were observed at the base of the stem strongly suggesting its functional relevance. The evolutionary implications of these observations are discussed.

  6. Cis elements and trans-acting factors involved in the RNA dimerization of the human immunodeficiency virus HIV-1.

    PubMed

    Darlix, J L; Gabus, C; Nugeyre, M T; Clavel, F; Barré-Sinoussi, F

    1990-12-05

    The retroviral genome consists of two identical RNA molecules joined at their 5' ends by the Dimer Linkage Structure (DLS). To study the mechanism of dimerization and the DLS of HIV-1 RNA, large amounts of bona fide HIV-1 RNA and of mutants have been synthesized in vitro. We report that HIV-1 RNA forms dimeric molecules and that viral nucleocapsid (NC) protein NCp15 greatly activates dimerization. Deletion mutagenesis in the RNA 5' 1333 nucleotides indicated that a small domain of 100 nucleotides, located between positions 311 to 415 from the 5' end, is necessary and sufficient to promote HIV-1 RNA dimerization. This dimerization domain encompasses an encapsidation element located between the 5' splice donor site and initiator AUG of gag and shows little sequence variations in different strains of HIV-1. Furthermore, cross-linking analysis of the interactions between NC and HIV-1 RNA (311 to 415) locates a major contact site in the encapsidation element of HIV-1 RNA. The genomic RNA dimer is tightly associated with nucleocapsid protein molecules in avian and murine retroviruses, and this ribonucleoprotein structure is believed to be the template for reverse transcription. Genomic RNA-protein interactions have been analyzed in human immunodeficiency virus (HIV) virions and results showed that NC protein molecules are tightly bound to the genomic RNA dimer. Since retroviral RNA dimerization and packaging appear to be under the control of the same cis element, the encapsidation sequences, and trans-acting factor, the NC protein, they are probably related events in the course of virion assembly.

  7. Active bacterial community structure along vertical redox gradients in Baltic Sea sediment

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jansson, Janet; Edlund, Anna; Hardeman, Fredrik

    Community structures of active bacterial populations were investigated along a vertical redox profile in coastal Baltic Sea sediments by terminal-restriction fragment length polymorphism (T-RFLP) and clone library analysis. According to correspondence analysis of T-RFLP results and sequencing of cloned 16S rRNA genes, the microbial community structures at three redox depths (179 mV, -64 mV and -337 mV) differed significantly. The bacterial communities in the community DNA differed from those in bromodeoxyuridine (BrdU)-labeled DNA, indicating that the growing members of the community that incorporated BrdU were not necessarily the most dominant members. The structures of the actively growing bacterial communities weremore » most strongly correlated to organic carbon followed by total nitrogen and redox potentials. Bacterial identification by sequencing of 16S rRNA genes from clones of BrdU-labeled DNA and DNA from reverse transcription PCR (rt-PCR) showed that bacterial taxa involved in nitrogen and sulfur cycling were metabolically active along the redox profiles. Several sequences had low similarities to previously detected sequences indicating that novel lineages of bacteria are present in Baltic Sea sediments. Also, a high number of different 16S rRNA gene sequences representing different phyla were detected at all sampling depths.« less

  8. Transcriptome-wide Analysis of Exosome Targets

    PubMed Central

    Schneider, Claudia; Kudla, Grzegorz; Wlotzka, Wiebke; Tuck, Alex; Tollervey, David

    2012-01-01

    Summary The exosome plays major roles in RNA processing and surveillance but the in vivo target range and substrate acquisition mechanisms remain unclear. Here we apply in vivo RNA crosslinking (CRAC) to the nucleases (Rrp44, Rrp6), two structural subunits (Rrp41, Csl4) and a cofactor (Trf4) of the yeast exosome. Analysis of wild-type Rrp44 and catalytic mutants showed that both the CUT and SUT classes of non-coding RNA, snoRNAs and, most prominently, pre-tRNAs and other Pol III transcripts are targeted for oligoadenylation and exosome degradation. Unspliced pre-mRNAs were also identified as targets for Rrp44 and Rrp6. CRAC performed using cleavable proteins (split-CRAC) revealed that Rrp44 endonuclease and exonuclease activities cooperate on most substrates. Mapping oligoadenylated reads suggests that the endonuclease activity may release stalled exosome substrates. Rrp6 was preferentially associated with structured targets, which frequently did not associate with the core exosome indicating that substrates follow multiple pathways to the nucleases. PMID:23000172

  9. Identification and characterization of circular RNAs in zebrafish.

    PubMed

    Shen, Yudong; Guo, Xianwu; Wang, Weimin

    2017-01-01

    Circular RNA (circRNA), a class of RNAs with circular structure, has received little attention until recently, when some new features and functions were discovered. In the present study, we sequenced circRNAs in zebrafish (Danio rerio) and identified 3868 circRNAs using three algorithms (find_circ, CIRI, segemehl). The analysis of microRNA target sites on circRNAs shows that some circRNAs may function as miRNA sponges. Furthermore, we identified the existence of reverse complementary sequences in the flanking regions of only 25 (2.64%) exonic circRNAs, indicating that the mechanism of zebrafish exonic circRNA biogenesis might be different from that in mammals. Moreover, 1122 (29%) zebrafish circRNA sequences showed homology with human, mouse and coelacanth circRNAs. © 2016 Federation of European Biochemical Societies.

  10. Modulation of Active Gut Microbiota by Lactobacillus rhamnosus GG in a Diet Induced Obesity Murine Model.

    PubMed

    Ji, Yosep; Park, Soyoung; Park, Haryung; Hwang, Eunchong; Shin, Hyeunkil; Pot, Bruno; Holzapfel, Wilhelm H

    2018-01-01

    Gut microbiota play a key role in the development of metabolic disorders. Defining and correlating structural shifts in gut microbial assemblages with conditions related to metabolic syndrome have, however, been proven difficult. Results from 16S genomic DNA and 16S ribosomal RNA analyses of fecal samples may differ widely, leading to controversial information on the whole microbial community and metabolically active microbiota. Using a C57BL/6J murine model, we compared data from 16S genomic DNA and ribosomal RNA of the fecal microbiota. The study included three groups of experimental animals comprising two groups with high fat diet induced obesity (DIO) while a third group (control) received a low fat diet. One of the DIO groups was treated with the probiotic Lactobacillus rhamnosus GG (LGG). Compared to the data obtained by DNA analysis, a significantly higher abundance of OTUs was accounted for by RNA analysis. Moreover, rRNA based analysis showed a modulation of the active gut microbial population in the DIO group receiving LGG, thus reflecting a change in the induced obesity status of the host. As one of the most widely studied probiotics the functionality of LGG has been linked to the alleviation of metabolic syndrome, and, in some cases, to an impact on the microbiome. Yet, it appears that no study has reported thus far on modulation of the active microbiota by LGG treatment. It is postulated that the resulting impact on calorie consumption affects weight gain concomitantly with modulation of the functional structure of the gut microbial population. Using the 16S rRNA based approach therefore decisively increased the precision of gut microbiota metagenome analysis.

  11. Highly-sensitive microRNA detection based on bio-bar-code assay and catalytic hairpin assembly two-stage amplification.

    PubMed

    Tang, Songsong; Gu, Yuan; Lu, Huiting; Dong, Haifeng; Zhang, Kai; Dai, Wenhao; Meng, Xiangdan; Yang, Fan; Zhang, Xueji

    2018-04-03

    Herein, a highly-sensitive microRNA (miRNA) detection strategy was developed by combining bio-bar-code assay (BBA) with catalytic hairpin assembly (CHA). In the proposed system, two nanoprobes of magnetic nanoparticles functionalized with DNA probes (MNPs-DNA) and gold nanoparticles with numerous barcode DNA (AuNPs-DNA) were designed. In the presence of target miRNA, the MNP-DNA and AuNP-DNA hybridized with target miRNA to form a "sandwich" structure. After "sandwich" structures were separated from the solution by the magnetic field and dehybridized by high temperature, the barcode DNA sequences were released by dissolving AuNPs. The released barcode DNA sequences triggered the toehold strand displacement assembly of two hairpin probes, leading to recycle of barcode DNA sequences and producing numerous fluorescent CHA products for miRNA detection. Under the optimal experimental conditions, the proposed two-stage amplification system could sensitively detect target miRNA ranging from 10 pM to 10 aM with a limit of detection (LOD) down to 97.9 zM. It displayed good capability to discriminate single base and three bases mismatch due to the unique sandwich structure. Notably, it presented good feasibility for selective multiplexed detection of various combinations of synthetic miRNA sequences and miRNAs extracted from different cell lysates, which were in agreement with the traditional polymerase chain reaction analysis. The two-stage amplification strategy may be significant implication in the biological detection and clinical diagnosis. Copyright © 2017 Elsevier B.V. All rights reserved.

  12. Structural analysis of the Quaking homodimerization interface

    PubMed Central

    Beuck, Christine; Qu, Song; Fagg, W. Samuel; Ares, Manuel; Williamson, James R.

    2012-01-01

    Quaking is a prototypical member of the STAR protein family, which plays key roles in posttranscriptional gene regulation by controlling mRNA translation, stability and splicing. QkI-5 has been shown to regulate mRNA expression in the central nervous system, but little is known about its roles in other tissues. STAR proteins function as dimers and bind to bipartite RNA sequences, however, the structural and functional roles of homo- and hetero-dimerization are still unclear. Here, we present the crystal structure of the QkI dimerization domain, which adopts a similar stacked helix-turn-helix arrangement as its homologs GLD-1 and Sam68, but differs by an additional helix inserted in the dimer interface. Variability of the dimer interface residues likely ensures selective homodimerization by preventing association with non-cognate STAR family proteins in the cell. Mutations that inhibit dimerization also significantly impair RNA binding in vitro, alter QkI-5 protein levels, and impair QkI function in a splicing assay in vivo. Together our results indicate that a functional Qua1 homodimerization domain is required for QkI-5 function in mammalian cells. PMID:22982292

  13. Inforna 2.0: A Platform for the Sequence-Based Design of Small Molecules Targeting Structured RNAs.

    PubMed

    Disney, Matthew D; Winkelsas, Audrey M; Velagapudi, Sai Pradeep; Southern, Mark; Fallahi, Mohammad; Childs-Disney, Jessica L

    2016-06-17

    The development of small molecules that target RNA is challenging yet, if successful, could advance the development of chemical probes to study RNA function or precision therapeutics to treat RNA-mediated disease. Previously, we described Inforna, an approach that can mine motifs (secondary structures) within target RNAs, which is deduced from the RNA sequence, and compare them to a database of known RNA motif-small molecule binding partners. Output generated by Inforna includes the motif found in both the database and the desired RNA target, lead small molecules for that target, and other related meta-data. Lead small molecules can then be tested for binding and affecting cellular (dys)function. Herein, we describe Inforna 2.0, which incorporates all known RNA motif-small molecule binding partners reported in the scientific literature, a chemical similarity searching feature, and an improved user interface and is freely available via an online web server. By incorporation of interactions identified by other laboratories, the database has been doubled, containing 1936 RNA motif-small molecule interactions, including 244 unique small molecules and 1331 motifs. Interestingly, chemotype analysis of the compounds that bind RNA in the database reveals features in small molecule chemotypes that are privileged for binding. Further, this updated database expanded the number of cellular RNAs to which lead compounds can be identified.

  14. Rclick: a web server for comparison of RNA 3D structures.

    PubMed

    Nguyen, Minh N; Verma, Chandra

    2015-03-15

    RNA molecules play important roles in key biological processes in the cell and are becoming attractive for developing therapeutic applications. Since the function of RNA depends on its structure and dynamics, comparing and classifying the RNA 3D structures is of crucial importance to molecular biology. In this study, we have developed Rclick, a web server that is capable of superimposing RNA 3D structures by using clique matching and 3D least-squares fitting. Our server Rclick has been benchmarked and compared with other popular servers and methods for RNA structural alignments. In most cases, Rclick alignments were better in terms of structure overlap. Our server also recognizes conformational changes between structures. For this purpose, the server produces complementary alignments to maximize the extent of detectable similarity. Various examples showcase the utility of our web server for comparison of RNA, RNA-protein complexes and RNA-ligand structures. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  15. Phylogenetic analysis of several Thermus strains from Rehai of Tengchong, Yunnan, China.

    PubMed

    Lin, Lianbing; Zhang, Jie; Wei, Yunlin; Chen, Chaoyin; Peng, Qian

    2005-10-01

    Several Thermus strains were isolated from 10 hot springs of the Rehai geothermal area in Tengchong, Yunnan province. The diversity of Thermus strains was examined by sequencing the 16S rRNA genes and comparing their sequences. Phylogenetic analysis showed that the 16S rDNA sequences from the Rehai geothermal isolates form four branches in the phylogenetic tree and had greater than 95.9% similarity in the phylogroup. Secondary structure comparison also indicated that the 16S rRNA from the Rehai geothermal isolates have unique secondary structure characteristics in helix 6, helix 9, and helix 10 (reference to Escherichia coli). This research is the first attempt to reveal the diversity of Thermus strains that are distributed in the Rehai geothermal area.

  16. RNA structures as mediators of neurological diseases and as drug targets

    PubMed Central

    Bernat, Viachaslau; Disney, Matthew D.

    2015-01-01

    RNAs adopt diverse folded structures that are essential for function and thus play critical roles in cellular biology. A striking example of this is the ribosome, a complex, three-dimensionally folded macromolecular machine that orchestrates protein synthesis. Advances in RNA biochemistry, structural and molecular biology, and bioinformatics have revealed other non-coding RNAs whose functions are dictated by their structure. It is not surprising that aberrantly folded RNA structures contribute to disease. In this review, we provide a brief introduction into RNA structural biology and then describe how RNA structures function in cells and cause or contribute to neurological disease. Finally, we highlight successful applications of rational design principles to provide chemical probes and lead compounds targeting structured RNAs. Based on several examples of well-characterized RNA-driven neurological disorders, we demonstrate how designed small molecules can facilitate study of RNA dysfunction, elucidating previously unknown roles for RNA in disease, and provide lead therapeutics. PMID:26139368

  17. The conservation and function of RNA secondary structure in plants

    PubMed Central

    Vandivier, Lee E.; Anderson, Stephen J.; Foley, Shawn W.; Gregory, Brian D.

    2016-01-01

    RNA transcripts fold into secondary structures via intricate patterns of base pairing. These secondary structures impart catalytic, ligand binding, and scaffolding functions to a wide array of RNAs, forming a critical node of biological regulation. Among their many functions, RNA structural elements modulate epigenetic marks, alter mRNA stability and translation, regulate alternative splicing, transduce signals, and scaffold large macromolecular complexes. Thus, the study of RNA secondary structure is critical to understanding the function and regulation of RNA transcripts. Here, we review the origins, form, and function of RNA secondary structure, focusing on plants. We then provide an overview of methods for probing secondary structure, from physical methods such as X-ray crystallography and nuclear magnetic resonance imaging (NMR) to chemical and nuclease probing methods. Marriage with high-throughput sequencing has enabled these latter methods to scale across whole transcriptomes, yielding tremendous new insights into the form and function of RNA secondary structure. PMID:26865341

  18. R-chie: a web server and R package for visualizing RNA secondary structures

    PubMed Central

    Lai, Daniel; Proctor, Jeff R.; Zhu, Jing Yun A.; Meyer, Irmtraud M.

    2012-01-01

    Visually examining RNA structures can greatly aid in understanding their potential functional roles and in evaluating the performance of structure prediction algorithms. As many functional roles of RNA structures can already be studied given the secondary structure of the RNA, various methods have been devised for visualizing RNA secondary structures. Most of these methods depict a given RNA secondary structure as a planar graph consisting of base-paired stems interconnected by roundish loops. In this article, we present an alternative method of depicting RNA secondary structure as arc diagrams. This is well suited for structures that are difficult or impossible to represent as planar stem-loop diagrams. Arc diagrams can intuitively display pseudo-knotted structures, as well as transient and alternative structural features. In addition, they facilitate the comparison of known and predicted RNA secondary structures. An added benefit is that structure information can be displayed in conjunction with a corresponding multiple sequence alignments, thereby highlighting structure and primary sequence conservation and variation. We have implemented the visualization algorithm as a web server R-chie as well as a corresponding R package called R4RNA, which allows users to run the software locally and across a range of common operating systems. PMID:22434875

  19. RAG-3D: A search tool for RNA 3D substructures

    DOE PAGES

    Zahran, Mai; Sevim Bayrak, Cigdem; Elmetwaly, Shereef; ...

    2015-08-24

    In this study, to address many challenges in RNA structure/function prediction, the characterization of RNA's modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally describedmore » in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding.« less

  20. RAG-3D: a search tool for RNA 3D substructures

    PubMed Central

    Zahran, Mai; Sevim Bayrak, Cigdem; Elmetwaly, Shereef; Schlick, Tamar

    2015-01-01

    To address many challenges in RNA structure/function prediction, the characterization of RNA's modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally described in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding. PMID:26304547

  1. RAG-3D: A search tool for RNA 3D substructures

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zahran, Mai; Sevim Bayrak, Cigdem; Elmetwaly, Shereef

    In this study, to address many challenges in RNA structure/function prediction, the characterization of RNA's modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally describedmore » in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding.« less

  2. ModeRNA server: an online tool for modeling RNA 3D structures.

    PubMed

    Rother, Magdalena; Milanowska, Kaja; Puton, Tomasz; Jeleniewicz, Jaroslaw; Rother, Kristian; Bujnicki, Janusz M

    2011-09-01

    The diverse functional roles of non-coding RNA molecules are determined by their underlying structure. ModeRNA server is an online tool for RNA 3D structure modeling by the comparative approach, based on a template RNA structure and a user-defined target-template sequence alignment. It offers an option to search for potential templates, given the target sequence. The server also provides tools for analyzing, editing and formatting of RNA structure files. It facilitates the use of the ModeRNA software and offers new options in comparison to the standalone program. ModeRNA server was implemented using the Python language and the Django web framework. It is freely available at http://iimcb.genesilico.pl/modernaserver. iamb@genesilico.pl.

  3. Insights into the Structural Dynamics of Nucleocytoplasmic Transport of tRNA by Exportin-t

    PubMed Central

    Gupta, Asmita; Kailasam, Senthilkumar; Bansal, Manju

    2016-01-01

    Exportin-t (Xpot) transports mature 5′- and 3′-end processed tRNA from the nucleus to the cytoplasm by associating with a small G-protein Ran (RAs-related nuclear protein), in the nucleus. The release of tRNA in cytoplasm involves RanGTP hydrolysis. Despite the availability of crystal structures of nuclear and cytosolic forms of Xpot, the molecular details regarding the sequential events leading to tRNA release and subsequent conformational changes occurring in Xpot remain unknown. We have performed a combination of classical all-atom and accelerated molecular dynamics simulations on a set of complexes involving Xpot to study a range of features including conformational flexibility of free and cargo-bound Xpot and functionally critical contacts between Xpot and its cargo. The systems investigated include free Xpot and its different complexes, bound either to Ran (GTP/GDP) or tRNA or both. This approach provided a statistically reliable estimate of structural dynamics of Xpot after cargo release. The mechanistic basis for Xpot opening after cargo release has been explained in terms of dynamic structural hinges, about which neighboring region could be displaced to facilitate the nuclear to cytosolic state transition. Post-RanGTP hydrolysis, a cascade of events including local conformational change in RanGTP and loss of critical contacts at Xpot/tRNA interface suggest factors responsible for eventual release of tRNA. The level of flexibility in different Xpot complexes varied depending on the arrangement of individual HEAT repeats. Current study provides one of the most comprehensive and robust analysis carried out on this protein using molecular dynamics schemes. PMID:27028637

  4. Structural control of caspase-generated glutamyl-tRNA synthetase by appended noncatalytic WHEP domains.

    PubMed

    Halawani, Dalia; Gogonea, Valentin; DiDonato, Joseph A; Pipich, Vitaliy; Yao, Peng; China, Arnab; Topbas, Celalettin; Vasu, Kommireddy; Arif, Abul; Hazen, Stanley L; Fox, Paul L

    2018-06-08

    Aminoacyl-tRNA synthetases are ubiquitous, evolutionarily conserved enzymes catalyzing the conjugation of amino acids onto cognate tRNAs. During eukaryotic evolution, tRNA synthetases have been the targets of persistent structural modifications. These modifications can be additive, as in the evolutionary acquisition of noncatalytic domains, or subtractive, as in the generation of truncated variants through regulated mechanisms such as proteolytic processing, alternative splicing, or coding region polyadenylation. A unique variant is the human glutamyl-prolyl-tRNA synthetase (EPRS) consisting of two fused synthetases joined by a linker containing three copies of the WHEP domain (termed by its presence in tryptophanyl-, histidyl-, and glutamyl-prolyl-tRNA synthetases). Here, we identify site-selective proteolysis as a mechanism that severs the linkage between the EPRS synthetases in vitro and in vivo Caspase action targeted Asp-929 in the third WHEP domain, thereby separating the two synthetases. Using a neoepitope antibody directed against the newly exposed C terminus, we demonstrate EPRS cleavage at Asp-929 in vitro and in vivo Biochemical and biophysical characterizations of the N-terminally generated EPRS proteoform containing the glutamyl-tRNA synthetase and most of the linker, including two WHEP domains, combined with structural analysis by small-angle neutron scattering, revealed a role for the WHEP domains in modulating conformations of the catalytic core and GSH- S -transferase-C-terminal-like (GST-C) domain. WHEP-driven conformational rearrangement altered GST-C domain interactions and conferred distinct oligomeric states in solution. Collectively, our results reveal long-range conformational changes imposed by the WHEP domains and illustrate how noncatalytic domains can modulate the global structure of tRNA synthetases in complex eukaryotic systems. © 2018 by The American Society for Biochemistry and Molecular Biology, Inc.

  5. An intergenic non-coding rRNA correlated with expression of the rRNA and frequency of an rRNA single nucleotide polymorphism in lung cancer cells.

    PubMed

    Shiao, Yih-Horng; Lupascu, Sorin T; Gu, Yuhan D; Kasprzak, Wojciech; Hwang, Christopher J; Fields, Janet R; Leighty, Robert M; Quiñones, Octavio; Shapiro, Bruce A; Alvord, W Gregory; Anderson, Lucy M

    2009-10-19

    Ribosomal RNA (rRNA) is a central regulator of cell growth and may control cancer development. A cis noncoding rRNA (nc-rRNA) upstream from the 45S rRNA transcription start site has recently been implicated in control of rRNA transcription in mouse fibroblasts. We investigated whether a similar nc-rRNA might be expressed in human cancer epithelial cells, and related to any genomic characteristics. Using quantitative rRNA measurement, we demonstrated that a nc-rRNA is transcribed in human lung epithelial and lung cancer cells, starting from approximately -1000 nucleotides upstream of the rRNA transcription start site (+1) and extending at least to +203. This nc-rRNA was significantly more abundant in the majority of lung cancer cell lines, relative to a nontransformed lung epithelial cell line. Its abundance correlated negatively with total 45S rRNA in 12 of 13 cell lines (P = 0.014). During sequence analysis from -388 to +306, we observed diverse, frequent intercopy single nucleotide polymorphisms (SNPs) in rRNA, with a frequency greater than predicted by chance at 12 sites. A SNP at +139 (U/C) in the 5' leader sequence varied among the cell lines and correlated negatively with level of the nc-rRNA (P = 0.014). Modelling of the secondary structure of the rRNA 5'-leader sequence indicated a small increase in structural stability due to the +139 U/C SNP and a minor shift in local configuration occurrences. The results demonstrate occurrence of a sense nc-rRNA in human lung epithelial and cancer cells, and imply a role in regulation of the rRNA gene, which may be affected by a +139 SNP in the 5' leader sequence of the primary rRNA transcript.

  6. The conserved CAAGAAAGA spacer sequence is an essential element for the formation of 3' termini of the sea urchin H3 histone mRNA by RNA processing.

    PubMed Central

    Georgiev, O; Birnstiel, M L

    1985-01-01

    Analysis of cDNA sequences obtained from the small nuclear RNA U7 has previously suggested specific contacts, by base pairing, between the conserved stem-loop structure and CAAGAAAGA sequence of the histone pre-mRNA and the 5'-terminal sequence of the U7 RNA during RNA processing. In order to test some aspects of the model we have created a series of linker scan, deletion and insertion mutants of the 3' terminus of a sea urchin H3 histone gene and have injected mutant DNAs or in vitro synthesized precursors into frog oocyte nuclei for interpretation. We find that, in addition to the stem-loop structure of the mRNA, the CAAGAAAGA spacer transcript within the histone pre-mRNA is required absolutely for RNA processing, as predicted from our model. Spacer sequences immediately downstream of the CAAGAAAGA motif are not complementary to U7 RNA. Nevertheless, they are necessary for obtaining a maximal rate of RNA processing, as is the ACCA sequence coding for the 3' terminus of the mature mRNA. An increase of distance between the mRNA palindrome and the CAAGAAAGA by as little as six nucleotides abolishes all processing. It may, therefore, be useful to regard both these sequence motifs as part of one and the same RNA processing signal with narrowly defined topologies. Interestingly, U7 RNA-dependent 3' processing of histone pre-mRNA can occur in RNA injection experiments only when the in vitro synthesized pre-mRNA contains sequence extensions well beyond the region of sequence complementarities to the U7 RNA. In addition to directing 3' processing the terminal mRNA sequences may have a role in histone mRNA stabilization in the cytoplasmic compartment. Images Fig. 3. Fig. 4. Fig. 5. Fig. 6. Fig. 7. PMID:2410259

  7. In silico methods for co-transcriptional RNA secondary structure prediction and for investigating alternative RNA structure expression.

    PubMed

    Meyer, Irmtraud M

    2017-05-01

    RNA transcripts are the primary products of active genes in any living organism, including many viruses. Their cellular destiny not only depends on primary sequence signals, but can also be determined by RNA structure. Recent experimental evidence shows that many transcripts can be assigned more than a single functional RNA structure throughout their cellular life and that structure formation happens co-transcriptionally, i.e. as the transcript is synthesised in the cell. Moreover, functional RNA structures are not limited to non-coding transcripts, but can also feature in coding transcripts. The picture that now emerges is that RNA structures constitute an additional layer of information that can be encoded in any RNA transcript (and on top of other layers of information such as protein-context) in order to exert a wide range of functional roles. Moreover, different encoded RNA structures can be expressed at different stages of a transcript's life in order to alter the transcript's behaviour depending on its actual cellular context. Similar to the concept of alternative splicing for protein-coding genes, where a single transcript can yield different proteins depending on cellular context, it is thus appropriate to propose the notion of alternative RNA structure expression for any given transcript. This review introduces several computational strategies that my group developed to detect different aspects of RNA structure expression in vivo. Two aspects are of particular interest to us: (1) RNA secondary structure features that emerge during co-transcriptional folding and (2) functional RNA structure features that are expressed at different times of a transcript's life and potentially mutually exclusive. Copyright © 2017. Published by Elsevier Inc.

  8. Computational prediction and biochemical characterization of novel RNA aptamers to Rift Valley fever virus nucleocapsid protein.

    PubMed

    Ellenbecker, Mary; St Goddard, Jeremy; Sundet, Alec; Lanchy, Jean-Marc; Raiford, Douglas; Lodmell, J Stephen

    2015-10-01

    Rift Valley fever virus (RVFV) is a potent human and livestock pathogen endemic to sub-Saharan Africa and the Arabian Peninsula that has potential to spread to other parts of the world. Although there is no proven effective and safe treatment for RVFV infections, a potential therapeutic target is the virally encoded nucleocapsid protein (N). During the course of infection, N binds to viral RNA, and perturbation of this interaction can inhibit viral replication. To gain insight into how N recognizes viral RNA specifically, we designed an algorithm that uses a distance matrix and multidimensional scaling to compare the predicted secondary structures of known N-binding RNAs, or aptamers, that were isolated and characterized in previous in vitro evolution experiment. These aptamers did not exhibit overt sequence or predicted structure similarity, so we employed bioinformatic methods to propose novel aptamers based on analysis and clustering of secondary structures. We screened and scored the predicted secondary structures of novel randomly generated RNA sequences in silico and selected several of these putative N-binding RNAs whose secondary structures were similar to those of known N-binding RNAs. We found that overall the in silico generated RNA sequences bound well to N in vitro. Furthermore, introduction of these RNAs into cells prior to infection with RVFV inhibited viral replication in cell culture. This proof of concept study demonstrates how the predictive power of bioinformatics and the empirical power of biochemistry can be jointly harnessed to discover, synthesize, and test new RNA sequences that bind tightly to RVFV N protein. The approach would be easily generalizable to other applications. Copyright © 2015 Elsevier Ltd. All rights reserved.

  9. Topological Structure of the Space of Phenotypes: The Case of RNA Neutral Networks

    PubMed Central

    Aguirre, Jacobo; Buldú, Javier M.; Stich, Michael; Manrubia, Susanna C.

    2011-01-01

    The evolution and adaptation of molecular populations is constrained by the diversity accessible through mutational processes. RNA is a paradigmatic example of biopolymer where genotype (sequence) and phenotype (approximated by the secondary structure fold) are identified in a single molecule. The extreme redundancy of the genotype-phenotype map leads to large ensembles of RNA sequences that fold into the same secondary structure and can be connected through single-point mutations. These ensembles define neutral networks of phenotypes in sequence space. Here we analyze the topological properties of neutral networks formed by 12-nucleotides RNA sequences, obtained through the exhaustive folding of sequence space. A total of 412 sequences fragments into 645 subnetworks that correspond to 57 different secondary structures. The topological analysis reveals that each subnetwork is far from being random: it has a degree distribution with a well-defined average and a small dispersion, a high clustering coefficient, and an average shortest path between nodes close to its minimum possible value, i.e. the Hamming distance between sequences. RNA neutral networks are assortative due to the correlation in the composition of neighboring sequences, a feature that together with the symmetries inherent to the folding process explains the existence of communities. Several topological relationships can be analytically derived attending to structural restrictions and generic properties of the folding process. The average degree of these phenotypic networks grows logarithmically with their size, such that abundant phenotypes have the additional advantage of being more robust to mutations. This property prevents fragmentation of neutral networks and thus enhances the navigability of sequence space. In summary, RNA neutral networks show unique topological properties, unknown to other networks previously described. PMID:22028856

  10. In silico design of ligand triggered RNA switches.

    PubMed

    Findeiß, Sven; Hammer, Stefan; Wolfinger, Michael T; Kühnl, Felix; Flamm, Christoph; Hofacker, Ivo L

    2018-04-13

    This contribution sketches a work flow to design an RNA switch that is able to adapt two structural conformations in a ligand-dependent way. A well characterized RNA aptamer, i.,e., knowing its K d and adaptive structural features, is an essential ingredient of the described design process. We exemplify the principles using the well-known theophylline aptamer throughout this work. The aptamer in its ligand-binding competent structure represents one structural conformation of the switch while an alternative fold that disrupts the binding-competent structure forms the other conformation. To keep it simple we do not incorporate any regulatory mechanism to control transcription or translation. We elucidate a commonly used design process by explicitly dissecting and explaining the necessary steps in detail. We developed a novel objective function which specifies the mechanistics of this simple, ligand-triggered riboswitch and describe an extensive in silico analysis pipeline to evaluate important kinetic properties of the designed sequences. This protocol and the developed software can be easily extended or adapted to fit novel design scenarios and thus can serve as a template for future needs. Copyright © 2018. Published by Elsevier Inc.

  11. The Shine-Dalgarno sequence of riboswitch-regulated single mRNAs shows ligand-dependent accessibility bursts

    NASA Astrophysics Data System (ADS)

    Rinaldi, Arlie J.; Lund, Paul E.; Blanco, Mario R.; Walter, Nils G.

    2016-01-01

    In response to intracellular signals in Gram-negative bacteria, translational riboswitches--commonly embedded in messenger RNAs (mRNAs)--regulate gene expression through inhibition of translation initiation. It is generally thought that this regulation originates from occlusion of the Shine-Dalgarno (SD) sequence upon ligand binding; however, little direct evidence exists. Here we develop Single Molecule Kinetic Analysis of RNA Transient Structure (SiM-KARTS) to investigate the ligand-dependent accessibility of the SD sequence of an mRNA hosting the 7-aminomethyl-7-deazaguanine (preQ1)-sensing riboswitch. Spike train analysis reveals that individual mRNA molecules alternate between two conformational states, distinguished by `bursts' of probe binding associated with increased SD sequence accessibility. Addition of preQ1 decreases the lifetime of the SD's high-accessibility (bursting) state and prolongs the time between bursts. In addition, ligand-jump experiments reveal imperfect riboswitching of single mRNA molecules. Such complex ligand sensing by individual mRNA molecules rationalizes the nuanced ligand response observed during bulk mRNA translation.

  12. RNA 3D Structure Modeling by Combination of Template-Based Method ModeRNA, Template-Free Folding with SimRNA, and Refinement with QRNAS.

    PubMed

    Piatkowski, Pawel; Kasprzak, Joanna M; Kumar, Deepak; Magnus, Marcin; Chojnowski, Grzegorz; Bujnicki, Janusz M

    2016-01-01

    RNA encompasses an essential part of all known forms of life. The functions of many RNA molecules are dependent on their ability to form complex three-dimensional (3D) structures. However, experimental determination of RNA 3D structures is laborious and challenging, and therefore, the majority of known RNAs remain structurally uncharacterized. To address this problem, computational structure prediction methods were developed that either utilize information derived from known structures of other RNA molecules (by way of template-based modeling) or attempt to simulate the physical process of RNA structure formation (by way of template-free modeling). All computational methods suffer from various limitations that make theoretical models less reliable than high-resolution experimentally determined structures. This chapter provides a protocol for computational modeling of RNA 3D structure that overcomes major limitations by combining two complementary approaches: template-based modeling that is capable of predicting global architectures based on similarity to other molecules but often fails to predict local unique features, and template-free modeling that can predict the local folding, but is limited to modeling the structure of relatively small molecules. Here, we combine the use of a template-based method ModeRNA with a template-free method SimRNA. ModeRNA requires a sequence alignment of the target RNA sequence to be modeled with a template of the known structure; it generates a model that predicts the structure of a conserved core and provides a starting point for modeling of variable regions. SimRNA can be used to fold small RNAs (<80 nt) without any additional structural information, and to refold parts of models for larger RNAs that have a correctly modeled core. ModeRNA can be either downloaded, compiled and run locally or run through a web interface at http://genesilico.pl/modernaserver/ . SimRNA is currently available to download for local use as a precompiled software package at http://genesilico.pl/software/stand-alone/simrna and as a web server at http://genesilico.pl/SimRNAweb . For model optimization we use QRNAS, available at http://genesilico.pl/qrnas .

  13. Closing loop base pairs in RNA loop-loop complexes: structural behavior, interaction energy and solvation analysis through molecular dynamics simulations.

    PubMed

    Golebiowski, Jérôme; Antonczak, Serge; Fernandez-Carmona, Juan; Condom, Roger; Cabrol-Bass, Daniel

    2004-12-01

    Nanosecond molecular dynamics using the Ewald summation method have been performed to elucidate the structural and energetic role of the closing base pair in loop-loop RNA duplexes neutralized by Mg2+ counterions in aqueous phases. Mismatches GA, CU and Watson-Crick GC base pairs have been considered for closing the loop of an RNA in complementary interaction with HIV-1 TAR. The simulations reveal that the mismatch GA base, mediated by a water molecule, leads to a complex that presents the best compromise between flexibility and energetic contributions. The mismatch CU base pair, in spite of the presence of an inserted water molecule, is too short to achieve a tight interaction at the closing-loop junction and seems to force TAR to reorganize upon binding. An energetic analysis has allowed us to quantify the strength of the interactions of the closing and the loop-loop pairs throughout the simulations. Although the water-mediated GA closing base pair presents an interaction energy similar to that found on fully geometry-optimized structure, the water-mediated CU closing base pair energy interaction reaches less than half the optimal value.

  14. Freiburg RNA tools: a central online resource for RNA-focused research and teaching.

    PubMed

    Raden, Martin; Ali, Syed M; Alkhnbashi, Omer S; Busch, Anke; Costa, Fabrizio; Davis, Jason A; Eggenhofer, Florian; Gelhausen, Rick; Georg, Jens; Heyne, Steffen; Hiller, Michael; Kundu, Kousik; Kleinkauf, Robert; Lott, Steffen C; Mohamed, Mostafa M; Mattheis, Alexander; Miladi, Milad; Richter, Andreas S; Will, Sebastian; Wolff, Joachim; Wright, Patrick R; Backofen, Rolf

    2018-05-21

    The Freiburg RNA tools webserver is a well established online resource for RNA-focused research. It provides a unified user interface and comprehensive result visualization for efficient command line tools. The webserver includes RNA-RNA interaction prediction (IntaRNA, CopraRNA, metaMIR), sRNA homology search (GLASSgo), sequence-structure alignments (LocARNA, MARNA, CARNA, ExpaRNA), CRISPR repeat classification (CRISPRmap), sequence design (antaRNA, INFO-RNA, SECISDesign), structure aberration evaluation of point mutations (RaSE), and RNA/protein-family models visualization (CMV), and other methods. Open education resources offer interactive visualizations of RNA structure and RNA-RNA interaction prediction as well as basic and advanced sequence alignment algorithms. The services are freely available at http://rna.informatik.uni-freiburg.de.

  15. RNA secondary structure prediction with pseudoknots: Contribution of algorithm versus energy model.

    PubMed

    Jabbari, Hosna; Wark, Ian; Montemagno, Carlo

    2018-01-01

    RNA is a biopolymer with various applications inside the cell and in biotechnology. Structure of an RNA molecule mainly determines its function and is essential to guide nanostructure design. Since experimental structure determination is time-consuming and expensive, accurate computational prediction of RNA structure is of great importance. Prediction of RNA secondary structure is relatively simpler than its tertiary structure and provides information about its tertiary structure, therefore, RNA secondary structure prediction has received attention in the past decades. Numerous methods with different folding approaches have been developed for RNA secondary structure prediction. While methods for prediction of RNA pseudoknot-free structure (structures with no crossing base pairs) have greatly improved in terms of their accuracy, methods for prediction of RNA pseudoknotted secondary structure (structures with crossing base pairs) still have room for improvement. A long-standing question for improving the prediction accuracy of RNA pseudoknotted secondary structure is whether to focus on the prediction algorithm or the underlying energy model, as there is a trade-off on computational cost of the prediction algorithm versus the generality of the method. The aim of this work is to argue when comparing different methods for RNA pseudoknotted structure prediction, the combination of algorithm and energy model should be considered and a method should not be considered superior or inferior to others if they do not use the same scoring model. We demonstrate that while the folding approach is important in structure prediction, it is not the only important factor in prediction accuracy of a given method as the underlying energy model is also as of great value. Therefore we encourage researchers to pay particular attention in comparing methods with different energy models.

  16. Evolution of introns in the archaeal world.

    PubMed

    Tocchini-Valentini, Giuseppe D; Fruscoloni, Paolo; Tocchini-Valentini, Glauco P

    2011-03-22

    The self-splicing group I introns are removed by an autocatalytic mechanism that involves a series of transesterification reactions. They require RNA binding proteins to act as chaperones to correctly fold the RNA into an active intermediate structure in vivo. Pre-tRNA introns in Bacteria and in higher eukaryote plastids are typical examples of self-splicing group I introns. By contrast, two striking features characterize RNA splicing in the archaeal world. First, self-splicing group I introns cannot be found, to this date, in that kingdom. Second, the RNA splicing scenario in Archaea is uniform: All introns, whether in pre-tRNA or elsewhere, are removed by tRNA splicing endonucleases. We suggest that in Archaea, the protein recruited for splicing is the preexisting tRNA splicing endonuclease and that this enzyme, together with the ligase, takes over the task of intron removal in a more efficient fashion than the ribozyme. The extinction of group I introns in Archaea would then be a consequence of recruitment of the tRNA splicing endonuclease. We deal here with comparative genome analysis, focusing specifically on the integration of introns into genes coding for 23S rRNA molecules, and how this newly acquired intron has to be removed to regenerate a functional RNA molecule. We show that all known oligomeric structures of the endonuclease can recognize and cleave a ribosomal intron, even when the endonuclease derives from a strain lacking rRNA introns. The persistence of group I introns in mitochondria and chloroplasts would be explained by the inaccessibility of these introns to the endonuclease.

  17. Comparative genomics of metabolic capacities of regulons controlled by cis-regulatory RNA motifs in bacteria.

    PubMed

    Sun, Eric I; Leyn, Semen A; Kazanov, Marat D; Saier, Milton H; Novichkov, Pavel S; Rodionov, Dmitry A

    2013-09-02

    In silico comparative genomics approaches have been efficiently used for functional prediction and reconstruction of metabolic and regulatory networks. Riboswitches are metabolite-sensing structures often found in bacterial mRNA leaders controlling gene expression on transcriptional or translational levels.An increasing number of riboswitches and other cis-regulatory RNAs have been recently classified into numerous RNA families in the Rfam database. High conservation of these RNA motifs provides a unique advantage for their genomic identification and comparative analysis. A comparative genomics approach implemented in the RegPredict tool was used for reconstruction and functional annotation of regulons controlled by RNAs from 43 Rfam families in diverse taxonomic groups of Bacteria. The inferred regulons include ~5200 cis-regulatory RNAs and more than 12000 target genes in 255 microbial genomes. All predicted RNA-regulated genes were classified into specific and overall functional categories. Analysis of taxonomic distribution of these categories allowed us to establish major functional preferences for each analyzed cis-regulatory RNA motif family. Overall, most RNA motif regulons showed predictable functional content in accordance with their experimentally established effector ligands. Our results suggest that some RNA motifs (including thiamin pyrophosphate and cobalamin riboswitches that control the cofactor metabolism) are widespread and likely originated from the last common ancestor of all bacteria. However, many more analyzed RNA motifs are restricted to a narrow taxonomic group of bacteria and likely represent more recent evolutionary innovations. The reconstructed regulatory networks for major known RNA motifs substantially expand the existing knowledge of transcriptional regulation in bacteria. The inferred regulons can be used for genetic experiments, functional annotations of genes, metabolic reconstruction and evolutionary analysis. The obtained genome-wide collection of reference RNA motif regulons is available in the RegPrecise database (http://regprecise.lbl.gov/).

  18. Expression characteristics of long noncoding RNA uc.322 and its effects on pancreatic islet function.

    PubMed

    Zhao, Xiaoqin; Rong, Can; Pan, Fenghui; Xiang, Lizhi; Wang, Xinlei; Hu, Yun

    2018-06-28

    Increasing evidence indicates that long noncoding RNAs (lncRNAs) perform special biological functions by regulating gene expression through multiple pathways and molecular mechanisms. The aim of this study was to explore the expression characteristics of lncRNA uc.322 in pancreatic islet cells and its effects on the secretion function of islet cells. Bioinformatics analysis was used to detect the lncRNA uc.322 sequence, location, and structural features. Expression of lncRNA uc.322 in different tissues was detected by quantitative polymerase chain reaction analyses. Quantitative polymerase chain reaction, Western blot analysis, adenosine triphosphate determination, glucose-stimulated insulin secretion, and enzyme-linked immunosorbent assay were used to evaluate the effects of lncRNA uc.322 on insulin secretion. The results showed that the full-length of lncRNA uc.322 is 224 bp and that it is highly conserved in various species. Bioinformatics analysis revealed that lncRNA uc.322 is located on chr7:122893196-122893419 (GRCH37/hg19) within the SRY-related HMG-box 6 gene exon region. Compared with other tissues, lncRNA uc.322 is highly expressed in pancreatic tissue. Upregulation of lncRNA uc.322 expression increases the insulin transcription factors pancreatic and duodenal homeobox 1 and Forkhead box O1 expression, promotes insulin secretion in the extracellular fluid of Min6 cells, and increases the adenosine triphosphate concentration. On the other hand, knockdown of lncRNA uc.322 has opposite effects on Min6 cells. Overall, this study showed that upregulation of lncRNA uc.322 in islet β-cells can increase the expression of insulin transcription factors and promote insulin secretion, and it may be a new therapeutic target for diabetes. © 2018 Wiley Periodicals, Inc.

  19. High-throughput determination of RNA structure by proximity ligation.

    PubMed

    Ramani, Vijay; Qiu, Ruolan; Shendure, Jay

    2015-09-01

    We present an unbiased method to globally resolve RNA structures through pairwise contact measurements between interacting regions. RNA proximity ligation (RPL) uses proximity ligation of native RNA followed by deep sequencing to yield chimeric reads with ligation junctions in the vicinity of structurally proximate bases. We apply RPL in both baker's yeast (Saccharomyces cerevisiae) and human cells and generate contact probability maps for ribosomal and other abundant RNAs, including yeast snoRNAs, the RNA subunit of the signal recognition particle and the yeast U2 spliceosomal RNA homolog. RPL measurements correlate with established secondary structures for these RNA molecules, including stem-loop structures and long-range pseudoknots. We anticipate that RPL will complement the current repertoire of computational and experimental approaches in enabling the high-throughput determination of secondary and tertiary RNA structures.

  20. Accurate Classification of RNA Structures Using Topological Fingerprints

    PubMed Central

    Li, Kejie; Gribskov, Michael

    2016-01-01

    While RNAs are well known to possess complex structures, functionally similar RNAs often have little sequence similarity. While the exact size and spacing of base-paired regions vary, functionally similar RNAs have pronounced similarity in the arrangement, or topology, of base-paired stems. Furthermore, predicted RNA structures often lack pseudoknots (a crucial aspect of biological activity), and are only partially correct, or incomplete. A topological approach addresses all of these difficulties. In this work we describe each RNA structure as a graph that can be converted to a topological spectrum (RNA fingerprint). The set of subgraphs in an RNA structure, its RNA fingerprint, can be compared with the fingerprints of other RNA structures to identify and correctly classify functionally related RNAs. Topologically similar RNAs can be identified even when a large fraction, up to 30%, of the stems are omitted, indicating that highly accurate structures are not necessary. We investigate the performance of the RNA fingerprint approach on a set of eight highly curated RNA families, with diverse sizes and functions, containing pseudoknots, and with little sequence similarity–an especially difficult test set. In spite of the difficult test set, the RNA fingerprint approach is very successful (ROC AUC > 0.95). Due to the inclusion of pseudoknots, the RNA fingerprint approach both covers a wider range of possible structures than methods based only on secondary structure, and its tolerance for incomplete structures suggests that it can be applied even to predicted structures. Source code is freely available at https://github.rcac.purdue.edu/mgribsko/XIOS_RNA_fingerprint. PMID:27755571

  1. Diversity in mRNA expression of the serine-type carboxypeptidase ocpG in Aspergillus oryzae through intron retention.

    PubMed

    Ishida, Ken; Kuboshima, Megumi; Morita, Hiroto; Maeda, Hiroshi; Okamoto, Ayako; Takeuchi, Michio; Yamagata, Youhei

    2014-01-01

    Alternative splicing is thought to be a means for diversification of products by mRNA modification. Although some intron retentions are predicted by transcriptome analysis in Aspergillus oryzae, its physiological significance remains unknown. We found that intron retention occurred occasionally in the serine-type carboxypeptidase gene, ocpG. Analysis under various culture conditions revealed that extracellular nitrogen conditions influence splicing patterns; this suggested that there might be a correlation between splicing efficiency and the necessity of OcpG activity for obtaining a nitrogen source. Since further analysis showed that splicing occurred independently in each intron, we constructed ocpG intron-exchanging strain by interchanging the positions of intron-1 and intron-2. The splicing pattern indicated the probability that ocpG intron retention was affected by the secondary structures of intronic mRNA.

  2. SimRNA: a coarse-grained method for RNA folding simulations and 3D structure prediction.

    PubMed

    Boniecki, Michal J; Lach, Grzegorz; Dawson, Wayne K; Tomala, Konrad; Lukasz, Pawel; Soltysinski, Tomasz; Rother, Kristian M; Bujnicki, Janusz M

    2016-04-20

    RNA molecules play fundamental roles in cellular processes. Their function and interactions with other biomolecules are dependent on the ability to form complex three-dimensional (3D) structures. However, experimental determination of RNA 3D structures is laborious and challenging, and therefore, the majority of known RNAs remain structurally uncharacterized. Here, we present SimRNA: a new method for computational RNA 3D structure prediction, which uses a coarse-grained representation, relies on the Monte Carlo method for sampling the conformational space, and employs a statistical potential to approximate the energy and identify conformations that correspond to biologically relevant structures. SimRNA can fold RNA molecules using only sequence information, and, on established test sequences, it recapitulates secondary structure with high accuracy, including correct prediction of pseudoknots. For modeling of complex 3D structures, it can use additional restraints, derived from experimental or computational analyses, including information about secondary structure and/or long-range contacts. SimRNA also can be used to analyze conformational landscapes and identify potential alternative structures. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  3. ModeRNA: a tool for comparative modeling of RNA 3D structure

    PubMed Central

    Rother, Magdalena; Rother, Kristian; Puton, Tomasz; Bujnicki, Janusz M.

    2011-01-01

    RNA is a large group of functionally important biomacromolecules. In striking analogy to proteins, the function of RNA depends on its structure and dynamics, which in turn is encoded in the linear sequence. However, while there are numerous methods for computational prediction of protein three-dimensional (3D) structure from sequence, with comparative modeling being the most reliable approach, there are very few such methods for RNA. Here, we present ModeRNA, a software tool for comparative modeling of RNA 3D structures. As an input, ModeRNA requires a 3D structure of a template RNA molecule, and a sequence alignment between the target to be modeled and the template. It must be emphasized that a good alignment is required for successful modeling, and for large and complex RNA molecules the development of a good alignment usually requires manual adjustments of the input data based on previous expertise of the respective RNA family. ModeRNA can model post-transcriptional modifications, a functionally important feature analogous to post-translational modifications in proteins. ModeRNA can also model DNA structures or use them as templates. It is equipped with many functions for merging fragments of different nucleic acid structures into a single model and analyzing their geometry. Windows and UNIX implementations of ModeRNA with comprehensive documentation and a tutorial are freely available. PMID:21300639

  4. [Analysis of the primary and secondary structure of the mitochondrial serine transfer RNA in seven species of Lutzomyia].

    PubMed

    Vivero, Rafael José; Contreras-Gutiérrez, Maria Angélica; Bejarano, Eduar Elías

    2007-09-01

    Lutzomyia sand flies are involved in the transmission of the parasite Leishmania spp. in America. The taxonomy of these vectors is traditionally based on morphological features of the adult stage, particularly the paired structures of the head and genitalia. Although these characters are useful to distinguish most species of Lutzomyia, morphological identification may be complicated by the similarities within subgenera and species group. To evaluate the utility of mitochondrial serine transfer RNA tRNA Ser for taxonomic identification of Lutzomyia. Seven sand fly species, each representing one of the 27 taxonomic subdivisions in genus Lutzomyia, were analyzed including L. trinidadensis (Oswaldoi group), L. (Psychodopygus) panamensis, L.(Micropygomyia) cayennensis cayennensis, L. dubitans (Migonei group), L. (Lutzomyia) gomezi, L. rangeliana (ungrouped) and L. evansi (Verrucarum group). The mitochondrial tRNA Ser gene, flanked by the cytochrome b and NAD dehydrogenase subunit one genes, was extracted, amplified and sequenced from each specimen. Secondary structure of the tRNA Ser was predicted by comparisons with previously described homologous structures from other dipteran species. The tRNA Ser gene ranged in size from 66 base pairs in L. gomezi to 69 base pairs in L. trinidadensis. Fourteen polymorphic sites, including four insertion-deletion events, were observed in the aligned 70 nucleotide positions. The majority of the substitutions were located in the dihydrouridine, ribothymidine-pseudouridine-cytosine and variable loops, as well as in the basal extreme of the anticodon arm. Changes of primary sequence of the tRNASer provided useful molecular characters for taxonomic identification of the sand fly species under consideration.

  5. Distinct families of cis-acting RNA replication elements epsilon from hepatitis B viruses

    PubMed Central

    Chen, Augustine; Brown, Chris

    2012-01-01

    The hepadnavirus encapsidation signal, epsilon (ε), is an RNA structure located at the 5′ end of the viral pregenomic RNA. It is essential for viral replication and functions in polymerase protein binding and priming. This structure could also have potential regulatory roles in controlling the expression of viral replicative proteins. In addition to its structure, the primary sequence of this RNA element has crucial functional roles in the viral lifecycle. Although the ε elements in hepadnaviruses share common critical functions, there are some significant differences in mammalian and avian hepadnaviruses, which include both sequence and structural variations.   Here we present several covariance models for ε elements from the Hepadnaviridae. The model building included experimentally determined data from previous studies using chemical probing and NMR analysis. These models have sufficient similarity to comprise a clan. The clan has in common a highly conserved overall structure consisting of a lower-stem, bulge, upper-stem and apical-loop. The models differ in functionally critical regions—notably the two types of avian ε elements have a tetra-loop (UGUU) including a non-canonical UU base pair, while the hepatitis B virus (HBV) epsilon has a tri-loop (UGU). The avian epsilon elements have a less stable dynamic structure in the upper stem. Comparisons between these models and all other Rfam models, and searches of genomes, showed these structures are specific to the Hepadnaviridae. Two family models and the clan are available from the Rfam database. PMID:22418844

  6. RNA Characterization by Solid-State NMR Spectroscopy.

    PubMed

    Yang, Yufei; Wang, Shenlin

    2018-06-21

    The structures of RNAs, which play critical roles in various biological processes, provide important clues and insights into the biological functions of these molecules. However, RNA structure determination remains a challenging topic. In recent years, magic-angle-spinning solid-state NMR (MAS SSNMR) has emerged as an alternative technique for structural and dynamic characterization of RNA. MAS SSNMR has been successfully applied to provide atomic-level structural information about several RNA molecules and RNA-protein complexes. In this Minireview, we give an overview of recent progress in the field of MAS SSNMR based RNA structural characterization, and introduce sample preparation strategies and SSNMR spectroscopic techniques that have been incorporated to identify RNA structural elements. We also highlight a few impressive examples of RNAs that have been investigated extensively by SSNMR. Finally, we briefly discuss future technical trends in the use of MAS SSNMR to facilitate RNA structure determination. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  7. NoFold: RNA structure clustering without folding or alignment.

    PubMed

    Middleton, Sarah A; Kim, Junhyong

    2014-11-01

    Structures that recur across multiple different transcripts, called structure motifs, often perform a similar function-for example, recruiting a specific RNA-binding protein that then regulates translation, splicing, or subcellular localization. Identifying common motifs between coregulated transcripts may therefore yield significant insight into their binding partners and mechanism of regulation. However, as most methods for clustering structures are based on folding individual sequences or doing many pairwise alignments, this results in a tradeoff between speed and accuracy that can be problematic for large-scale data sets. Here we describe a novel method for comparing and characterizing RNA secondary structures that does not require folding or pairwise alignment of the input sequences. Our method uses the idea of constructing a distance function between two objects by their respective distances to a collection of empirical examples or models, which in our case consists of 1973 Rfam family covariance models. Using this as a basis for measuring structural similarity, we developed a clustering pipeline called NoFold to automatically identify and annotate structure motifs within large sequence data sets. We demonstrate that NoFold can simultaneously identify multiple structure motifs with an average sensitivity of 0.80 and precision of 0.98 and generally exceeds the performance of existing methods. We also perform a cross-validation analysis of the entire set of Rfam families, achieving an average sensitivity of 0.57. We apply NoFold to identify motifs enriched in dendritically localized transcripts and report 213 enriched motifs, including both known and novel structures. © 2014 Middleton and Kim; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  8. SONAR Discovers RNA-Binding Proteins from Analysis of Large-Scale Protein-Protein Interactomes.

    PubMed

    Brannan, Kristopher W; Jin, Wenhao; Huelga, Stephanie C; Banks, Charles A S; Gilmore, Joshua M; Florens, Laurence; Washburn, Michael P; Van Nostrand, Eric L; Pratt, Gabriel A; Schwinn, Marie K; Daniels, Danette L; Yeo, Gene W

    2016-10-20

    RNA metabolism is controlled by an expanding, yet incomplete, catalog of RNA-binding proteins (RBPs), many of which lack characterized RNA binding domains. Approaches to expand the RBP repertoire to discover non-canonical RBPs are currently needed. Here, HaloTag fusion pull down of 12 nuclear and cytoplasmic RBPs followed by quantitative mass spectrometry (MS) demonstrates that proteins interacting with multiple RBPs in an RNA-dependent manner are enriched for RBPs. This motivated SONAR, a computational approach that predicts RNA binding activity by analyzing large-scale affinity precipitation-MS protein-protein interactomes. Without relying on sequence or structure information, SONAR identifies 1,923 human, 489 fly, and 745 yeast RBPs, including over 100 human candidate RBPs that contain zinc finger domains. Enhanced CLIP confirms RNA binding activity and identifies transcriptome-wide RNA binding sites for SONAR-predicted RBPs, revealing unexpected RNA binding activity for disease-relevant proteins and DNA binding proteins. Copyright © 2016 Elsevier Inc. All rights reserved.

  9. Global RNA association with the transcriptionally active chromosome of chloroplasts.

    PubMed

    Lehniger, Marie-Kristin; Finster, Sabrina; Melonek, Joanna; Oetke, Svenja; Krupinska, Karin; Schmitz-Linneweber, Christian

    2017-10-01

    Processed chloroplast RNAs are co-enriched with preparations of the chloroplast transcriptionally active chromosome. Chloroplast genomes are organized as a polyploid DNA-protein structure called the nucleoid. Transcriptionally active chloroplast DNA together with tightly bound protein factors can be purified by gel filtration as a functional entity called the transcriptionally active chromosome (TAC). Previous proteomics analyses of nucleoids and of TACs demonstrated a considerable overlap in protein composition including RNA binding proteins. Therefore the RNA content of TAC preparations from Nicotiana tabacum was determined using whole genome tiling arrays. A large number of chloroplast RNAs was found to be associated with the TAC. The pattern of RNAs attached to the TAC consists of RNAs produced by different chloroplast RNA polymerases and differs from the pattern of RNA found in input controls. An analysis of RNA splicing and RNA editing of selected RNA species demonstrated that TAC-associated RNAs are processed to a similar extent as the RNA in input controls. Thus, TAC fractions contain a specific subset of the processed chloroplast transcriptome.

  10. Stem-Loop RNA Hairpins in Giant Viruses: Invading rRNA-Like Repeats and a Template Free RNA

    PubMed Central

    Seligmann, Hervé; Raoult, Didier

    2018-01-01

    We examine the hypothesis that de novo template-free RNAs still form spontaneously, as they did at the origins of life, invade modern genomes, contribute new genetic material. Previously, analyses of RNA secondary structures suggested that some RNAs resembling ancestral (t)RNAs formed recently de novo, other parasitic sequences cluster with rRNAs. Here positive control analyses of additional RNA secondary structures confirm ancestral and de novo statuses of RNA grouped according to secondary structure. Viroids with branched stems resemble de novo RNAs, rod-shaped viroids resemble rRNA secondary structures, independently of GC contents. 5′ UTR leading regions of West Nile and Dengue flavivirid viruses resemble de novo and rRNA structures, respectively. An RNA homologous with Megavirus, Dengue and West Nile genomes, copperhead snake microsatellites and levant cotton repeats, not templated by Mimivirus' genome, persists throughout Mimivirus' infection. Its secondary structure clusters with candidate de novo RNAs. The saltatory phyletic distribution and secondary structure of Mimivirus' peculiar RNA suggest occasional template-free polymerization of this sequence, rather than noncanonical transcriptions (swinger polymerization, posttranscriptional editing). PMID:29449833

  11. ε, a new subunit of RNA polymerase found in gram-positive bacteria.

    PubMed

    Keller, Andrew N; Yang, Xiao; Wiedermannová, Jana; Delumeau, Olivier; Krásný, Libor; Lewis, Peter J

    2014-10-01

    RNA polymerase in bacteria is a multisubunit protein complex that is essential for gene expression. We have identified a new subunit of RNA polymerase present in the high-A+T Firmicutes phylum of Gram-positive bacteria and have named it ε. Previously ε had been identified as a small protein (ω1) that copurified with RNA polymerase. We have solved the structure of ε by X-ray crystallography and show that it is not an ω subunit. Rather, ε bears remarkable similarity to the Gp2 family of phage proteins involved in the inhibition of host cell transcription following infection. Deletion of ε shows no phenotype and has no effect on the transcriptional profile of the cell. Determination of the location of ε within the assembly of RNA polymerase core by single-particle analysis suggests that it binds toward the downstream side of the DNA binding cleft. Due to the structural similarity of ε with Gp2 and the fact they bind similar regions of RNA polymerase, we hypothesize that ε may serve a role in protection from phage infection. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  12. Analysis of the contributions of ring current and electric field effects to the chemical shifts of RNA bases.

    PubMed

    Sahakyan, Aleksandr B; Vendruscolo, Michele

    2013-02-21

    Ring current and electric field effects can considerably influence NMR chemical shifts in biomolecules. Understanding such effects is particularly important for the development of accurate mappings between chemical shifts and the structures of nucleic acids. In this work, we first analyzed the Pople and the Haigh-Mallion models in terms of their ability to describe nitrogen base conjugated ring effects. We then created a database (DiBaseRNA) of three-dimensional arrangements of RNA base pairs from X-ray structures, calculated the corresponding chemical shifts via a hybrid density functional theory approach and used the results to parametrize the ring current and electric field effects in RNA bases. Next, we studied the coupling of the electric field and ring current effects for different inter-ring arrangements found in RNA bases using linear model fitting, with joint electric field and ring current, as well as only electric field and only ring current approximations. Taken together, our results provide a characterization of the interdependence of ring current and electric field geometric factors, which is shown to be especially important for the chemical shifts of non-hydrogen atoms in RNA bases.

  13. Altering the orientation of a fused protein to the RNA-binding ribosomal protein L7Ae and its derivatives through circular permutation.

    PubMed

    Ohuchi, Shoji J; Sagawa, Fumihiko; Sakamoto, Taiichi; Inoue, Tan

    2015-10-23

    RNA-protein complexes (RNPs) are useful for constructing functional nano-objects because a variety of functional proteins can be displayed on a designed RNA scaffold. Here, we report circular permutations of an RNA-binding protein L7Ae based on the three-dimensional structure information to alter the orientation of the displayed proteins on the RNA scaffold. An electrophoretic mobility shift assay and atomic force microscopy (AFM) analysis revealed that most of the designed circular permutants formed an RNP nano-object. Moreover, the alteration of the enhanced green fluorescent protein (EGFP) orientation was confirmed with AFM by employing EGFP on the L7Ae permutant on the RNA. The results demonstrate that targeted fine-tuning of the stereo-specific fixation of a protein on a protein-binding RNA is feasible by using the circular permutation technique. Copyright © 2015 Elsevier Inc. All rights reserved.

  14. Altering the orientation of a fused protein to the RNA-binding ribosomal protein L7Ae and its derivatives through circular permutation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ohuchi, Shoji J.; Sagawa, Fumihiko; Sakamoto, Taiichi

    RNA-protein complexes (RNPs) are useful for constructing functional nano-objects because a variety of functional proteins can be displayed on a designed RNA scaffold. Here, we report circular permutations of an RNA-binding protein L7Ae based on the three-dimensional structure information to alter the orientation of the displayed proteins on the RNA scaffold. An electrophoretic mobility shift assay and atomic force microscopy (AFM) analysis revealed that most of the designed circular permutants formed an RNP nano-object. Moreover, the alteration of the enhanced green fluorescent protein (EGFP) orientation was confirmed with AFM by employing EGFP on the L7Ae permutant on the RNA. Themore » results demonstrate that targeted fine-tuning of the stereo-specific fixation of a protein on a protein-binding RNA is feasible by using the circular permutation technique.« less

  15. Vfold: a web server for RNA structure and folding thermodynamics prediction.

    PubMed

    Xu, Xiaojun; Zhao, Peinan; Chen, Shi-Jie

    2014-01-01

    The ever increasing discovery of non-coding RNAs leads to unprecedented demand for the accurate modeling of RNA folding, including the predictions of two-dimensional (base pair) and three-dimensional all-atom structures and folding stabilities. Accurate modeling of RNA structure and stability has far-reaching impact on our understanding of RNA functions in human health and our ability to design RNA-based therapeutic strategies. The Vfold server offers a web interface to predict (a) RNA two-dimensional structure from the nucleotide sequence, (b) three-dimensional structure from the two-dimensional structure and the sequence, and (c) folding thermodynamics (heat capacity melting curve) from the sequence. To predict the two-dimensional structure (base pairs), the server generates an ensemble of structures, including loop structures with the different intra-loop mismatches, and evaluates the free energies using the experimental parameters for the base stacks and the loop entropy parameters given by a coarse-grained RNA folding model (the Vfold model) for the loops. To predict the three-dimensional structure, the server assembles the motif scaffolds using structure templates extracted from the known PDB structures and refines the structure using all-atom energy minimization. The Vfold-based web server provides a user friendly tool for the prediction of RNA structure and stability. The web server and the source codes are freely accessible for public use at "http://rna.physics.missouri.edu".

  16. Crystal Structures of the Mango-II RNA Aptamer Reveal Heterogeneous Fluorophore Binding and Guide Engineering of Variants with Improved Selectivity and Brightness.

    PubMed

    Trachman, Robert J; Abdolahzadeh, Amir; Andreoni, Alessio; Cojocaru, Razvan; Knutson, Jay R; Ryckelynck, Michael; Unrau, Peter J; Ferré-D'Amaré, Adrian R

    2018-05-24

    Several RNA aptamers that bind small molecules and enhance their fluorescence have been successfully used to tag and track RNAs in vivo, but these genetically encodable tags have not yet achieved single-fluorophore resolution. Recently, Mango-II, an RNA that binds TO1-Biotin with ∼1 nM affinity and enhances its fluorescence by >1500-fold, was isolated by fluorescence selection from the pool that yielded the original RNA Mango. We determined the crystal structures of Mango-II in complex with two fluorophores, TO1-Biotin and TO3-Biotin, and found that despite their high affinity, the ligands adopt multiple distinct conformations, indicative of a binding pocket with modest stereoselectivity. Mutational analysis of the binding site led to Mango-II(A22U), which retains high affinity for TO1-Biotin but now discriminates >5-fold against TO3-biotin. Moreover, fluorescence enhancement of TO1-Biotin increases by 18%, while that of TO3-Biotin decreases by 25%. Crystallographic, spectroscopic, and analogue studies show that the A22U mutation improves conformational homogeneity and shape complementarity of the fluorophore-RNA interface. Our work demonstrates that even after extensive functional selection, aptamer RNAs can be further improved through structure-guided engineering.

  17. Rigid 2',4'-difluororibonucleosides: synthesis, conformational analysis, and incorporation into nascent RNA by HCV polymerase.

    PubMed

    Martínez-Montero, Saúl; Deleavey, Glen F; Kulkarni, Anupriya; Martín-Pintado, Nerea; Lindovska, Petra; Thomson, Michael; González, Carlos; Götte, Matthias; Damha, Masad J

    2014-06-20

    We report on the synthesis and conformational properties of 2'-deoxy-2',4'-difluorouridine (2',4'-diF-rU) and cytidine (2',4'-diF-rC) nucleosides. NMR analysis and quantum mechanical calculations show that the strong stereoelectronic effects induced by the two fluorines essentially "lock" the conformation of the sugar in the North region of the pseudorotational cycle. Our studies also demonstrate that NS5B HCV RNA polymerase was able to accommodate 2',4'-diF-rU 5'-triphosphate (2',4'-diF-rUTP) and to link the monophosphate to the RNA primer strand. 2',4'-diF-rUTP inhibited RNA synthesis in dinucleotide-primed reactions, although with relatively high half-maximal inhibitory concentrations (IC50 > 50 μM). 2',4'-diF-rU/C represents rare examples of "locked" ribonucleoside mimics that lack a bicyclic ring structure.

  18. Solution structure and thermodynamics of 2',5' RNA intercalation.

    PubMed

    Horowitz, Eric D; Lilavivat, Seth; Holladay, Benjamin W; Germann, Markus W; Hud, Nicholas V

    2009-04-29

    As a means to explore the influence of the nucleic acid backbone on the intercalative binding of ligands to DNA and RNA, we have determined the solution structure of a proflavine-bound 2',5'-linked octamer duplex with the sequence GCCGCGGC. This structure represents the first NMR structure of an intercalated RNA duplex, of either backbone structural isomer. By comparison with X-ray crystal structures, we have identified similarities and differences between intercalated 3',5' and 2',5'-linked RNA duplexes. First, the two forms of RNA have different sugar pucker geometries at the intercalated nucleotide steps, yet have the same interphosphate distances. Second, as in intercalated 3',5' RNA, the phosphate backbone angle zeta at the 2',5' RNA intercalation site prefers to be in the trans conformation, whereas unintercalated 2',5' and 3',5' RNA prefer the -gauche conformation. These observations provide new insights regarding the transitions required for intercalation of a phosphodiester-ribose backbone and suggest a possible contribution of the backbone to the origin of the nearest-neighbor exclusion principle. Thermodynamic studies presented for intercalation of both structural RNA isomers also reveal a surprising sensitivity of intercalator binding enthalpy and entropy to the details of RNA backbone structure.

  19. Interaction of zanamivir with DNA and RNA: Models for drug DNA and drug RNA bindings

    NASA Astrophysics Data System (ADS)

    Nafisi, Shohreh; Kahangi, Fatemeh Ghoreyshi; Azizi, Ebrahim; Zebarjad, Nader; Tajmir-Riahi, Heidar-Ali

    2007-03-01

    Zanamivir (ZAN) is the first of a new generation of influenza virus-specific drugs known as neuraminidase inhibitors, which acts by interfering with life cycles of influenza viruses A and B. It prevents the virus spreading infection to other cells by blocking the neuraminidase enzyme present on the surface of the virus. The aim of this study was to examine the stability and structural features of calf thymus DNA and yeast RNA complexes with zanamivir in aqueous solution, using constant DNA or RNA concentration (12.5 mM) and various zanamivir/polynucleotide ( P) ratios of 1/20, 1/10, 1/4, and 1/2. FTIR and UV-visible spectroscopy are used to determine the drug external binding modes, the binding constant and the stability of zanamivir-DNA and RNA complexes in aqueous solution. Structural analysis showed major interaction of zanamivir with G-C (major groove) and A-T (minor groove) base pairs and minor perturbations of the backbone PO 2 group with overall binding constants of Kzanamivir-DNA = 1.30 × 10 4 M -1 and Kzanamivir-RNA = 1.38 × 10 4 M -1. The drug interaction induces a partial B to A-DNA transition, while RNA remains in A-conformation.

  20. A genetically encoded fluorescent tRNA is active in live-cell protein synthesis

    PubMed Central

    Masuda, Isao; Igarashi, Takao; Sakaguchi, Reiko; Nitharwal, Ram G.; Takase, Ryuichi; Han, Kyu Young; Leslie, Benjamin J.; Liu, Cuiping; Gamper, Howard; Ha, Taekjip; Sanyal, Suparna

    2017-01-01

    Abstract Transfer RNAs (tRNAs) perform essential tasks for all living cells. They are major components of the ribosomal machinery for protein synthesis and they also serve in non-ribosomal pathways for regulation and signaling metabolism. We describe the development of a genetically encoded fluorescent tRNA fusion with the potential for imaging in live Escherichia coli cells. This tRNA fusion carries a Spinach aptamer that becomes fluorescent upon binding of a cell-permeable and non-toxic fluorophore. We show that, despite having a structural framework significantly larger than any natural tRNA species, this fusion is a viable probe for monitoring tRNA stability in a cellular quality control mechanism that degrades structurally damaged tRNA. Importantly, this fusion is active in E. coli live-cell protein synthesis allowing peptidyl transfer at a rate sufficient to support cell growth, indicating that it is accommodated by translating ribosomes. Imaging analysis shows that this fusion and ribosomes are both excluded from the nucleoid, indicating that the fusion and ribosomes are in the cytosol together possibly engaged in protein synthesis. This fusion methodology has the potential for developing new tools for live-cell imaging of tRNA with the unique advantage of both stoichiometric labeling and broader application to all cells amenable to genetic engineering. PMID:27956502

  1. Design and implementation of a synthetic pre-miR switch for controlling miRNA biogenesis in mammals

    PubMed Central

    Atanasov, Janina; Groher, Florian

    2017-01-01

    Abstract Synthetic RNA-based systems have increasingly been used for the regulation of eukaryotic gene expression. Due to their structural properties, riboregulators provide a convenient basis for the development of ligand-dependent controllable systems. Here, we demonstrate reversible conditional control of miRNA biogenesis with an aptamer domain as a sensing unit connected to a natural miRNA precursor for the first time. For the design of the pre-miR switch, we replaced the natural terminal loop with the TetR aptamer. Thus, the TetR aptamer was positioned close to the Dicer cleavage sites, which allowed sterical control over pre-miR processing by Dicer. Our design proved to be highly versatile, allowing us to regulate the biogenesis of three structurally different miRNAs: miR-126, -34a and -199a. Dicer cleavage was inhibited up to 143-fold via co-expression of the TetR protein, yet could be completely restored upon addition of doxycycline. Moreover, we showed the functionality of the pre-miR switches for gene regulation through the interaction of the respective miRNA with its specific target sequence. Our designed device is capable of robust and reversible control of miRNA abundance. Thus, we offer a novel investigational tool for functional miRNA analysis. PMID:29036355

  2. NMR studies of protein-nucleic acid interactions.

    PubMed

    Varani, Gabriele; Chen, Yu; Leeper, Thomas C

    2004-01-01

    Protein-DNA and protein-RNA complexes play key functional roles in every living organism. Therefore, the elucidation of their structure and dynamics is an important goal of structural and molecular biology. Nuclear magnetic resonance (NMR) studies of protein and nucleic acid complexes have common features with studies of protein-protein complexes: the interaction surfaces between the molecules must be carefully delineated, the relative orientation of the two species needs to be accurately and precisely determined, and close intermolecular contacts defined by nuclear Overhauser effects (NOEs) must be obtained. However, differences in NMR properties (e.g., chemical shifts) and biosynthetic pathways for sample productions generate important differences. Chemical shift differences between the protein and nucleic acid resonances can aid the NMR structure determination process; however, the relatively limited dispersion of the RNA ribose resonances makes the process of assigning intermolecular NOEs more difficult. The analysis of the resulting structures requires computational tools unique to nucleic acid interactions. This chapter summarizes the most important elements of the structure determination by NMR of protein-nucleic acid complexes and their analysis. The main emphasis is on recent developments (e.g., residual dipolar couplings and new Web-based analysis tools) that have facilitated NMR studies of these complexes and expanded the type of biological problems to which NMR techniques of structural elucidation can now be applied.

  3. Optimization of oligonucleotide arrays and RNA amplification protocols for analysis of transcript structure and alternative splicing.

    PubMed

    Castle, John; Garrett-Engele, Phil; Armour, Christopher D; Duenwald, Sven J; Loerch, Patrick M; Meyer, Michael R; Schadt, Eric E; Stoughton, Roland; Parrish, Mark L; Shoemaker, Daniel D; Johnson, Jason M

    2003-01-01

    Microarrays offer a high-resolution means for monitoring pre-mRNA splicing on a genomic scale. We have developed a novel, unbiased amplification protocol that permits labeling of entire transcripts. Also, hybridization conditions, probe characteristics, and analysis algorithms were optimized for detection of exons, exon-intron edges, and exon junctions. These optimized protocols can be used to detect small variations and isoform mixtures, map the tissue specificity of known human alternative isoforms, and provide a robust, scalable platform for high-throughput discovery of alternative splicing.

  4. Optimization of oligonucleotide arrays and RNA amplification protocols for analysis of transcript structure and alternative splicing

    PubMed Central

    Castle, John; Garrett-Engele, Phil; Armour, Christopher D; Duenwald, Sven J; Loerch, Patrick M; Meyer, Michael R; Schadt, Eric E; Stoughton, Roland; Parrish, Mark L; Shoemaker, Daniel D; Johnson, Jason M

    2003-01-01

    Microarrays offer a high-resolution means for monitoring pre-mRNA splicing on a genomic scale. We have developed a novel, unbiased amplification protocol that permits labeling of entire transcripts. Also, hybridization conditions, probe characteristics, and analysis algorithms were optimized for detection of exons, exon-intron edges, and exon junctions. These optimized protocols can be used to detect small variations and isoform mixtures, map the tissue specificity of known human alternative isoforms, and provide a robust, scalable platform for high-throughput discovery of alternative splicing. PMID:14519201

  5. RNA Structures as Mediators of Neurological Diseases and as Drug Targets.

    PubMed

    Bernat, Viachaslau; Disney, Matthew D

    2015-07-01

    RNAs adopt diverse folded structures that are essential for function and thus play critical roles in cellular biology. A striking example of this is the ribosome, a complex, three-dimensionally folded macromolecular machine that orchestrates protein synthesis. Advances in RNA biochemistry, structural and molecular biology, and bioinformatics have revealed other non-coding RNAs whose functions are dictated by their structure. It is not surprising that aberrantly folded RNA structures contribute to disease. In this Review, we provide a brief introduction into RNA structural biology and then describe how RNA structures function in cells and cause or contribute to neurological disease. Finally, we highlight successful applications of rational design principles to provide chemical probes and lead compounds targeting structured RNAs. Based on several examples of well-characterized RNA-driven neurological disorders, we demonstrate how designed small molecules can facilitate the study of RNA dysfunction, elucidating previously unknown roles for RNA in disease, and provide lead therapeutics. Copyright © 2015 Elsevier Inc. All rights reserved.

  6. RNA-Puzzles Round II: assessment of RNA structure prediction programs applied to three large RNA structures

    PubMed Central

    Miao, Zhichao; Adamiak, Ryszard W.; Blanchet, Marc-Frédérick; Boniecki, Michal; Bujnicki, Janusz M.; Chen, Shi-Jie; Cheng, Clarence; Chojnowski, Grzegorz; Chou, Fang-Chieh; Cordero, Pablo; Cruz, José Almeida; Ferré-D'Amaré, Adrian R.; Das, Rhiju; Ding, Feng; Dokholyan, Nikolay V.; Dunin-Horkawicz, Stanislaw; Kladwang, Wipapat; Krokhotin, Andrey; Lach, Grzegorz; Magnus, Marcin; Major, François; Mann, Thomas H.; Masquida, Benoît; Matelska, Dorota; Meyer, Mélanie; Peselis, Alla; Popenda, Mariusz; Purzycka, Katarzyna J.; Serganov, Alexander; Stasiewicz, Juliusz; Szachniuk, Marta; Tandon, Arpit; Tian, Siqi; Wang, Jian; Xiao, Yi; Xu, Xiaojun; Zhang, Jinwei; Zhao, Peinan; Zok, Tomasz; Westhof, Eric

    2015-01-01

    This paper is a report of a second round of RNA-Puzzles, a collective and blind experiment in three-dimensional (3D) RNA structure prediction. Three puzzles, Puzzles 5, 6, and 10, represented sequences of three large RNA structures with limited or no homology with previously solved RNA molecules. A lariat-capping ribozyme, as well as riboswitches complexed to adenosylcobalamin and tRNA, were predicted by seven groups using RNAComposer, ModeRNA/SimRNA, Vfold, Rosetta, DMD, MC-Fold, 3dRNA, and AMBER refinement. Some groups derived models using data from state-of-the-art chemical-mapping methods (SHAPE, DMS, CMCT, and mutate-and-map). The comparisons between the predictions and the three subsequently released crystallographic structures, solved at diffraction resolutions of 2.5–3.2 Å, were carried out automatically using various sets of quality indicators. The comparisons clearly demonstrate the state of present-day de novo prediction abilities as well as the limitations of these state-of-the-art methods. All of the best prediction models have similar topologies to the native structures, which suggests that computational methods for RNA structure prediction can already provide useful structural information for biological problems. However, the prediction accuracy for non-Watson–Crick interactions, key to proper folding of RNAs, is low and some predicted models had high Clash Scores. These two difficulties point to some of the continuing bottlenecks in RNA structure prediction. All submitted models are available for download at http://ahsoka.u-strasbg.fr/rnapuzzles/. PMID:25883046

  7. Partial DNA-guided Cas9 enables genome editing with reduced off-target activity

    PubMed Central

    Yin, Hao; Song, Chun-Qing; Suresh, Sneha; Kwan, Suet-Yan; Wu, Qiongqiong; Walsh, Stephen; Ding, Junmei; Bogorad, Roman L; Zhu, Lihua Julie; Wolfe, Scot A; Koteliansky, Victor; Xue, Wen; Langer, Robert; Anderson, Daniel G

    2018-01-01

    CRISPR–Cas9 is a versatile RNA-guided genome editing tool. Here we demonstrate that partial replacement of RNA nucleotides with DNA nucleotides in CRISPR RNA (crRNA) enables efficient gene editing in human cells. This strategy of partial DNA replacement retains on-target activity when used with both crRNA and sgRNA, as well as with multiple guide sequences. Partial DNA replacement also works for crRNA of Cpf1, another CRISPR system. We find that partial DNA replacement in the guide sequence significantly reduces off-target genome editing through focused analysis of off-target cleavage, measurement of mismatch tolerance and genome-wide profiling of off-target sites. Using the structure of the Cas9–sgRNA complex as a guide, the majority of the 3′ end of crRNA can be replaced with DNA nucleotide, and the 5 - and 3′-DNA-replaced crRNA enables efficient genome editing. Cas9 guided by a DNA–RNA chimera may provide a generalized strategy to reduce both the cost and the off-target genome editing in human cells. PMID:29377001

  8. Decreased A-to-I RNA editing as a source of keratinocytes' dsRNA in psoriasis.

    PubMed

    Shallev, Lea; Kopel, Eli; Feiglin, Ariel; Leichner, Gil S; Avni, Dror; Sidi, Yechezkel; Eisenberg, Eli; Barzilai, Aviv; Levanon, Erez Y; Greenberger, Shoshana

    2018-06-01

    Recognition of dsRNA molecules activates the MDA5-MAVS pathway and plays a critical role in stimulating type-I interferon responses in psoriasis. However, the source of the dsRNA accumulation in psoriatic keratinocytes remains largely unknown. A-to-I RNA editing is a common co- or post-transcriptional modification that diversifies adenosine in dsRNA, and leads to unwinding of dsRNA structures. Thus, impaired RNA editing activity can result in an increased load of endogenous dsRNAs. Here we provide a transcriptome-wide analysis of RNA editing across dozens of psoriasis patients, and we demonstrate a global editing reduction in psoriatic lesions. In addition to the global alteration, we also detect editing changes in functional recoding sites located in the IGFBP7 , COPA , and FLNA genes. Accretion of dsRNA activates autoimmune responses, and therefore the results presented here, linking for the first time an autoimmune disease to reduction in global editing level, are relevant to a wide range of autoimmune diseases. © 2018 Shallev et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  9. The presence of highly disruptive 16S rRNA mutations in clinical samples indicates a wider role for mutations of the mitochondrial ribosome in human disease

    PubMed Central

    Elson, Joanna L.; Smith, Paul M.; Greaves, Laura C.; Lightowlers, Robert N.; Chrzanowska-Lightowlers, Zofia M.A.; Taylor, Robert W.; Vila-Sanjurjo, Antón

    2015-01-01

    Mitochondrial DNA mutations are well recognized as an important cause of disease, with over two hundred variants in the protein encoding and mt-tRNA genes associated with human disorders. In contrast, the two genes encoding the mitochondrial rRNAs (mt-rRNAs) have been studied in far less detail. This is because establishing the pathogenicity of mt-rRNA mutations is a major diagnostic challenge. Only two disease causing mutations have been identified at these loci, both mapping to the small subunit (SSU). On the large subunit (LSU), however, the evidence for the presence of pathogenic LSU mt-rRNA changes is particularly sparse. We have previously expanded the list of deleterious SSU mt-rRNA mutations by identifying highly disruptive base changes capable of blocking the activity of the mitoribosomal SSU. To do this, we used a new methodology named heterologous inferential analysis (HIA). The recent arrival of near-atomic-resolution structures of the human mitoribosomal LSU, has enhanced the power of our approach by permitting the analysis of the corresponding sites of mutation within their natural structural context. Here, we have used these tools to determine whether LSU mt-rRNA mutations found in the context of human disease and/or ageing could disrupt the function of the mitoribosomal LSU. Our results clearly show that, much like the for SSU mt-rRNA, LSU mt-rRNAs mutations capable of compromising the function of the mitoribosomal LSU are indeed present in clinical samples. Thus, our work constitutes an important contribution to an emerging view of the mitoribosome as an important element in human health. PMID:26349026

  10. Sequence heterogeneities of genes encoding 16S rRNAs in Paenibacillus polymyxa detected by temperature gradient gel electrophoresis.

    PubMed Central

    Nübel, U; Engelen, B; Felske, A; Snaidr, J; Wieshuber, A; Amann, R I; Ludwig, W; Backhaus, H

    1996-01-01

    Sequence heterogeneities in 16S rRNA genes from individual strains of Paenibacillus polymyxa were detected by sequence-dependent separation of PCR products by temperature gradient gel electrophoresis (TGGE). A fragment of the 16S rRNA genes, comprising variable regions V6 to V8, was used as a target sequence for amplifications. PCR products from P. polymyxa (type strain) emerged as a well-defined pattern of bands in the gradient gel. Six plasmids with different inserts, individually demonstrating the migration characteristics of single bands of the pattern, were obtained by cloning the PCR products. Their sequences were analyzed as a representative sample of the total heterogeneity. An amount of 10 variant nucleotide positions in the fragment of 347 bp was observed, with all substitutions conserving the relevant secondary structures of the V6 and V8 regions in the RNA molecules. Hybridizations with specifically designed probes demonstrated different chromosomal locations of the respective rRNA genes. Amplifications of reverse-transcribed rRNA from ribosome preparations, as well as whole-cell hybridizations, revealed a predominant representation of particular sequences in ribosomes of exponentially growing laboratory cultures. Different strains of P. polymyxa showed not only remarkably differing patterns of PCR products in TGGE analysis but also discriminative whole-cell labeling with the designed oligonucleotide probes, indicating the different representation of individual sequences in active ribosomes. Our results demonstrate the usefulness of TGGE for the structural analysis of heterogeneous rRNA genes together with their expression, stress problems of the generation of meaningful data for 16S rRNA sequences and probe designs, and might have consequences for evolutionary concepts. PMID:8824607

  11. Isolation of temperature-sensitive mutants of 16 S rRNA in Escherichia coli.

    PubMed

    Triman, K; Becker, E; Dammel, C; Katz, J; Mori, H; Douthwaite, S; Yapijakis, C; Yoast, S; Noller, H F

    1989-10-20

    Temperature-sensitive mutants have been isolated following hydroxylamine mutagenesis of a plasmid containing Escherichia coli rRNA genes carrying selectable markers for spectinomycin resistance (U1192 in 16 S rRNA) and erythromycin resistance (G2058 in 23 S rRNA). These antibiotic resistance alleles, originally identified by Morgan and co-workers, enable us to follow expression of cloned rRNA genes in vivo. Recessive mutations causing the loss of expression of the cloned 16 S rRNA gene were identified by the loss of the ability of cells to survive on media containing spectinomycin. The mutations were localized by in vitro restriction fragment replacement followed by in vivo marker rescue and were identified by DNA sequence analysis. We report here seven single-base alterations in 16 S rRNA (A146, U153, A350, A359, A538, A1292 and U1293), five of which produce temperature-sensitive spectinomycin resistance and two that produce unconditional loss of resistance. In each case, loss of ribosomal function can be accounted for by disruption of base-pairing in the secondary structure of 16 S rRNA. For the temperature-sensitive mutants, there is a lag period of about two generations between a shift to the restrictive temperature and cessation of growth, implying that the structural defects cause impairment of ribosome assembly.

  12. Structure of RNA polymerase complex and genome within a dsRNA virus provides insights into the mechanisms of transcription and assembly.

    PubMed

    Wang, Xurong; Zhang, Fuxian; Su, Rui; Li, Xiaowu; Chen, Wenyuan; Chen, Qingxiu; Yang, Tao; Wang, Jiawei; Liu, Hongrong; Fang, Qin; Cheng, Lingpeng

    2018-06-25

    Most double-stranded RNA (dsRNA) viruses transcribe RNA plus strands within a common innermost capsid shell. This process requires coordinated efforts by RNA-dependent RNA polymerase (RdRp) together with other capsid proteins and genomic RNA. Here we report the near-atomic resolution structure of the RdRp protein VP2 in complex with its cofactor protein VP4 and genomic RNA within an aquareovirus capsid using 200-kV cryoelectron microscopy and symmetry-mismatch reconstruction. The structure of these capsid proteins enabled us to observe the elaborate nonicosahedral structure within the double-layered icosahedral capsid. Our structure shows that the RdRp complex is anchored at the inner surface of the capsid shell and interacts with genomic dsRNA and four of the five asymmetrically arranged N termini of the capsid shell proteins under the fivefold axis, implying roles for these N termini in virus assembly. The binding site of the RNA end at VP2 is different from the RNA cap binding site identified in the crystal structure of orthoreovirus RdRp λ3, although the structures of VP2 and λ3 are almost identical. A loop, which was thought to separate the RNA template and transcript, interacts with an apical domain of the capsid shell protein, suggesting a mechanism for regulating RdRp replication and transcription. A conserved nucleoside triphosphate binding site was localized in our RdRp cofactor protein VP4 structure, and interactions between the VP4 and the genomic RNA were identified.

  13. Phylogenetic Network Analysis Revealed the Occurrence of Horizontal Gene Transfer of 16S rRNA in the Genus Enterobacter

    PubMed Central

    Sato, Mitsuharu; Miyazaki, Kentaro

    2017-01-01

    Horizontal gene transfer (HGT) is a ubiquitous genetic event in bacterial evolution, but it seldom occurs for genes involved in highly complex supramolecules (or biosystems), which consist of many gene products. The ribosome is one such supramolecule, but several bacteria harbor dissimilar and/or chimeric 16S rRNAs in their genomes, suggesting the occurrence of HGT of this gene. However, we know little about whether the genes actually experience HGT and, if so, the frequency of such a transfer. This is primarily because the methods currently employed for phylogenetic analysis (e.g., neighbor-joining, maximum likelihood, and maximum parsimony) of 16S rRNA genes assume point mutation-driven tree-shape evolution as an evolutionary model, which is intrinsically inappropriate to decipher the evolutionary history for genes driven by recombination. To address this issue, we applied a phylogenetic network analysis, which has been used previously for detection of genetic recombination in homologous alleles, to the 16S rRNA gene. We focused on the genus Enterobacter, whose phylogenetic relationships inferred by multi-locus sequence alignment analysis and 16S rRNA sequences are incompatible. All 10 complete genomic sequences were retrieved from the NCBI database, in which 71 16S rRNA genes were included. Neighbor-joining analysis demonstrated that the genes residing in the same genomes clustered, indicating the occurrence of intragenomic recombination. However, as suggested by the low bootstrap values, evolutionary relationships between the clusters were uncertain. We then applied phylogenetic network analysis to representative sequences from each cluster. We found three ancestral 16S rRNA groups; the others were likely created through recursive recombination between the ancestors and chimeric descendants. Despite the large sequence changes caused by the recombination events, the RNA secondary structures were conserved. Successive intergenomic and intragenomic recombination thus shaped the evolution of 16S rRNA genes in the genus Enterobacter. PMID:29180992

  14. Crystal structure of archaeal ribonuclease P protein Ph1771p from Pyrococcus horikoshii OT3: An archaeal homolog of eukaryotic ribonuclease P protein Rpp29

    PubMed Central

    NUMATA, TOMOYUKI; ISHIMATSU, IKUKO; KAKUTA, YOSHIMITSU; TANAKA, ISAO; KIMURA, MAKOTO

    2004-01-01

    Ribonuclease P (RNase P) is the endonuclease responsible for the removal of 5′ leader sequences from tRNA precursors. The crystal structure of an archaeal RNase P protein, Ph1771p (residues 36–127) from hyperthermophilic archaeon Pyrococcus horikoshii OT3 was determined at 2.0 Å resolution by X-ray crystallography. The structure is composed of four helices (α1–α4) and a six-stranded antiparallel β-sheet (β1–β6) with a protruding β-strand (β7) at the C-terminal region. The strand β7 forms an antiparallel β-sheet by interacting with strand β4 in a symmetry-related molecule, suggesting that strands β4 and β7 could be involved in protein-protein interactions with other RNase P proteins. Structural comparison showed that the β-barrel structure of Ph1771p has a topological resemblance to those of Staphylococcus aureus translational regulator Hfq and Haloarcula marismortui ribosomal protein L21E, suggesting that these RNA binding proteins have a common ancestor and then diverged to specifically bind to their cognate RNAs. The structure analysis as well as structural comparison suggested two possible RNA binding sites in Ph1771p, one being a concave surface formed by terminal α-helices (α1–α4) and β-strand β6, where positively charged residues are clustered. A second possible RNA binding site is at a loop region connecting strands β2 and β3, where conserved hydrophilic residues are exposed to the solvent and interact specifically with sulfate ion. These two potential sites for RNA binding are located in close proximity. The crystal structure of Ph1771p provides insight into the structure and function relationships of archaeal and eukaryotic RNase P. PMID:15317976

  15. Functional 5' UTR mRNA structures in eukaryotic translation regulation and how to find them.

    PubMed

    Leppek, Kathrin; Das, Rhiju; Barna, Maria

    2018-03-01

    RNA molecules can fold into intricate shapes that can provide an additional layer of control of gene expression beyond that of their sequence. In this Review, we discuss the current mechanistic understanding of structures in 5' untranslated regions (UTRs) of eukaryotic mRNAs and the emerging methodologies used to explore them. These structures may regulate cap-dependent translation initiation through helicase-mediated remodelling of RNA structures and higher-order RNA interactions, as well as cap-independent translation initiation through internal ribosome entry sites (IRESs), mRNA modifications and other specialized translation pathways. We discuss known 5' UTR RNA structures and how new structure probing technologies coupled with prospective validation, particularly compensatory mutagenesis, are likely to identify classes of structured RNA elements that shape post-transcriptional control of gene expression and the development of multicellular organisms.

  16. Inverted repeat Alu elements in the human lincRNA-p21 adopt a conserved secondary structure that regulates RNA function

    PubMed Central

    Chillón, Isabel; Pyle, Anna M.

    2016-01-01

    LincRNA-p21 is a long intergenic non-coding RNA (lincRNA) involved in the p53-mediated stress response. We sequenced the human lincRNA-p21 (hLincRNA-p21) and found that it has a single exon that includes inverted repeat Alu elements (IRAlus). Sense and antisense Alu elements fold independently of one another into a secondary structure that is conserved in lincRNA-p21 among primates. Moreover, the structures formed by IRAlus are involved in the localization of hLincRNA-p21 in the nucleus, where hLincRNA-p21 colocalizes with paraspeckles. Our results underscore the importance of IRAlus structures for the function of hLincRNA-p21 during the stress response. PMID:27378782

  17. Small-interfering RNA (siRNA)-based functional micro- and nanostructures for efficient and selective gene silencing.

    PubMed

    Lee, Soo Hyeon; Chung, Bong Hyun; Park, Tae Gwan; Nam, Yoon Sung; Mok, Hyejung

    2012-07-17

    Because of RNA's ability to encode structure and functional information, researchers have fabricated diverse geometric structures from this polymer at the micro- and nanoscale. With their tunable structures, rigidity, and biocompatibility, novel two-dimensional and three-dimensional RNA structures can serve as a fundamental platform for biomedical applications, including engineered tissues, biosensors, and drug delivery vehicles. The discovery of the potential of small-interfering RNA (siRNA) has underscored the applications of RNA-based micro- and nanostructures in medicine. Small-interfering RNA (siRNA), synthetic double-stranded RNA consisting of approximately 21 base pairs, suppresses problematic target genes in a sequence-specific manner via inherent RNA interference (RNAi) processing. As a result, siRNA offers a potential strategy for treatment of many human diseases. However, due to inefficient delivery to cells and off-target effects, the clinical application of therapeutic siRNA has been very challenging. To address these issues, researchers have studied a variety of nanocarrier systems for siRNA delivery. In this Account, we describe several strategies for efficient siRNA delivery and selective gene silencing. We took advantage of facile chemical conjugation and complementary hybridization to design novel siRNA-based micro- and nanostructures. Using chemical crosslinkers and hydrophobic/hydrophilic polymers at the end of siRNA, we produced various RNA-based structures, including siRNA block copolymers, micelles, linear siRNA homopolymers, and microhydrogels. Because of their increased charge density and flexibility compared with conventional siRNA, these micro- and nanostructures can form polyelectrolyte complexes with poorly charged and biocompatible cationic carriers that are both more condensed and more homogenous than the complexes formed in other carrier systems. In addition, the fabricated siRNA-based structures are linked by cleavable disulfide bonds for facile generation of original siRNA in the cytosol and for target-specific gene silencing. These newly developed siRNA-based structures greatly enhance intracellular uptake and gene silencing both in vitro and in vivo, making them promising biomaterials for siRNA therapeutics.

  18. Combining laser microdissection and RNA-seq to chart the transcriptional landscape of fungal development

    PubMed Central

    2012-01-01

    Background During sexual development, filamentous ascomycetes form complex, three-dimensional fruiting bodies for the protection and dispersal of sexual spores. Fruiting bodies contain a number of cell types not found in vegetative mycelium, and these morphological differences are thought to be mediated by changes in gene expression. However, little is known about the spatial distribution of gene expression in fungal development. Here, we used laser microdissection (LM) and RNA-seq to determine gene expression patterns in young fruiting bodies (protoperithecia) and non-reproductive mycelia of the ascomycete Sordaria macrospora. Results Quantitative analysis showed major differences in the gene expression patterns between protoperithecia and total mycelium. Among the genes strongly up-regulated in protoperithecia were the pheromone precursor genes ppg1 and ppg2. The up-regulation was confirmed by fluorescence microscopy of egfp expression under the control of ppg1 regulatory sequences. RNA-seq analysis of protoperithecia from the sterile mutant pro1 showed that many genes that are differentially regulated in these structures are under the genetic control of transcription factor PRO1. Conclusions We have generated transcriptional profiles of young fungal sexual structures using a combination of LM and RNA-seq. This allowed a high spatial resolution and sensitivity, and yielded a detailed picture of gene expression during development. Our data revealed significant differences in gene expression between protoperithecia and non-reproductive mycelia, and showed that the transcription factor PRO1 is involved in the regulation of many genes expressed specifically in sexual structures. The LM/RNA-seq approach will also be relevant to other eukaryotic systems in which multicellular development is investigated. PMID:23016559

  19. Widespread signatures of local mRNA folding structure selection in four Dengue virus serotypes

    PubMed Central

    2015-01-01

    Background It is known that mRNA folding can affect and regulate various gene expression steps both in living organisms and in viruses. Previous studies have recognized functional RNA structures in the genome of the Dengue virus. However, these studies usually focused either on the viral untranslated regions or on very specific and limited regions at the beginning of the coding sequences, in a limited number of strains, and without considering evolutionary selection. Results Here we performed the first large scale comprehensive genomics analysis of selection for local mRNA folding strength in the Dengue virus coding sequences, based on a total of 1,670 genomes and 4 serotypes. Our analysis identified clusters of positions along the coding regions that may undergo a conserved evolutionary selection for strong or weak local folding maintained across different viral variants. Specifically, 53-66 clusters for strong folding and 49-73 clusters for weak folding (depending on serotype) aggregated of positions with a significant conservation of folding energy signals (related to partially overlapping local genomic regions) were recognized. In addition, up to 7% of these positions were found to be conserved in more than 90% of the viral genomes. Although some of the identified positions undergo frequent synonymous / non-synonymous substitutions, the selection for folding strength therein is preserved, and thus cannot be trivially explained based on sequence conservation alone. Conclusions The fact that many of the positions with significant folding related signals are conserved among different Dengue variants suggests that a better understanding of the mRNA structures in the corresponding regions may promote the development of prospective anti- Dengue vaccination strategies. The comparative genomics approach described here can be employed in the future for detecting functional regions in other pathogens with very high mutations rates. PMID:26449467

  20. The role of RNA structure in the interaction of U1A protein with U1 hairpin II RNA

    PubMed Central

    Law, Michael J.; Rice, Andrew J.; Lin, Patti; Laird-Offringa, Ite A.

    2006-01-01

    The N-terminal RNA Recognition Motif (RRM1) of the spliceosomal protein U1A interacting with its target U1 hairpin II (U1hpII) has been used as a paradigm for RRM-containing proteins interacting with their RNA targets. U1A binds to U1hpII via direct interactions with a 7-nucleotide (nt) consensus binding sequence at the 5′ end of a 10-nt loop, and via hydrogen bonds with the closing C–G base pair at the top of the RNA stem. Using surface plasmon resonance (Biacore), we have examined the role of structural features of U1hpII in binding to U1A RRM1. Mutational analysis of the closing base pair suggests it plays a minor role in binding and mainly prevents “breathing” of the loop. Lengthening the stem and nontarget part of the loop suggests that the increased negative charge of the RNA might slightly aid association. However, this is offset by an increase in dissociation, which may be caused by attraction of the RRM to nontarget parts of the RNA. Studies of a single stranded target and RNAs with untethered loops indicate that structure is not very relevant for association but is important for complex stability. In particular, breaking the link between the stem and the 5′ side of the loop greatly increases complex dissociation, presumably by hindering simultaneous contacts between the RRM and stem and loop nucleotides. While binding of U1A to a single stranded target is much weaker than to U1hpII, it occurs with nanomolar affinity, supporting recent evidence that binding of unstructured RNA by U1A has physiological significance. PMID:16738410

  1. The role of RNA structure in the interaction of U1A protein with U1 hairpin II RNA.

    PubMed

    Law, Michael J; Rice, Andrew J; Lin, Patti; Laird-Offringa, Ite A

    2006-07-01

    The N-terminal RNA Recognition Motif (RRM1) of the spliceosomal protein U1A interacting with its target U1 hairpin II (U1hpII) has been used as a paradigm for RRM-containing proteins interacting with their RNA targets. U1A binds to U1hpII via direct interactions with a 7-nucleotide (nt) consensus binding sequence at the 5' end of a 10-nt loop, and via hydrogen bonds with the closing C-G base pair at the top of the RNA stem. Using surface plasmon resonance (Biacore), we have examined the role of structural features of U1hpII in binding to U1A RRM1. Mutational analysis of the closing base pair suggests it plays a minor role in binding and mainly prevents "breathing" of the loop. Lengthening the stem and nontarget part of the loop suggests that the increased negative charge of the RNA might slightly aid association. However, this is offset by an increase in dissociation, which may be caused by attraction of the RRM to nontarget parts of the RNA. Studies of a single stranded target and RNAs with untethered loops indicate that structure is not very relevant for association but is important for complex stability. In particular, breaking the link between the stem and the 5' side of the loop greatly increases complex dissociation, presumably by hindering simultaneous contacts between the RRM and stem and loop nucleotides. While binding of U1A to a single stranded target is much weaker than to U1hpII, it occurs with nanomolar affinity, supporting recent evidence that binding of unstructured RNA by U1A has physiological significance.

  2. Crystal structure, mutational analysis and RNA-dependent ATPase activity of the yeast DEAD-box pre-mRNA splicing factor Prp28

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jacewicz, Agata; Schwer, Beate; Smith, Paul

    Yeast Prp28 is a DEAD-box pre-mRNA splicing factor implicated in displacing U1 snRNP from the 5' splice site. Here we report that the 588-aa Prp28 protein consists of a trypsin-sensitive 126-aa N-terminal segment (of which aa 1–89 are dispensable for Prp28 function in vivo) fused to a trypsin-resistant C-terminal catalytic domain. Purified recombinant Prp28 and Prp28-(127–588) have an intrinsic RNA-dependent ATPase activity, albeit with a low turnover number. The crystal structure of Prp28-(127–588) comprises two RecA-like domains splayed widely apart. AMPPNP•Mg 2+ is engaged by the proximal domain, with proper and specific contacts from Phe194 and Gln201 (Q motif) tomore » the adenine nucleobase. The triphosphate moiety of AMPPNP•Mg 2+ is not poised for catalysis in the open domain conformation. Guided by the Prp28•AMPPNP structure, and that of the Drosophila Vasa•AMPPNP•Mg 2+•RNA complex, we targeted 20 positions in Prp28 for alanine scanning. ATP-site components Asp341 and Glu342 (motif II) and Arg527 and Arg530 (motif VI) and RNA-site constituent Arg476 (motif Va) are essential for Prp28 activity in vivo. Synthetic lethality of double-alanine mutations highlighted functionally redundant contacts in the ATP-binding (Phe194-Gln201, Gln201-Asp502) and RNA-binding (Arg264-Arg320) sites. As a result, overexpression of defective ATP-site mutants, but not defective RNA-site mutants, elicited severe dominant-negative growth defects.« less

  3. Crystal structure, mutational analysis and RNA-dependent ATPase activity of the yeast DEAD-box pre-mRNA splicing factor Prp28

    DOE PAGES

    Jacewicz, Agata; Schwer, Beate; Smith, Paul; ...

    2014-10-10

    Yeast Prp28 is a DEAD-box pre-mRNA splicing factor implicated in displacing U1 snRNP from the 5' splice site. Here we report that the 588-aa Prp28 protein consists of a trypsin-sensitive 126-aa N-terminal segment (of which aa 1–89 are dispensable for Prp28 function in vivo) fused to a trypsin-resistant C-terminal catalytic domain. Purified recombinant Prp28 and Prp28-(127–588) have an intrinsic RNA-dependent ATPase activity, albeit with a low turnover number. The crystal structure of Prp28-(127–588) comprises two RecA-like domains splayed widely apart. AMPPNP•Mg 2+ is engaged by the proximal domain, with proper and specific contacts from Phe194 and Gln201 (Q motif) tomore » the adenine nucleobase. The triphosphate moiety of AMPPNP•Mg 2+ is not poised for catalysis in the open domain conformation. Guided by the Prp28•AMPPNP structure, and that of the Drosophila Vasa•AMPPNP•Mg 2+•RNA complex, we targeted 20 positions in Prp28 for alanine scanning. ATP-site components Asp341 and Glu342 (motif II) and Arg527 and Arg530 (motif VI) and RNA-site constituent Arg476 (motif Va) are essential for Prp28 activity in vivo. Synthetic lethality of double-alanine mutations highlighted functionally redundant contacts in the ATP-binding (Phe194-Gln201, Gln201-Asp502) and RNA-binding (Arg264-Arg320) sites. As a result, overexpression of defective ATP-site mutants, but not defective RNA-site mutants, elicited severe dominant-negative growth defects.« less

  4. On topological RNA interaction structures.

    PubMed

    Qin, Jing; Reidys, Christian M

    2013-07-01

    Recently a folding algorithm of topological RNA pseudoknot structures was presented in Reidys et al. (2011). This algorithm folds single-stranded γ-structures, that is, RNA structures composed by distinct motifs of bounded topological genus. In this article, we set the theoretical foundations for the folding of the two backbone analogues of γ structures: the RNA γ-interaction structures. These are RNA-RNA interaction structures that are constructed by a finite number of building blocks over two backbones having genus at most γ. Combinatorial properties of γ-interaction structures are of practical interest since they have direct implications for the folding of topological interaction structures. We compute the generating function of γ-interaction structures and show that it is algebraic, which implies that the numbers of interaction structures can be computed recursively. We obtain simple asymptotic formulas for 0- and 1-interaction structures. The simplest class of interaction structures are the 0-interaction structures, which represent the two backbone analogues of secondary structures.

  5. Archaeal homologs of eukaryotic methylation guide small nucleolar RNAs: lessons from the Pyrococcus genomes.

    PubMed

    Gaspin, C; Cavaillé, J; Erauso, G; Bachellerie, J P

    2000-04-07

    Ribose methylation is a prevalent type of nucleotide modification in rRNA. Eukaryotic rRNAs display a complex pattern of ribose methylations, amounting to 55 in yeast Saccharomyces cerevisiae and about 100 in vertebrates. Ribose methylations of eukaryotic rRNAs are each guided by a cognate small RNA, belonging to the family of box C/D antisense snoRNAs, through transient formation of a specific base-pairing at the rRNA modification site. In prokaryotes, the pattern of rRNA ribose methylations has been fully characterized in a single species so far, Escherichia coli, which contains only four ribose methylated rRNA nucleotides. However, the hyperthermophile archaeon Sulfolobus solfataricus contains, like eukaryotes, a large number of (yet unmapped) rRNA ribose methylations and homologs of eukaryotic box C/D small nucleolar ribonuclear proteins have been identified in archaeal genomes. We have therefore searched archaeal genomes for potential homologs of eukaryotic methylation guide small nucleolar RNAs, by combining searches for structured motifs with homology searches. We have identified a family of 46 small RNAs, conserved in the genomes of three hyperthermophile Pyrococcus species, which we have experimentally characterized in Pyrococcus abyssi. The Pyrococcus small RNAs, the first reported homologs of methylation guide small nucleolar RNAs in organisms devoid of a nucleus, appear as a paradigm of minimalist box C/D antisense RNAs. They differ from their eukaryotic homologs by their outstanding structural homogeneity, extended consensus box motifs and the quasi-systematic presence of two (instead of one) rRNA antisense elements. Remarkably, for each small RNA the two antisense elements always match rRNA sequences close to each other in rRNA structure, suggesting an important role in rRNA folding. Only a few of the predicted P. abyssi rRNA ribose methylations have been detected so far. Further analysis of these archaeal small RNAs could provide new insights into the origin and functions of methylation guide small nucleolar RNAs and illuminate the still elusive role of rRNA ribose methylations. Copyright 2000 Academic Press.

  6. RNA Bricks—a database of RNA 3D motifs and their interactions

    PubMed Central

    Chojnowski, Grzegorz; Waleń, Tomasz; Bujnicki, Janusz M.

    2014-01-01

    The RNA Bricks database (http://iimcb.genesilico.pl/rnabricks), stores information about recurrent RNA 3D motifs and their interactions, found in experimentally determined RNA structures and in RNA–protein complexes. In contrast to other similar tools (RNA 3D Motif Atlas, RNA Frabase, Rloom) RNA motifs, i.e. ‘RNA bricks’ are presented in the molecular environment, in which they were determined, including RNA, protein, metal ions, water molecules and ligands. All nucleotide residues in RNA bricks are annotated with structural quality scores that describe real-space correlation coefficients with the electron density data (if available), backbone geometry and possible steric conflicts, which can be used to identify poorly modeled residues. The database is also equipped with an algorithm for 3D motif search and comparison. The algorithm compares spatial positions of backbone atoms of the user-provided query structure and of stored RNA motifs, without relying on sequence or secondary structure information. This enables the identification of local structural similarities among evolutionarily related and unrelated RNA molecules. Besides, the search utility enables searching ‘RNA bricks’ according to sequence similarity, and makes it possible to identify motifs with modified ribonucleotide residues at specific positions. PMID:24220091

  7. Crystallization and X-ray diffraction analysis of an 'all-locked' nucleic acid duplex derived from a tRNA(Ser) microhelix.

    PubMed

    Behling, Katja; Eichert, André; Fürste, Jens P; Betzel, Christian; Erdmann, Volker A; Förster, Charlotte

    2009-08-01

    Modified nucleic acids are of great interest with respect to their nuclease resistance and enhanced thermostability. In therapeutical and diagnostic applications, such molecules can substitute for labile natural nucleic acids that are targeted against particular diseases or applied in gene therapy. The so-called 'locked nucleic acids' contain modified sugar moieties such as 2'-O,4'-C-methylene-bridged beta-D-ribofuranose and are known to be very stable nucleic acid derivatives. The structure of locked nucleic acids in single or multiple LNA-substituted natural nucleic acids and in LNA-DNA or LNA-RNA heteroduplexes has been well investigated, but the X-ray structure of an ;all-locked' nucleic acid double helix has not been described to date. Here, the crystallization and X-ray diffraction data analysis of an 'all-locked' nucleic acid helix, which was designed as an LNA originating from a tRNA(Ser) microhelix RNA structure, is presented. The crystals belonged to space group C2, with unit-cell parameters a = 77.91, b = 40.74, c = 30.06 A, beta = 91.02 degrees . A high-resolution and a low-resolution data set were recorded, with the high-resolution data showing diffraction to 1.9 A resolution. The crystals contained two double helices per asymmetric unit, with a Matthews coefficient of 2.48 A(3) Da(-1) and a solvent content of 66.49% for the merged data.

  8. Exploring the molecular basis of dsRNA recognition by NS1 protein of influenza A virus using molecular dynamics simulation and free energy calculation.

    PubMed

    Pan, Dabo; Sun, Huijun; Shen, Yulin; Liu, Huanxiang; Yao, Xiaojun

    2011-12-01

    The frequent outbreak of influenza pandemic and the limited available anti-influenza drugs highlight the urgent need for the development of new antiviral drugs. The dsRNA-binding surface of nonstructural protein 1 of influenza A virus (NS1A) is a promising target. The detailed understanding of NS1A-dsRNA interaction will be valuable for structure-based anti-influenza drug discovery. To characterize and explore the key interaction features between dsRNA and NS1A, molecular dynamics simulation combined with MM-GBSA calculations were performed. Based on the MM-GBSA calculations, we find that the intermolecular van der Waals interaction and the nonpolar solvation term provide the main driving force for the binding process. Meanwhile, 17 key residues from NS1A were identified to be responsible for the dsRNA binding. Compared with the wild type NS1A, all the studied mutants S42A, T49A, R38A, R35AR46A have obvious reduced binding free energies with dsRNA reflecting in the reduction of the polar and/or nonpolar interactions. In addition, the structural and energy analysis indicate the mutations have a small effect to the backbone structures but the loss of side chain interactions is responsible for the decrease of the binding affinity. The uncovering of NS1A-dsRNA recognition mechanism will provide some useful insights and new chances for the development of anti-influenza drugs. Copyright © 2011 Elsevier B.V. All rights reserved.

  9. High-Resolution Melt Analysis for Rapid Comparison of Bacterial Community Compositions

    PubMed Central

    Hjelmsø, Mathis Hjort; Hansen, Lars Hestbjerg; Bælum, Jacob; Feld, Louise; Holben, William E.

    2014-01-01

    In the study of bacterial community composition, 16S rRNA gene amplicon sequencing is today among the preferred methods of analysis. The cost of nucleotide sequence analysis, including requisite computational and bioinformatic steps, however, takes up a large part of many research budgets. High-resolution melt (HRM) analysis is the study of the melt behavior of specific PCR products. Here we describe a novel high-throughput approach in which we used HRM analysis targeting the 16S rRNA gene to rapidly screen multiple complex samples for differences in bacterial community composition. We hypothesized that HRM analysis of amplified 16S rRNA genes from a soil ecosystem could be used as a screening tool to identify changes in bacterial community structure. This hypothesis was tested using a soil microcosm setup exposed to a total of six treatments representing different combinations of pesticide and fertilization treatments. The HRM analysis identified a shift in the bacterial community composition in two of the treatments, both including the soil fumigant Basamid GR. These results were confirmed with both denaturing gradient gel electrophoresis (DGGE) analysis and 454-based 16S rRNA gene amplicon sequencing. HRM analysis was shown to be a fast, high-throughput technique that can serve as an effective alternative to gel-based screening methods to monitor microbial community composition. PMID:24610853

  10. Composition and Dynamics of Bacterial Communities of a Drinking Water Supply System as Assessed by RNA- and DNA-Based 16S rRNA Gene Fingerprinting

    PubMed Central

    Eichler, Stefan; Christen, Richard; Höltje, Claudia; Westphal, Petra; Bötel, Julia; Brettar, Ingrid; Mehling, Arndt; Höfle, Manfred G.

    2006-01-01

    Bacterial community dynamics of a whole drinking water supply system (DWSS) were studied from source to tap. Raw water for this DWSS is provided by two reservoirs with different water characteristics in the Harz mountains of Northern Germany. Samples were taken after different steps of treatment of raw water (i.e., flocculation, sand filtration, and chlorination) and at different points along the supply system to the tap. RNA and DNA were extracted from the sampled water. The 16S rRNA or its genes were partially amplified by reverse transcription-PCR or PCR and analyzed by single-strand conformation polymorphism community fingerprints. The bacterial community structures of the raw water samples from the two reservoirs were very different, but no major changes of these structures occurred after flocculation and sand filtration. Chlorination of the processed raw water strongly affected bacterial community structure, as reflected by the RNA-based fingerprints. This effect was less pronounced for the DNA-based fingerprints. After chlorination, the bacterial community remained rather constant from the storage containers to the tap. Furthermore, the community structure of the tap water did not change substantially for several months. Community composition was assessed by sequencing of abundant bands and phylogenetic analysis of the sequences obtained. The taxonomic compositions of the bacterial communities from both reservoirs were very different at the species level due to their different limnologies. On the other hand, major taxonomic groups, well known to occur in freshwater, such as Alphaproteobacteria, Betaproteobacteria, and Bacteroidetes, were found in both reservoirs. Significant differences in the detection of the major groups were observed between DNA-based and RNA-based fingerprints irrespective of the reservoir. Chlorination of the drinking water seemed to promote growth of nitrifying bacteria. Detailed analysis of the community dynamics of the whole DWSS revealed a significant influence of both source waters on the overall composition of the drinking water microflora and demonstrated the relevance of the raw water microflora for the drinking water microflora provided to the end user. PMID:16517632

  11. RNA folding: structure prediction, folding kinetics and ion electrostatics.

    PubMed

    Tan, Zhijie; Zhang, Wenbing; Shi, Yazhou; Wang, Fenghua

    2015-01-01

    Beyond the "traditional" functions such as gene storage, transport and protein synthesis, recent discoveries reveal that RNAs have important "new" biological functions including the RNA silence and gene regulation of riboswitch. Such functions of noncoding RNAs are strongly coupled to the RNA structures and proper structure change, which naturally leads to the RNA folding problem including structure prediction and folding kinetics. Due to the polyanionic nature of RNAs, RNA folding structure, stability and kinetics are strongly coupled to the ion condition of solution. The main focus of this chapter is to review the recent progress in the three major aspects in RNA folding problem: structure prediction, folding kinetics and ion electrostatics. This chapter will introduce both the recent experimental and theoretical progress, while emphasize the theoretical modelling on the three aspects in RNA folding.

  12. Efficient Translation of Pelargonium line pattern virus RNAs Relies on a TED-Like 3´-Translational Enhancer that Communicates with the Corresponding 5´-Region through a Long-Distance RNA-RNA Interaction

    PubMed Central

    Blanco-Pérez, Marta; Pérez-Cañamás, Miryam; Ruiz, Leticia; Hernández, Carmen

    2016-01-01

    Cap-independent translational enhancers (CITEs) have been identified at the 3´-terminal regions of distinct plant positive-strand RNA viruses belonging to families Tombusviridae and Luteoviridae. On the bases of their structural and/or functional requirements, at least six classes of CITEs have been defined whose distribution does not correlate with taxonomy. The so-called TED class has been relatively under-studied and its functionality only confirmed in the case of Satellite tobacco necrosis virus, a parasitic subviral agent. The 3´-untranslated region of the monopartite genome of Pelargonium line pattern virus (PLPV), the recommended type member of a tentative new genus (Pelarspovirus) in the family Tombusviridae, was predicted to contain a TED-like CITE. Similar CITEs can be anticipated in some other related viruses though none has been experimentally verified. Here, in the first place, we have performed a reassessment of the structure of the putative PLPV-TED through in silico predictions and in vitro SHAPE analysis with the full-length PLPV genome, which has indicated that the presumed TED element is larger than previously proposed. The extended conformation of the TED is strongly supported by the pattern of natural sequence variation, thus providing comparative structural evidence in support of the structural data obtained by in silico and in vitro approaches. Next, we have obtained experimental evidence demonstrating the in vivo activity of the PLPV-TED in the genomic (g) RNA, and also in the subgenomic (sg) RNA that the virus produces to express 3´-proximal genes. Besides other structural features, the results have highlighted the key role of long-distance kissing-loop interactions between the 3´-CITE and 5´-proximal hairpins for gRNA and sgRNA translation. Bioassays of CITE mutants have confirmed the importance of the identified 5´-3´ RNA communication for viral infectivity and, moreover, have underlined the strong evolutionary constraints that may operate on genome stretches with both regulatory and coding functions. PMID:27043436

  13. Efficient Translation of Pelargonium line pattern virus RNAs Relies on a TED-Like 3´-Translational Enhancer that Communicates with the Corresponding 5´-Region through a Long-Distance RNA-RNA Interaction.

    PubMed

    Blanco-Pérez, Marta; Pérez-Cañamás, Miryam; Ruiz, Leticia; Hernández, Carmen

    2016-01-01

    Cap-independent translational enhancers (CITEs) have been identified at the 3´-terminal regions of distinct plant positive-strand RNA viruses belonging to families Tombusviridae and Luteoviridae. On the bases of their structural and/or functional requirements, at least six classes of CITEs have been defined whose distribution does not correlate with taxonomy. The so-called TED class has been relatively under-studied and its functionality only confirmed in the case of Satellite tobacco necrosis virus, a parasitic subviral agent. The 3´-untranslated region of the monopartite genome of Pelargonium line pattern virus (PLPV), the recommended type member of a tentative new genus (Pelarspovirus) in the family Tombusviridae, was predicted to contain a TED-like CITE. Similar CITEs can be anticipated in some other related viruses though none has been experimentally verified. Here, in the first place, we have performed a reassessment of the structure of the putative PLPV-TED through in silico predictions and in vitro SHAPE analysis with the full-length PLPV genome, which has indicated that the presumed TED element is larger than previously proposed. The extended conformation of the TED is strongly supported by the pattern of natural sequence variation, thus providing comparative structural evidence in support of the structural data obtained by in silico and in vitro approaches. Next, we have obtained experimental evidence demonstrating the in vivo activity of the PLPV-TED in the genomic (g) RNA, and also in the subgenomic (sg) RNA that the virus produces to express 3´-proximal genes. Besides other structural features, the results have highlighted the key role of long-distance kissing-loop interactions between the 3´-CITE and 5´-proximal hairpins for gRNA and sgRNA translation. Bioassays of CITE mutants have confirmed the importance of the identified 5´-3´ RNA communication for viral infectivity and, moreover, have underlined the strong evolutionary constraints that may operate on genome stretches with both regulatory and coding functions.

  14. Structure of Hepatitis C Virus Polymerase in Complex with Primer-Template RNA

    PubMed Central

    Murakami, Eisuke; Lam, Angela M.; Grice, Rena L.; Du, Jinfa; Sofia, Michael J.; Furman, Philip A.; Otto, Michael J.

    2012-01-01

    The replication of the hepatitis C viral (HCV) genome is accomplished by the NS5B RNA-dependent RNA polymerase (RdRp), for which mechanistic understanding and structure-guided drug design efforts have been hampered by its propensity to crystallize in a closed, polymerization-incompetent state. The removal of an autoinhibitory β-hairpin loop from genotype 2a HCV NS5B increases de novo RNA synthesis by >100-fold, promotes RNA binding, and facilitated the determination of the first crystallographic structures of HCV polymerase in complex with RNA primer-template pairs. These crystal structures demonstrate the structural realignment required for primer-template recognition and elongation, provide new insights into HCV RNA synthesis at the molecular level, and may prove useful in the structure-based design of novel antiviral compounds. Additionally, our approach for obtaining the RNA primer-template-bound structure of HCV polymerase may be generally applicable to solving RNA-bound complexes for other viral RdRps that contain similar regulatory β-hairpin loops, including bovine viral diarrhea virus, dengue virus, and West Nile virus. PMID:22496223

  15. Exploration of RNA structure spaces

    NASA Technical Reports Server (NTRS)

    Fox, G. E.

    1991-01-01

    In order to understand the structure of real structure spaces, we are studying the 5S rRNA structure space experimentally. A plasmid containing a synthetic 5S rRNA gene, two rRNA promoters, and transcription terminators has been assembled. Assays are conducted to determine if the foreign 5S rRNA is expressed, and to see whether or not it is incorporated into ribosomes. Evolutionary competition is used to determine the relative fitness of strains containing the foreign 5S rRNA and a control 5S rRNA. By using site directed mutagenesis, a number of mutants can be made in order to study the boundaries of the structure space and how sharply defined they are. By making similar studies in the vicinity of structure space, it will be possible to determine how homogeneous the 5S rRNA structure space is. Useable experimental protocols have been developed, and a number of mutants have already been studied. Initial results suggest an explanation of why single stranded regions of the RNA are less subject to mutation than double stranded regions.

  16. INFO-RNA--a fast approach to inverse RNA folding.

    PubMed

    Busch, Anke; Backofen, Rolf

    2006-08-01

    The structure of RNA molecules is often crucial for their function. Therefore, secondary structure prediction has gained much interest. Here, we consider the inverse RNA folding problem, which means designing RNA sequences that fold into a given structure. We introduce a new algorithm for the inverse folding problem (INFO-RNA) that consists of two parts; a dynamic programming method for good initial sequences and a following improved stochastic local search that uses an effective neighbor selection method. During the initialization, we design a sequence that among all sequences adopts the given structure with the lowest possible energy. For the selection of neighbors during the search, we use a kind of look-ahead of one selection step applying an additional energy-based criterion. Afterwards, the pre-ordered neighbors are tested using the actual optimization criterion of minimizing the structure distance between the target structure and the mfe structure of the considered neighbor. We compared our algorithm to RNAinverse and RNA-SSD for artificial and biological test sets. Using INFO-RNA, we performed better than RNAinverse and in most cases, we gained better results than RNA-SSD, the probably best inverse RNA folding tool on the market. www.bioinf.uni-freiburg.de?Subpages/software.html.

  17. Evolution of RNA-Protein Interactions: Non-Specific Binding Led to RNA Splicing Activity of Fungal Mitochondrial Tyrosyl-tRNA Synthetases

    PubMed Central

    Lamech, Lilian T.; Mallam, Anna L.; Lambowitz, Alan M.

    2014-01-01

    The Neurospora crassa mitochondrial tyrosyl-tRNA synthetase (mtTyrRS; CYT-18 protein) evolved a new function as a group I intron splicing factor by acquiring the ability to bind group I intron RNAs and stabilize their catalytically active RNA structure. Previous studies showed: (i) CYT-18 binds group I introns by using both its N-terminal catalytic domain and flexibly attached C-terminal anticodon-binding domain (CTD); and (ii) the catalytic domain binds group I introns specifically via multiple structural adaptations that occurred during or after the divergence of Peziomycotina and Saccharomycotina. However, the function of the CTD and how it contributed to the evolution of splicing activity have been unclear. Here, small angle X-ray scattering analysis of CYT-18 shows that both CTDs of the homodimeric protein extend outward from the catalytic domain, but move inward to bind opposite ends of a group I intron RNA. Biochemical assays show that the isolated CTD of CYT-18 binds RNAs non-specifically, possibly contributing to its interaction with the structurally different ends of the intron RNA. Finally, we find that the yeast mtTyrRS, which diverged from Pezizomycotina fungal mtTyrRSs prior to the evolution of splicing activity, binds group I intron and other RNAs non-specifically via its CTD, but lacks further adaptations needed for group I intron splicing. Our results suggest a scenario of constructive neutral (i.e., pre-adaptive) evolution in which an initial non-specific interaction between the CTD of an ancestral fungal mtTyrRS and a self-splicing group I intron was “fixed” by an intron RNA mutation that resulted in protein-dependent splicing. Once fixed, this interaction could be elaborated by further adaptive mutations in both the catalytic domain and CTD that enabled specific binding of group I introns. Our results highlight a role for non-specific RNA binding in the evolution of RNA-binding proteins. PMID:25536042

  18. Evolution of RNA-protein interactions: non-specific binding led to RNA splicing activity of fungal mitochondrial tyrosyl-tRNA synthetases.

    PubMed

    Lamech, Lilian T; Mallam, Anna L; Lambowitz, Alan M

    2014-12-01

    The Neurospora crassa mitochondrial tyrosyl-tRNA synthetase (mtTyrRS; CYT-18 protein) evolved a new function as a group I intron splicing factor by acquiring the ability to bind group I intron RNAs and stabilize their catalytically active RNA structure. Previous studies showed: (i) CYT-18 binds group I introns by using both its N-terminal catalytic domain and flexibly attached C-terminal anticodon-binding domain (CTD); and (ii) the catalytic domain binds group I introns specifically via multiple structural adaptations that occurred during or after the divergence of Peziomycotina and Saccharomycotina. However, the function of the CTD and how it contributed to the evolution of splicing activity have been unclear. Here, small angle X-ray scattering analysis of CYT-18 shows that both CTDs of the homodimeric protein extend outward from the catalytic domain, but move inward to bind opposite ends of a group I intron RNA. Biochemical assays show that the isolated CTD of CYT-18 binds RNAs non-specifically, possibly contributing to its interaction with the structurally different ends of the intron RNA. Finally, we find that the yeast mtTyrRS, which diverged from Pezizomycotina fungal mtTyrRSs prior to the evolution of splicing activity, binds group I intron and other RNAs non-specifically via its CTD, but lacks further adaptations needed for group I intron splicing. Our results suggest a scenario of constructive neutral (i.e., pre-adaptive) evolution in which an initial non-specific interaction between the CTD of an ancestral fungal mtTyrRS and a self-splicing group I intron was "fixed" by an intron RNA mutation that resulted in protein-dependent splicing. Once fixed, this interaction could be elaborated by further adaptive mutations in both the catalytic domain and CTD that enabled specific binding of group I introns. Our results highlight a role for non-specific RNA binding in the evolution of RNA-binding proteins.

  19. Revisiting the structure/function relationships of H/ACA(-like) RNAs: a unified model for Euryarchaea and Crenarchaea

    PubMed Central

    Toffano-Nioche, Claire; Gautheret, Daniel; Leclerc, Fabrice

    2015-01-01

    A structural and functional classification of H/ACA and H/ACA-like motifs is obtained from the analysis of the H/ACA guide RNAs which have been identified previously in the genomes of Euryarchaea (Pyrococcus) and Crenarchaea (Pyrobaculum). A unified structure/function model is proposed based on the common structural determinants shared by H/ACA and H/ACA-like motifs in both Euryarchaea and Crenarchaea. Using a computational approach, structural and energetic rules for the guide:target RNA-RNA interactions are derived from structural and functional data on the H/ACA RNP particles. H/ACA(-like) motifs found in Pyrococcus are evaluated through the classification and their biological relevance is discussed. Extra-ribosomal targets found in both Pyrococcus and Pyrobaculum might support the hypothesis of a gene regulation mediated by H/ACA(-like) guide RNAs in archaea. PMID:26240384

  20. Automated and fast building of three-dimensional RNA structures.

    PubMed

    Zhao, Yunjie; Huang, Yangyu; Gong, Zhou; Wang, Yanjie; Man, Jianfen; Xiao, Yi

    2012-01-01

    Building tertiary structures of non-coding RNA is required to understand their functions and design new molecules. Current algorithms of RNA tertiary structure prediction give satisfactory accuracy only for small size and simple topology and many of them need manual manipulation. Here, we present an automated and fast program, 3dRNA, for RNA tertiary structure prediction with reasonable accuracy for RNAs of larger size and complex topology.

  1. Paraspeckles: nuclear bodies built on long noncoding RNA

    PubMed Central

    Bond, Charles S.

    2009-01-01

    Paraspeckles are ribonucleoprotein bodies found in the interchromatin space of mammalian cell nuclei. These structures play a role in regulating the expression of certain genes in differentiated cells by nuclear retention of RNA. The core paraspeckle proteins (PSF/SFPQ, P54NRB/NONO, and PSPC1 [paraspeckle protein 1]) are members of the DBHS (Drosophila melanogaster behavior, human splicing) family. These proteins, together with the long nonprotein-coding RNA NEAT1 (MEN-ϵ/β), associate to form paraspeckles and maintain their integrity. Given the large numbers of long noncoding transcripts currently being discovered through whole transcriptome analysis, paraspeckles may be a paradigm for a class of subnuclear bodies formed around long noncoding RNA. PMID:19720872

  2. A movie of the RNA polymerase nucleotide addition cycle.

    PubMed

    Brueckner, Florian; Ortiz, Julio; Cramer, Patrick

    2009-06-01

    During gene transcription, RNA polymerase (Pol) passes through repetitive cycles of adding a nucleotide to the growing mRNA chain. Here we obtained a movie of the nucleotide addition cycle by combining structural information on different functional states of the Pol II elongation complex (EC). The movie illustrates the two-step loading of the nucleoside triphosphate (NTP) substrate, closure of the active site for catalytic nucleotide incorporation, and the presumed two-step translocation of DNA and RNA, which is accompanied by coordinated conformational changes in the polymerase bridge helix and trigger loop. The movie facilitates teaching and a mechanistic analysis of transcription and can be downloaded from http://www.lmb.uni-muenchen.de/cramer/pr-materials.

  3. Systematic discovery of Xist RNA binding proteins

    PubMed Central

    Chu, Ci; Zhang, Qiangfeng Cliff; da Rocha, Simão Teixeira; Flynn, Ryan A.; Bharadwaj, Maheetha; Calabrese, J. Mauro; Magnuson, Terry; Heard, Edith; Chang, Howard Y.

    2015-01-01

    Summary Noncoding RNAs (ncRNAs) function with associated proteins to effect complex structural and regulatory outcomes. To reveal the composition and dynamics of specific noncoding RNA- protein complexes (RNPs) in vivo, we developed comprehensive identification of RNA-binding proteins by mass spectrometry (ChIRP-MS). ChIRP-MS analysis of four ncRNAs captures key protein interactors, including a U1-specific link to the 3′ RNA processing machinery. Xist, an essential lncRNA for X-chromosome inactivation (XCI), interacts with 81 proteins from chromatin modification, nuclear matrix, and RNA remodeling pathways. The Xist RNA-protein particle assembles in two steps coupled with the transition from pluripotency to differentiation. Specific interactors include HnrnpK that participates in Xist-mediated gene silencing and histone modifications, but not Xist localization and Drosophila Split ends homolog Spen that interacts via the A-repeat domain of Xist and is required for gene silencing. Thus, Xist lncRNA engages with proteins in a modular and developmentally controlled manner to coordinate chromatin spreading and silencing. PMID:25843628

  4. Trans-activation of the Tetrahymena group I intron ribozyme via a non-native RNA-RNA interaction.

    PubMed Central

    Ikawa, Y; Shiraishi, H; Inoue, T

    1999-01-01

    The peripheral P2.1 domain of the Tetrahymena group I intron ribozyme has been shown to be non-essential for splicing. We found, however, that separately prepared P2.1 RNA efficiently accelerates the 3' splice-site-specific hydrolysis reaction of a mutant ribozyme lacking both P2.1 and its upstream region in trans. We report here the unusual properties of this trans-activation. Compensatory mutational analysis revealed that non-native long-range base-pairings between the loop region of P2.1 RNA and L5c region of the mutant ribozyme are needed for the activation in spite of the fact that P2.1 forms base-pairings with P9.1 in the Tetrahymena ribozyme. The trans -activation depends on the non-native RNA-RNA interaction together with the higher order structure of P2.1 RNA. This activation is unique among the known trans-activations that utilize native tertiary interactions or RNA chaperons. PMID:10075996

  5. Recognition of RNA by amide modified backbone nucleic acids: molecular dynamics simulations of DNA-RNA hybrids in aqueous solution.

    PubMed

    Nina, Mafalda; Fonné-Pfister, Raymonde; Beaudegnies, Renaud; Chekatt, Habiba; Jung, Pierre M J; Murphy-Kessabi, Fiona; De Mesmaeker, Alain; Wendeborn, Sebastian

    2005-04-27

    Thermodynamic and structural properties of a chemically modified DNA-RNA hybrid in which a phosphodiester linkage is replaced by a neutral amide-3 linkage (3'-CH(2)-CONH-5') were investigated using UV melting experiments, molecular dynamics simulations in explicit water, and continuum solvent models. van't Hoff analysis of the experimental UV melting curves suggests that the significant increase of the thermodynamic stability of a 15-mer DNA-RNA with seven alternated amide-3 modifications (+11 degrees C) is mainly due to an increased binding enthalpy. To further evaluate the origin in the observed affinities differences, the electrostatic contribution to the binding free energy was calculated by solving the Poisson-Boltzmann equation numerically. The nonelectrostatic contribution was estimated as the product of a hydrophobic surface tension coefficient and the surface area that is buried upon double strand formation. Structures were taken from 10 ns molecular dynamics simulations computed in a consistent fashion using explicit solvent, counterions, and the particle-mesh Ewald procedure. The present preliminary thermodynamic study suggests that the favorable binding free energy of the amide-3 DNA single strand to the complementary RNA is equally driven by electrostatic and nonpolar contributions to the binding compared to their natural analogues. In addition, molecular dynamics simulations in explicit water were performed on an amide-3 DNA single strand and the corresponding natural DNA. Results from the conformations cluster analysis of the simulated amide-3 DNA single strand ensembles suggest that the 25% of the population sampled within 10 ns has a pre-organized conformation where the sugar C3' endo pucker is favored at the 3'-flanking nucleotides. These structural and thermodynamic features contribute to the understanding of the observed increased affinities of the amide-3 DNA-RNA hybrids at the microscopic level.

  6. Quantitation of base substitutions in eukaryotic 5S rRNA: selection for the maintenance of RNA secondary structure.

    PubMed

    Curtiss, W C; Vournakis, J N

    1984-01-01

    Eukaryotic 5S rRNA sequences from 34 diverse species were compared by the following method: (1) The sequences were aligned; (2) the positions of substitutions were located by comparison of all possible pairs of sequences; (3) the substitution sites were mapped to an assumed general base pairing model; and (4) the R-Y model of base stacking was used to study stacking pattern relationships in the structure. An analysis of the sequence and structure variability in each region of the molecule is presented. It was found that the degree of base substitution varies over a wide range, from absolute conservation to occurrence of over 90% of the possible observable substitutions. The substitutions are located primarily in stem regions of the 5S rRNA secondary structure. More than 88% of the substitutions in helical regions maintain base pairing. The disruptive substitutions are primarily located at the edges of helical regions, resulting in shortening of the helical regions and lengthening of the adjacent nonpaired regions. Base stacking patterns determined by the R-Y model are mapped onto the general secondary structure. Intrastrand and interstrand stacking could stabilize alternative coaxial structures and limit the conformational flexibility of nonpaired regions. Two short contiguous regions are 100% conserved in all species. This may reflect evolutionary constraints imposed at the DNA level by the requirement for binding of a 5S gene transcription initiation factor during gene expression.

  7. Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures

    PubMed Central

    Stark, Alexander; Lin, Michael F.; Kheradpour, Pouya; Pedersen, Jakob S.; Parts, Leopold; Carlson, Joseph W.; Crosby, Madeline A.; Rasmussen, Matthew D.; Roy, Sushmita; Deoras, Ameya N.; Ruby, J. Graham; Brennecke, Julius; Hodges, Emily; Hinrichs, Angie S.; Caspi, Anat; Paten, Benedict; Park, Seung-Won; Han, Mira V.; Maeder, Morgan L.; Polansky, Benjamin J.; Robson, Bryanne E.; Aerts, Stein; van Helden, Jacques; Hassan, Bassem; Gilbert, Donald G.; Eastman, Deborah A.; Rice, Michael; Weir, Michael; Hahn, Matthew W.; Park, Yongkyu; Dewey, Colin N.; Pachter, Lior; Kent, W. James; Haussler, David; Lai, Eric C.; Bartel, David P.; Hannon, Gregory J.; Kaufman, Thomas C.; Eisen, Michael B.; Clark, Andrew G.; Smith, Douglas; Celniker, Susan E.; Gelbart, William M.; Kellis, Manolis

    2008-01-01

    Sequencing of multiple related species followed by comparative genomics analysis constitutes a powerful approach for the systematic understanding of any genome. Here, we use the genomes of 12 Drosophila species for the de novo discovery of functional elements in the fly. Each type of functional element shows characteristic patterns of change, or ‘evolutionary signatures’, dictated by its precise selective constraints. Such signatures enable recognition of new protein-coding genes and exons, spurious and incorrect gene annotations, and numerous unusual gene structures, including abundant stop-codon readthrough. Similarly, we predict non-protein-coding RNA genes and structures, and new microRNA (miRNA) genes. We provide evidence of miRNA processing and functionality from both hairpin arms and both DNA strands. We identify several classes of pre- and post-transcriptional regulatory motifs, and predict individual motif instances with high confidence. We also study how discovery power scales with the divergence and number of species compared, and we provide general guidelines for comparative studies. PMID:17994088

  8. Probing RNA Native Conformational Ensembles with Structural Constraints.

    PubMed

    Fonseca, Rasmus; van den Bedem, Henry; Bernauer, Julie

    2016-05-01

    Noncoding ribonucleic acids (RNA) play a critical role in a wide variety of cellular processes, ranging from regulating gene expression to post-translational modification and protein synthesis. Their activity is modulated by highly dynamic exchanges between three-dimensional conformational substates, which are difficult to characterize experimentally and computationally. Here, we present an innovative, entirely kinematic computational procedure to efficiently explore the native ensemble of RNA molecules. Our procedure projects degrees of freedom onto a subspace of conformation space defined by distance constraints in the tertiary structure. The dimensionality reduction enables efficient exploration of conformational space. We show that the conformational distributions obtained with our method broadly sample the conformational landscape observed in NMR experiments. Compared to normal mode analysis-based exploration, our procedure diffuses faster through the experimental ensemble while also accessing conformational substates to greater precision. Our results suggest that conformational sampling with a highly reduced but fully atomistic representation of noncoding RNA expresses key features of their dynamic nature.

  9. Abundant RNA editing sites of chloroplast protein-coding genes in Ginkgo biloba and an evolutionary pattern analysis.

    PubMed

    He, Peng; Huang, Sheng; Xiao, Guanghui; Zhang, Yuzhou; Yu, Jianing

    2016-12-01

    RNA editing is a posttranscriptional modification process that alters the RNA sequence so that it deviates from the genomic DNA sequence. RNA editing mainly occurs in chloroplasts and mitochondrial genomes, and the number of editing sites varies in terrestrial plants. Why and how RNA editing systems evolved remains a mystery. Ginkgo biloba is one of the oldest seed plants and has an important evolutionary position. Determining the patterns and distribution of RNA editing in the ancient plant provides insights into the evolutionary trend of RNA editing, and helping us to further understand their biological significance. In this paper, we investigated 82 protein-coding genes in the chloroplast genome of G. biloba and identified 255 editing sites, which is the highest number of RNA editing events reported in a gymnosperm. All of the editing sites were C-to-U conversions, which mainly occurred in the second codon position, biased towards to the U_A context, and caused an increase in hydrophobic amino acids. RNA editing could change the secondary structures of 82 proteins, and create or eliminate a transmembrane region in five proteins as determined in silico. Finally, the evolutionary tendencies of RNA editing in different gene groups were estimated using the nonsynonymous-synonymous substitution rate selection mode. The G. biloba chloroplast genome possesses the highest number of RNA editing events reported so far in a seed plant. Most of the RNA editing sites can restore amino acid conservation, increase hydrophobicity, and even influence protein structures. Similar purifying selections constitute the dominant evolutionary force at the editing sites of essential genes, such as the psa, some psb and pet groups, and a positive selection occurred in the editing sites of nonessential genes, such as most ndh and a few psb genes.

  10. Aminoglycosylation Can Enhance the G-Quadruplex Binding Activity of Epigallocatechin

    PubMed Central

    Bai, Li-Ping; Ho, Hing-Man; Ma, Dik-Lung; Yang, Hui; Fu, Wai-Chung; Jiang, Zhi-Hong

    2013-01-01

    With the aim of enhancing G-quadruplex binding activity, two new glucosaminosides (16, 18) of penta-methylated epigallocatechin were synthesized by chemical glycosylation. Subsequent ESI-TOF-MS analysis demonstrated that these two glucosaminoside derivatives exhibit much stronger binding activity to human telomeric DNA and RNA G-quadruplexes than their parent structure (i.e., methylated EGC) (14) as well as natural epigallocatechin (EGC, 6). The DNA G-quadruplex binding activity of 16 and 18 is even more potent than strong G-quadruplex binder quercetin, which has a more planar structure. These two synthetic compounds also showed a higher binding strength to human telomeric RNA G-quadruplex than its DNA counterpart. Analysis of the structure-activity relationship revealed that the more basic compound, 16, has a higher binding capacity with DNA and RNA G-quadruplexes than its N-acetyl derivative, 18, suggesting the importance of the basicity of the aminoglycoside for G-quadruplex binding activity. Molecular docking simulation predicted that the aromatic ring of 16 π-stacks with the aromatic ring of guanine nucleotides, with the glucosamine moiety residing in the groove of G-quadruplex. This research indicates that glycosylation of natural products with aminosugar can significantly enhance their G-quadruplex binding activities, thus is an effective way to generate small molecules targeting G-quadruplexes in nucleic acids. In addition, this is the first report that green tea catechin can bind to nucleic acid G-quadruplex structures. PMID:23335983

  11. SimRNAweb: a web server for RNA 3D structure modeling with optional restraints.

    PubMed

    Magnus, Marcin; Boniecki, Michał J; Dawson, Wayne; Bujnicki, Janusz M

    2016-07-08

    RNA function in many biological processes depends on the formation of three-dimensional (3D) structures. However, RNA structure is difficult to determine experimentally, which has prompted the development of predictive computational methods. Here, we introduce a user-friendly online interface for modeling RNA 3D structures using SimRNA, a method that uses a coarse-grained representation of RNA molecules, utilizes the Monte Carlo method to sample the conformational space, and relies on a statistical potential to describe the interactions in the folding process. SimRNAweb makes SimRNA accessible to users who do not normally use high performance computational facilities or are unfamiliar with using the command line tools. The simplest input consists of an RNA sequence to fold RNA de novo. Alternatively, a user can provide a 3D structure in the PDB format, for instance a preliminary model built with some other technique, to jump-start the modeling close to the expected final outcome. The user can optionally provide secondary structure and distance restraints, and can freeze a part of the starting 3D structure. SimRNAweb can be used to model single RNA sequences and RNA-RNA complexes (up to 52 chains). The webserver is available at http://genesilico.pl/SimRNAweb. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. Comprehensive analysis of RNA-protein interactions by high-throughput sequencing-RNA affinity profiling.

    PubMed

    Tome, Jacob M; Ozer, Abdullah; Pagano, John M; Gheba, Dan; Schroth, Gary P; Lis, John T

    2014-06-01

    RNA-protein interactions play critical roles in gene regulation, but methods to quantitatively analyze these interactions at a large scale are lacking. We have developed a high-throughput sequencing-RNA affinity profiling (HiTS-RAP) assay by adapting a high-throughput DNA sequencer to quantify the binding of fluorescently labeled protein to millions of RNAs anchored to sequenced cDNA templates. Using HiTS-RAP, we measured the affinity of mutagenized libraries of GFP-binding and NELF-E-binding aptamers to their respective targets and identified critical regions of interaction. Mutations additively affected the affinity of the NELF-E-binding aptamer, whose interaction depended mainly on a single-stranded RNA motif, but not that of the GFP aptamer, whose interaction depended primarily on secondary structure.

  13. Biochemical identification of Argonaute 2 as the sole protein required for RNA-induced silencing complex activity

    PubMed Central

    Rand, Tim A.; Ginalski, Krzysztof; Grishin, Nick V.; Wang, Xiaodong

    2004-01-01

    RNA interference is carried out by the small double-stranded RNA-induced silencing complex (RISC). The RISC-bound small RNA guides the RISC complex to identify and cleave mRNAs with complementary sequences. The proteins that make up the RISC complex and cleave mRNA have not been unequivocally defined. Here, we report the biochemical purification of RISC activity to homogeneity from Drosophila Schnieder 2 cell extracts. Argonaute 2 (Ago-2) is the sole protein component present in the purified, functional RISC. By using a bioinformatics method that combines sequence-profile analysis with predicted protein secondary structure, we found homology between the PIWI domain of Ago-2 and endonuclease V and identified potential active-site amino acid residues within the PIWI domain of Ago-2. PMID:15452342

  14. Biochemical identification of Argonaute 2 as the sole protein required for RNA-induced silencing complex activity.

    PubMed

    Rand, Tim A; Ginalski, Krzysztof; Grishin, Nick V; Wang, Xiaodong

    2004-10-05

    RNA interference is carried out by the small double-stranded RNA-induced silencing complex (RISC). The RISC-bound small RNA guides the RISC complex to identify and cleave mRNAs with complementary sequences. The proteins that make up the RISC complex and cleave mRNA have not been unequivocally defined. Here, we report the biochemical purification of RISC activity to homogeneity from Drosophila Schnieder 2 cell extracts. Argonaute 2 (Ago-2) is the sole protein component present in the purified, functional RISC. By using a bioinformatics method that combines sequence-profile analysis with predicted protein secondary structure, we found homology between the PIWI domain of Ago-2 and endonuclease V and identified potential active-site amino acid residues within the PIWI domain of Ago-2.

  15. Using in-cell SHAPE-Seq and simulations to probe structure-function design principles of RNA transcriptional regulators.

    PubMed

    Takahashi, Melissa K; Watters, Kyle E; Gasper, Paul M; Abbott, Timothy R; Carlson, Paul D; Chen, Alan A; Lucks, Julius B

    2016-06-01

    Antisense RNA-mediated transcriptional regulators are powerful tools for controlling gene expression and creating synthetic gene networks. RNA transcriptional repressors derived from natural mechanisms called attenuators are particularly versatile, though their mechanistic complexity has made them difficult to engineer. Here we identify a new structure-function design principle for attenuators that enables the forward engineering of new RNA transcriptional repressors. Using in-cell SHAPE-Seq to characterize the structures of attenuator variants within Escherichia coli, we show that attenuator hairpins that facilitate interaction with antisense RNAs require interior loops for proper function. Molecular dynamics simulations of these attenuator variants suggest these interior loops impart structural flexibility. We further observe hairpin flexibility in the cellular structures of natural RNA mechanisms that use antisense RNA interactions to repress translation, confirming earlier results from in vitro studies. Finally, we design new transcriptional attenuators in silico using an interior loop as a structural requirement and show that they function as desired in vivo. This work establishes interior loops as an important structural element for designing synthetic RNA gene regulators. We anticipate that the coupling of experimental measurement of cellular RNA structure and function with computational modeling will enable rapid discovery of structure-function design principles for a diverse array of natural and synthetic RNA regulators. © 2016 Takahashi et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  16. A new way to see RNA

    PubMed Central

    Keating, Kevin S.; Humphris, Elisabeth L.; Pyle, Anna Marie

    2015-01-01

    Unlike proteins, the RNA backbone has numerous degrees of freedom (eight, if one counts the sugar pucker), making RNA modeling, structure building and prediction a multidimensional problem of exceptionally high complexity. And yet RNA tertiary structures are not infinite in their structural morphology; rather, they are built from a limited set of discrete units. In order to reduce the dimensionality of the RNA backbone in a physically reasonable way, a shorthand notation was created that reduced the RNA backbone torsion angles to two (η and θ, analogous to ϕ and ψ in proteins). When these torsion angles are calculated for nucleotides in a crystallographic database and plotted against one another, one obtains a plot analogous to a Ramachandran plot (the η/θ plot), with highly populated and unpopulated regions. Nucleotides that occupy proximal positions on the plot have identical structures and are found in the same units of tertiary structure. In this review, we describe the statistical validation of the η/θ formalism and the exploration of features within the η/θ plot. We also describe the application of the η/θ formalism in RNA motif discovery, structural comparison, RNA structure building and tertiary structure prediction. More than a tool, however, the η/θ formalism has provided new insights into RNA structure itself, revealing its fundamental components and the factors underlying RNA architectural form. PMID:21729350

  17. RNA-Puzzles Round III: 3D RNA structure prediction of five riboswitches and one ribozyme

    PubMed Central

    Biesiada, Marcin; Boniecki, Michał J.; Chou, Fang-Chieh; Ferré-D'Amaré, Adrian R.; Das, Rhiju; Dunin-Horkawicz, Stanisław; Geniesse, Caleb; Kappel, Kalli; Kladwang, Wipapat; Krokhotin, Andrey; Łach, Grzegorz E.; Major, François; Mann, Thomas H.; Pachulska-Wieczorek, Katarzyna; Patel, Dinshaw J.; Piccirilli, Joseph A.; Popenda, Mariusz; Purzycka, Katarzyna J.; Ren, Aiming; Rice, Greggory M.; Santalucia, John; Tandon, Arpit; Trausch, Jeremiah J.; Wang, Jian; Weeks, Kevin M.; Williams, Benfeard; Xiao, Yi; Zhang, Dong; Zok, Tomasz

    2017-01-01

    RNA-Puzzles is a collective experiment in blind 3D RNA structure prediction. We report here a third round of RNA-Puzzles. Five puzzles, 4, 8, 12, 13, 14, all structures of riboswitch aptamers and puzzle 7, a ribozyme structure, are included in this round of the experiment. The riboswitch structures include biological binding sites for small molecules (S-adenosyl methionine, cyclic diadenosine monophosphate, 5-amino 4-imidazole carboxamide riboside 5′-triphosphate, glutamine) and proteins (YbxF), and one set describes large conformational changes between ligand-free and ligand-bound states. The Varkud satellite ribozyme is the most recently solved structure of a known large ribozyme. All puzzles have established biological functions and require structural understanding to appreciate their molecular mechanisms. Through the use of fast-track experimental data, including multidimensional chemical mapping, and accurate prediction of RNA secondary structure, a large portion of the contacts in 3D have been predicted correctly leading to similar topologies for the top ranking predictions. Template-based and homology-derived predictions could predict structures to particularly high accuracies. However, achieving biological insights from de novo prediction of RNA 3D structures still depends on the size and complexity of the RNA. Blind computational predictions of RNA structures already appear to provide useful structural information in many cases. Similar to the previous RNA-Puzzles Round II experiment, the prediction of non-Watson–Crick interactions and the observed high atomic clash scores reveal a notable need for an algorithm of improvement. All prediction models and assessment results are available at http://ahsoka.u-strasbg.fr/rnapuzzles/. PMID:28138060

  18. The box C/D sRNP dimeric architecture is conserved across domain Archaea

    PubMed Central

    Bower-Phipps, Kathleen R.; Taylor, David W.; Wang, Hong-Wei; Baserga, Susan J.

    2012-01-01

    Box C/D small (nucleolar) ribonucleoproteins [s(no)RNPs] catalyze RNA-guided 2′-O-ribose methylation in two of the three domains of life. Recent structural studies have led to a controversy over whether box C/D sRNPs functionally assemble as monomeric or dimeric macromolecules. The archaeal box C/D sRNP from Methanococcus jannaschii (Mj) has been shown by glycerol gradient sedimentation, gel filtration chromatography, native gel analysis, and single-particle electron microscopy (EM) to adopt a di-sRNP architecture, containing four copies of each box C/D core protein and two copies of the Mj sR8 sRNA. Subsequently, investigators used a two-stranded artificial guide sRNA, CD45, to assemble a box C/D sRNP from Sulfolobus solfataricus with a short RNA methylation substrate, yielding a crystal structure of a mono-sRNP. To more closely examine box C/D sRNP architecture, we investigate the role of the omnipresent sRNA loop as a structural determinant of sRNP assembly. We show through sRNA mutagenesis, native gel electrophoresis, and single-particle EM that a di-sRNP is the near exclusive architecture obtained when reconstituting box C/D sRNPs with natural or artificial sRNAs containing an internal loop. Our results span three distantly related archaeal species—Sulfolobus solfataricus, Pyrococcus abyssi, and Archaeoglobus fulgidus—indicating that the di-sRNP architecture is broadly conserved across the entire archaeal domain. PMID:22753779

  19. The box C/D sRNP dimeric architecture is conserved across domain Archaea.

    PubMed

    Bower-Phipps, Kathleen R; Taylor, David W; Wang, Hong-Wei; Baserga, Susan J

    2012-08-01

    Box C/D small (nucleolar) ribonucleoproteins [s(no)RNPs] catalyze RNA-guided 2'-O-ribose methylation in two of the three domains of life. Recent structural studies have led to a controversy over whether box C/D sRNPs functionally assemble as monomeric or dimeric macromolecules. The archaeal box C/D sRNP from Methanococcus jannaschii (Mj) has been shown by glycerol gradient sedimentation, gel filtration chromatography, native gel analysis, and single-particle electron microscopy (EM) to adopt a di-sRNP architecture, containing four copies of each box C/D core protein and two copies of the Mj sR8 sRNA. Subsequently, investigators used a two-stranded artificial guide sRNA, CD45, to assemble a box C/D sRNP from Sulfolobus solfataricus with a short RNA methylation substrate, yielding a crystal structure of a mono-sRNP. To more closely examine box C/D sRNP architecture, we investigate the role of the omnipresent sRNA loop as a structural determinant of sRNP assembly. We show through sRNA mutagenesis, native gel electrophoresis, and single-particle EM that a di-sRNP is the near exclusive architecture obtained when reconstituting box C/D sRNPs with natural or artificial sRNAs containing an internal loop. Our results span three distantly related archaeal species--Sulfolobus solfataricus, Pyrococcus abyssi, and Archaeoglobus fulgidus--indicating that the di-sRNP architecture is broadly conserved across the entire archaeal domain.

  20. Structural analysis of the human U3 ribonucleoprotein particle reveal a conserved sequence available for base pairing with pre-rRNA.

    PubMed Central

    Parker, K A; Steitz, J A

    1987-01-01

    The human U3 ribonucleoprotein (RNP) has been analyzed to determine its protein constituents, sites of protein-RNA interaction, and RNA secondary structure. By using anti-U3 RNP antibodies and extracts prepared from HeLa cells labeled in vivo, the RNP was found to contain four nonphosphorylated proteins of 36, 30, 13, and 12.5 kilodaltons and two phosphorylated proteins of 74 and 59 kilodaltons. U3 nucleotides 72-90, 106-121, 154-166, and 190-217 must contain sites that interact with proteins since these regions are immunoprecipitated after treatment of the RNP with RNase A or T1. The secondary structure was probed with specific nucleases and by chemical modification with single-strand-specific reagents that block subsequent reverse transcription. Regions that are single stranded (and therefore potentially able to interact with a substrate RNA) include an evolutionarily conserved sequence at nucleotides 104-112 and nonconserved sequences at nucleotides 65-74, 80-84, and 88-93. Nucleotides 159-168 do not appear to be highly accessible, thus making it unlikely that this U3 sequence base pairs with sequences near the 5.8S rRNA-internal transcribed spacer II junction, as previously proposed. Alternative functions of the U3 RNP are discussed, including the possibility that U3 may participate in a processing event near the 3' end of 28S rRNA. Images PMID:2959855

  1. Structural model of the p14/SF3b155 · branch duplex complex.

    PubMed

    Schellenberg, Matthew J; Dul, Erin L; MacMillan, Andrew M

    2011-01-01

    Human p14 (SF3b14), a component of the spliceosomal U2 snRNP, interacts directly with the pre-mRNA branch adenosine within the context of the bulged duplex formed between the pre-mRNA branch region and U2 snRNA. This association occurs early in spliceosome assembly and persists within the fully assembled spliceosome. Analysis of the crystal structure of a complex containing p14 and a peptide derived from p14-associated SF3b155 combined with the results of cross-linking studies has suggested that the branch nucleotide interacts with a pocket on a non-canonical RNA binding surface formed by the complex. Here we report a structural model of the p14 · bulged duplex interaction based on a combination of X-ray crystallography of an adenine p14/SF3b155 peptide complex, biochemical comparison of a panel of disulfide cross-linked protein-RNA complexes, and small-angle X-ray scattering (SAXS). These studies reveal specific recognition of the branch adenosine within the p14 pocket and establish the orientation of the bulged duplex RNA bound on the protein surface. The intimate association of one surface of the bulged duplex with the p14/SF3b155 peptide complex described by this model buries the branch nucleotide at the interface and suggests that p14 · duplex interaction must be disrupted before the first step of splicing.

  2. Structural model of the p14/SF3b155·branch duplex complex

    PubMed Central

    Schellenberg, Matthew J.; Dul, Erin L.; MacMillan, Andrew M.

    2011-01-01

    Human p14 (SF3b14), a component of the spliceosomal U2 snRNP, interacts directly with the pre-mRNA branch adenosine within the context of the bulged duplex formed between the pre-mRNA branch region and U2 snRNA. This association occurs early in spliceosome assembly and persists within the fully assembled spliceosome. Analysis of the crystal structure of a complex containing p14 and a peptide derived from p14-associated SF3b155 combined with the results of cross-linking studies has suggested that the branch nucleotide interacts with a pocket on a non-canonical RNA binding surface formed by the complex. Here we report a structural model of the p14•bulged duplex interaction based on a combination of X-ray crystallography of an adenine p14/SF3b155 peptide complex, biochemical comparison of a panel of disulfide cross-linked protein–RNA complexes, and small-angle X-ray scattering (SAXS). These studies reveal specific recognition of the branch adenosine within the p14 pocket and establish the orientation of the bulged duplex RNA bound on the protein surface. The intimate association of one surface of the bulged duplex with the p14/SF3b155 peptide complex described by this model buries the branch nucleotide at the interface and suggests that p14•duplex interaction must be disrupted before the first step of splicing. PMID:21062891

  3. Four RNA families with functional transient structures

    PubMed Central

    Zhu, Jing Yun A; Meyer, Irmtraud M

    2015-01-01

    Protein-coding and non-coding RNA transcripts perform a wide variety of cellular functions in diverse organisms. Several of their functional roles are expressed and modulated via RNA structure. A given transcript, however, can have more than a single functional RNA structure throughout its life, a fact which has been previously overlooked. Transient RNA structures, for example, are only present during specific time intervals and cellular conditions. We here introduce four RNA families with transient RNA structures that play distinct and diverse functional roles. Moreover, we show that these transient RNA structures are structurally well-defined and evolutionarily conserved. Since Rfam annotates one structure for each family, there is either no annotation for these transient structures or no such family. Thus, our alignments either significantly update and extend the existing Rfam families or introduce a new RNA family to Rfam. For each of the four RNA families, we compile a multiple-sequence alignment based on experimentally verified transient and dominant (dominant in terms of either the thermodynamic stability and/or attention received so far) RNA secondary structures using a combination of automated search via covariance model and manual curation. The first alignment is the Trp operon leader which regulates the operon transcription in response to tryptophan abundance through alternative structures. The second alignment is the HDV ribozyme which we extend to the 5′ flanking sequence. This flanking sequence is involved in the regulation of the transcript's self-cleavage activity. The third alignment is the 5′ UTR of the maturation protein from Levivirus which contains a transient structure that temporarily postpones the formation of the final inhibitory structure to allow translation of maturation protein. The fourth and last alignment is the SAM riboswitch which regulates the downstream gene expression by assuming alternative structures upon binding of SAM. All transient and dominant structures are mapped to our new alignments introduced here. PMID:25751035

  4. Four RNA families with functional transient structures.

    PubMed

    Zhu, Jing Yun A; Meyer, Irmtraud M

    2015-01-01

    Protein-coding and non-coding RNA transcripts perform a wide variety of cellular functions in diverse organisms. Several of their functional roles are expressed and modulated via RNA structure. A given transcript, however, can have more than a single functional RNA structure throughout its life, a fact which has been previously overlooked. Transient RNA structures, for example, are only present during specific time intervals and cellular conditions. We here introduce four RNA families with transient RNA structures that play distinct and diverse functional roles. Moreover, we show that these transient RNA structures are structurally well-defined and evolutionarily conserved. Since Rfam annotates one structure for each family, there is either no annotation for these transient structures or no such family. Thus, our alignments either significantly update and extend the existing Rfam families or introduce a new RNA family to Rfam. For each of the four RNA families, we compile a multiple-sequence alignment based on experimentally verified transient and dominant (dominant in terms of either the thermodynamic stability and/or attention received so far) RNA secondary structures using a combination of automated search via covariance model and manual curation. The first alignment is the Trp operon leader which regulates the operon transcription in response to tryptophan abundance through alternative structures. The second alignment is the HDV ribozyme which we extend to the 5' flanking sequence. This flanking sequence is involved in the regulation of the transcript's self-cleavage activity. The third alignment is the 5' UTR of the maturation protein from Levivirus which contains a transient structure that temporarily postpones the formation of the final inhibitory structure to allow translation of maturation protein. The fourth and last alignment is the SAM riboswitch which regulates the downstream gene expression by assuming alternative structures upon binding of SAM. All transient and dominant structures are mapped to our new alignments introduced here.

  5. FRASS: the web-server for RNA structural comparison

    PubMed Central

    2010-01-01

    Background The impressive increase of novel RNA structures, during the past few years, demands automated methods for structure comparison. While many algorithms handle only small motifs, few techniques, developed in recent years, (ARTS, DIAL, SARA, SARSA, and LaJolla) are available for the structural comparison of large and intact RNA molecules. Results The FRASS web-server represents a RNA chain with its Gauss integrals and allows one to compare structures of RNA chains and to find similar entries in a database derived from the Protein Data Bank. We observed that FRASS scores correlate well with the ARTS and LaJolla similarity scores. Moreover, the-web server can also reproduce satisfactorily the DARTS classification of RNA 3D structures and the classification of the SCOR functions that was obtained by the SARA method. Conclusions The FRASS web-server can be easily used to detect relationships among RNA molecules and to scan efficiently the rapidly enlarging structural databases. PMID:20553602

  6. Functional 5′ UTR mRNA structures in eukaryotic translation regulation and how to find them

    PubMed Central

    Leppek, Kathrin; Das, Rhiju; Barna, Maria

    2017-01-01

    RNA molecules can fold into intricate shapes that can provide an additional layer of control of gene expression beyond that of their sequence. In this Review, we discuss the current mechanistic understanding of structures in 5′ untranslated regions (UTRs) of eukaryotic mRNAs and the emerging methodologies used to explore them. These structures may regulate cap-dependent translation initiation through helicase-mediated remodelling of RNA structures and higher-order RNA interactions, as well as cap-independent translation initiation through internal ribosome entry sites (IRESs), mRNA modifications and other specialized translation pathways. We discuss known 5′ UTR RNA structures and how new structure probing technologies coupled with prospective validation, particularly compensatory mutagenesis, are likely to identify classes of structured RNA elements that shape post-transcriptional control of gene expression and the development of multicellular organisms. PMID:29165424

  7. JOINT AND INDIVIDUAL VARIATION EXPLAINED (JIVE) FOR INTEGRATED ANALYSIS OF MULTIPLE DATA TYPES.

    PubMed

    Lock, Eric F; Hoadley, Katherine A; Marron, J S; Nobel, Andrew B

    2013-03-01

    Research in several fields now requires the analysis of datasets in which multiple high-dimensional types of data are available for a common set of objects. In particular, The Cancer Genome Atlas (TCGA) includes data from several diverse genomic technologies on the same cancerous tumor samples. In this paper we introduce Joint and Individual Variation Explained (JIVE), a general decomposition of variation for the integrated analysis of such datasets. The decomposition consists of three terms: a low-rank approximation capturing joint variation across data types, low-rank approximations for structured variation individual to each data type, and residual noise. JIVE quantifies the amount of joint variation between data types, reduces the dimensionality of the data, and provides new directions for the visual exploration of joint and individual structure. The proposed method represents an extension of Principal Component Analysis and has clear advantages over popular two-block methods such as Canonical Correlation Analysis and Partial Least Squares. A JIVE analysis of gene expression and miRNA data on Glioblastoma Multiforme tumor samples reveals gene-miRNA associations and provides better characterization of tumor types.

  8. Efficient RNA structure comparison algorithms.

    PubMed

    Arslan, Abdullah N; Anandan, Jithendar; Fry, Eric; Monschke, Keith; Ganneboina, Nitin; Bowerman, Jason

    2017-12-01

    Recently proposed relative addressing-based ([Formula: see text]) RNA secondary structure representation has important features by which an RNA structure database can be stored into a suffix array. A fast substructure search algorithm has been proposed based on binary search on this suffix array. Using this substructure search algorithm, we present a fast algorithm that finds the largest common substructure of given multiple RNA structures in [Formula: see text] format. The multiple RNA structure comparison problem is NP-hard in its general formulation. We introduced a new problem for comparing multiple RNA structures. This problem has more strict similarity definition and objective, and we propose an algorithm that solves this problem efficiently. We also develop another comparison algorithm that iteratively calls this algorithm to locate nonoverlapping large common substructures in compared RNAs. With the new resulting tools, we improved the RNASSAC website (linked from http://faculty.tamuc.edu/aarslan ). This website now also includes two drawing tools: one specialized for preparing RNA substructures that can be used as input by the search tool, and another one for automatically drawing the entire RNA structure from a given structure sequence.

  9. Structure-function analysis of Sua5 protein reveals novel functional motifs required for the biosynthesis of the universal t6A tRNA modification.

    PubMed

    Pichard-Kostuch, Adeline; Zhang, Wenhua; Liger, Dominique; Daugeron, Marie-Claire; Letoquart, Juliette; Li de la Sierra-Gallay, Ines; Forterre, Patrick; Collinet, Bruno; van Tilbeurgh, Herman; Basta, Tamara

    2018-04-12

    N6-threonyl-carbamoyl adenosine (t6A) is a universal tRNA modification found at position 37, next to the anticodon, in almost all tRNAs decoding ANN codons (where N = A, U, G or C). t6A stabilizes the codon-anticodon interaction and hence promotes translation fidelity. The first step of the biosynthesis of t6A, the production of threonyl-carbamoyl adenylate (TC-AMP), is catalyzed by the Sua5/TsaC family of enzymes. While TsaC is a single domain protein, Sua5 enzymes are composed of the TsaC-like domain, a linker and an extra domain called SUA5 of unknown function. In the present study, we report structure-function analysis of Pyrococcus abyssi Sua5 (Pa-Sua5). Crystallographic data revealed binding sites for bicarbonate substrate and pyrophosphate product. The linker of Pa-Sua5 forms a loop structure that folds into the active site gorge and closes it. Using structure-guided mutational analysis we established that the conserved sequence motifs in the linker and the domain-domain interface are essential for the function of Pa-Sua5. We propose that the linker participates actively in the biosynthesis of TC-AMP by binding to ATP/PPi and by stabilizing the N-carboxy-L-threonine intermediate. Hence, TsaC orthologs which lack such a linker and SUA5 domain use different mechanism for TC-AMP synthesis. Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  10. K-Partite RNA Secondary Structures

    NASA Astrophysics Data System (ADS)

    Jiang, Minghui; Tejada, Pedro J.; Lasisi, Ramoni O.; Cheng, Shanhong; Fechser, D. Scott

    RNA secondary structure prediction is a fundamental problem in structural bioinformatics. The prediction problem is difficult because RNA secondary structures may contain pseudoknots formed by crossing base pairs. We introduce k-partite secondary structures as a simple classification of RNA secondary structures with pseudoknots. An RNA secondary structure is k-partite if it is the union of k pseudoknot-free sub-structures. Most known RNA secondary structures are either bipartite or tripartite. We show that there exists a constant number k such that any secondary structure can be modified into a k-partite secondary structure with approximately the same free energy. This offers a partial explanation of the prevalence of k-partite secondary structures with small k. We give a complete characterization of the computational complexities of recognizing k-partite secondary structures for all k ≥ 2, and show that this recognition problem is essentially the same as the k-colorability problem on circle graphs. We present two simple heuristics, iterated peeling and first-fit packing, for finding k-partite RNA secondary structures. For maximizing the number of base pair stackings, our iterated peeling heuristic achieves a constant approximation ratio of at most k for 2 ≤ k ≤ 5, and at most frac6{1-(1-6/k)^k} le frac6{1-e^{-6}} < 6.01491 for k ≥ 6. Experiment on sequences from PseudoBase shows that our first-fit packing heuristic outperforms the leading method HotKnots in predicting RNA secondary structures with pseudoknots. Source code, data set, and experimental results are available at http://www.cs.usu.edu/ mjiang/rna/kpartite/.

  11. In cell mutational interference mapping experiment (in cell MIME) identifies the 5' polyadenylation signal as a dual regulator of HIV-1 genomic RNA production and packaging.

    PubMed

    Smyth, Redmond P; Smith, Maureen R; Jousset, Anne-Caroline; Despons, Laurence; Laumond, Géraldine; Decoville, Thomas; Cattenoz, Pierre; Moog, Christiane; Jossinet, Fabrice; Mougel, Marylène; Paillart, Jean-Christophe; von Kleist, Max; Marquet, Roland

    2018-05-18

    Non-coding RNA regulatory elements are important for viral replication, making them promising targets for therapeutic intervention. However, regulatory RNA is challenging to detect and characterise using classical structure-function assays. Here, we present in cell Mutational Interference Mapping Experiment (in cell MIME) as a way to define RNA regulatory landscapes at single nucleotide resolution under native conditions. In cell MIME is based on (i) random mutation of an RNA target, (ii) expression of mutated RNA in cells, (iii) physical separation of RNA into functional and non-functional populations, and (iv) high-throughput sequencing to identify mutations affecting function. We used in cell MIME to define RNA elements within the 5' region of the HIV-1 genomic RNA (gRNA) that are important for viral replication in cells. We identified three distinct RNA motifs controlling intracellular gRNA production, and two distinct motifs required for gRNA packaging into virions. Our analysis reveals the 73AAUAAA78 polyadenylation motif within the 5' PolyA domain as a dual regulator of gRNA production and gRNA packaging, and demonstrates that a functional polyadenylation signal is required for viral packaging even though it negatively affects gRNA production.

  12. In cell mutational interference mapping experiment (in cell MIME) identifies the 5′ polyadenylation signal as a dual regulator of HIV-1 genomic RNA production and packaging

    PubMed Central

    Smith, Maureen R; Jousset, Anne-Caroline; Despons, Laurence; Laumond, Géraldine; Decoville, Thomas; Cattenoz, Pierre; Moog, Christiane; Jossinet, Fabrice; Mougel, Marylène; Paillart, Jean-Christophe

    2018-01-01

    Abstract Non-coding RNA regulatory elements are important for viral replication, making them promising targets for therapeutic intervention. However, regulatory RNA is challenging to detect and characterise using classical structure-function assays. Here, we present in cell Mutational Interference Mapping Experiment (in cell MIME) as a way to define RNA regulatory landscapes at single nucleotide resolution under native conditions. In cell MIME is based on (i) random mutation of an RNA target, (ii) expression of mutated RNA in cells, (iii) physical separation of RNA into functional and non-functional populations, and (iv) high-throughput sequencing to identify mutations affecting function. We used in cell MIME to define RNA elements within the 5′ region of the HIV-1 genomic RNA (gRNA) that are important for viral replication in cells. We identified three distinct RNA motifs controlling intracellular gRNA production, and two distinct motifs required for gRNA packaging into virions. Our analysis reveals the 73AAUAAA78 polyadenylation motif within the 5′ PolyA domain as a dual regulator of gRNA production and gRNA packaging, and demonstrates that a functional polyadenylation signal is required for viral packaging even though it negatively affects gRNA production. PMID:29514260

  13. Comparative Structural and Functional Analysis of Bunyavirus and Arenavirus Cap-Snatching Endonucleases

    PubMed Central

    Reguera, Juan; Gerlach, Piotr; Rosenthal, Maria; Gaudon, Stephanie; Coscia, Francesca; Günther, Stephan; Cusack, Stephen

    2016-01-01

    Segmented negative strand RNA viruses of the arena-, bunya- and orthomyxovirus families uniquely carry out viral mRNA transcription by the cap-snatching mechanism. This involves cleavage of host mRNAs close to their capped 5′ end by an endonuclease (EN) domain located in the N-terminal region of the viral polymerase. We present the structure of the cap-snatching EN of Hantaan virus, a bunyavirus belonging to hantavirus genus. Hantaan EN has an active site configuration, including a metal co-ordinating histidine, and nuclease activity similar to the previously reported La Crosse virus and Influenza virus ENs (orthobunyavirus and orthomyxovirus respectively), but is more active in cleaving a double stranded RNA substrate. In contrast, Lassa arenavirus EN has only acidic metal co-ordinating residues. We present three high resolution structures of Lassa virus EN with different bound ion configurations and show in comparative biophysical and biochemical experiments with Hantaan, La Crosse and influenza ENs that the isolated Lassa EN is essentially inactive. The results are discussed in the light of EN activation mechanisms revealed by recent structures of full-length influenza virus polymerase. PMID:27304209

  14. Structural analysis of malaria-parasite lysyl-tRNA synthetase provides a platform for drug development.

    PubMed

    Khan, Sameena; Garg, Ankur; Camacho, Noelia; Van Rooyen, Jason; Kumar Pole, Anil; Belrhali, Hassan; Ribas de Pouplana, Lluis; Sharma, Vinay; Sharma, Amit

    2013-05-01

    Aminoacyl-tRNA synthetases are essential enzymes that transmit information from the genetic code to proteins in cells and are targets for antipathogen drug development. Elucidation of the crystal structure of cytoplasmic lysyl-tRNA synthetase from the malaria parasite Plasmodium falciparum (PfLysRS) has allowed direct comparison with human LysRS. The authors' data suggest that PfLysRS is dimeric in solution, whereas the human counterpart can also adopt tetrameric forms. It is shown for the first time that PfLysRS is capable of synthesizing the signalling molecule Ap4a (diadenosine tetraphosphate) using ATP as a substrate. The PfLysRS crystal structure is in the apo form, such that binding to ATP will require rotameric changes in four conserved residues. Differences in the active-site regions of parasite and human LysRSs suggest the possibility of exploiting PfLysRS for selective inhibition. These investigations on PfLysRS further validate malarial LysRSs as attractive antimalarial targets and provide new structural space for the development of inhibitors that target pathogen LysRSs selectively.

  15. Modeling of the structure of ribosomal protein L1 from the archaeon Haloarcula marismortui

    NASA Astrophysics Data System (ADS)

    Nevskaya, N. A.; Kljashtorny, V. G.; Vakhrusheva, A. V.; Garber, M. B.; Nikonov, S. V.

    2017-07-01

    The halophilic archaeon Haloarcula marismortui proliferates in the Dead Sea at extremely high salt concentrations (higher than 3 M). This is the only archaeon, for which the crystal structure of the ribosomal 50S subunit was determined. However, the structure of the functionally important side protuberance containing the abnormally negatively charged protein L1 (HmaL1) was not visualized. Attempts to crystallize HmaL1 in the isolated state or as its complex with RNA using normal salt concentrations (≤500 mM) failed. A theoretical model of HmaL1 was built based on the structural data for homologs of the protein L1 from other organisms, and this model was refined by molecular dynamics methods. Analysis of this model showed that the protein HmaL1 can undergo aggregation due to the presence of a cluster of positive charges unique for proteins L1. This cluster is located at the RNA-protein interface, which interferes with the crystallization of HmaL1 and the binding of the latter to RNA.

  16. Computer-Aided Design of RNA Origami Structures.

    PubMed

    Sparvath, Steffen L; Geary, Cody W; Andersen, Ebbe S

    2017-01-01

    RNA nanostructures can be used as scaffolds to organize, combine, and control molecular functionalities, with great potential for applications in nanomedicine and synthetic biology. The single-stranded RNA origami method allows RNA nanostructures to be folded as they are transcribed by the RNA polymerase. RNA origami structures provide a stable framework that can be decorated with functional RNA elements such as riboswitches, ribozymes, interaction sites, and aptamers for binding small molecules or protein targets. The rich library of RNA structural and functional elements combined with the possibility to attach proteins through aptamer-based binding creates virtually limitless possibilities for constructing advanced RNA-based nanodevices.In this chapter we provide a detailed protocol for the single-stranded RNA origami design method using a simple 2-helix tall structure as an example. The first step involves 3D modeling of a double-crossover between two RNA double helices, followed by decoration with tertiary motifs. The second step deals with the construction of a 2D blueprint describing the secondary structure and sequence constraints that serves as the input for computer programs. In the third step, computer programs are used to design RNA sequences that are compatible with the structure, and the resulting outputs are evaluated and converted into DNA sequences to order.

  17. The DEAH-box helicase Dhr1 dissociates U3 from the pre-rRNA to promote formation of the central pseudoknot.

    PubMed

    Sardana, Richa; Liu, Xin; Granneman, Sander; Zhu, Jieyi; Gill, Michael; Papoulas, Ophelia; Marcotte, Edward M; Tollervey, David; Correll, Carl C; Johnson, Arlen W

    2015-02-01

    In eukaryotes, the highly conserved U3 small nucleolar RNA (snoRNA) base-pairs to multiple sites in the pre-ribosomal RNA (pre-rRNA) to promote early cleavage and folding events. Binding of the U3 box A region to the pre-rRNA is mutually exclusive with folding of the central pseudoknot (CPK), a universally conserved rRNA structure of the small ribosomal subunit essential for protein synthesis. Here, we report that the DEAH-box helicase Dhr1 (Ecm16) is responsible for displacing U3. An active site mutant of Dhr1 blocked release of U3 from the pre-ribosome, thereby trapping a pre-40S particle. This particle had not yet achieved its mature structure because it contained U3, pre-rRNA, and a number of early-acting ribosome synthesis factors but noticeably lacked ribosomal proteins (r-proteins) that surround the CPK. Dhr1 was cross-linked in vivo to the pre-rRNA and to U3 sequences flanking regions that base-pair to the pre-rRNA including those that form the CPK. Point mutations in the box A region of U3 suppressed a cold-sensitive mutation of Dhr1, strongly indicating that U3 is an in vivo substrate of Dhr1. To support the conclusions derived from in vivo analysis we showed that Dhr1 unwinds U3-18S duplexes in vitro by using a mechanism reminiscent of DEAD box proteins.

  18. The DEAH-box Helicase Dhr1 Dissociates U3 from the Pre-rRNA to Promote Formation of the Central Pseudoknot

    PubMed Central

    Granneman, Sander; Zhu, Jieyi; Gill, Michael; Papoulas, Ophelia; Marcotte, Edward M.; Tollervey, David; Correll, Carl C.; Johnson, Arlen W.

    2015-01-01

    In eukaryotes, the highly conserved U3 small nucleolar RNA (snoRNA) base-pairs to multiple sites in the pre-ribosomal RNA (pre-rRNA) to promote early cleavage and folding events. Binding of the U3 box A region to the pre-rRNA is mutually exclusive with folding of the central pseudoknot (CPK), a universally conserved rRNA structure of the small ribosomal subunit essential for protein synthesis. Here, we report that the DEAH-box helicase Dhr1 (Ecm16) is responsible for displacing U3. An active site mutant of Dhr1 blocked release of U3 from the pre-ribosome, thereby trapping a pre-40S particle. This particle had not yet achieved its mature structure because it contained U3, pre-rRNA, and a number of early-acting ribosome synthesis factors but noticeably lacked ribosomal proteins (r-proteins) that surround the CPK. Dhr1 was cross-linked in vivo to the pre-rRNA and to U3 sequences flanking regions that base-pair to the pre-rRNA including those that form the CPK. Point mutations in the box A region of U3 suppressed a cold-sensitive mutation of Dhr1, strongly indicating that U3 is an in vivo substrate of Dhr1. To support the conclusions derived from in vivo analysis we showed that Dhr1 unwinds U3-18S duplexes in vitro by using a mechanism reminiscent of DEAD box proteins. PMID:25710520

  19. How Mg2+ ion and water network affect the stability and structure of non-Watson-Crick base pairs in E. coli loop E of 5S rRNA: a molecular dynamics and reference interaction site model (RISM) study.

    PubMed

    Shanker, Sudhanshu; Bandyopadhyay, Pradipta

    2017-08-01

    The non-Watson-Crick (non-WC) base pairs of Escherichia coli loop E of 5S rRNA are stabilized by Mg 2+ ions through water-mediated interaction. It is important to know the synergic role of Mg 2+ and the water network surrounding Mg 2+ in stabilizing the non-WC base pairs of RNA. For this purpose, free energy change of the system is calculated using molecular dynamics (MD) simulation as Mg 2+ is pulled from RNA, which causes disturbance of the water network. It was found that Mg 2+ remains hexahydrated unless it is close to or far from RNA. In the pentahydrated form, Mg 2+ interacts directly with RNA. Water network has been identified by two complimentary methods; MD followed by a density-based clustering algorithm and three-dimensional-reference interaction site model. These two methods gave similar results. Identification of water network around Mg 2+ and non-WC base pairs gives a clue to the strong effect of water network on the stability of this RNA. Based on sequence analysis of all Eubacteria 5s rRNA, we propose that hexahydrated Mg 2+ is an integral part of this RNA and geometry of base pairs surrounding it adjust to accommodate the [Formula: see text]. Overall the findings from this work can help in understanding the basis of the complex structure and stability of RNA with non-WC base pairs.

  20. Automated 3D structure composition for large RNAs

    PubMed Central

    Popenda, Mariusz; Szachniuk, Marta; Antczak, Maciej; Purzycka, Katarzyna J.; Lukasiak, Piotr; Bartol, Natalia; Blazewicz, Jacek; Adamiak, Ryszard W.

    2012-01-01

    Understanding the numerous functions that RNAs play in living cells depends critically on knowledge of their three-dimensional structure. Due to the difficulties in experimentally assessing structures of large RNAs, there is currently great demand for new high-resolution structure prediction methods. We present the novel method for the fully automated prediction of RNA 3D structures from a user-defined secondary structure. The concept is founded on the machine translation system. The translation engine operates on the RNA FRABASE database tailored to the dictionary relating the RNA secondary structure and tertiary structure elements. The translation algorithm is very fast. Initial 3D structure is composed in a range of seconds on a single processor. The method assures the prediction of large RNA 3D structures of high quality. Our approach needs neither structural templates nor RNA sequence alignment, required for comparative methods. This enables the building of unresolved yet native and artificial RNA structures. The method is implemented in a publicly available, user-friendly server RNAComposer. It works in an interactive mode and a batch mode. The batch mode is designed for large-scale modelling and accepts atomic distance restraints. Presently, the server is set to build RNA structures of up to 500 residues. PMID:22539264

  1. CellTree: an R/bioconductor package to infer the hierarchical structure of cell populations from single-cell RNA-seq data.

    PubMed

    duVerle, David A; Yotsukura, Sohiya; Nomura, Seitaro; Aburatani, Hiroyuki; Tsuda, Koji

    2016-09-13

    Single-cell RNA sequencing is fast becoming one the standard method for gene expression measurement, providing unique insights into cellular processes. A number of methods, based on general dimensionality reduction techniques, have been suggested to help infer and visualise the underlying structure of cell populations from single-cell expression levels, yet their models generally lack proper biological grounding and struggle at identifying complex differentiation paths. Here we introduce cellTree: an R/Bioconductor package that uses a novel statistical approach, based on document analysis techniques, to produce tree structures outlining the hierarchical relationship between single-cell samples, while identifying latent groups of genes that can provide biological insights. With cellTree, we provide experimentalists with an easy-to-use tool, based on statistically and biologically-sound algorithms, to efficiently explore and visualise single-cell RNA data. The cellTree package is publicly available in the online Bionconductor repository at: http://bioconductor.org/packages/cellTree/ .

  2. Methyl Transfer by Substrate Signaling from a Knotted Protein Fold

    PubMed Central

    Christian, Thomas; Sakaguchi, Reiko; Perlinska, Agata P.; Lahoud, Georges; Ito, Takuhiro; Taylor, Erika A.; Yokoyama, Shigeyuki; Sulkowska, Joanna I.; Hou, Ya-Ming

    2017-01-01

    Proteins with knotted configurations are restricted in conformational space relative to unknotted proteins. Little is known if knotted proteins have sufficient dynamics to communicate between spatially separated substrate-binding sites. In bacteria, TrmD is a methyl transferase that uses a knotted protein fold to catalyze methyl transfer from S-adenosyl methionine (AdoMet) to G37-tRNA. The product m1G37-tRNA is essential for life as a determinant to maintain protein synthesis reading-frame. Using an integrated approach of structure, kinetic, and computational analysis, we show here that the structurally constrained TrmD knot is required for its catalytic activity. Unexpectedly, the TrmD knot has complex internal movements that respond to AdoMet binding and signaling. Most of the signaling propagates the free energy of AdoMet binding to stabilize tRNA binding and to assemble the active site. This work demonstrates new principles of knots as an organized structure that captures the free energies of substrate binding to facilitate catalysis. PMID:27571175

  3. A search for H/ACA snoRNAs in yeast using MFE secondary structure prediction.

    PubMed

    Edvardsson, Sverker; Gardner, Paul P; Poole, Anthony M; Hendy, Michael D; Penny, David; Moulton, Vincent

    2003-05-01

    Noncoding RNA genes produce functional RNA molecules rather than coding for proteins. One such family is the H/ACA snoRNAs. Unlike the related C/D snoRNAs these have resisted automated detection to date. We develop an algorithm to screen the yeast genome for novel H/ACA snoRNAs. To achieve this, we introduce some new methods for facilitating the search for noncoding RNAs in genomic sequences which are based on properties of predicted minimum free-energy (MFE) secondary structures. The algorithm has been implemented and can be generalized to enable screening of other eukaryote genomes. We find that use of primary sequence alone is insufficient for identifying novel H/ACA snoRNAs. Only the use of secondary structure filters reduces the number of candidates to a manageable size. From genomic context, we identify three strong H/ACA snoRNA candidates. These together with a further 47 candidates obtained by our analysis are being experimentally screened.

  4. Thioflavin T as an efficient fluorescence sensor for selective recognition of RNA G-quadruplexes

    NASA Astrophysics Data System (ADS)

    Xu, Shujuan; Li, Qian; Xiang, Junfeng; Yang, Qianfan; Sun, Hongxia; Guan, Aijiao; Wang, Lixia; Liu, Yan; Yu, Lijia; Shi, Yunhua; Chen, Hongbo; Tang, Yalin

    2016-04-01

    RNA G-quadruplexes (G4s) play important roles in translational regulation, mRNA processing events and gene expression. Therefore, a fluorescent probe that is capable of efficiently recognizing RNA G-quadruplex structures among other RNA forms is highly desirable. In this study, a water-soluble fluorogenic dye (i.e., Thioflavin T (ThT)) was employed to recognize RNA G-quadruplex structures using UV-Vis absorption spectra, fluorescence spectra and emission lifetime experiments. By stacking on the G-tetrad, the ThT probe exhibited highly specific recognition of RNA G-quadruplex structures with striking fluorescence enhancement compared with other RNA forms. The specific binding demonstrates that ThT is an efficient fluorescence sensor that can distinguish G4 and non-G4 RNA structures.

  5. Crystal-Structure-Guided Design of Self-Assembling RNA Nanotriangles.

    PubMed

    Boerneke, Mark A; Dibrov, Sergey M; Hermann, Thomas

    2016-03-14

    RNA nanotechnology uses RNA structural motifs to build nanosized architectures that assemble through selective base-pair interactions. Herein, we report the crystal-structure-guided design of highly stable RNA nanotriangles that self-assemble cooperatively from short oligonucleotides. The crystal structure of an 81 nucleotide nanotriangle determined at 2.6 Å resolution reveals the so-far smallest circularly closed nanoobject made entirely of double-stranded RNA. The assembly of the nanotriangle architecture involved RNA corner motifs that were derived from ligand-responsive RNA switches, which offer the opportunity to control self-assembly and dissociation. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  6. On the importance of cotranscriptional RNA structure formation

    PubMed Central

    Lai, Daniel; Proctor, Jeff R.; Meyer, Irmtraud M.

    2013-01-01

    The expression of genes, both coding and noncoding, can be significantly influenced by RNA structural features of their corresponding transcripts. There is by now mounting experimental and some theoretical evidence that structure formation in vivo starts during transcription and that this cotranscriptional folding determines the functional RNA structural features that are being formed. Several decades of research in bioinformatics have resulted in a wide range of computational methods for predicting RNA secondary structures. Almost all state-of-the-art methods in terms of prediction accuracy, however, completely ignore the process of structure formation and focus exclusively on the final RNA structure. This review hopes to bridge this gap. We summarize the existing evidence for cotranscriptional folding and then review the different, currently used strategies for RNA secondary-structure prediction. Finally, we propose a range of ideas on how state-of-the-art methods could be potentially improved by explicitly capturing the process of cotranscriptional structure formation. PMID:24131802

  7. ToNER: A tool for identifying nucleotide enrichment signals in feature-enriched RNA-seq data.

    PubMed

    Promworn, Yuttachon; Kaewprommal, Pavita; Shaw, Philip J; Intarapanich, Apichart; Tongsima, Sissades; Piriyapongsa, Jittima

    2017-01-01

    Biochemical methods are available for enriching 5' ends of RNAs in prokaryotes, which are employed in the differential RNA-seq (dRNA-seq) and the more recent Cappable-seq protocols. Computational methods are needed to locate RNA 5' ends from these data by statistical analysis of the enrichment. Although statistical-based analysis methods have been developed for dRNA-seq, they may not be suitable for Cappable-seq data. The more efficient enrichment method employed in Cappable-seq compared with dRNA-seq could affect data distribution and thus algorithm performance. We present Transformation of Nucleotide Enrichment Ratios (ToNER), a tool for statistical modeling of enrichment from RNA-seq data obtained from enriched and unenriched libraries. The tool calculates nucleotide enrichment scores and determines the global transformation for fitting to the normal distribution using the Box-Cox procedure. From the transformed distribution, sites of significant enrichment are identified. To increase power of detection, meta-analysis across experimental replicates is offered. We tested the tool on Cappable-seq and dRNA-seq data for identifying Escherichia coli transcript 5' ends and compared the results with those from the TSSAR tool, which is designed for analyzing dRNA-seq data. When combining results across Cappable-seq replicates, ToNER detects more known transcript 5' ends than TSSAR. In general, the transcript 5' ends detected by ToNER but not TSSAR occur in regions which cannot be locally modeled by TSSAR. ToNER uses a simple yet robust statistical modeling approach, which can be used for detecting RNA 5'ends from Cappable-seq data, in particular when combining information from experimental replicates. The ToNER tool could potentially be applied for analyzing other RNA-seq datasets in which enrichment for other structural features of RNA is employed. The program is freely available for download at ToNER webpage (http://www4a.biotec.or.th/GI/tools/toner) and GitHub repository (https://github.com/PavitaKae/ToNER).

  8. Structure of the Paramyxovirus Parainfluenza Virus 5 Nucleoprotein in Complex with an Amino-Terminal Peptide of the Phosphoprotein

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Aggarwal, Megha; Leser, George P.; Kors, Christopher A.

    Parainfluenza virus 5 (PIV5) belongs to the familyParamyxoviridae, which consists of enveloped viruses with a nonsegmented negative-strand RNA genome encapsidated by the nucleoprotein (N). Paramyxovirus replication is regulated by the phosphoprotein (P) through protein-protein interactions with N and the RNA polymerase (L). The chaperone activity of P is essential to maintain the unassembled RNA-free form of N in order to prevent nonspecific RNA binding and premature N oligomerization. Here, we determined the crystal structure of unassembled PIV5 N in complex with a P peptide (N 0P) derived from the N terminus of P (P50) at 2.65 Å. The PIV5 Nmore » 0P consists of two domains: an N-terminal domain (NTD) and a C-terminal domain (CTD) separated by a hinge region. The cleft at the hinge region of RNA-bound PIV5 N was previously shown to be an RNA binding site. The N 0P structure shows that the P peptide binds to the CTD of N and extends toward the RNA binding site to inhibit N oligomerization and, hence, RNA binding. Binding of P peptide also keeps the PIV5 N in the open form. A molecular dynamics (MD) analysis of both the open and closed forms of N shows the flexibility of the CTD and the preference of the N protein to be in an open conformation. The gradual opening of the hinge region, to release the RNA, was also observed. Together, these results advance our knowledge of the conformational swapping of N required for the highly regulated paramyxovirus replication. IMPORTANCEParamyxovirus replication is regulated by the interaction of P with N and L proteins. Here, we report the crystal structure of unassembled parainfluenza virus 5 (PIV5) N chaperoned with P peptide. Our results provide a detailed understanding of the binding of P to N. The conformational switching of N between closed and open forms during its initial interaction with P, as well as during RNA release, was analyzed. Our data also show the plasticity of the CTD and the importance of domain movement for conformational switching. The results improve our understanding of the mechanism of interchanging N conformations for RNA replication and release.« less

  9. Structure of the Paramyxovirus Parainfluenza Virus 5 Nucleoprotein in Complex with an Amino-Terminal Peptide of the Phosphoprotein.

    PubMed

    Aggarwal, Megha; Leser, George P; Kors, Christopher A; Lamb, Robert A

    2018-03-01

    Parainfluenza virus 5 (PIV5) belongs to the family Paramyxoviridae , which consists of enveloped viruses with a nonsegmented negative-strand RNA genome encapsidated by the nucleoprotein (N). Paramyxovirus replication is regulated by the phosphoprotein (P) through protein-protein interactions with N and the RNA polymerase (L). The chaperone activity of P is essential to maintain the unassembled RNA-free form of N in order to prevent nonspecific RNA binding and premature N oligomerization. Here, we determined the crystal structure of unassembled PIV5 N in complex with a P peptide (N 0 P) derived from the N terminus of P (P50) at 2.65 Å. The PIV5 N 0 P consists of two domains: an N-terminal domain (NTD) and a C-terminal domain (CTD) separated by a hinge region. The cleft at the hinge region of RNA-bound PIV5 N was previously shown to be an RNA binding site. The N 0 P structure shows that the P peptide binds to the CTD of N and extends toward the RNA binding site to inhibit N oligomerization and, hence, RNA binding. Binding of P peptide also keeps the PIV5 N in the open form. A molecular dynamics (MD) analysis of both the open and closed forms of N shows the flexibility of the CTD and the preference of the N protein to be in an open conformation. The gradual opening of the hinge region, to release the RNA, was also observed. Together, these results advance our knowledge of the conformational swapping of N required for the highly regulated paramyxovirus replication. IMPORTANCE Paramyxovirus replication is regulated by the interaction of P with N and L proteins. Here, we report the crystal structure of unassembled parainfluenza virus 5 (PIV5) N chaperoned with P peptide. Our results provide a detailed understanding of the binding of P to N. The conformational switching of N between closed and open forms during its initial interaction with P, as well as during RNA release, was analyzed. Our data also show the plasticity of the CTD and the importance of domain movement for conformational switching. The results improve our understanding of the mechanism of interchanging N conformations for RNA replication and release. Copyright © 2018 American Society for Microbiology.

  10. Structural Requirement in Clostridium perfringens Collagenase mRNA 5′ Leader Sequence for Translational Induction through Small RNA-mRNA Base Pairing

    PubMed Central

    Nomura, Nobuhiko; Nakamura, Kouji

    2013-01-01

    The Gram-positive anaerobic bacterium Clostridium perfringens is pathogenic to humans and animals, and the production of its toxins is strictly regulated during the exponential phase. We recently found that the 5′ leader sequence of the colA transcript encoding collagenase, which is a major toxin of this organism, is processed and stabilized in the presence of the small RNA VR-RNA. The primary colA 5′-untranslated region (5′UTR) forms a long stem-loop structure containing an internal bulge and masks its own ribosomal binding site. Here we found that VR-RNA directly regulates colA expression through base pairing with colA mRNA in vivo. However, when the internal bulge structure was closed by point mutations in colA mRNA, translation ceased despite the presence of VR-RNA. In addition, a mutation disrupting the colA stem-loop structure induced mRNA processing and ColA-FLAG translational activation in the absence of VR-RNA, indicating that the stem-loop and internal bulge structure of the colA 5′ leader sequence is important for regulation by VR-RNA. On the other hand, processing was required for maximal ColA expression but was not essential for VR-RNA-dependent colA regulation. Finally, colA processing and translational activation were induced at a high temperature without VR-RNA. These results suggest that inhibition of the colA 5′ leader structure through base pairing is the primary role of VR-RNA in colA regulation and that the colA 5′ leader structure is a possible thermosensor. PMID:23585542

  11. RNA chaperoning and intrinsic disorder in the core proteins of Flaviviridae.

    PubMed

    Ivanyi-Nagy, Roland; Lavergne, Jean-Pierre; Gabus, Caroline; Ficheux, Damien; Darlix, Jean-Luc

    2008-02-01

    RNA chaperone proteins are essential partners of RNA in living organisms and viruses. They are thought to assist in the correct folding and structural rearrangements of RNA molecules by resolving misfolded RNA species in an ATP-independent manner. RNA chaperoning is probably an entropy-driven process, mediated by the coupled binding and folding of intrinsically disordered protein regions and the kinetically trapped RNA. Previously, we have shown that the core protein of hepatitis C virus (HCV) is a potent RNA chaperone that can drive profound structural modifications of HCV RNA in vitro. We now examined the RNA chaperone activity and the disordered nature of core proteins from different Flaviviridae genera, namely that of HCV, GBV-B (GB virus B), WNV (West Nile virus) and BVDV (bovine viral diarrhoea virus). Despite low-sequence similarities, all four proteins demonstrated general nucleic acid annealing and RNA chaperone activities. Furthermore, heat resistance of core proteins, as well as far-UV circular dichroism spectroscopy suggested that a well-defined 3D protein structure is not necessary for core-induced RNA structural rearrangements. These data provide evidence that RNA chaperoning-possibly mediated by intrinsically disordered protein segments-is conserved in Flaviviridae core proteins. Thus, besides nucleocapsid formation, core proteins may function in RNA structural rearrangements taking place during virus replication.

  12. RNA chaperoning and intrinsic disorder in the core proteins of Flaviviridae

    PubMed Central

    Ivanyi-Nagy, Roland; Lavergne, Jean-Pierre; Gabus, Caroline; Ficheux, Damien; Darlix, Jean-Luc

    2008-01-01

    RNA chaperone proteins are essential partners of RNA in living organisms and viruses. They are thought to assist in the correct folding and structural rearrangements of RNA molecules by resolving misfolded RNA species in an ATP-independent manner. RNA chaperoning is probably an entropy-driven process, mediated by the coupled binding and folding of intrinsically disordered protein regions and the kinetically trapped RNA. Previously, we have shown that the core protein of hepatitis C virus (HCV) is a potent RNA chaperone that can drive profound structural modifications of HCV RNA in vitro. We now examined the RNA chaperone activity and the disordered nature of core proteins from different Flaviviridae genera, namely that of HCV, GBV-B (GB virus B), WNV (West Nile virus) and BVDV (bovine viral diarrhoea virus). Despite low-sequence similarities, all four proteins demonstrated general nucleic acid annealing and RNA chaperone activities. Furthermore, heat resistance of core proteins, as well as far-UV circular dichroism spectroscopy suggested that a well-defined 3D protein structure is not necessary for core-induced RNA structural rearrangements. These data provide evidence that RNA chaperoning—possibly mediated by intrinsically disordered protein segments—is conserved in Flaviviridae core proteins. Thus, besides nucleocapsid formation, core proteins may function in RNA structural rearrangements taking place during virus replication. PMID:18033802

  13. Proteomic Analysis of the Arabidopsis Nucleolus Suggests Novel Nucleolar FunctionsD⃞

    PubMed Central

    Pendle, Alison F.; Clark, Gillian P.; Boon, Reinier; Lewandowska, Dominika; Lam, Yun Wah; Andersen, Jens; Mann, Matthias; Lamond, Angus I.; Brown, John W. S.; Shaw, Peter J.

    2005-01-01

    The eukaryotic nucleolus is involved in ribosome biogenesis and a wide range of other RNA metabolism and cellular functions. An important step in the functional analysis of the nucleolus is to determine the complement of proteins of this nuclear compartment. Here, we describe the first proteomic analysis of plant (Arabidopsis thaliana) nucleoli, in which we have identified 217 proteins. This allows a direct comparison of the proteomes of an important nuclear structure between two widely divergent species: human and Arabidopsis. The comparison identified many common proteins, plant-specific proteins, proteins of unknown function found in both proteomes, and proteins that were nucleolar in plants but nonnucleolar in human. Seventy-two proteins were expressed as GFP fusions and 87% showed nucleolar or nucleolar-associated localization. In a striking and unexpected finding, we have identified six components of the postsplicing exon-junction complex (EJC) involved in mRNA export and nonsense-mediated decay (NMD)/mRNA surveillance. This association was confirmed by GFP-fusion protein localization. These results raise the possibility that in plants, nucleoli may have additional functions in mRNA export or surveillance. PMID:15496452

  14. Fluorescent in situ sequencing (FISSEQ) of RNA for gene expression profiling in intact cells and tissues

    PubMed Central

    Lee, Je Hyuk; Daugharthy, Evan R.; Scheiman, Jonathan; Kalhor, Reza; Ferrante, Thomas C.; Terry, Richard; Turczyk, Brian M.; Yang, Joyce L.; Lee, Ho Suk; Aach, John; Zhang, Kun; Church, George M.

    2014-01-01

    RNA sequencing measures the quantitative change in gene expression over the whole transcriptome, but it lacks spatial context. On the other hand, in situ hybridization provides the location of gene expression, but only for a small number of genes. Here we detail a protocol for genome-wide profiling of gene expression in situ in fixed cells and tissues, in which RNA is converted into cross-linked cDNA amplicons and sequenced manually on a confocal microscope. Unlike traditional RNA-seq our method enriches for context-specific transcripts over house-keeping and/or structural RNA, and it preserves the tissue architecture for RNA localization studies. Our protocol is written for researchers experienced in cell microscopy with minimal computing skills. Library construction and sequencing can be completed within 14 d, with image analysis requiring an additional 2 d. PMID:25675209

  15. A Method to Predict the Structure and Stability of RNA/RNA Complexes.

    PubMed

    Xu, Xiaojun; Chen, Shi-Jie

    2016-01-01

    RNA/RNA interactions are essential for genomic RNA dimerization and regulation of gene expression. Intermolecular loop-loop base pairing is a widespread and functionally important tertiary structure motif in RNA machinery. However, computational prediction of intermolecular loop-loop base pairing is challenged by the entropy and free energy calculation due to the conformational constraint and the intermolecular interactions. In this chapter, we describe a recently developed statistical mechanics-based method for the prediction of RNA/RNA complex structures and stabilities. The method is based on the virtual bond RNA folding model (Vfold). The main emphasis in the method is placed on the evaluation of the entropy and free energy for the loops, especially tertiary kissing loops. The method also uses recursive partition function calculations and two-step screening algorithm for large, complicated structures of RNA/RNA complexes. As case studies, we use the HIV-1 Mal dimer and the siRNA/HIV-1 mutant (T4) to illustrate the method.

  16. Synthesizing topological structures containing RNA

    NASA Astrophysics Data System (ADS)

    Liu, Di; Shao, Yaming; Chen, Gang; Tse-Dinh, Yuk-Ching; Piccirilli, Joseph A.; Weizmann, Yossi

    2017-03-01

    Though knotting and entanglement have been observed in DNA and proteins, their existence in RNA remains an enigma. Synthetic RNA topological structures are significant for understanding the physical and biological properties pertaining to RNA topology, and these properties in turn could facilitate identifying naturally occurring topologically nontrivial RNA molecules. Here we show that topological structures containing single-stranded RNA (ssRNA) free of strong base pairing interactions can be created either by configuring RNA-DNA hybrid four-way junctions or by template-directed synthesis with a single-stranded DNA (ssDNA) topological structure. By using a constructed ssRNA knot as a highly sensitive topological probe, we find that Escherichia coli DNA topoisomerase I has low RNA topoisomerase activity and that the R173A point mutation abolishes the unknotting activity for ssRNA, but not for ssDNA. Furthermore, we discover the topological inhibition of reverse transcription (RT) and obtain different RT-PCR patterns for an ssRNA knot and circle of the same sequence.

  17. Novel Structure and Unexpected RNA-Binding Ability of the C-Terminal Domain of Herpes Simplex Virus 1 Tegument Protein UL21

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Metrick, Claire M.; Heldwein, Ekaterina E.; Sandri-Goldin, R. M.

    Proteins forming the tegument layers of herpesviral virions mediate many essential processes in the viral replication cycle, yet few have been characterized in detail. UL21 is one such multifunctional tegument protein and is conserved among alphaherpesviruses. While UL21 has been implicated in many processes in viral replication, ranging from nuclear egress to virion morphogenesis to cell-cell spread, its precise roles remain unclear. Here we report the 2.7-Å crystal structure of the C-terminal domain of herpes simplex virus 1 (HSV-1) UL21 (UL21C), which has a unique α-helical fold resembling a dragonfly. Analysis of evolutionary conservation patterns and surface electrostatics pinpointed fourmore » regions of potential functional importance on the surface of UL21C to be pursued by mutagenesis. In combination with the previously determined structure of the N-terminal domain of UL21, the structure of UL21C provides a 3-dimensional framework for targeted exploration of the multiple roles of UL21 in the replication and pathogenesis of alphaherpesviruses. Additionally, we describe an unanticipated ability of UL21 to bind RNA, which may hint at a yet unexplored function. IMPORTANCEDue to the limited genomic coding capacity of viruses, viral proteins are often multifunctional, which makes them attractive antiviral targets. Such multifunctionality, however, complicates their study, which often involves constructing and characterizing null mutant viruses. Systematic exploration of these multifunctional proteins requires detailed road maps in the form of 3-dimensional structures. In this work, we determined the crystal structure of the C-terminal domain of UL21, a multifunctional tegument protein that is conserved among alphaherpesviruses. Structural analysis pinpointed surface areas of potential functional importance that provide a starting point for mutagenesis. In addition, the unexpected RNA-binding ability of UL21 may expand its functional repertoire. The structure of UL21C and the observation of its RNA-binding ability are the latest additions to the navigational chart that can guide the exploration of the multiple functions of UL21.« less

  18. Optimal packaging of FIV genomic RNA depends upon a conserved long-range interaction and a palindromic sequence within gag.

    PubMed

    Rizvi, Tahir A; Kenyon, Julia C; Ali, Jahabar; Aktar, Suriya J; Phillip, Pretty S; Ghazawi, Akela; Mustafa, Farah; Lever, Andrew M L

    2010-10-15

    The feline immunodeficiency virus (FIV) is a lentivirus that is related to human immunodeficiency virus (HIV), causing a similar pathology in cats. It is a potential small animal model for AIDS and the FIV-based vectors are also being pursued for human gene therapy. Previous studies have mapped the FIV packaging signal (ψ) to two or more discontinuous regions within the 5' 511 nt of the genomic RNA and structural analyses have determined its secondary structure. The 5' and 3' sequences within ψ region interact through extensive long-range interactions (LRIs), including a conserved heptanucleotide interaction between R/U5 and gag. Other secondary structural elements identified include a conserved 150 nt stem-loop (SL2) and a small palindromic stem-loop within gag open reading frame that might act as a viral dimerization initiation site. We have performed extensive mutational analysis of these sequences and structures and ascertained their importance in FIV packaging using a trans-complementation assay. Disrupting the conserved heptanucleotide LRI to prevent base pairing between R/U5 and gag reduced packaging by 2.8-5.5 fold. Restoration of pairing using an alternative, non-wild type (wt) LRI sequence restored RNA packaging and propagation to wt levels, suggesting that it is the structure of the LRI, rather than its sequence, that is important for FIV packaging. Disrupting the palindrome within gag reduced packaging by 1.5-3-fold, but substitution with a different palindromic sequence did not restore packaging completely, suggesting that the sequence of this region as well as its palindromic nature is important. Mutation of individual regions of SL2 did not have a pronounced effect on FIV packaging, suggesting that either it is the structure of SL2 as a whole that is necessary for optimal packaging, or that there is redundancy within this structure. The mutational analysis presented here has further validated the previously predicted RNA secondary structure of FIV ψ. Copyright © 2010 Elsevier Ltd. All rights reserved.

  19. A novel abundant family of retroposed elements (DAS-SINEs) in the nine-banded armadillo (Dasypus novemcinctus).

    PubMed

    Churakov, Gennady; Smit, Arian F A; Brosius, Jürgen; Schmitz, Jürgen

    2005-04-01

    About half of the mammalian genome is composed of retroposons. Long interspersed elements (LINEs) and short interspersed elements (SINEs) are the most abundant repetitive elements and account for about 21% and 13% of the human genome, respectively. SINEs have been detected in all major mammalian lineages, except for the South American order Xenarthra, also termed Edentata (armadillos, anteaters, and sloths). Investigating this order, we discovered a novel high-copy-number family of tRNA derived SINEs in the nine-banded armadillo Dasypus novemcinctus, a species that successfully crossed the Central American land bridge to North America in the Pliocene. A specific computer algorithm was developed, and we detected and extracted 687 specific SINEs from databases. Termed DAS-SINEs, we further divided them into six distinct subfamilies. We extracted tRNA(Ala)-derived monomers, two types of dimers, and three subfamilies of chimeric fusion products of a tRNA(Ala) domain and an approximately 180-nt sequence of thus far unidentified origin. Comparisons of secondary structures of the DAS-SINEs' tRNA domains suggest selective pressure to maintain a tRNA-like D-arm structure in the respective founder RNAs, as shown by compensatory mutations. By analysis of subfamily-specific genetic variability, comparison of the proportion of direct repeats, and analysis of self-integrations as well as key events of dimerization and deletions or insertions, we were able to delineate the evolutionary history of the DAS-SINE subfamilies.

  20. Reverse Transcription of a Self-Primed Retrotransposon Requires an RNA Structure Similar to the U5-IR Stem-Loop of Retroviruses

    PubMed Central

    Lin, Jia-Hwei; Levin, Henry L.

    1998-01-01

    An inverted repeat (IR) within the U5 region of the Rous sarcoma virus (RSV) mRNA forms a structure composed of a 7-bp stem and a 5-nucleotide (nt) loop. This U5-IR structure has been shown to be required for the initiation of reverse transcription. The mRNA of Tf1, long terminal repeat-containing retrotransposon from fission yeast (Schizosaccharomyces pombe) contains nucleotides with the potential to form a U5-IR stem-loop that is strikingly similar to that of RSV. The putative U5-IR stem-loop of Tf1 consists of a 7-bp stem and a 25-nt loop. Results from mutagenesis studies indicate that the U5-IR stem-loop in the mRNA of Tf1 does form and that it is required for Tf1 transposition. Although the loop is required for transposition, we were surprised that the specific sequence of the nucleotides within the loop was unimportant for function. Additional investigation indicates that the loss of transposition activity due to a reduction in the loop size to 6 nt could be rescued by increasing the GC content of the stem. This result indicates that the large loop in the Tf1 mRNA relative to that of the RSV allows the formation of the relatively weak U5-IR stem. The levels of Tf1 proteins expressed and the amounts of Tf1 RNA packaged into the virus-like particles were not affected by mutations in the U5-IR structure. However, all of the mutations in the U5-IR structure that caused defects in transposition produced low amounts of reverse transcripts. A unique feature in the initiation of Tf1 reverse transcription is that, instead of a tRNA, the first 11 nt of the Tf1 mRNA serve as the minus-strand primer. Analysis of the 5′ end of Tf1 mRNA revealed that the mutations in the U5-IR stem-loop that resulted in defects in reverse transcription caused a reduction in the cleavage activity required to generate the Tf1 primer. Our results indicate that the U5-IR stems of Tf1 and RSV are conserved in size, position, and function. PMID:9774699

Top